#+SETUPFILE: ~/.emacs.d/org-templates/projects.org #+EXPORT_FILE_NAME: index #+TITLE: Draco Draco is a script to convert reddit thread to Org document. It accepts a url & prints the Org document to STDOUT. It'll also print comments along with their replies. | Project Home | [[https://andinus.nand.sh/draco/][Draco]] | | Source Code | [[https://git.tilde.institute/andinus/draco/][Andinus / Draco]] | | GitHub (Mirror) | [[https://github.com/andinus/draco/][Draco - GitHub]] | * Why? I reference things from the web in my Journal & don't want those links to break so I save them locally. Previously I used to manually archive the whole thread, this automates it. * Demo This was recorded with =asciinema(1)=. [[https://asciinema.org/a/373860][https://asciinema.org/a/373860.png]] + Draco v0.1.2: https://asciinema.org/a/373860 + Draco 2020-11-19: https://asciinema.org/a/373851 + alt-links (download) - v0.1.2: https://andinus.nand.sh/static/draco/v0.1.2.cast - 2020-11-19: https://andinus.nand.sh/static/draco/2020-11-19.cast * Installation Follow these instructions to get draco & then install the dependencies, they're listed below. All dependencies are in Debian & Fedora repositories. Check the /News/ section before updating or downloading latest release. ** Release Release archives are generated by cgit/GitHub. 1. Download the release: - https://git.tilde.institute/andinus/draco - https://github.com/andinus/draco/releases 2. Extract the file. 3. =cd= into the directory. 4. Run =make install= as root. 5. Install dependencies. ** From Source All commits will be signed by my [[https://andinus.nand.sh/static/D9AE4AEEE1F1B3598E81D9DFB67D55D482A799FD.asc][PGP Key]]. #+BEGIN_SRC sh # Clone the project. git clone https://git.tilde.institute/andinus/draco cd draco # Install draco. Use `sudo' if `doas' is not present. doas make install # Install dependencies. See the section below. #+END_SRC * Dependencies ** OpenBSD #+BEGIN_SRC sh doas pkg_add p5-Unicode-LineBreak p5-JSON-MaybeXS cpan install HTTP::Tiny #+END_SRC ** Debian (apt) #+BEGIN_SRC sh sudo apt install libunicode-linebreak-perl libjson-maybexs-perl \ libhttp-tiny-perl #+END_SRC ** Fedora (dnf) #+BEGIN_SRC sh sudo dnf install perl-JSON-MaybeXS perl-HTTP-Tiny perl-Unicode-LineBreak #+END_SRC * News ** v0.3.1 - 2020-11-25 Minor improvement. + Put author name in code block if it begins & ends with "_". Org underlines headings that begin & end with "_". ** v0.3.0 - 2020-11-24 This version adds code to fetch all the comments in a thread. Now users can archive the whole thread. Everyone should get this update, the code has become a lot more complex since v0.1.3. If you don't want to update then you can get the patches for small changes from the git history. + Fetch all the comments. + Add debug message for HTTP calls. It'll print a "." for every HTTP call. Users will be able to tell when the script is making HTTP calls. ** v0.2.2 - 2020-11-24 This version is mostly structural changes, it'll now be easier to add code to fetch comments hidden behind "continue this thread". + Add more debug information. ** v0.2.1 - 2020-11-24 + Previously fetching comments hidden under "load more comments" would fail if the url passed by user ends in "/". This has been fixed in this release. ** v0.2.0 - 2020-11-23 This version makes the script lot more complex. If you download only small threads then this update is not required. Previous version (v0.1.3) might throw some errors on threads that have comments hidden behind "load more comments" but the rest of thread will be saved. This version will load all those comments hidden behind "load more comments". But not those hidden behind "continue this thread". This is a known bug. + Add "[S]" after submitter's comments. + Print comments hidden under "load more comments". + Document environment variables in manual. + Add "limit=500" & "sort=top" to all posts/comments. + Print more information when debug is on. + Add help option.