blob: 0becfb5c05c00a98fb86c008041ba7aa77246ce6 (
plain) (
blame)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
|
#+SETUPFILE: ~/.emacs.d/org-templates/projects.org
#+EXPORT_FILE_NAME: index
#+TITLE: Draco
Draco is a script to convert reddit thread to Org document. It accepts a
url & prints the Org document to STDOUT. It'll also print comments along
with their replies.
| Project Home | [[https://andinus.nand.sh/draco/][Draco]] |
| Source Code | [[https://git.tilde.institute/andinus/draco/][Andinus / Draco]] |
| GitHub (Mirror) | [[https://github.com/andinus/draco/][Draco - GitHub]] |
* Why?
I reference things from the web in my Journal & don't want those links
to break so I save them locally. Previously I used to manually archive
the whole thread, this automates it.
* Demo
This was recorded with =asciinema(1)=.
[[https://asciinema.org/a/373860][https://asciinema.org/a/373860.png]]
+ Draco v0.1.2: https://asciinema.org/a/373860
+ Draco 2020-11-19: https://asciinema.org/a/373851
+ alt-links (download)
- v0.1.2: https://andinus.nand.sh/static/draco/v0.1.2.cast
- 2020-11-19: https://andinus.nand.sh/static/draco/2020-11-19.cast
* Installation
Follow these instructions to get draco & then install the dependencies,
they're listed below. All dependencies are in Debian & Fedora
repositories.
Check the /News/ section before updating or downloading latest release.
** Release
Release archives are generated by cgit/GitHub.
1. Download the release:
- https://git.tilde.institute/andinus/draco
- https://github.com/andinus/draco/releases
2. Extract the file.
3. =cd= into the directory.
4. Run =make install= as root.
5. Install dependencies.
** From Source
All commits will be signed by my [[https://andinus.nand.sh/static/D9AE4AEEE1F1B3598E81D9DFB67D55D482A799FD.asc][PGP Key]].
#+BEGIN_SRC sh
# Clone the project.
git clone https://git.tilde.institute/andinus/draco
cd draco
# Install draco. Use `sudo' if `doas' is not present.
doas make install
# Install dependencies. See the section below.
#+END_SRC
* Dependencies
** OpenBSD
#+BEGIN_SRC sh
doas pkg_add p5-Unicode-LineBreak p5-JSON-MaybeXS
cpan install HTTP::Tiny
#+END_SRC
** Debian (apt)
#+BEGIN_SRC sh
sudo apt install libunicode-linebreak-perl libjson-maybexs-perl \
libhttp-tiny-perl
#+END_SRC
** Fedora (dnf)
#+BEGIN_SRC sh
sudo dnf install perl-JSON-MaybeXS perl-HTTP-Tiny perl-Unicode-LineBreak
#+END_SRC
* News
** v0.2.2 - 2020-11-24
This version is mostly structural changes, it'll now be easier to add
code to fetch comments hidden behind "continue this thread".
+ Add more debug information.
** v0.2.1 - 2020-11-24
+ Previously fetching comments hidden under "load more comments" would
fail if the url passed by user ends in "/". This has been fixed in
this release.
** v0.2.0 - 2020-11-23
This version makes the script lot more complex. If you download only
small threads then this update is not required.
Previous version (v0.1.3) might throw some errors on threads that have
comments hidden behind "load more comments" but the rest of thread will
be saved.
This version will load all those comments hidden behind "load more
comments". But not those hidden behind "continue this thread". This is a
known bug.
+ Add "[S]" after submitter's comments.
+ Print comments hidden under "load more comments".
+ Document environment variables in manual.
+ Add "limit=500" & "sort=top" to all posts/comments.
+ Print more information when debug is on.
+ Add help option.
|