-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathTODO
42 lines (42 loc) · 2.06 KB
/
TODO
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
ffnet category and property. Append in brackets to title?
fetcher_args: minimum length of forum post to consider a chapter
"update" operation. Fetch new chapters.
verbose mode (or dry-run mode?) that prints all the config settings at the beginning of execution (and then exits?)
spacebattles: category for all posts by OP, even if not threadmarked
serialize json: omit null values? and in deserialization, assume null for missing fields?
mention in the EPUB which comment mode was used during fetching
make the option have effect when reading from the archive, too
collapse double space after period? or general html sanitizer
https://pypi.org/project/html-sanitizer/
Python 3.8
functools.cached_property
centralize console output in some quietable mechanism
test coverage wordpress models
images
retry around network fetches? Does requests already do that? And praw?
options to set metadata in output epub. essentially passthrough.
IP in metadata for Calibre
Plain-text output suitable for text-to-speech
TextWriter class --writer \[epub, text, none\] instead of --write_epub
Readme: text output for TTS
SB:
category for "all posts by author in a given thread longer than configurable minimum length".
Different URL, though... can the categories be classes?
multi-thread stories
categories uniq
Comments unique ID. Use it to prevent duplicates?
genericize to work for SV
wordpress.Fetcher move that link_or_whitespace method into a new subclass of bs4.Element ?
Royal Road:
handle polls
AO3 "show more comments". See Crimson ch 103.
multiple comment rules (author or regex match). Handle "click to show more comments" deep chains. Example https://archiveofourown.org/works/11310144/chapters/30565317#workskin
postprocessors
download images
comment pruner
normalize HTML?
download last page twice (second time forced) to bust cache
document the force-fetch option, and remove it from my config
split story into books. take start chapter of each book. external script? One of several in bin/ that import common modules.
compress local http cache and archive
flaresolver-assistant integration