Tools to work with LaTeX files. Tools include automatic publication preparation, reference formatting and reference extraction.
At least Python 3.6 is required for format strings and to easily handle unicode strings.
External dependencies are bs4
, requests
and html5lib
.
Run unittests with
python -m unittest discover tests/ -b -v -c
Automatically extract the names of authors from references in a tex file given a DOI or arXiv identifier, and open a Google search page for that name. Also makes sure to only extract references from 2000 or later and get only one instance of an author name. Any references that could not be processed are shown at the end of the program to be checked manually.
python reference_scraper.py latex_file
Automatically format references correctly. The system relies on DOIs and arXiv identifiers to extract information from the correct databases. If a paper has been published and the arXiv page updated with the DOI, the scraper will prefer the DOI data over the arXiv data.
python reference_formatter.py latex_file
There is also an option --add_arxiv
, if you want to add arXiv references as well.
- Handle multiple references in the same bibitem
- Add option to format single reference