Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Load articles from other file formats #257

Open
Jenselme opened this issue Oct 4, 2024 · 2 comments
Open

Load articles from other file formats #257

Jenselme opened this issue Oct 4, 2024 · 2 comments
Labels
enhancement New feature or request

Comments

@Jenselme
Copy link
Owner

Jenselme commented Oct 4, 2024

We could load data from:

  • epub: should be easily doable, it’s HTML in a ZIP. What would be nice would be to allow import of all the HTML pages in one article, each one in an independent article, each one in a group of related articles. The only issue is to load the ZIP while being protected from ZIP bombs.
  • odt/docx: it’s XML in a ZIP. I don’t think converting to HTML can be part of this project. We could have a service based on LibreOffice that does the conversion to HTML for us.
  • pdf: probably the hardest one.
@Jenselme Jenselme added the enhancement New feature or request label Oct 4, 2024
@Jenselme Jenselme self-assigned this Oct 17, 2024
@Jenselme Jenselme added this to the December 2024 milestone milestone Oct 17, 2024
@Jenselme
Copy link
Owner Author

Jenselme commented Oct 17, 2024

Let’s try to support:

  • Direct HTML upload.
  • Direct MHTML upload.
  • epub upload (if I can manage to prevent ZIP bomb).
  • Link that responds with an epub.

Other formats are too weird. Let’s just check we can save the link to keep it around with tags. And users will be able to convert it to HTML themselves to save it.

@Jenselme
Copy link
Owner Author

Notes:

  • Currently, all articles must have a link. If we allow direct upload, this won’t be the case.
  • We would also need a way to store the images associated with the epub and update the HTML to points to the images we saved. This would required extra work and some changes to the deploy configuration to configure where to store the images.

@Jenselme Jenselme removed their assignment Nov 8, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant