Skip to content

Commit

Permalink
Create Wiki
Browse files Browse the repository at this point in the history
  • Loading branch information
JingQunCui authored Dec 5, 2023
1 parent 9e63ed9 commit 2abee4c
Showing 1 changed file with 10 additions and 0 deletions.
10 changes: 10 additions & 0 deletions ACMAS/app/ACMAS_Web/Wiki
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
Overview:
OCR is implemented with the usage of a python module PyMuPDF, which parses a pdf upload and extracts text from it with very high accuracy.
OCR should be called during each non-text file uploads. Along with the original uploaded file, upload a text file with the same pre-extension name that has the extracted text in it.
Currently the OCR functionality is incomplete.

Issues:
Had some troubles installing and testing PyMuPDF with the python alpine version used.

Solution:
Installed the module from source with custom made wheels to import the package

0 comments on commit 2abee4c

Please sign in to comment.