Skip to content

Commit

Permalink
feat: add HTR data sets via search in Hugging Face
Browse files Browse the repository at this point in the history
  • Loading branch information
mhucka committed Mar 5, 2024
1 parent 1d6a014 commit 5a8e7b0
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -180,7 +180,8 @@ Note: datasets for training and testing are listed in a [separate section](#data

- [Datasets from the National Library of Sweden](https://huggingface.co/KBLab) – available on Hugging Face
- [Gensim datasets](https://github.com/piskvorky/gensim-data#readme) – repository of datasets for unstructured text processing
- [HTR datasets in Zenodo](https://zenodo.org/search?q=metadata.subjects.subject%3A%22handwritten%20text%20recognition%22&l=list&p=1&s=10&sort=bestmatch) – based on subject search in Zenodo
- [HTR datasets in Hugging Face)(https://huggingface.co/search/full-text?q=Handwritten+Text+Recognition&type=dataset) – subject search in Hugging Face
- [HTR datasets in Zenodo](https://zenodo.org/search?q=metadata.subjects.subject%3A%22handwritten%20text%20recognition%22&l=list&p=1&s=10&sort=bestmatch) – subject search in Zenodo
- [HTR-United](https://htr-united.github.io) – datasets for training transcription or segmentation models
- [Kaggle datasets](https://www.kaggle.com/datasets)
- [nlp-datasets](https://github.com/niderhoff/nlp-datasets#readme) – free/public domain datasets with text data for use in NLP
Expand Down

0 comments on commit 5a8e7b0

Please sign in to comment.