Skip to content

Commit

Permalink
feat: add data sets from Nat. Lib of Scotland
Browse files Browse the repository at this point in the history
  • Loading branch information
mhucka committed Feb 21, 2024
1 parent ab89320 commit 6bacd5a
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -179,12 +179,13 @@ Note: datasets for training and testing are listed in a [separate section](#data

- [Datasets from the National Library of Sweden](https://huggingface.co/KBLab) – available on Hugging Face
- [Gensim datasets](https://github.com/piskvorky/gensim-data#readme) – repository of datasets for unstructured text processing
- [HTR datasets in Zenodo](https://zenodo.org/search?q=metadata.subjects.subject%3A%22handwritten%20text%20recognition%22&l=list&p=1&s=10&sort=bestmatch) – based on subject search in Zenodo
- [HTR-United](https://htr-united.github.io) – datasets for training transcription or segmentation models
- [Kaggle datasets](https://www.kaggle.com/datasets)
- [nlp-datasets](https://github.com/niderhoff/nlp-datasets#readme) – free/public domain datasets with text data for use in NLP
- [Open data collections from the National Library of Scotland](https://data.nls.uk/)
- [Open Library data dumps](https://openlibrary.org/developers/dumps) – from the Internet Archive
- [Registry of Open Data on AWS](https://registry.opendata.aws) – datasets tagged by topic
- [HTR datasets in Zenodo](https://zenodo.org/search?q=metadata.subjects.subject%3A%22handwritten%20text%20recognition%22&l=list&p=1&s=10&sort=bestmatch) – based on subject search in Zenodo


## Projects, Initiatives, and Case Studies<a title="Suggest an addition to the list!" href="https://forms.gle/aPA41GT5AmbxrTwq5"><img alt="Click button to suggest an addition" align="right" src="https://raw.githubusercontent.com/AI4LAM/awesome-ai4lam/main/.graphics/suggest-addition-small.svg"></a>
Expand Down

0 comments on commit 6bacd5a

Please sign in to comment.