KCTErrorExtractor

Extracts error types from the Karslruhe Childrens' Texts corpus. This code was used for the extraction of errors in the article Weiss & Meurers (2019).

annotated_document.py reads documents in the format of the KCT corpus (Lavalley et al. 2015) or the H1/H2 corpus (Berkling 2016, 2018) and retrieves information on different error types from it. You may output the target hypothesis and the original writing as two sperate plain text files and additionally extract statistics on different error types saved in a separate csv.

See main_extract_h1_or_h2.py and main_extract_kct.py for use examples.

References

Berkling, K. (2018). A 2nd Longitudinal Corpus for Children's Writing with Enhanced Output for Specific Spelling Patterns. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC-2018).
Berkling, K. (2016). Corpus for Children's Writing with Enhanced Output for Specific Spelling Patterns (2nd and 3rd Grade). In LREC.
Lavalley, R., Berkling, K., & Stüker, S. (2015). Preparing children's writing database for automated processing. In LTLT@ SLaTE (pp. 9-15).
Weiss, Z., Meurers, D. (2019). Analyzing Linguistic Complexity and Accuracy in Academic Language Development of German across Elementary and Secondary School. In Proceedings of the 14th Workshop on Innovative Use of NLP for Building Educational Applications.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
KCT_Long_Reader.py		KCT_Long_Reader.py
KCT_Reader.py		KCT_Reader.py
LICENSE		LICENSE
README.md		README.md
annotated_document.py		annotated_document.py
main_extract_h1_or_h2.py		main_extract_h1_or_h2.py
main_extract_kct.py		main_extract_kct.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KCTErrorExtractor

References

About

Releases

Packages

Languages

License

zweiss/KCTErrorExtractor

Folders and files

Latest commit

History

Repository files navigation

KCTErrorExtractor

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages