This work is done by four Techniche Universität Graz Erasmus+ students during the Winter Semester 2022. The work is based on the materials and lectures of the Advanced Information Retrieval course in TU Graz.
In the project we used dataset from Hugging Face. Whole the dataset could be downloaded here.
All of the python code is provided via .ipynb notebooks, which can be opened with some collaborative web-tool like Google Colab or Kaggle or locally e.g with Jyputer.
Structure of the whole project is following: