Skip to content

ylivuoto/AIR-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Content Analysis of Twitter Corpus

This work is done by four Techniche Universität Graz Erasmus+ students during the Winter Semester 2022. The work is based on the materials and lectures of the Advanced Information Retrieval course in TU Graz.

Dataset

In the project we used dataset from Hugging Face. Whole the dataset could be downloaded here.

Code

All of the python code is provided via .ipynb notebooks, which can be opened with some collaborative web-tool like Google Colab or Kaggle or locally e.g with Jyputer.

Structure

Structure of the whole project is following:

Bias Analysis

Toxicity Analysis

Emotion Analysis

Positivity Analysis

Overpresented Words

Similarity Measures

About

Content analysis of Twitter corpus

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published