speech_emotion_classification

In this notebook, I want to classify emotion in speech from audiofiles with different Machine Learning techniques.

For this purpose I use the Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS). I only use the speech audio data, that consists of 1440 files. The database contains 24 professional actors (12 female, 12 male), vocalizing two lexically-matched statements in a neutral North American accent. Speech includes calm, happy, sad, angry, fearful, surprise, and disgust expressions, and song contains calm, happy, sad, angry, and fearful emotions. Each expression is produced at two levels of emotional intensity (normal, strong), with an additional neutral expression. The data is freely available for download here: https://zenodo.org/record/1188976#.YUuSMC0es1I

Additionally, I provide a small survey on human accuracy on the database for download.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
emotion_survey.xlsx		emotion_survey.xlsx
speech_emotion_classification.ipynb		speech_emotion_classification.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

speech_emotion_classification

About

Releases

Packages

Languages

victorjonathanibanez/speech_emotion_classification

Folders and files

Latest commit

History

Repository files navigation

speech_emotion_classification

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages