- Amazon review dataset
- Yelp Dataset Challenge At this stage we are testing different models on digital music category of Amazon review dataset.
- Bag of Words
- word2vec
- Random Forest
- SVM
- TF/TF-IDF + Naïve bayes
- Kaggle's tutorial on text classification gave us a solid start on this project.