Sentiment Classification using simple FNN

Description

This is a sentiment classification using nlp for my course. The project utilizes a simple feedforward neural network (FNN) to classify the sentences into negative (0) or positive (1) class.

On dataset

train.txt: is used for training, the pattern within the training data is [label] [sentence]
dev.txt: is used for testing, the pattern within the testing data is [label] [sentence]
glove file: contains multiple word embeddings for words, the pattern within the glove file is [word] [embeddings]

Preparing data

Data is prepared by (first) loading the glove file into a dictionary which will be used to map words.
Training & dev files are split by lines, normalized into lower case and all special symbols were removed.
The lines are then tokenized using WhiteSpaceTokenizer()

TextDataset & Dataloader

A custom dataset was created using torch.utils.data.dataset. Its functions include __len__ and __getitem__ The __getitem__ function utilizes the glove file to map inputs into embeddings, and return the torch version of both inputs and features. The dataloader was also used to load train and dev data for training & evaluation.

Model & Training

The model has a simple architecture: first, inputs are being passed into a Linear layer which transform the inputs by multiplying it with a matrix, and uses a ReLU function to activates it. The next layer uses another Linear function to transform the hidden layer outputs, and uses a sigmoid function to map it to the correct output.

Evaluation & results

The training model was optimized using Adam optimizer with a learning rate of 1e-4.

Further improvement

Collater could be used to ensure the width of training is covering all sentences length
Further text preprocessing to remove stop words, recognize Named Entity
Loading training data in batch size

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
FNN		FNN
__pycache__		__pycache__
.gitignore		.gitignore
README.md		README.md
config.py		config.py
misc.ipynb		misc.ipynb
prepare_data.py		prepare_data.py
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py
webapp.py		webapp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment Classification using simple FNN

Description

On dataset

Preparing data

TextDataset & Dataloader

Model & Training

Evaluation & results

Further improvement

About

Releases

Packages

Languages

imkhoibui/nlp

Folders and files

Latest commit

History

Repository files navigation

Sentiment Classification using simple FNN

Description

On dataset

Preparing data

TextDataset & Dataloader

Model & Training

Evaluation & results

Further improvement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages