News Category Classifier

Using only PyTorch's matrix multiplications, and no other built-in APIs to understand the math behind!

A self project to learn in depth about machine learning, its pitfalls, why the architectures are designed the way they are, and how to improve them.

Description

This project is an extension for the NewsSwipe app I built in January 2024. (checkout the app here). The main goal of this project is to categorize news articles into different categories. (e.g. sports, politics, technology, etc.)

I'm using different machine learning techniques and deep learning to achieve this goal.

I've only implemented a simple RNN model and a LSTM model from scratch (using only pytorch) till now, and achieving accuracy of 83% and 86% respectively on the test dataset.

Planning to implement more complex models and techniques in the future while investigating the pitfalls of each model and documenting them.

LSTM Model (crux)

Initialization

Training Loop

Dataset

The dataset used for this project is the "AG News" dataset. It is a collection of news articles from the AG's corpus of news articles on the web. The dataset contains 120,000 training samples and 7,600 testing samples from 42 different classes.

The dataset can be found here, in which orignal authors implemented a ConvNet on character-level inputs. The paper can be found here.

Requirements

Python 3.6+
Pytorch
Numpy

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
datasets		datasets
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
gradients.ipynb		gradients.ipynb
model.pth		model.pth
ncc_v1_lstm_torch_ag_dataset.ipynb		ncc_v1_lstm_torch_ag_dataset.ipynb
ncc_v1_rnn_np.ipynb		ncc_v1_rnn_np.ipynb
ncc_v2_rnn_torch.ipynb		ncc_v2_rnn_torch.ipynb
ncc_v2_rnn_torch_ag_dataset.ipynb		ncc_v2_rnn_torch_ag_dataset.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

News Category Classifier

Description

LSTM Model (crux)

Initialization

Training Loop

Dataset

Requirements

About

Releases

Packages

Languages

anshmehtamm/lstm-pytorch

Folders and files

Latest commit

History

Repository files navigation

News Category Classifier

Description

LSTM Model (crux)

Initialization

Training Loop

Dataset

Requirements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages