A content-based anime recommendation system uses cosine similarity to recommend animes. Built systems use only genres and synopsis(summary) for a recommendation.
This dataset is taken from ::point_right: https://www.kaggle.com/datasets/hernan4444/anime-recommendation-database-2020
Dataset contains 16206 animes
- MAL_ID
- Name
- Score
- Genres
- sypnopsis
This type of recommendation system takes in an anime for instance that a user currently likes as input. Then it analyzes the contents of the anime to find other animes that have similar content. Then it recommends the most relevant animes according to their ranks.
Cosine similarity is a metric used to measure how similar the documents are irrespective of their size. Mathematically, it measures the cosine of the angle between two vectors projected in a multi-dimensional space.The smaller the angle, higher the cosine similarity. [1]
We use cosine similarity between these vectors to find out how similar they are. We can calculate this using cosine_similarity()
function from sklearn.metrics.pairwise library.