Skip to content

KusRb/Wine-Quality-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Wine Quality Data Analysis

This project provides complete exploratory data analysis of red wine data. The main purpose of the project is to uncover insights about the factor influencing the quality of red wine.

Content

Description of Dataset

The datasets are related to red variants of the Portuguese "Vinho Verde" wine. The classes are ordered and not balanced (e.g. there are many more normal wines than excellent or poor ones). Due to privacy and logistic issues, only physicochemical (inputs) and sensory (the output) variables are available (e.g. there is no data about grape types, wine brand, wine selling price, etc.).

Source of Dataset

UCI Machine Learning Repository

EDA pipeline

  1. Importing necessary librabries
  2. Loading datasets
  3. Data Analysis
  4. Data cleaning
  5. Data visualization

Data visualization chart

  • Histogram
  • Correlation matrix
  • Count plot
  • Pair plot
  • Box plot

Data cleaning steps

  • Missing data
  • Duplicate checking
  • Removing duplicates

Installation requirement

Installing required packages

pip install numpy pandas matplotlib

Clone the project

git clone https://github.com/KusRb/Wine-Quality-Analysis.git
cd Wine-Quality-Analysis

Red wine data analysis

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published