This project focuses on performing a detailed regression analysis of the "meta" dataset. The primary aim is to apply regression techniques to understand the characteristics of literature datasets processed by a learning algorithm.
- Load the dataset using scikit-learn's
fetch_openml
. - Organize data into a pandas DataFrame and display the first few rows.
- Remove nominal features and handle missing values.
- Create two datasets: D1 (missing values removed) and D2 (interpolated values).
- Apply standardization.
- Split datasets into training and test sets.
- Apply regression models (Linear Regression, Logistic Regression, etc.).
- Report results using MAE, MAPE, and SMAPE.
- Include appropriate graphs for analysis.