The answer files are written in Jupyter notebooks (*.ipynb) with self-explained
In order to view and run again the result, please install Jupyter and then start it from terminal at the current directory
jupyter notebook
Library dependencies:
- Python 3
- Scikit Learn
- Numpy
- Pandas
- Seaborn
- Tweepy (Twitter Library)
- XGBoost
- Imbalanced-Learn
Those question could be found in file "Associates_in_Data_Science-_Test.docx"
The source code could be found in code folder
The result for question 5 could be found in result folder
The answers are as follow:
- GENERAL QUESTIONS: 1_general_question.ipynb
- DATA EXPLORATION CHALLENGE: 2_data_exploration.ipynb
- DATA SCRAPING AND TEXT ANALYSIS: 3_data_scraping.ipynb
- TIME SERIES ANALYSIS & FORECASTING (Not finished): 4_time_series_analysis.ipynb
- CODING CHALLENGE – FEATURE ENGINEERING & MACHINE LEARNING: 5_credit_card_default.ipynb