Learn Decision Tree algorithms
Read the basics of classifications
Read the basics of regressions
- Decision Trees in Machine Learning
- Classification And Regression Trees for Machine Learning
- Decision trees explained - video
- Understanding Gini index
- Decision tree classifier tutorial in scikit learn
- Decision tree regressor tutorial
- Scikitlearn documentation on trees
- Visualizing trees in scikit learn
- Ch. 8 "Decision Trees" in Introduction to Statistical Learning
- What problem can decision trees solve? Classification, regression, both?
- What are the strengths and weaknesses of Decision Trees?
- What is 'greedy algorithm'?
- How can we stop the tree from further dividing?
- Name some 'stopping criteria' to stop tree dividing further
- What is 'pruning'?
- What is a Gini index
★☆☆ - Easy
★★☆ - Medium
★★★ - Challenging
★★★★ - Bonus
Use Scikit's make_blobs or make_classification to generate some sample data.
Try to separate them using DT
- Here is Bank marketing dataset
- You may want to encode variables
- Use DT to predict yes/no binary decision
- Visualize the tree
- Create a confusion matrix
- What is the accuracy of the model
- Run Cross Validation to gauge the accuracy of this model
Use Scikit's make_regression to generate some sample data.
Use DTRegressor to solve this
- Use Bike sharing data
- Use DTRegressor to predict bike demand
- Visualize the tree
- Use RMSE, R2 to evaluate the model
- Use Cross Validation to thoroughly test the model performance