This is a simple stroke prediction model based on a clinical historical data on patients in a hospital.
- Importing data and data preprocessing (Load data, data cleaning, data transformation, feature engineering, missing data imputation)
- Build prediction models including logistic regression, support vector machine, decision trees, random forest, and XGBoost
- Evaluate and select prediction models based on evaluation metrics, including accuracy, sensitivity, recall, F-score, and AUC
- Deploying the best prediction model using R
In this project, you will step into the shoes of an entry-level health data analyst at a leading health organization, helping to build and deploy a stroke prediction model to enhance clinical decision-making.
A leading healthcare organization has noticed a trend in an increasing number of patients being diagnosed with strokes. To mitigate this growing problem, the organization has decided to launch a project aimed at predicting the likelihood of a patient getting a stroke based on a variety of health factors. The hospital has access to a vast amount of patient data, including medical history and demographic information, which can be used to build the predictive model.
Once the predictive model is validated and tested, the healthcare organization plans to integrate it into its clinical decision-making process. The model will be used to identify patients who are at high risk of getting a stroke and provide early intervention and prevention measures. Additionally, the model will be used to track the progress of high-risk patients and monitor the impact of preventive measures on reducing the incidence of stroke.
The success of this project will not only help the healthcare organization reduce the number of strokes in its patient population, but it will also position the organization as a leader in the use of advanced analytics and machine learning to improve patient outcomes. The predictive model will be a valuable tool for healthcare providers and patients alike, providing insight into their risk of getting a stroke and the steps they can take to prevent it.
- Explore the dataset to identify the most important patient and/or clinical characteristics.
- Build a well-validated stroke prediction model for clinical use.
- Deploy the model to enhance the organization's clinical decision-making.
In this project, we'll use data containing 11 clinical features for predicting stroke events.