Decision Tree Class Project
For this project, I had to extract the data from a CSV file dataset and analyze it to predict the outcome. For my dataset, It focuses on predicting the risk of getting diabetes through certain variables/features. I choose my outcome column to be obesity, because one of the main causes of having diabetes usually comes when a person is obese or not. In this project, I had to test out different values to find the most optimal maximum depth and minimum split to make sure that the data is accurate while avoiding overfitting to reduce complexity when reading the data. I also had to determine which node had the highest and lowest probability.
Source/Link: https://www.kaggle.com/datasets/rcratos/diabetes-risk-prediction/code
# Submitted Through Canvas