Community Platform
Interests
  • Application Programming Interface (APIs)
  • Cascading Style Sheets (CSS)
  • Cyber-security
  • Data analytics
  • more...
This Year
365 Points
Total
505 Points
MIS Badge

Click here
to validate the recipient

Decision Tree Class Project

For this project, I had to extract the data from a CSV file dataset and analyze it to predict the outcome. For my dataset, It focuses on predicting the risk of getting diabetes through certain variables/features. I choose my outcome column to be obesity, because one of the main causes of having diabetes usually comes when a person is obese or not. In this project, I had to test out different values to find the most optimal  maximum depth and minimum split to make sure that the data is accurate while avoiding overfitting to reduce complexity when reading the data. I also had to determine which node had the highest and lowest probability.

 

Source/Link: https://www.kaggle.com/datasets/rcratos/diabetes-risk-prediction/code

# Submitted Through Canvas


Skip to toolbar