annotated-diabetes_dataset.csv
annotated-Propointsproject.docx (1)
To complete this assignment, I sourced a dataset detailing patients and their health attributes that may or may not be correlated with having diabetes. This dataset proved more difficult for me to find a fitting minimum split than past ones have, but I was able to settle on using 50 because it maximized visibility as well as accuracy. I created the four scenarios with differing attributes to try and get an idea of both sides of the spectrum as far as likelihood of having diabetes. These scenarios were based on glucose levels, age, BMI, Diabetes pedigree function, and blood pressure. After analysis of the data, it became clear that certain attributes, like high blood pressure, BMI, and glucose levels, could help indicate a patient’s chance of having diabetes.