ADS Capstone Chronicles Revised

15

Table 1 Patient Data Structure

Income, and Glucose Values. This visualization reveals there are not equivalent distributions across the numeric and ordinal variables. Such imbalances will be addressed during data preprocessing, as they could impact model training and the accuracy of predictions.

4.2.1.1 Univariate Analysis To understand the general distribution of the different data types, univariate analyses were performed using descriptive statistics for numerical and ordinal variables, and categorical counts for binary features. Figure 9 below presents the distributions of all numeric and ordinal features within histograms: body mass index (BMI), Age, Education,

217

Made with FlippingBook - Online Brochure Maker