ADS Capstone Chronicles Revised
17
Figure 4.10 Accidents by Hour of Day and Weather Condition
Figure 4.11 presented a correlation heatmap for all of the numerical variablesinthedataset.The heatmap visually represented the relationships between the variables, with darker colors indicating stronger correlations. Several pairs of variables showed high correlation with one another, suggesting potential redundancy in the dataset.Uponfurtherinspection,itwasnotedthat these highly correlated variableswereduplicates or highly similar measures of the same underlying concept. These duplicated variables were addressed during the feature selection and engineeringphaseoftheproject.Byremovingor consolidating these redundant features, the efficiency of the model was improved, and the risk of multicollinearity, which could negatively affect the performanceandinterpretabilityofthe model, was reduced.
257
Made with FlippingBook - Online Brochure Maker