ADS Capstone Chronicles Revised

‭18‬

‭Figure 4.11‬ ‭Correlation Heatmap‬

‭4.3‬ ‭ Data Quality‬ ‭Ensuring‬‭data‬‭quality‬‭was‬‭a‬‭critical‬‭aspect‬‭of‬‭this‬ ‭study‬ ‭to‬ ‭guarantee‬ ‭the‬ ‭validity‬ ‭and‬ ‭reliability‬ ‭of‬ ‭the‬ ‭results.‬ ‭Several‬ ‭dimensions‬ ‭of‬ ‭data‬ ‭quality,‬ ‭including‬ ‭completeness,‬ ‭consistency,‬ ‭accuracy,‬ ‭timeliness‬‭and‬‭relevance‬‭were‬‭evaluated.‬‭Despite‬ ‭the‬ ‭dataset’s‬ ‭comprehensive‬ ‭coverage‬ ‭of‬ ‭traffic‬ ‭accidents,‬‭weather‬‭conditions‬‭and‬‭traffic‬‭patterns,‬ ‭gaps were identified.‬

‭4.3.1 Temporal Coverage and Missing Data‬ ‭It‬ ‭was‬ ‭identified‬ ‭that‬ ‭the‬ ‭temporal‬ ‭coverage‬ ‭for‬ ‭reporting‬‭periods‬‭was‬‭inconsistent,‬‭with‬‭2‬‭months‬ ‭of‬ ‭2016‬ ‭and‬‭9‬‭months‬‭of‬‭2023‬‭missing.‬‭This‬‭has‬ ‭the‬ ‭potential‬ ‭to‬ ‭introduce‬ ‭bias‬ ‭in‬ ‭the‬ ‭trend‬ ‭analysis.‬ ‭Additionally,‬ ‭a‬ ‭few‬ ‭variables‬ ‭were‬ ‭found‬ ‭to‬ ‭have‬ ‭a‬ ‭significant‬ ‭amount‬ ‭of‬ ‭missing‬ ‭data.‬ ‭Therefore,‬ ‭these‬ ‭columns‬ ‭were‬ ‭dropped‬ ‭from‬ ‭the‬ ‭analysis.‬ ‭It‬ ‭should‬ ‭be‬ ‭noted‬‭that‬‭sparse‬ ‭records‬ ‭in‬ ‭specific‬ ‭weather‬ ‭conditions,‬ ‭such‬ ‭as‬ ‭snowfall,‬ ‭could‬ ‭limit‬‭the‬‭study’s‬‭ability‬‭to‬‭assess‬ ‭rare events.‬

258

Made with FlippingBook - Online Brochure Maker