M.S. Applied Data Science - Capstone Chronicles 2025

18

Figure 10 Feature Correlation with Target Variable

classification logistic regression, decision trees, random forest, XGBoost, and a MLP—are employed to determine the most effective predictive approach. Model performance is evaluated using key metrics such as F1-score, recall, and precision to ensure a balanced assessment of classification effectiveness. An iterative experimentation and models—including

4.5 Modeling This section implements advanced machine learning techniques to classify product recalls using an imbalanced dataset. The target variable represents distinct recall categories, requiring strategies such as class weighting and sampling adjustments to address class imbalance. Multiple

22

Made with FlippingBook flipbook maker