AAI_2025_Capstone_Chronicles_Combined
MENTAL HEALTH RISK DETECTION USING ML
12
minimizing overfitting. A Tabular Neural Network is constructed with dense layers, batch normalization, and gated linear units (GLUs) to model non-linear interactions. These models are planned to be combined using a Soft voting Ensemble to enhance prediction stability across risk categories. The rationale for each model is summarized in Table 2 .
Table 3 Suitability of Selected Machine Learning Models
Method
Strengths
Why is it suitable for this project
Logistic Regression Interpretable, efficient, clinically trusted
Ideal for establishing a transparent and explainable baseline
Tabular Neural Network
Captures complex non-linear interactions in tabular data Balances interpretability and predictive power
Learns patterns in behavioral and environmental indicators
Soft Voting Ensemble
Improves stability and accuracy across risk categories
For Logistic Regression, we enhanced the model by adding interaction terms, new features based on pairs of existing variables, to help capture more complex relationships between factors. For example, the combined effect of stress and work hours may reveal patterns not apparent when considered independently. To avoid overfitting while still using all features, we applied elastic net regularization, which combines L1 (lasso) and L2 (ridge) penalties in equal proportion. This allows the model to remain accurate and generalizable without becoming too complex. We used the SAGA optimizer, which is specifically designed to work efficiently with elastic net and large datasets. Since our dataset was imbalanced across mental health risk categories, we used class weighting to ensure the model didn’t favor the majority class during
212
Made with FlippingBook - Share PDF online