AAI_2025_Capstone_Chronicles_Combined

2

Introduction

Our capstone project aims to develop a machine learning model to predict in-hospital mortality

for ICU patients using the PhysioNet 2012 Challenge dataset (msafi04, 2021). This real-world dataset

contains multivariate time-series data from over 12,000 ICU stays, including vital signs, lab results, and

demographic information recorded during the first 48 hours of admission.

We seek to answer: Can temporal patterns in physiological data accurately predict ICU mortality?

This question is both clinically important and technically complex. Early mortality prediction can guide

timely interventions, improve patient outcomes, and optimize resource allocation. It also helps hospitals

manage ICU staffing, bed capacity, and prioritization during high-demand periods like public health

emergencies (Yeh et al., 2024).

The end users of our AI model include ICU physicians, hospital administrators, and healthcare

data scientists. These stakeholders rely on accurate, real-time information to make critical decisions in

fast-paced clinical environments.

The dataset supports two modeling strategies: aggregated features for simpler models (e.g.,

logistic regression, XGBoost), and full temporal dynamics to capture key patterns like declining MAP or

heart rate spikes. While aggregation improves interpretability via SHAP analysis (NHANES I Survival

Model — SHAP Latest Documentation, 2018), it may obscure essential prognostic signals. To evaluate

both approaches, we will implement models ranging from logistic regression to XGBoost, as well as deep

learning architectures including multilayer perceptrons (MLPs), long short-term memory networks

(LSTMs), one-dimensional convolutional neural networks (1D CNNs), and Transformer-based models —

well-suited for capturing temporal dependencies in clinical time series (Harutyunyan et al., 2019; Deng et

al., 2025).

148

Made with FlippingBook - Share PDF online