AAI_2025_Capstone_Chronicles_Combined

The end users of this AI-powered diagnostic tool are medical professionals, particularly

radiologists and physicians, who can utilize the model to assist in pneumonia diagnostics. By

integrating synthetic image generation via GANs, the project aims to create a more accurate and

robust pneumonia detection system, addressing the limitations posed by data scarcity in

medical AI applications. This approach demonstrates that augmenting real medical datasets

with GAN-generated images can serve as a useful tool for developing high-accuracy diagnostic

models, contributing to improved patient outcomes and optimized healthcare workflows.

Data Summary

The dataset contains 5,856 grayscale X-ray images divided into training (5,216 images),

testing (624 images), and validation (16 images) subsets. The data is pre-sorted into

“PNEUMONIA” and “NORMAL” folders, which serve as class labels. The dataset is designed to

support the development of a CNN classification model to differentiate normal chest X-rays

from those showing pneumonia. It also serves as the basis for generating synthetic images using

a GAN to supplement the CNN model and improve performance metrics such as accuracy and

F1 score.

Variables and Data Characteristics

The dataset consists of X-ray images varying in size and resolution, depicting the chest

cavities of patients (Figure 1). Differences in patient posture and positioning are present,

including arms positioned above or below the head, slight arm rotations, and tilted or off-center

96

Made with FlippingBook - Share PDF online