AAI_2025_Capstone_Chronicles_Combined
and converted into a compatible format using a provided script. Images were saved as
uncompressed PNGs to ensure lossless quality.
Training was conducted using a batch size of 32 and a gamma value of 6.6. StyleGAN3
training proceeds in "ticks," with each tick representing the generation and processing of
1,000 images. Model snapshots, including generator, discriminator, and optimizer states,
were saved every 10 ticks. Internally, the generator uses a non-saturating logistic loss and
path-length regularization, while the discriminator uses a logistic loss with R1
regularization (Karras et al., 2020). Overall model evaluation includes Frechet Inception
Distance, Precision and Recall, and Perceptual Path Length, but for this study, Generator
Loss and Discriminator Loss were used as primary performance measures. Regularization
terms, such as path-length penalty (Loss/pl_penalty) and R1 penalty (Loss/r1_penalty),
were also referenced to assess model stability and constraint.
The training process was repeated separately for each image category, resulting in one
StyleGAN model for NORMAL images and another for PNEUMONIA images. After training,
1,000 synthetic images from each model were generated for use in the CNN classification
task.
Results
The following section outlines two aspects of the results: image generation using
GANs and CNN model performance. The first part describes how well the GAN models
converged to generate believable images. The second part evaluates the CNN
111
Made with FlippingBook - Share PDF online