AAI_2025_Capstone_Chronicles_Combined

4

The dataset includes detailed imaging metadata - such as PixelSpacing, slice thickness, ImagePositionPatient/Orientation vectors, and scanner-specific acquisition parameters - which enables accurate mapping of voxel indices into physical space. This geometric information is essential for reconstructing 3D masks from the RTSTRUCT contours using a consistent reference frame, and it allows tumor volumes to be computed in true patient-space units rather than relying on approximations or assumptions about slice spacing. The GTVs were delineated by clinical experts as part of radiotherapy treatment planning, providing a clinically meaningful and widely adopted reference standard for evaluating segmentation algorithms (Van De Steene et al., 2002; Das et al., 2021). Variables and Data Types The primary imaging data consist of 3D CT volumes, stored as voxel intensities in Hounsfield Units. These serve as the input to the segmentation models. The RTSTRUCT files

contain lists of closed polygon contours for each axial slice where the tumor is present, represented in millimeter-accurate patient coordinates. From these contours, a voxelized 3D binary mask is reconstructed, aligned precisely to the CT grid; this mask constitutes the supervised training target and the baseline for evaluating segmentation accuracy and derived tumor volumes. Spatial metadata such as PixelSpacing (Δx, Δy) and slice spacing Δz - derived from ImagePositionPatient and ImageOrientationPatient - provide the necessary factors for converting voxel counts into physical volumes. Acquisition-related categorical variables (e.g., reconstruction kernel, scanner vendor) are also available and will support stratified analyses examining how protocol differences influence segmentation and volumetry performance (Choe et al., 2019; Wang et al., 2024). A detailed summary of all variables and their roles in the project is provided in Table 2.1.

Table 2.1. Variables and Data Types Used in This Project Data Component Type Description

Use in Project

CT voxel intensities

3D float tensor (HU)

Raw CT imaging volume

Model input for segmentation and volume inference

RTSTRUCT GTV contours

Polygon coordinate lists (mm)

Clinically delineated tumor boundaries

Ground-truth reference for segmentation

Reconstructed GTV mask PixelSpacing (Δx, Δy)

3D binary tensor (z, y, x)

Voxelized tumor region aligned to CT

Supervised training target; evaluation baseline Needed for physical volume computation

Float pair

In-plane resolution (mm)

401

Made with FlippingBook - Share PDF online