M.S. Applied Data Science - Capstone Chronicles 2025

8

Figure 2 Distribution of Product Types

products per event ranging from 1 to 470, a mean of 2.77, and a median of 1.0. As each product type is uniquely associated with an FDA center, only the Product Type variable will be retained for modeling to eliminate redundancy and preserve interpretability. Table 1 Product Type and Its Center Association

Product type

Center

Note. This figure shows the distribution of product types. Further analysis revealed that the center variable had a perfect one-to-one correspondence with product type. Since each product type maps to one specific FDA center, this redundancy led to the exclusion of the center variable to enhance interpretability. The analysis of the distribution pattern variable also revealed significant trends, with certain distribution types more frequent in high-risk recalls. The distribution of recalls by the FDA center is presented in Table 1. The Center for Devices and Radiological Health center (devices) accounted for the largest share at 37.3%, followed by Center for Food Safety and Applied Nutrition (food/cosmetics, 28.9%), Center for Drug Evaluation and Research (drugs, 17.6%), Center for Biology Evaluation and Research (biologics, 12.7%), Center for Veterinary Medicine (veterinary, 3.6%), and Center for Tobacco Products (tobacco, <1%). This breakdown aligns precisely with the distribution of product types, further confirming the perfect correspondence between product type and center. Additionally, an analysis of products per recall event revealed considerable variation, with the number of

Devices

Center for Devices and Radiological Health (CDRH) Center for Food Safety and Applied Nutrition (CFSAN) Center for Drug Evaluation and Research (CDER) Center for Biology Evaluation and Research (CBER) Center for Veterinary Medicine (CVM) Center for Tobacco Products (CTP)

Food/cosmetics

Drugs

Biologics

Veterinary

Tobacco

The dataset also included several ID variables, such as Event ID , Product ID , FEI Number , and Recall Details , which were excluded from the analysis due to their uniqueness and lack of analytical value (Kuhn & Johnson, 2013).

12

Made with FlippingBook flipbook maker