AAI_2025_Capstone_Chronicles_Combined
Cinema Analytics and Prediction System
29
Revenue Prediction and Success Classification
For the regression task, the quality of predictions is represented in Figure 28, where
actual vs. predicted revenues are plotted alongside the
budget. The model captured overall revenue trends but
struggled with extreme cases, underestimating
blockbusters and overestimating some low-revenue
films due to their rarity in the training data. Still, it
Figure 28: Actual vs Predicted revenue
outperformed mean and median baselines across all
error metrics (Figure 29) like MAE, MedAE, and MAPE. Its relative errors (Figure 30) were much
lower, and it achieved an R-squared score near 0.7, showing strong performance compared to
the negative R-squared of baseline models .
Figure 30: Error metrics comparison
Figure 29: Error metrics comparison
The binary XGBoost model for success classification performed well, with balanced
precision, recall, and F1-scores around 0.75, showing good generalization. In contrast, the
multiclass XGBoost model showed mixed performance. The model's predictions varied
significantly across categories. It performed reasonably well on the Flop and Superhit classes,
achieving F1-scores of 0.78 and 0.65, respectively. However, it struggled to classify the Hit
category, with an F1-score of just 0.09, suggesting potential issues such as class imbalance or
ambiguous feature representations for that class.
196
Internal
Made with FlippingBook - Share PDF online