AAI_2025_Capstone_Chronicles_Combined

Cinema Analytics and Prediction System

9

Figure 7: Cinema Analytics and Prediction System Architecture

Application Research and Machine Learning Method

Application Research

The project’s analytical strategy is divided into two main components: Natural Language

Processing (NLP) tasks and predictive modeling tasks. The NLP component involves text-based

analyses such as genre classification and content-based movie recommendation, focusing on

extracting and interpreting semantic features from movie descriptions. The predictive modeling

component focuses on revenue prediction and success classification. Revenue prediction is

framed as a regression problem aimed at estimating a continuous target variable, while success

classification is approached as a classification problem to determine whether a movie is likely to

be a hit or a flop. Together, these components provide a comprehensive and holistic framework

for analyzing the movie industry from both linguistic and numerical perspectives.

The genre classification task involves predicting one or more genres (multi-label)

associated with a movie using a combination of its textual descriptions and numeric metadata.

The recommendation task is designed to suggest similar or relevant movies to users based on

the semantic content of the movie.

There are several well-established methods and technologies used to address similar

problems in real-world and academic contexts. Commercial systems such as Netflix, IMDb, and

176

Internal

Made with FlippingBook - Share PDF online