AAI_2025_Capstone_Chronicles_Combined

Cinema Analytics and Prediction System

10

Amazon Prime Video often implement hybrid recommendation engines that combine

collaborative filtering based on user-item interaction history with content-based filtering that

uses movie metadata (Introduction to Recommender Systems, Deep Learning, n.d.).

Similar research efforts have also leveraged modern language models to improve

recommendation systems. Netflix has published studies highlighting the effectiveness of hybrid

recommender systems that use natural language embeddings alongside user behavior data

(Netflix, 2024). Additionally, platforms such as HuggingFace provide tutorials on fine-tuning

BERT for classification tasks using movie-related datasets like IMDb reviews, showcasing the

growing integration of transformer models in applied recommendation systems (Text

Classification, n.d).

Machine Learning Method Research

Various machine learning and natural language processing (NLP) techniques have been

developed to address tasks like genre classification and movie recommendation. A commonly

used baseline approach is Term Frequency – Inverse Document Frequency (TF-IDF), this is a

statistical method introduced to evaluate the importance of words within a document relative

to a corpus (Salton & Buckley, 1988). TF-IDF creates sparse vector representations based on

word frequency, allowing for similarity comparisons using metrics such as cosine similarity. In

the context of movie recommendation, TF-IDF can be used to compare plot descriptions and

retrieve similar content. However, it struggles to capture semantic relationships or contextual

meaning, which can lead to misinterpretations when different vocabularies are used to describe

similar themes. Prior studies have shown that TF-IDF fails to relate movies like The Dark Knight

and Joker or Interstellar and The Martian despite shared themes (Tamanna, 2023).

177

Internal

Made with FlippingBook - Share PDF online