AAI_2025_Capstone_Chronicles_Combined
Cinema Analytics and Prediction System
24
Figure 23: LSTM-based movie clusters table
The results indicate that while the LSTM + GloVe model captures some genre and
stylistic groupings, it struggles with deeper semantic coherence, especially for hybrid films.
Using five clusters ensured consistency with other models but may not fully reflect the
LSTM embedding structure. The project confirmed that transformer-based embeddings like
BERT outperform TF-IDF in capturing nuanced semantics. LSTM + GloVe adds supervised insight
but requires careful tuning. For deployment, BERT offers the best balance of robustness and
quality. Future work could fine-tune BERT on movie-specific data, combine embeddings with
user metadata, implement scalable embedding and fast similarity search (e.g., FAISS), and
integrate genre classification to enhance personalization. This work lays a strong foundation for
semantic-aware, personalized movie recommendation systems.
Genre classification
The genre classification task involved multi-label movie genre prediction from text.
Models tested included logistic regression with BERT embeddings, an LSTM with GloVe
191
Internal
Made with FlippingBook - Share PDF online