AAI_2025_Capstone_Chronicles_Combined
19
Table 2 Comparison of clustering model results
Cluster 1
Cluster 2
Cluster 3
Cluster 4
Accuracy Run Time
DEC
Step-By-Step Guided Prac tice
Addressing Errors and Building Un derstanding
Extended Practice Across Topics
Diverse Problem Solving Approaches
55% 2m13s
K-Means
Practice Prob lems
Geometry
Trigonometry Calculus
76% 3s
LLM-as-Classifier Arithmetic
Geometry
Algebra
Calculus
87% 15m32s
The “Process-Oriented Engaged Learners” cluster groups together 62 users who are focused on understanding mathematical processes and concepts. “Foundational Skills Learners” consisted of 45 users who were working on basic mathematical concepts with positive attitudes. The remaining conversations were captured in “Mixed Engagement with Learning Challenges.” These users displayed varied emotional states while actively participating in learning. Conclusions Ultimately, deep embedded clustering presented significant challenges for our use case. The DEC model was much more difficult to design and tune than the k-means and LLM-as-classifier models, required far more computational resources, and yielded less useful results. However, it did often create interesting and nuanced clusters, and was likely inhibited by the small dataset relative to the typically large data requirements of deep learning models. On large datasets, DEC could be a valuable option when deep contextual clustering is needed. Although the DEC model was not reliable enough for use in the AI tutoring application, the use of clustering in this context still showed great promise, particularly when combined with large language models. The LLM-as-classifier model was able to significantly outperform both the k-means and DEC models, and the generated clusters were able to be incorporated into the tutoring application in a way that created an intuitive data explorer for users. The use of unsupervised learning presents major advantages for application maintenance over supervised techniques, as it allows the system to automatically generate labels based on the course content without any human intervention or manual labeling. The incorporation of facet extraction also
243
Made with FlippingBook - Share PDF online