M.S. AAI Capstone Chronicles 2024

A.S.LINGUIST

16

chatbot answers and make the entire project impossible to be carried out. Thus, considering our

project goal, the CNN model performance we obtained is absolutely desirable and does not

suggest the necessity to implement further model modifications.

Fine-Tuned Flan-T5-Base Chatbot Model

The learning task was properly learned by the chatbot model, as resulting from the

behavior of the training and validation loss, which both converge to zero after about 90 steps

(Figure 7).

Figure 7

Representation of training (blue) and validation (red) loss for the Flan-T5-Base chatbot model

The performance of the chatbot model was qualitatively evaluated through an initial list

of 20 questions on different topics, which should be further expanded in the future for a more

rigorous evaluation. The answers provided by the model to each questionnaire are reported in

Table 1. As one can see from the latter, model answers are relevant and consistent with

corresponding questions, although they are not always right and could be more precise. For

196

Made with FlippingBook - professional solution for displaying marketing and sales documents online