M.S. AAI Capstone Chronicles 2024

References

Beddiar, R., & Oussalah, M. (2023). Chapter 12 - Explainability in medical image captioning. In J. Benois-Pineau, R. Bourqui, D. Petkovic, & G. Quénot (Eds.), Explainable Deep Learning AI (pp. 239–261). Academic Press. https://doi.org/10.1016/B978-0-32-396098-4.00018-1 Young, P., Lai, A., Hodosh, M., & Hockenmaier, J. (2014). From image descriptions to visual denotations: New similarity metrics for semantic inference over event descriptions. Transactions of the Association for Computational Linguistics , 2 , 67–78. https://doi.org/10.1162/tacl_a_00166 nlphuji. (2023). Flickr30k [Data set]. https://huggingface.co/datasets/nlphuji/flickr30k Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G. S., Davis, A., Dean, J., Devin, M., Ghemawat, S., Goodfellow, I., Harp, A., Irving, G., Isard, M., Jia, Y., Jozefowicz, R., Kaiser, L., Kudlur, M., … Zheng, X. (2015). TensorFlow: Large-scale machine learning on heterogeneous systems [Software]. Available from https://www.tensorflow.org/ Papers with Code - Image Captioning . (n.d.). Retrieved December 7, 2024, from https://paperswithcode.com/task/image-captioning Hossain, M. Z., Sohel, F., Shiratuddin, M. F., & Laga, H. (2019). A comprehensive survey of deep learning for image captioning. ACM Computing Surveys (CsUR) , 51 (6), 1-36. Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., Uszkoreit, J., & Houlsby, N. (2021). An Image

is Worth 16x16 Words: Transformers for Image Recognition at Scale (No. arXiv:2010.11929). arXiv. https://doi.org/10.48550/arXiv.2010.11929

231

Made with FlippingBook - professional solution for displaying marketing and sales documents online