Article
Multimodal Sentiment Analysis in Quick Commerce: LSTM Networks for Text, Image, Video Feedback in FMCG Platforms


This work is licensed under a Creative Commons Attribution 4.0 International License.
Copyright
The authors shall retain the copyright of their work but allow the Publisher to publish, copy, distribute, and convey the work.
License
Digital Technologies Research and Applications (DTRA) publishes accepted manuscripts under Creative Commons Attribution 4.0 International (CC BY 4.0). Authors who submit their papers for publication by DTRA agree to have the CC BY 4.0 license applied to their work, and that anyone is allowed to reuse the article or part of it free of charge for any purpose, including commercial use. As long as the author and original source are properly cited, anyone may copy, redistribute, reuse, and transform the content.
This research explores the application of Long Short-Term Memory (LSTM) networks for performing sentiment analysis on customer reviews gathered from prominent FMCG e-commerce platforms such as Blinkit, Zepto, and JioMart. In the fiercely competitive landscape of online retail, accurately interpreting customer sentiment is essential for sustaining customer satisfaction and achieving strategic growth. These platforms accumulate massive amounts of unstructured data—ranging from feedback on product quality to delivery efficiency and overall user experience—which are challenging to process using traditional manual methods. To address this, the study leverages advanced Natural Language Processing (NLP) techniques, with a particular focus on LSTM networks due to their superior ability to model sequential dependencies and retain contextual meaning across review texts. To further enhance performance, pretrained word embeddings are used, enabling the model to understand nuanced language and improve accuracy across varying review structures. Beyond analyzing textual data, the research also integrates visual components into a multimodal sentiment classification framework, offering a holistic understanding of consumer emotions. This dual-modality approach captures subtle sentiments that may not be evident in text alone. The findings yield practical insights for enhancing customer service, optimizing product selections, and improving overall brand engagement. Ultimately, this study empowers data-driven strategies that elevate user experience and market responsiveness in the dynamic FMCG e-commerce industry.
Keywords:
FMCG Platforms Natural Language Processing (NLP) Deep Learning Aspect-Based Sentiment Analysis (ABSA) Sentiment Classification Cross-Platform Comparison Customer Experience AnalyticsReferences
- Yu, Y.; Si, X.; Hu, C.; et al. A review of recurrent neural networks: LSTM cells and network architectures. Neural Comput. 2019, 31, 1235–1270. DOI: https://doi.org/10.1162/neco_a_01199
- Sherstinsky, A. Fundamentals of recurrent neural network (RNN) and long short-term memory (LSTM) network. Physica D 2020, 404, 132306. DOI: https://doi.org/10.1016/j.physd.2019.132306
- Bansal, N.; Singla, B. Predictive modeling for event-free survival in allogeneic hematopoietic stem cell transplantation patients using LSTM networks. Foundry 2025, 28, 55–61.
- Chen, K.; Wang, R.; Utiyama, M.; et al. Content Word Aware Neural Machine Translation. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Stroudsburg, PA, United States, 5–10 July 2020. pp. 358–364. DOI: https://doi.org/10.18653/v1/2020.acl-main.34
- Tang, D.; Qin, B.; Liu, T. Document modeling with gated recurrent neural network for sentiment classification. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, Lisbon, Portugal, 17–21 September 2015. pp. 1422–1432. DOI: https://doi.org/10.18653/v1/D15-1167
- Wankhade, M.; Rao, A.C.S.; Kulkarni, C.; et al. A survey on sentiment analysis methods, applications, and challenges. Artif. Intell. Rev. 2022, 55, 5731–5780. DOI: https://doi.org/10.1007/s10462-022-10144-1
- Hussein, D.M.E.D.M. A survey on sentiment analysis challenges. J. King Saud Univ.-Eng. Sci. 2018, 30, 330–338. DOI: https://doi.org/10.1016/j.jksues.2016.04.002
- Zhu, L., Zhu, Z., Zhang, C., Xu, Y., & Kong, X. (2023). Multimodal sentiment analysis based on fusion methods: A survey. Information Fusion, 95, 306-325.DOI:https://doi.org/10.1016/j.inffus.2023.02.028
- Ma, Y.; Peng, H.; Khan, T.; et al. Sentic LSTM: a hybrid network for targeted aspect-based sentiment analysis. Cogn. Comput. 2018, 10, 639–650. DOI: https://doi.org/10.1007/s12559-018-9549-x
- Mikolov, T.; Chen, K.; Corrado, G.; et al. Efficient estimation of word representations in vector space. arXiv 2013, revised. arXiv:1301.3781 [cs.CL]. https://doi.org/10.48550/arXiv.1301.3781
- Mikolov, T.; Sutskever, I.; Chen, K.; et al. Distributed representations of words and phrases and their compositionality. Adv. Neural Inf. Process. Syst. 2013, 26, 3111–3119.
- Pennington, J.; Socher, R.; Manning, C. GloVe: global vectors for word representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, 25–29 October 2014. pp. 1532–1543. DOI: https://doi.org/10.3115/v1/D14-1162
- Anoop, V. S., & Asharaf, S. (2019). Aspect-oriented sentiment analysis: A topic modeling-powered approach. Journal of Intelligent Systems, 29(1), 1166-1178.DOI:https://doi.org/10.1515/jisys-2018-0299
- Wu, C., Ma, B., Zhang, Z., Deng, N., He, Y., & Xue, Y. (2024). Evaluating Zero-Shot Multilingual Aspect-Based Sentiment Analysis with Large Language Models. arXiv preprint arXiv:2412.12564.
- Shen, L.; Feng, Y.; Zhan, H. Modeling Semantic Relationship in Multi-turn Conversations with Hierarchical Latent Variables. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, July 28th–August 2nd, 2019. pp. 5497–5502. DOI: https://doi.org/10.18653/v1/P19-1549
- Singh, A.; Thakur, N.; Sharma, A. A review of supervised machine learning algorithms. In 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom), New Delhi, India, 16–18 March 2016. pp. 1310–1315.
- Osisanwo, F.Y.; Akinsola, J.E.T.; Awodele, O.; et al. Supervised machine learning algorithms: classification and comparison. Int. J. Comput. Trends Technol. 2017, 48, 128–138.
- Poria, S.; Cambria, E.; Hazarika, D.; et al. A Deeper Look into Sarcastic Tweets Using Deep Convolutional Neural Networks. In Proceedings of the COLING 2016, the 26th International Conference on Computational Linguistics, Osaka, Japan, 11–16 December 2016. pp. 1601–1612.
- Zadeh, A.; Zellers, R.; Pincus, E.; et al. Multimodal sentiment intensity analysis in videos: Facial gestures and verbal messages. IEEE Intell. Syst. 2016, 31, 82–88. DOI: https://doi.org/10.1109/MIS.2016.94
- Singla, B. Fine-tuning FaceNet for celebrity face Recognition: Enhancing Real vs. Fake Image Detection on the '105_Classes_Pins' Dataset. Degres 2024, 9, 117–129.
- Singla, B.; Verma, A.K. Application of Machine Learning for the Tracing of Jellyfish Attack in MANETs. In Science and Technology: Developments and Applications Vol. 2; Jakóbczak, D.J. Ed.; BP International: London, United Kingdom, 2025. pp. 50–65. DOI: https://doi.org/10.9734/bpi/stda/v2/3679
- Zhou, C.; Sun, C.; Liu, Z.; et al. A C-LSTM neural network for text classification. arXiv 2015, revised. arXiv preprint arXiv:1511.08630. DOI: https://doi.org/10.48550/arXiv.1511.08630
- Huang, H., Jin, Y., Rao, R. Sentiment-aware transformer using joint training. In 2020 IEEE 32nd International Conference on Tools with Artificial Intelligence (ICTAI), Baltimore, MD, USA, 09–11 November 2020. DOI: https://doi.org/10.1109/ICTAI50040.2020.00175
- Liu, Y.; Ott, M.; Goyal, N.; et al. RoBERTa: A robustly optimized BERT pretraining approach. arXiv 2019, Submitted. arXiv preprint arXiv:1907.11692. DOI: https://doi.org/10.48550/arXiv.1907.11692
- Han, Y., Moghaddam, M. Design knowledge as attention emphasizer in large language model-based sentiment analysis. J. Comput. Inf. Sci. Eng. 2025, 25(2),021007. DOI: https://doi.org/10.1115/1.4067212
- Wang, Y.; Huang, M.; Zhu, X.; et al. Attention-based LSTM for aspect-level sentiment classification. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing; 1–5 November 2016, Austin, Texas, USA. pp. 606–615. DOI: https://doi.org/10.18653/v1/D16-1058

Download
