Multimodal Sentiment Analysis in Quick Commerce: LSTM Networks for Text, Image, Video Feedback in FMCG Platforms-Scilight

Digital Technologies Research and Applications

Article

Multimodal Sentiment Analysis in Quick Commerce: LSTM Networks for Text, Image, Video Feedback in FMCG Platforms

Downloads

Bansal, N., & Singla, B. (2025). Multimodal Sentiment Analysis in Quick Commerce: LSTM Networks for Text, Image, Video Feedback in FMCG Platforms. Digital Technologies Research and Applications, 4(1), 85–105. https://doi.org/10.54963/dtra.v4i1.979

Authors

  • Neha Bansal

    School of Computer Science and Engineering, Geeta University, Panipat 132103, India
  • Bhawna Singla

    School of Computer Science and Engineering, Geeta University, Panipat 132103, India

This research explores the application of Long Short-Term Memory (LSTM) networks for performing sentiment analysis on customer reviews gathered from prominent FMCG e-commerce platforms such as Blinkit, Zepto, and JioMart. In the fiercely competitive landscape of online retail, accurately interpreting customer sentiment is essential for sustaining customer satisfaction and achieving strategic growth. These platforms accumulate massive amounts of unstructured data—ranging from feedback on product quality to delivery efficiency and overall user experience—which are challenging to process using traditional manual methods. To address this, the study leverages advanced Natural Language Processing (NLP) techniques, with a particular focus on LSTM networks due to their superior ability to model sequential dependencies and retain contextual meaning across review texts. To further enhance performance, pretrained word embeddings are used, enabling the model to understand nuanced language and improve accuracy across varying review structures. Beyond analyzing textual data, the research also integrates visual components into a multimodal sentiment classification framework, offering a holistic understanding of consumer emotions. This dual-modality approach captures subtle sentiments that may not be evident in text alone. The findings yield practical insights for enhancing customer service, optimizing product selections, and improving overall brand engagement. Ultimately, this study empowers data-driven strategies that elevate user experience and market responsiveness in the dynamic FMCG e-commerce industry.

Keywords:

FMCG Platforms Natural Language Processing (NLP) Deep Learning Aspect-Based Sentiment Analysis (ABSA) Sentiment Classification Cross-Platform Comparison Customer Experience Analytics

References

  1. Yu, Y.; Si, X.; Hu, C.; et al. A review of recurrent neural networks: LSTM cells and network architectures. Neural Comput. 2019, 31, 1235–1270. DOI: https://doi.org/10.1162/neco_a_01199
  2. Sherstinsky, A. Fundamentals of recurrent neural network (RNN) and long short-term memory (LSTM) network. Physica D 2020, 404, 132306. DOI: https://doi.org/10.1016/j.physd.2019.132306
  3. Bansal, N.; Singla, B. Predictive modeling for event-free survival in allogeneic hematopoietic stem cell transplantation patients using LSTM networks. Foundry 2025, 28, 55–61.
  4. Chen, K.; Wang, R.; Utiyama, M.; et al. Content Word Aware Neural Machine Translation. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Stroudsburg, PA, United States, 5–10 July 2020. pp. 358–364. DOI: https://doi.org/10.18653/v1/2020.acl-main.34
  5. Tang, D.; Qin, B.; Liu, T. Document modeling with gated recurrent neural network for sentiment classification. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, Lisbon, Portugal, 17–21 September 2015. pp. 1422–1432. DOI: https://doi.org/10.18653/v1/D15-1167
  6. Wankhade, M.; Rao, A.C.S.; Kulkarni, C.; et al. A survey on sentiment analysis methods, applications, and challenges. Artif. Intell. Rev. 2022, 55, 5731–5780. DOI: https://doi.org/10.1007/s10462-022-10144-1
  7. Hussein, D.M.E.D.M. A survey on sentiment analysis challenges. J. King Saud Univ.-Eng. Sci. 2018, 30, 330–338. DOI: https://doi.org/10.1016/j.jksues.2016.04.002
  8. Zhu, L., Zhu, Z., Zhang, C., Xu, Y., & Kong, X. (2023). Multimodal sentiment analysis based on fusion methods: A survey. Information Fusion, 95, 306-325.DOI:https://doi.org/10.1016/j.inffus.2023.02.028
  9. Ma, Y.; Peng, H.; Khan, T.; et al. Sentic LSTM: a hybrid network for targeted aspect-based sentiment analysis. Cogn. Comput. 2018, 10, 639–650. DOI: https://doi.org/10.1007/s12559-018-9549-x
  10. Mikolov, T.; Chen, K.; Corrado, G.; et al. Efficient estimation of word representations in vector space. arXiv 2013, revised. arXiv:1301.3781 [cs.CL]. https://doi.org/10.48550/arXiv.1301.3781
  11. Mikolov, T.; Sutskever, I.; Chen, K.; et al. Distributed representations of words and phrases and their compositionality. Adv. Neural Inf. Process. Syst. 2013, 26, 3111–3119.
  12. Pennington, J.; Socher, R.; Manning, C. GloVe: global vectors for word representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, 25–29 October 2014. pp. 1532–1543. DOI: https://doi.org/10.3115/v1/D14-1162
  13. Anoop, V. S., & Asharaf, S. (2019). Aspect-oriented sentiment analysis: A topic modeling-powered approach. Journal of Intelligent Systems, 29(1), 1166-1178.DOI:https://doi.org/10.1515/jisys-2018-0299
  14. Wu, C., Ma, B., Zhang, Z., Deng, N., He, Y., & Xue, Y. (2024). Evaluating Zero-Shot Multilingual Aspect-Based Sentiment Analysis with Large Language Models. arXiv preprint arXiv:2412.12564.
  15. Shen, L.; Feng, Y.; Zhan, H. Modeling Semantic Relationship in Multi-turn Conversations with Hierarchical Latent Variables. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, July 28th–August 2nd, 2019. pp. 5497–5502. DOI: https://doi.org/10.18653/v1/P19-1549
  16. Singh, A.; Thakur, N.; Sharma, A. A review of supervised machine learning algorithms. In 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom), New Delhi, India, 16–18 March 2016. pp. 1310–1315.
  17. Osisanwo, F.Y.; Akinsola, J.E.T.; Awodele, O.; et al. Supervised machine learning algorithms: classification and comparison. Int. J. Comput. Trends Technol. 2017, 48, 128–138.
  18. Poria, S.; Cambria, E.; Hazarika, D.; et al. A Deeper Look into Sarcastic Tweets Using Deep Convolutional Neural Networks. In Proceedings of the COLING 2016, the 26th International Conference on Computational Linguistics, Osaka, Japan, 11–16 December 2016. pp. 1601–1612.
  19. Zadeh, A.; Zellers, R.; Pincus, E.; et al. Multimodal sentiment intensity analysis in videos: Facial gestures and verbal messages. IEEE Intell. Syst. 2016, 31, 82–88. DOI: https://doi.org/10.1109/MIS.2016.94
  20. Singla, B. Fine-tuning FaceNet for celebrity face Recognition: Enhancing Real vs. Fake Image Detection on the '105_Classes_Pins' Dataset. Degres 2024, 9, 117–129.
  21. Singla, B.; Verma, A.K. Application of Machine Learning for the Tracing of Jellyfish Attack in MANETs. In Science and Technology: Developments and Applications Vol. 2; Jakóbczak, D.J. Ed.; BP International: London, United Kingdom, 2025. pp. 50–65. DOI: https://doi.org/10.9734/bpi/stda/v2/3679
  22. Zhou, C.; Sun, C.; Liu, Z.; et al. A C-LSTM neural network for text classification. arXiv 2015, revised. arXiv preprint arXiv:1511.08630. DOI: https://doi.org/10.48550/arXiv.1511.08630
  23. Huang, H., Jin, Y., Rao, R. Sentiment-aware transformer using joint training. In 2020 IEEE 32nd International Conference on Tools with Artificial Intelligence (ICTAI), Baltimore, MD, USA, 09–11 November 2020. DOI: https://doi.org/10.1109/ICTAI50040.2020.00175
  24. Liu, Y.; Ott, M.; Goyal, N.; et al. RoBERTa: A robustly optimized BERT pretraining approach. arXiv 2019, Submitted. arXiv preprint arXiv:1907.11692. DOI: https://doi.org/10.48550/arXiv.1907.11692
  25. Han, Y., Moghaddam, M. Design knowledge as attention emphasizer in large language model-based sentiment analysis. J. Comput. Inf. Sci. Eng. 2025, 25(2),021007. DOI: https://doi.org/10.1115/1.4067212
  26. Wang, Y.; Huang, M.; Zhu, X.; et al. Attention-based LSTM for aspect-level sentiment classification. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing; 1–5 November 2016, Austin, Texas, USA. pp. 606–615. DOI: https://doi.org/10.18653/v1/D16-1058