Original Article
Advances in Deep Learning for Head and Neck Cancer: Datasets and Applied Methods
Downloads
- Download
This work is licensed under a Creative Commons Attribution 4.0 International License.
Head and neck cancers (HNCs) include malignancies of the oral cavity, salivary glands, thyroid, oropharynx, and nasopharynx, with risk factors such as tobacco use, alcohol consumption, viral infections, and environmental exposures contributing to over half a million global cases annually. Despite treatment advances, poor prognosis underscores the need for accurate diagnosis and continuous monitoring. Medical imaging plays a critical role in HNC evaluation but is often limited by the complexity of anatomy and tumor biology. Recent advances in artificial intelligence (AI), particularly deep learning, offer opportunities to enhance diagnostic accuracy and optimize treatment strategies. This study reviews the application of deep learning in HNC imaging, evaluating different architectures and addressing challenges like limited annotated datasets, high computational demands, and ethical concerns. Overcoming these challenges will revolutionize HNC diagnostics, redefine precision oncology, and improve patient care. The future integration of explainable AI models and multimodal data will be crucial in advancing diagnostic precision, ensuring clinical applicability, and addressing ethical and resource challenges. As AI progresses, its effective integration into clinical workflows will not only enhance healthcare delivery but also reduce inequalities, accelerating significant advancements in HNC management and transforming patient outcomes.
Keywords:
Deep Learning Head and Neck Cancer Histopathology Images Attention Mechanisms Imaging Modal‑ ities Survival Prediction Healthcare Decision‑MakingReferences
- Barsouk, A.; Aluru, J.S.; Rawla, P.; et al. Epidemiology, Risk Factors, and Prevention of Head and Neck Squamous Cell Carcinoma. Med. Sci. 2023, 11, 42.
- Giaquinto, A.N.; Sung, H.; Miller, K.D.; et al. Breast Cancer Statistics, 2022. CA: Cancer J. Clin. 2022, 72, 524–541.
- Dai, X.; Park, J.H.; Yoo, S.; et al. Survival Analysis of Localized Prostate Cancer with Deep Learning. Sci. Rep. 2022, 12, 17821.
- Boucai, L.; Zafereo, M.; Cabanillas, M.E. Thyroid Cancer: A Review. JAMA 2024, 331, 425–435.
- Cheng, L.; Albers, P.; Berney, D.M.; et al. Testicular Cancer. Nat. Rev. Dis. Primers 2018, 4, 29.
- Song, X.; Zhao, Z.; Barber, B.; et al. Overall Survival in Patients with Metastatic Melanoma. Curr. Med. Res. Opin. 2015, 31, 987–991.
- Tolstikhin, I.O.; Houlsby, N.; Kolesnikov, A.; et al. MLP-Mixer: An All-MLP Architecture for Vision. Adv. Neural Inf. Process. Syst. 2021, 34, 24261–24272.
- Devlin, J.; Chang, M.-W.; Lee, K.; et al. Pre-training of Deep Bidirectional Transformers for Language Understanding. 2018, arXiv preprint.
- Boulesteix, A.-L.; Wright, M. Artificial Intelligence in Genomics. Hum. Genet. 2022, 141, 1449–1450.
- Kumar, Y. Artificial Intelligence & Robotics–Synthetic Brain in Action. 2018, SSRN 3325115 .
- Bhatia, S.; Sinha, Y.; Goel, L. Lung cancer detection: a deep learning approach. In InSoft Computing for Problem Solving: SocProS 2017. Springer: Singapore, 2019; Volume 2, pp. 699–705.
- Guo, R.; Lu, G.; Qin, B.; et al. Ultrasound Imaging Technologies for Breast Cancer Detection and Management: A Review. Ultrasound Med. Biol. 2018, 44, 37–70.
- Sailasya, G., Kumari, G.L. Analyzing the performance of stroke prediction using ML classification algorithms. Int. J. Adv. Comput. Sci. Appl. 2021, 12.
- Sajid, S.; Hussain, S.; Sarwar, A. Brain Tumor Detection and Segmentation in MR Images Using Deep Learning. Arabian J. Sci. Eng. 2019, 44, 9249–9261.
- Khalid, S.; Goldenberg, M.; Grantcharov, T.; et al. Evaluation of Deep Learning Models for Identifying Surgical Actions and Measuring Performance. JAMA Netw. Open 2020, 3, 201664.
- Roy, M.; Kong, J.; Kashyap, S.; et al. Convolutional Autoencoder Based Model Histocae for Segmentation of Viable Tumor Regions in Liver Whole-Slide Images. Sci. Rep. 2011, 11, 139.
- Chen, H.; Zhang, Y.; Zhang, W.; et al. Low-Dose CT via Convolutional Neural Network. Biomed. Opt. Express 2017, 8, 679–694.
- Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; et al. Generative Adversarial Networks. Commun. ACM 2020, 63, 139–144.
- Michelutti, L.; Tel, A.; Zeppieri, M.; et al. Generative Adversarial Networks (GANs) in the Field of Head and Neck Surgery: Current Evidence and Prospects for the Future—A Systematic Review. J. Clin. Med. 2024. 13, 3556.
- Son, J.; Park, S.J.; Jung, K.-H. Towards Accurate Segmentation of Retinal Vessels and the Optic Disc in Fundoscopic Images with Generative Adversarial Networks. J. Digit. Imaging 2019, 32, 499–512.
- Wolterink, J.M.; Leiner, T.; Viergever, M.A.; et al. Generative Adversarial Networks for Noise Reduction in Low-Dose CT. IEEE Trans. Med. Imaging 2017, 36, 2536–2545.
- Pan, Z.; Yu, W.; Yi, X.; et al. Recent progress on generative adversarial networks (GANs): A survey. IEEE Access. 2019, 7, 36322–36333.
- Han, K.; Wang, Y.; Chen, H.; et al. A survey on vision transformer. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 45, 87–110.
- Li, X.; Zhang, L.; Yang, J.; et al. Role of Artificial Intelligence in Medical Image Analysis: A Review of Current Trends and Future Directions. J. Med. Biol. Eng. 2024, 1–3.
- Atabansi, C.C.; Nie, J.; Liu, H.; et al. A survey of Transformer Applications for Histopathological Image Analysis: New Developments and Future Directions. BioMed. Eng. OnLine 2023, 22, 96.
- Majeed, T.; Masoodi, T.A.; Macha, M.A., et al. Addressing data imbalance challenges in oral cavity histopathological whole slide images with advanced deep learning techniques. Int. J. Syst. Assur. Eng. Manag. 2024, 26, 1–9.
- Kavyashree, C.; Vimala, H.S.; Shreyas, J. Improving oral cancer detection using pretrained model. In Proceedings of the 2022 IEEE 6th conference on information and communication technology (CICT), Gwalior, India, 18–20 November 2022; pp. 1–5.
- You, Z.; Han, B.; Shi, Z.; et al. Vocal Cord Leukoplakia Classification Using Siamese Network Under Small Samples of White Light Endoscopy Images. Otolaryngol. Head Neck Surg. 2024, 170, 1099–1108.
- Peng, J.; Zhou, Z.; Han, X.; et al. Multi-Level Fusion Graph Neural Network: Application to PET and CT Imaging for Risk Stratification of Head And Neck Cancer. Biomed. Signal Process. Control 2024, 92, 106137.
- Al Ajmi, E.; Forghani, B.; Reinhold, C.; et al. Spectral Multi- Energy CT Texture Analysis with Machine Learning for Tissue Classification: An Investigation Using Classification of Benign Parotid Tumours as a Testing Paradigm. Eur. Radiol. 2018, 28, 2604–2611.
- Ranjbar, S.; Ning, S.; Zwart, C.M.; et al. Computed Tomography-Based Texture Analysis to Determine Human Papillomavirus Status of Oropharyngeal Squamous Cell Carcinoma. J. Comput. Assist. Tomogr. 2018, 42, 299–305.
- Wu, B.; Khong, P.-L.; Chan, T. Automatic Detection and Classification of Nasopharyngeal Carcinoma on PET/CT with Support Vector Machine. Int. J. Comput. Assist. Radiol. Surg. 2012, 7, 635–646.
- Siebers, S.; Zenk, J.; Bozzato, A.; et al. Computer Aided Diagnosis of Parotid Gland Lesions Using Ultrasonic Multi-Feature Tissue Characterization. Ultrasound Med. Biol. 2010, 36, 1525–1534.
- Huang, R., Zhou, Z., Wang, X., et al. Magnetic resonance imaging features on deep learning algorithm for the diagnosis of nasopharyngeal carcinoma. Contrast Media Mol. Imaging 2022, 3790269.
- Ramkumar, S.; Ranjbar, S.; Ning, S.; et al. Mri-Based Texture Analysis to Differentiate Sinonasal Squamous Cell Carcinoma from Inverted Papilloma. Am. J. Neuroradiol. 2017, 38, 1019–1025.
- Li, C.; Jing, B.; Ke, L.; et al. Development and Validation of an Endoscopic Images-Based Deep Learning Model for Detection with Nasopharyngeal Malignancies. Cancer Commun. 2018, 38, 1–11.
- Al-Ma’aitah, M.; AlZubi, A.A. Enhanced Computational Model for Gravitational Search Optimized Echo State Neural Networks Based Oral Cancer Detection. J. Med. Syst. 2018, 42, 1–7.
- Moccia, S.; De Momi, E.; Guarnaschelli, M.; et al. Confident Texture-Based Laryngeal Tissue Classification for Early Stage Diagnosis Support. J. Med. Imaging 2017, 4, 034502.
- Song, B.; Sunny, S.; Uthoff, R.D.; et al. Automatic Classification of Dual-Modalilty, Smartphone-Based Oral Dysplasia and Malignancy Images Using Deep Learning. Biomed. Opt. Express 2018, 9, 5318–5329.
- Roblyer, D.; Kurachi, C.; Stepanek, V.; et al. Comparison of Multispectral Wide-Field Optical Imaging Modalities to Maximize Image Contrast for Objective Discrimination of Oral Neoplasia. J. Biomed. Opt. 2010, 15, 066017.
- Quang, T.; Tran, E.Q.; Schwarz, R.A.; et al. Prospective Evaluation of Multimodal Optical Imaging with Automated Image Analysis to Detect Oral Neoplasia in Vivomultimodal Optical Imaging to Detect Oral Neoplasia In Vivo Cancer Prev. Res. 2017, 10, 563–570 .
- Kang, E.Y.-C.; Yeung, L.; Lee, Y.-L.; et al. A Multimodal Imaging–Based Deep Learning Model for Detecting Treatment-Requiring Retinal Vascular Diseases: Model Development and Validation Study. JMIR Med. Inform. 2021, 9, 28868.
- Halicek, M.; Little, J.V.; Wang, X.; et al. Optical biopsy of head and neck cancer using hyperspectral imaging and convolutional neural networks. In Optical Imaging, Therapeutics, and Advanced Technology in Head and Neck Surgery and Otolaryngology; SPIE: Bellingham, WA, USA. 2018; Volume 10469, pp. 8–16.
- Jeyaraj, P.R.; Samuel Nadar, E.R. Computer-Assisted Medical Image Classification for Early Diagnosis of Oral Cancer Employing Deep Learning Algorithm. J. Cancer Res. Clin. Oncol. 2019, 145, 829–837.
- Ma, L.; Lu, G.; Wang, D.; et al. Adaptive Deep Learning for Head and Neck Cancer Detection Using Hyperspectral Imaging. Vis. Comput. Ind. Biomed. Art 2019, 2, 1–12.
- Halicek, M.; Shahedi, M.; Little, J.V.; et al. Head and Neck Cancer Detection in Digitized Whole-Slide Histology Using Convolutional Neural Networks. Sci. Rep. 2019, 9, 14043.
- He, Y.; Cheng, Y.; Huang, Z.; et al. A deep convolutional neural network-based method for laryngeal squamous cell carcinoma diagnosis. Ann. Transl. Med. 2021, 9.
- Rahman, T.Y.; Mahanta, L.B.; Das, A.K.; et al. Automated Oral Squamous Cell Carcinoma Identification Using Shape, Texture and Color Features of Whole Image Strips. Tissue Cell 2020, 63, 101322.
- Rahman, T.; Mahanta, L.; Chakraborty, C.; et al. Textural Pattern Classification for Oral Squamous Cell Carcinoma. J. Microsc. 2018, 269, 85–93.
- Rodner, E.; Bocklitz, T.; Eggeling, F.; et al. Fully Convolutional Networks in Multimodal Nonlinear Microscopy Images for Automated Detection of Head and Neck Carcinoma: Pilot Study. Head Neck 2019, 41, 116–121.
- Tang, H.; Li, G.; Liu, C.; et al. Diagnosis of Lymph Node Metastasis in Head and Neck Squamous Cell Carcinoma Using Deep Learning. Laryngoscope Investig. Otolaryngol. 2022, 7, 161–169.
- Ariji, Y.; Fukuda, M.; Kise, Y.; et al. Contrast-Enhanced Computed Tomography Image Assessment of Cervical Lymph Node Metastasis in Patients with Oral Cancer by Using a Deep Learning System of Artificial Intelligence. Oral Surg. Oral Med. Oral Pathol. Oral Radiol. 2019, 127, 458–463.
- Abram, T.J.; Floriano, P.N.; James, R.; et al. Development of a Cytology-Based Multivariate Analytical Risk Index for Oral Cancer. Oral Oncol. 2019. 92, 6–11.
- Tapak, L.; Shirmohammadi-Khorram, N.; Amini, P.; et al. Prediction of Survival and Metastasis in Breast Cancer Patients Using Machine Learning Classifiers. Clin. Epidemiol. Glob. Health 2019, 7, 293–299.
- Zhang, Q.; Xiao, Y.; Dai, W.; et al. Deep Learning Based Classification of Breast Tumors with Shear-Wave Elastography. Ultrasonics 2016, 72, 150–157.
- Chen, R.J.; Lu, M.Y.; Wang, J.; et al. Pathomic Fusion: An Integrated Framework for Fusing Histopathology and Genomic Features for Cancer Diagnosis and Prognosis. IEEE Trans. Med. Imaging 2020, 41, 757–770.
- Goode, A.; Gilbert, B.; Harkes, J.; et al. Openslide: A Vendor-Neutral Software Foundation for Digital Pathology. J. Pathol. Inf. 2013, 4, 27.