Lung Cancer Detection Using Deep Learning

Published 12 Apr 2026 in cs.CV, cs.AI, and cs.LG | (2604.10765v1)

Abstract: Lung cancer, the second leading cause of cancer-related deaths, is primarily linked to long-term tobacco smoking (85% of cases). Surprisingly, 10-15% of cases occur in non-smokers. In 2020, approximately 2 million people were affected globally, resulting in 1.5 million deaths. The survival rate, at around 20%, lags behind other cancers, partly due to late-stage symptom manifestation. Necessitates early and accurate detection for effective treatment. Performance metrics such as accuracy, precision, recall (sensitivity), and F1-score are computed to provide a comprehensive evaluation of each model's capabilities. By comparing these metrics, this study offers insights into the strengths and limitations of each approach, contributing to the advancement of lung cancer detection techniques. In this paper, we are going to discuss the methodologies of lung cancer detection using different deep learning algorithms - InceptionV3, MobileNetV2, VGG16, ResNet152 - are explored for their efficacy in classifying lung cancer cases. Our Proposed Model algorithm based is a 16 layers architecture based on CNN model. Our Proposed model exhibits several key highlights that contribute to its novelty. By integrating multiple layer types such as convolutional, pooling, flatten, dropout, fully connected and dense layers, the model leverages the strengths of each layer to enhance its predictive capabilities. Novelty of our proposed model is that its accuracy is increasing consistently with the increasing no of epochs. We have tested the model performance up to epoch no 30. Our proposed model also overcome the overfitting problem.

Abstract PDF Upgrade to Chat

Authors (2)

Summary

The paper introduces a custom 16-layer CNN architecture that achieves 84.44% test accuracy, outperforming established pre-trained models.
It outlines a structured design with stacked convolutional blocks, dropout regularization, and a softmax layer for multi-class CT scan classification.
The study emphasizes model interpretability, computational efficiency, and viability for clinical integration in resource-constrained settings.

Deep Learning-Based Detection of Lung Cancer from Chest CT Images

Background and Motivation

The persistent high mortality associated with lung cancer is fundamentally linked to diagnostic latency and the limitations of conventional human evaluation, particularly in low-resource environments. Existing studies have demonstrated the utility of deep learning (DL) in augmenting the sensitivity and specificity of cancer detection tasks by automating the analysis of complex radiological data. However, robust performance in real-world scenarios necessitates architectures that balance accurate feature extraction, overfitting mitigation, and computational efficiency, especially amid modest-sized domain-specific datasets.

Model Architecture and Justification

The work introduces and rigorously evaluates a custom convolutional neural network (CNN) with 16 layers, focusing on chest CT-scan image classification across clinically relevant lung cell types. The architecture comprises:

Five stacked convolutional blocks with ReLU activations, each followed by MaxPooling layers, enabling effective feature abstraction and dimensionality reduction.
A flatten layer transitioning the high-dimensional feature maps into a vectorized format for classification.
Two dense fully connected layers, each regularized with dropout to combat overfitting.
A final Dense (Softmax) layer for multi-class output, supporting flexible adaptation to both classification and regression tasks.

Key arguments for this bespoke architecture include enhanced interpretability, computational efficiency tailored for small to medium datasets, and structural flexibility supporting multi-output prediction. The model architecture diverges from conventional transfer learning paradigms (e.g., VGG16, ResNet152 pre-trained on ImageNet [15]), thereby better capturing domain-specific discriminative features. Dropout integration and the architectural restraint circumvent the excessive capacity that plagues larger pretrained models with limited medical data.

Experimental Protocol and Quantitative Performance

A Kaggle-derived chest CT image corpus underpins the experimental comparison among several canonical architectures (InceptionV3, MobileNetV2, VGG16, ResNet152) and the proposed model. Principal evaluation metrics include accuracy, precision, recall, F1-score, and RMSE, analyzed at increasing numbers of training epochs.

Salient empirical findings:

The proposed model attains a test accuracy of 84.44% at 30 epochs, outperforming ResNet152 (77.78%), VGG16 (75.22%), MobileNetV2 (61.27%), and InceptionV3 (42.86%).
Training accuracy for the proposed approach aligns with top performances (99.85%), indicating robust convergence without apparent overfitting, as the increase in epoch count does not degrade test accuracy—a property absent from most baselines.
Stability and consistent accuracy improvement are observed with further epochs, distinguishing the proposed model from the fluctuating performance of canonical deep models on the same dataset.
F1-scores, precision, and recall also peak for the proposed model, reflecting balanced predictive performance across all classes.

This consistent superiority evidences the advantage of custom-tuned, domain-aware architectures over transfer learning baselines, especially where dataset size and label granularity depart from generic image classification settings.

Implications and Future Directions

The findings underscore several theoretical and practical implications for lung cancer diagnostic pipelines:

Model Generalization and Interpretability: By leveraging modular, interpretable blocks, the design enhances the potential for clinical adoption, bolstering user trust and facilitating regulatory scrutiny. The architectural transparency and reduced parameter count mitigate the risk of spurious correlations prevalent in over-parameterized transfer learning models.
Small/Medium Dataset Viability: The study provides empirical substantiation that custom CNNs can systematically outperform large-scale pre-trained models in medical image analysis when faced with moderate sample sizes, typical in specialized healthcare applications.
Computational Resource Management: Beyond accuracy gains, the custom model's lightweight nature renders it deployable in resource-constrained clinical settings, facilitating translational impact beyond tertiary care centers.

However, several limitations must be acknowledged. The study relies on a single annotated CT data source; broader generalization assessment requires cross-cohort validation and robustness testing across diverse imaging modalities. Furthermore, model interpretability, already enhanced relative to opaque transfer networks, would benefit from post hoc explainability methods tailored to critical clinical decision points.

Future research should address:

Hyperparameter Tuning and Data Augmentation: Systematic optimization targeting further improvements in accuracy and calibration under limited data regimes.
Integration with Multimodal Data: Extending the architecture to fuse radiological images with clinical metadata (cf. [11]), supporting a more comprehensive risk assessment.
Prospective Clinical Trials: Rigorous clinical validation to bridge the translational gap and quantify real-world diagnostic and prognostic utility.
Ethical and Regulatory Considerations: Formalizing workflows for privacy, fairness, and transparency, ensuring responsible AI deployment in healthcare.

Conclusion

This work presents a domain-specialized CNN architecture for lung cancer detection on CT images, achieving superior evaluation metrics over established pre-trained models under equivalent experimental conditions. The model's consistent accuracy, resilience against overfitting, and computational efficiency substantiate its relevance as a deployable candidate in clinical workflows. Further research should focus on larger-scale validation, cross-modal learning, and integration of explainability mechanisms, enabling trustworthy and effective translation into medical practice.

Reference:

Lung Cancer Detection Using Deep Learning (2604.10765)

Markdown Report Issue