INTRODUCTION

Colorectal cancer (CRC) is one of the most common cause of morbidity and mortality worldwide (Romero-Zoghbi et al. 2015). Histopathological analysis of tissue specimens is the mainstay of the diagnostic process, but is subjective by nature and can be influenced by inter-observer variability (Rizzo 2025). Image processing and machine learning technologies have advanced significantly in the last few years and are thus transforming digital pathology. Classical approaches such as machine learning, are typically reliant on handcrafted features to describe tissue morphology. However, the rapid progression of deep learning, especially with convolutional neural networks (CNNs), has revolutionized the fully automated feature learning and end-to-end analysis of histopathology images.

Traditionally, classical approaches such as support vector machines (SVMs), decision trees, and k-nearest neighbors (k-NN) were used after an extensive feature extraction process. The most common features included texture descriptors, color histograms, and morphological features, which were selected and fine-tuned by domain experts.

In contrast, deep learning algorithms have demonstrated an ability to build hierarchical representations from raw data without requiring explicit feature definitions. The advantage of this end-to-end learning process is that it streamlines the workflow and often leads to higher accuracies in tasks like tumor segmentation, classification, and even prognostic predictions. Nonetheless, deep learning methods have some flaws, it needs extensive annotated datasets, huge computational resources, and is less interpretable than classical methods.

Computational techniques applied to histopathological diagnosis have achieved high levels of efficiency and reproducibility. Critical comparison of these techniques will help clinicians select the optimal technique in the appropriate setting. In this paper, we seek to unify results from 25 pivotal studies so as to derive an insight of the comparative strengths and weaknesses of both the classical and deep learning paradigms as applied to colorectal cancer histopathology.

With the advancements in the field of computer vision and increasing availability of digital histology slides, it is not as difficult as before to develop accurate algorithms for automated CRC detection. The aim of this work is to give clinicians and researchers a comprehensive overview of the current state-of-the-art, compare the strengths and weaknesses of the various approaches and suggest directions for future work. We explore the entire workflow ranging from image acquisition, preprocessing, and algorithm implementation all the way through to the interpretation of results. The subsequent sections describe the literature, methods, discussion of findings, and ending with a summary of implications for clinical practice.

Research Objective

·        To systematically review classical machine learning techniques applied to colorectal histopathology image analysis.

·        To explore recent advancements in deep learning approaches for colorectal cancer detection using histopathology images.

·        To compare the effectiveness, advantages, and limitations of classical and deep learning methods in the context of colorectal cancer diagnosis.

LITERATURE REVIEW

Related work on computational methods for colorectal cancer diagnosis is broad. Both classical and deep learning methods have been investigated for the derivation of clinical information from histopathology images.  In the following subsections, we review studies that have played an important role in this area of work.

Classical Machine Learning Approaches

Classical machine learning techniques were used on early stages in digital pathology. Studies such as Wang et al. (2019) evaluate and compared traditional classifiers by manually extracting features like texture and shape descriptors. Komura and Ishikawa (2018) developed quantitative histology methods that used machine learning to correlate morphological patterns with patient outcomes. Chhillar et al. (2024) used a LightGBM classifier to identify colon cancer using histopathology images and achieved 99.90% accuracy. Komura and Ishikawa (2018) provided a comprehensive review of machine learning methods in histopathological image analysis, focusing on the need for robust feature extraction. Collectively, these studies showed that although classical methods largely require human labor, they are useful in certain diagnostic settings.

Besides, classical studies have introduced feature engineering and dimensionality reduction as potential solutions. Classical image processing pipelines have shown good performance in classifying tissue regions in both benign and malignant cases. Moreover, researchers developed techniques to normalize for differences in staining intensities, reducing image appearance variability. Despite these advances, traditional methods struggled with generalizability on heterogeneous datasets and even larger data sizes, a shortcoming that became increasingly evident after the advent of deep learning.

Deep Learning Approaches

Deep learning has been a game-changer for image analysis tasks such as classification, segmentation, and detection in digital pathology. Kather et al. (2019) confirmed that deep learning algorithms can predict microsatellite instability (MSI) from histology slides alone in gastrointestinal cancers, including colorectal cancer. Their study used CNNs showed a marked enhancement in diagnostic correctness over traditional approaches. Sirinukunwattana et al. (2016) specifically addressed nuclei detection and classification in colon cancer by proposing a locality-sensitive deep learning framework that does not require any prior segmentation of nuclei.

A number of works have explored the optimization of CNN architectures on histopathological images. Janowczyk and Madabhushi (2016) provided an overview of deep learning methods applied to digital pathology, discussing prominent architectural choices and data preprocessing techniques. Litjens et al. (2017) published a comprehensive review on deep learning applications to medical image analysis, highlighting how network architectures have developed to rival human-level performance on some diagnostic challenges. Banerji et al. (2022) built upon these results by exploring a wider array of deep learning models, using data augmentation and transfer learning approaches, both of which are important in situations when there is limited training data.

Other notable contributions include Tellez et al. (2019), who measured the effect of stain color normalization and data augmentation on CNN performance, and Dey et al. (2020) implemented a deep learning segmentation model specifically targeting colorectal cancer regions. In another study by Kather et al. (2018), where they used CNNs to predict patient survival from colorectal histopathology images, thereby connecting image analysis to clinical outcomes.

In addition to improving the classification accuracy, deep learning studies also yield insights into tumor heterogeneity. In the study by Khazaee et al. (2023), the authors designed a hybrid framework that combines classical features and deep learning representations, which improved the colon cancer detection rates. Hamida et al. (2021) and Malik et al. (2019) explored the use of fully convolutional networks for whole slide image analysis, achieving high sensitivity and specificity in identifying cancerous tissues.

Comparative Analyses and Hybrid Approaches

A number of studies have compared classical and deep learning techniques directly. Huang et al. (2023) presented a comparative analysis of both approaches in lung cancer detection, drawing parallels that are relevant to colorectal cancer applications. Jimenez-del-Toro et al. (2017) conducted a head-to-head comparison, showing that while deep learning models generally outperformed classical methods in accuracy, the latter offered improved interpretability and lower computational overhead. Madabhushi et al. (2016) provided a systematic review that examine various challenges and opportunities presents while using image processing and machine learning in digital pathology.

In one comparative study, Zhang et al. (2019) evaluated multiple deep learning architectures against traditional machine learning models, finding that deep networks with weakly supervised learning strategy significantly improved performance but required extensive computational resources. Meanwhile, classical methods maintained robustness in cases where the volume of data was insufficient for training deep models. The consensus among these studies is that a hybrid approach, which combines the transparency of classical techniques with the predictive power of deep learning, and weakly supervised learning may offer the optimal solution for colorectal cancer histopathology analysis.

Challenges and Future Trends

Despite the progress made by both paradigms, several challenges persist. Variability in staining, slide preparation, and scanning protocols remains a barrier to the standardization of image analysis workflows. Moreover, the scarcity of high-quality, annotated datasets continues to impede the training of deep learning models. Future research directions highlighted in the literature include the development of transfer learning methods, unsupervised learning strategies, and the integration of multi-scale analysis. Studies by Bera et al. (2019) and Shen et al. (2015) stress the importance of model interpretability and validation across multi-institutional datasets.

In summary, the literature reveals that both classical and deep learning approaches have unique advantages and limitations. While deep learning has pushed the boundaries of diagnostic accuracy, classical machine learning continues to provide valuable insights through interpretable features and lower data requirements. The following sections describe the methodology adopted to synthesize these findings and discuss the implications for clinical practice.

METHODOLOGY

We performed a systematic review of reporting studies that applied classical and deep learning techniques in colorectal cancer identification using histopathology images. It consists of three phases: Literature search, data extraction and quality assessment, and findings synthesis.

Literature Search and Inclusion Criteria

Several databases (PubMed, IEEE Xplore, Scopus, and Web of Science) were thoroughly searched to identify studies performed within the past 10 years. Based on the inclusion criteria, the search strategy covered the following keywords: “colorectal cancer,” “histopathology images,” “classical machine learning,” “deep learning,” “convolutional neural networks” and “image analysis”. Studies were included if they met the following criteria: (a) they reported the use of either classical machine learning or deep learning methods for the analysis of histopathology images; (b) they provided a quantitative assessment of model performance; and (c) the study focused on colorectal cancer or provided comparative insights that were relevant to CRC diagnosis. Studies that did not report performance metrics or were limited to theoretical frameworks were excluded.

Data Extraction and Quality Assessment

For each selected study, data were extracted on dataset characteristics (e.g., number of images, staining types), preprocessing techniques (e.g., normalization, augmentation), algorithmic approach (classical, deep-learning), performance metrics (accuracy, sensitivity, specificity, area under the curve (AUC), and key conclusions. A standard data extraction form was utilized for consistency. Studies were then assessed on their quality according to study design, dataset size, use of validation techniques (cross-validation, independent test set), and reproducibility of findings. Studies were subsequently classified based on the approach (classical or deep learning) used and were subsequently analyzed for methodological innovations and limitations.

Synthesis of Findings

This synthesis ofthe work was in a form of a narrative review that compared the pros and cons of traditional machine learning methods and deep learning methods. An emphasis wasplaced on studies that assessed hybrid models. Statistical estimates were not pooled directly in a meta-analysis becauseof heterogeneity of study designs and outcome measurements. Rather we performed qualitative synthesis and comparative analysis of reported outcomesto identify trends. The approach also included critical appraisal of the extent to which preprocessing methods like stain normalization has impact the analysis process, with some studies reporting such steps were essential for bothclassical and deep learning models.

DISCUSSION

Comparative Performance Analysis

The results of the reviewed studies reveal that deep learning approaches have a distinct advantage in terms of accuracy and sensitivity. For instance, Kather et al. (2019) and Kather et al. (2018) reported significant improvements in the ability of CNNs to detect subtle morphological features associated with microsatellite instability (MSI) and overall prognosis. These improvements can largely be attributed to the end-to-end learning nature of CNNs, which eliminates the need for manual feature selection. In contrast, classical methods, as demonstrated in the work of Jimenez-del-Toro et al. (2017) and Komura and Ishikawa (2018), provided results that were more interpretable but sometimes less robust to variations in image quality and staining.

Interpretability and Clinical Applicability

Interpretability is a major consideration in clinical practice. While deep learning models have shown impressive accuracy, their “black-box” nature often complicates the clinical decision-making process. Conversely, classical methods—by relying on manually extracted features—offer greater transparency. This trade-off is highlighted in studies such as Wang et al. (2019) and Madabhushi et al. (2016), where the importance of balancing performance with explainability is underscored. Hybrid models, which integrate both approaches, have been proposed as a means to harness the strengths of each. Khazaee et al. (2023) demonstrated that such models can achieve high accuracy while retaining some degree of interpretability, making them more acceptable in clinical settings.

Challenges and Limitations

Despite the successes, several challenges remain. Data scarcity and variability in tissue staining and slide preparation continue to impede the performance of both classical and deep learning models. Preprocessing steps, such as stain normalization and augmentation, are essential for mitigating these issues, as evidenced by Tellez et al. (2019) and Banerji et al. (2022). In addition, the lack of large, standardized, and publicly available datasets for colorectal cancer hampers the reproducibility and validation of results across institutions. The studies by Bilal et al. (2020) and Hamida et al. (2021) highlight the need for larger multi-center datasets to improve generalizability.

Future Directions and Recommendations

Future studies should emphasise on developing hybrid models that capture boththe automatic feature learning possibilities offered by the deep networks as well as the interpretability offered by classical machine learning. These data limitations are promising to address via transfer learning, domain adaptation andunsupervised learning. Researchers should also strive towards the standardization of staining and imagingprotocols, which would enable the generation of robust multi-institutional datasets. Techniques of interpretabilitysuch as attention mapping and feature visualization could help to connect complex deep learning models to clinical interpretability. Finally, working alongside pathologists will be important to ensure model outputs are clinicallymeaningful and easily used in routine diagnostics.

CONCLUSION

The current review has systematically presented acomparative overview between classical machine learning methods and deep learning-based approaches for histopathology image-based colorectal cancer diagnosis. Based on a systematic review and aggregation of results from 25 relevant studies, our findings reveal that deep learning modalities predominantlyoutperformed classical machine learning approaches; however, those were still superior with respect to interpretability and computational burden. Hybridapproaches that combine both methodologies might be the most fruitful way to go forward. The fusion of contemporary computational strategies intohistopathology represents a strong potential impact on diagnostic accuracy and patient prognosis. Future research should build off current limitations with standardizeddata protocols, model transparency, and interdisciplinary collaboration.

SECTION TITLE 6