One in every three people is expected to have cancer in their lifetime, making it a major health concern for mankind. A crucial indicator of the outcome of cancer is its tumor microsatellite status-whether it is stable or unstable. It refers to how stable the DNA is in tumors with respect to the number of mutations within microsatellites. The tumor microsatellite status has important clinical value because patients with microsatellite instability-high (MSI-H) cancers usually have more promising outcomes compared to patients with microsatellite stable tumors. Furthermore, tumors deficient in mismatch repair proteins-these are cells with mutations in specific genes that are involved in correcting mistakes made when DNA is copied in a cell-respond well to immune checkpoint inhibitors (ICIs) and not necessarily to chemotherapeutics.
Therefore, health practitioners and experts suggest MSI testing for newly diagnosed gastric and colorectal cancers. In recent years, artificial intelligence (AI) has made significant strides in this field and its incorporation in clinical workflow is expected to provide cost-efficient and highly accessible MSI testing. While several studies have utilized deep learning methods such as convolutional neural networks and vision-transformer-based techniques for MSI status prediction, they fail to capture the uncertainty in the prediction. Moreover, most of them do not provide key insights into ICI responsiveness, restricting their clinical applications.
Addressing these shortcomings, in a recent breakthrough, a team of researchers from the USA and Korea, including Jae-Ho Cheong from Yonsei University College of Medicine and Jeonghyun Kang from Gangnam Severance Hospital, Yonsei University College of Medicine proposed MSI-SEER. This innovative deep Gaussian process-based Bayesian model analyzes hematoxylin and eosin-stained whole-slide images in weakly-supervised learning to predict microsatellite status in gastric and colorectal cancers. A central and distinctive contribution of this study lies in the integration of uncertainty prediction and quantification into the AI model. Specifically, the model is equipped with the capability to self-assess its confidence by estimating predictive variance via Monte Carlo dropout. This variance is then transformed into a Bayesian Confidence Score (BCS), which quantifies the reliability of each prediction. As a result, the AI model is able to recognize instances where its predictions carry high uncertainty-effectively "knowing what it does not know." In such cases, the system, termed MSI-SEER, automatically flags these high-uncertainty slides for secondary review by human pathologists, rather than making autonomous decisions. The novel findings were made available online and published in the journal npj digital medicine on 19 May 2025.
"This study provides a vital blueprint for how an AI model that 'knows what it doesn't know,' which in turn increases the system's overall reliability, can create an AI-Human collaboration clinical framework for safer, more reliable, and more useful in real-world clinical environments," Prof. Cheong said.
According to Prof. Cheong, "We performed extensive validation using multiple large datasets comprising patients from diverse racial backgrounds and found that MSI-SEER achieved state-of-the-art performance with MSI prediction by integrating uncertainty prediction."
In addition, the model proved to be highly accurate for ICI responsiveness prediction by integrating tumor MSI status and stroma-to-tumor ratio. Furthermore, the tile-level predictions by MSI-SEER provided key insights into the contribution of spatial distribution of MSI-H regions in the tumor microenvironment and ICI response.
"We believe our technology already has potential for real-world application as a form of prospective cohort surveillance, or a kind of Phase IV clinical trials. The longer-term implication of this study is that it is not about a single specific predictive AI model. Rather, it has a broader implication of how AI algorithm can analyze clinical multi-modal data and create clinically usable models for precision cancer medicine," expands Prof. Cheong on the possibilities of their innovation.
Overall, this work showcases the utilization of an AI model to devise clinically usable algorithm to predict the responsiveness to ICIs for patients with cancer.
Source:
Journal reference:
Park, S., et al. (2025) Deep Gaussian process with uncertainty estimation for microsatellite instability and immunotherapy response prediction from histology. npj Digital Medicine. doi.org/10.1038/s41746-025-01580-8.