Using explainable AI framework for gut microbiome-based colorectal cancer classification

NewsGuard 100/100 Score

The gut microbiome comprises a complex population of different bacterial species that are essential to human health. In recent years, scientists across several fields have found that changes in the gut microbiome can be linked to a wide variety of diseases, notably colorectal cancer (CRC). Multiple studies have revealed that a higher abundance of certain bacteria, such as Fusobacterium nucleatum and Parvimonas micra, is typically associated with CRC progression.

Based on these findings, researchers have developed various artificial intelligence (AI) models to help them analyze which bacterial species are useful as CRC biomarkers. However, most of these models rely on what is known as "global explanations," meaning that they can only consider the entirety of the input data to make predictions. As a result, such models are unable to identify bacterial species that could be relevant CRC biomarkers for smaller, less-representative groups of patients.

Against this backdrop, a research team from Tokyo Institute of Technology (Tokyo Tech), Japan, decided to adopt a different approach capable of addressing this limitation. As outlined in their paper, which was recently published in Genome Biology, the team employed an explainable AI framework that provides local, rather than global, explanations for its CRC predictions.

Local explanation techniques make it possible to discover the most contributing bacteria for each individual CRC patient, enabling us to examine inter-individual differences between subjects within a disease group."

Associate Professor Takuji Yamada, main author of the study

The team used a framework called "Shapley additive explanations" (SHAP), which originated from a concept in game theory called the Shapley value. Put simply, the Shapley value tells us how a payout should be distributed among the players of a coalition or group. Similarly, in their study, the team used SHAP to calculate the contribution of each bacterial species to each individual CRC prediction.

Using this approach along with data from five CRC datasets, the researchers discovered that projecting the SHAP values into a two-dimensional (2D) space allowed them to see a clear separation between healthy and CRC subjects. Clustering this 2D information resulted in four distinct subgroups of CRC subjects, each differing in the CRC probability and the associated bacteria. In addition, the team found that subjects in the CRC subgroups with the highest CRC probability always had an enriched population of bacteria typically associated with CRC. Most remarkably, the results were consistent across the five datasets, showcasing the wide applicability of this method.

With these promising results, the team anticipates their approach to make solid contributions in the gut microbiome research community. "Considering the increasing use of machine learning in microbiome–disease association studies, our novel method could be beneficial for a more personalized microbiome data exploration as well as help uncover potential disease subgroups along with their potential associated biomarkers," speculates Dr. Yamada. Further, the technique is also applicable to other diseases with known links to the gut microbiome, such as ulcerative colitis, Chron's disease, and diabetes.

Hopefully, explainable AI will reveal more such secrets of the gut microbiome in the near future, so stay tuned!

Source:
Journal reference:

Rynazal, R., et al. (2023). Leveraging explainable AI for gut microbiome-based colorectal cancer classification. Genome Biology. doi.org/10.1186/s13059-023-02858-4

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of News Medical.
Post a new comment
Post

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.

You might also like...
Study unveils novel bladder cancer diagnostic model based on key mitochondrial genes