Statistical model builds SARS-CoV-2 interactome with miRNAs, protein-coding genes, and co-infecting microbes

NewsGuard 100/100 Score

The unprecedented global healthcare crises caused by the outbreak of the severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) has driven extensive research focused on understanding the virus’ infection mechanism, which will help develop effective therapeutics.

Although previous studies have reported protein-protein interaction linkages during the viral infection life cycle, a more comprehensive understanding of the full interactome containing human micro ribonucleic acids (miRNAs), protein-coding genes, and co-infecting microbes is crucial.

Study: Constructing a multiple-layer interactome for SARS-CoV-2 in the context of lung disease: Linking the virus with human genes and co-infecting microbes. Image Credit: Connect world / Shutterstock.com

*Important notice: bioRxiv publishes preliminary scientific reports that are not peer-reviewed and, therefore, should not be regarded as conclusive, guide clinical practice/health-related behavior, or treated as established information.

About the study

In a recent study published on the preprint server bioRxiv*, a team of researchers recently developed a statistical modeling method known as multiple-layer crosstalk (MLCrosstalk).

MLCrosstalk is an advanced statistical model based on Latent Dirichlet Allocation (LDA) that links multiple data types to build the entire interactome for SARS-CoV-2. MLCrosstalk can integrate samples with multiple information layers, ensure a consistent topic distribution on all types of data, and deduce individual-level linkages that can differ between patients.

The researchers also implemented a secondary refinement with network propagation to enable the microbe-gene linkages to focus on larger network structures. They first evaluated the trained model by analyzing the clustering of sample topic distributions. The model groups individuals with the coronavirus disease 2019 (COVID-19), healthy individuals, and those with community-acquired pneumonia (CAP) into distinct clusters.

The authors used multiple known gene sets such as the Kyoto Encyclopedia of Genes and Genomes (KEGG), virus-host protein-protein interaction (PPI), WikiPathways, and COVID-19-related gene sets to annotate the functions of top-weighted genes.

MLCrosstalk workflow. We transform gene expression, microbe abundance, and (pre)miRNA expression data, which are then input into the MLCrosstalk model. After training, we apply network propagation to refine the linkages. Multiple layer comparison and network tracing can identify shared and specific pathways and connections.

Study findings

The MLCrosstalk model had three major advantages in the integration analysis of multiple data types. To this end, this modeling approach handled sparse and noisy data using the Dirichlet distribution of hyperparameter, unifies topic distribution for all patient/samples, thus facilitating linkage identification across various types of data, and can be easily extended to many data types with missing samples.

Upon analysis of COVID-19 datasets, MLCrosstalk extracted dimensionally reduced patterns to show a detailed link between host protein-coding genes, non-coding genes, and microbes. MLCrosstalk constructed a comprehensive interactome for the gene-microbe-miRNA network, which was further refined through network propagation to integrate pathway data and link host-pathogen interactions with biological relevance.

The researchers used the Kullback-Leibler divergence between topic distributions and compared it to a random background to find that Topic 9 is the most interesting topic and differed this from the background distribution. Using multiple known gene sets, the authors annotated the functions of the top-weighted genes part of Topic 9 to discover that these genes are highly enriched in heat-shock response proteins and immune-related pathways.

The researchers analyzed and compared microbes with possible associations with SARS-CoV-2 and found that Rothia mucilaginosa, Prevotella melaninogenica, and Haemophilus parainfluenzae showed reduced relative abundance in patients with COVID-19. The results also showed that genes involved in the Notch signaling pathway like NOTCH4, HDAC2, PSEN1 are upregulated significantly in the lungs during COVID-19. Other bacteria like Escherichia coli, Staphylococcus aureus, and Klebsiella pneumoniae were also found to be highly associated with COVID-19, although many of these, especially gram-negative bacteria, could be nosocomial in some contexts.

The researchers inferred COVID-19-specific linked genes by comparing their occurrences in healthy individuals and those with COVID-19. They found that VEGFA-VEGFR2 and cytoplasmic ribosomal protein was associated with COVID-19.

Top-ranked pathways were identified using a random walk with restart (RWR) approach, whierein the findings identified the VEGFA-VEGFR2 as well as the immune pathway. MLCrosstalk identified genes like IFNAR1, IFNAR2, and STAT associated with the viral entry.

Conclusions

The MLCrosstalk statistical model developed by the researchers overcame three challenges including heterogeneity and noise in data, integration of multiple data types, and personalized linkage identification. Using MLCrosstalk, a list of genes and microbes associated with SARS-CoV-2 were identified, latent patterns of multiple data sets were retrieved, and sample-specific linkages showing biological evidence were identified.

The team identified the microbial co-infections associated with COVID-19, with some microbes showing synergistic and antagonistic effects with COVID-19. The linked genes of R. mucilaginosa had a high representation in COVID-19 patients, while those of P. melaninogenica had a lower representation in the aforementioned pathway creating an opposite pattern. Such distinct patterns were also seen between these two microbe groups for other pathways like immune-response, type II interferon signaling, and Notch signaling pathways.

*Important notice: bioRxiv publishes preliminary scientific reports that are not peer-reviewed and, therefore, should not be regarded as conclusive, guide clinical practice/health-related behavior, or treated as established information.

Journal reference:
Susha Cheriyedath

Written by

Susha Cheriyedath

Susha is a scientific communication professional holding a Master's degree in Biochemistry, with expertise in Microbiology, Physiology, Biotechnology, and Nutrition. After a two-year tenure as a lecturer from 2000 to 2002, where she mentored undergraduates studying Biochemistry, she transitioned into editorial roles within scientific publishing. She has accumulated nearly two decades of experience in medical communication, assuming diverse roles in research, writing, editing, and editorial management.

Citations

Please use one of the following formats to cite this article in your essay, paper or report:

  • APA

    Cheriyedath, Susha. (2021, December 12). Statistical model builds SARS-CoV-2 interactome with miRNAs, protein-coding genes, and co-infecting microbes. News-Medical. Retrieved on May 12, 2024 from https://www.news-medical.net/news/20211212/Statistical-model-builds-SARS-CoV-2-interactome-with-miRNAs-protein-coding-genes-and-co-infecting-microbes.aspx.

  • MLA

    Cheriyedath, Susha. "Statistical model builds SARS-CoV-2 interactome with miRNAs, protein-coding genes, and co-infecting microbes". News-Medical. 12 May 2024. <https://www.news-medical.net/news/20211212/Statistical-model-builds-SARS-CoV-2-interactome-with-miRNAs-protein-coding-genes-and-co-infecting-microbes.aspx>.

  • Chicago

    Cheriyedath, Susha. "Statistical model builds SARS-CoV-2 interactome with miRNAs, protein-coding genes, and co-infecting microbes". News-Medical. https://www.news-medical.net/news/20211212/Statistical-model-builds-SARS-CoV-2-interactome-with-miRNAs-protein-coding-genes-and-co-infecting-microbes.aspx. (accessed May 12, 2024).

  • Harvard

    Cheriyedath, Susha. 2021. Statistical model builds SARS-CoV-2 interactome with miRNAs, protein-coding genes, and co-infecting microbes. News-Medical, viewed 12 May 2024, https://www.news-medical.net/news/20211212/Statistical-model-builds-SARS-CoV-2-interactome-with-miRNAs-protein-coding-genes-and-co-infecting-microbes.aspx.

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of News Medical.
Post a new comment
Post

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.

You might also like...
Do SARS-CoV-2 infections cause long-term loss of smell and taste?