Genome-wide covariation analysis sheds light on the evolution of SARS-CoV-2

Download PDF Copy

Add News Medical on Googleas a preferred source

By Sally Robertson, B.Sc.Mar 10 2021Revised

Researchers in the United States have computed genome-wide covariation within severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) to investigate interactions that could be of considerable importance in the prevention, diagnosis, and treatment of coronavirus disease 2019 (COVID-19).

When the researchers considered the level of variability within both the full genome and different virus clades, they found nucleotide variability differed between encoding regions of the full genome and between different clades.

Evan Cresswell-Clay and Vipul Periwal from The National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) in Bethesda, Maryland, say future extensions of this analysis will provide several avenues of investigation.

As the database of SARS-CoV-2 genomes grows, increasing variability will yield further insights into important interactions within the genome.

Furthermore, the availability of this data over time will allow for chronological compartmentalization of genome datasets that could be used to study the temporal evolution of the virus.

The analysis can also be applied to other diseases for which more data is becoming available, says the team.

A pre-print version of the research paper is available on the bioRxiv* server, while the article undergoes peer review.

Study: Genome-Wide Covariation in SARS-CoV-2. Image Credit: NIAID

This news article was a review of a preliminary scientific report that had not undergone peer-review at the time of publication. Since its initial publication, the scientific report has now been peer reviewed and accepted for publication in a Scientific Journal. Links to the preliminary and peer-reviewed reports are available in the Sources section at the bottom of this article. View Sources

More about the SARS-CoV-2 genome

The genome of the SARS-CoV-2 virus – the agent responsible for the COVID-19-pandemic – was first characterized in December 2019.

The genome is around 30 kilobases in length and contains several open reading frames, including ORF1ab, ORF3a, ORF6, ORF7a, ORF7b, ORF8, and ORF10. These ORFs encode for non-structural proteins, while specific genomic regions encode four structural proteins, of which the spike protein is the largest.

The spike protein is the surface structure SARS-CoV-2 uses to bind to and infect host cells. The other three structural proteins include the envelope (E) and membrane (M) proteins that form the viral envelope and the nucleocapsid (N) protein that is involved in viral assembly.

The EpiCoV database

The design of vaccines and therapies depends on the structure and mutational stability of proteins encoded in the ORFs of the genome.

While the reference genome is used for most studies, a growing body of available data can be used to monitor variations in the genome and analyze the virus's evolution.

This data – assembled by GISAID (Global Initiative on Sharing Avian Influenza Data) – has enabled different SARS-CoV-2 strains to be documented in a new database called EpiCoV.

Since the first viral strain was entered on 10^th January 2020, the database has grown to include 292,000 submissions.

Now, Cresswell-Clay and Periwal have used 137,636 of these documented strains to analyze the evolution of SARS-CoV-2.

"The variation of the virus's genetic structure is of considerable medical and biological importance for prevention, diagnosis, and therapy," writes the team.

Comparative RNA sequence analysis has long been used to study co-evolution via covariance of nucleotide mutations. However, separating the indirect and direct interactions that lead to such covariation has been challenging, say the researchers.

What did the current study involve?

The team used an optimization method called Expectation Reflection together with Direct Coupling Analysis to compute the genome-wide covariation within SARS-CoV-2 and infer direct interactions within the viral genome.

"These interactions may also provide information on protein-protein interaction," writes the team. "Additionally, this analysis could be useful in vaccine development, aiding in efforts to mitigate 'escape pathways' for the virus to use in future strains."

The team identified genome interactions both within individual encoding regions and between different encoding regions throughout the genome.

The ORF1ab and Spike regions showed the most significant variability within the dataset.

Genome-wide interaction maps also expressed determinant positions of all clades available, while interaction maps of individual clades revealed clade-specific co-evolution of nucleotide positions.

Nucleotide variability was different both between encoding regions of the full genome and between different clades. Region-specific incidences were not consistent between clades, with different variability expressed in individual regions of different clades.

The analysis could help future research

Cresswell-Clay and Periwal say future extensions of this analysis could provide several research opportunities.

"First, as the database of SARS CoV-2 genomes grows, the incidence and overall variability will increase, yielding further insights into genome interactions," writes the team.

The increased availability of data over time will enable chronological compartmentalization of genome datasets and comparison of interaction maps across the temporal evolution of the virus.

"Second, this analysis can also be applied to diseases for which there is more data available, as the importance of genome interactions is not SARS-CoV-2 specific," say the researchers.

Journal references:

Preliminary scientific report. Cresswell-Clay E and Periwal V Genome-Wide Covariation in SARS-CoV-2. bioRxiv, 2021. doi: https://doi.org/10.1101/2021.03.08.434363, https://www.biorxiv.org/content/10.1101/2021.03.08.434363v2
Peer reviewed and published scientific report. Cresswell-Clay, Evan, and Vipul Periwal. 2021. “Genome-Wide Covariation in SARS-CoV-2.” Mathematical Biosciences, August, 108678. https://doi.org/10.1016/j.mbs.2021.108678. https://www.sciencedirect.com/science/article/pii/S0025556421001024.

Article Revisions

May 18 2023 - The preprint preliminary research paper that this article was based upon was accepted for publication in a peer-reviewed Scientific Journal. This article was edited accordingly to include a link to the final peer-reviewed paper, now shown in the sources section.

Posted in: Medical Research News | Disease/Infection News

Comments (0)

Written by

Sally Robertson

Sally first developed an interest in medical communications when she took on the role of Journal Development Editor for BioMed Central (BMC), after having graduated with a degree in biomedical science from Greenwich University.

Download PDF Copy

Citations

Please use one of the following formats to cite this article in your essay, paper or report:

APA
Robertson, Sally. (2023, May 18). Genome-wide covariation analysis sheds light on the evolution of SARS-CoV-2. News-Medical. Retrieved on June 24, 2026 from https://www.news-medical.net/news/20210310/Genome-wide-covariation-analysis-sheds-light-on-the-evolution-of-SARS-CoV-2.aspx.
MLA
Robertson, Sally. "Genome-wide covariation analysis sheds light on the evolution of SARS-CoV-2". News-Medical. 24 June 2026. <https://www.news-medical.net/news/20210310/Genome-wide-covariation-analysis-sheds-light-on-the-evolution-of-SARS-CoV-2.aspx>.
Chicago
Robertson, Sally. "Genome-wide covariation analysis sheds light on the evolution of SARS-CoV-2". News-Medical. https://www.news-medical.net/news/20210310/Genome-wide-covariation-analysis-sheds-light-on-the-evolution-of-SARS-CoV-2.aspx. (accessed June 24, 2026).
Harvard
Robertson, Sally. 2023. Genome-wide covariation analysis sheds light on the evolution of SARS-CoV-2. News-Medical, viewed 24 June 2026, https://www.news-medical.net/news/20210310/Genome-wide-covariation-analysis-sheds-light-on-the-evolution-of-SARS-CoV-2.aspx.

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of News Medical.

Post a new comment

(Logout)

Post

Sign in to keep reading

We're committed to providing free access to quality science. By registering and providing insight into your preferences you're joining a community of over 1m science interested individuals and help us to provide you with insightful content whilst keeping our service free.