The origin of SARS-CoV-2 furin cleavage site remains a mystery

The ongoing pandemic of coronavirus disease 2019 (COVID-19), caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has largely defied attempts to contain its spread by non-pharmaceutical interventions (NPIs). With the massive loss of life and economic damage, the only way out, in the absence of specific antiviral therapeutics, has been the development of vaccines to achieve population immunity.

A new study on the Preprints server discusses the origin of the furin cleavage site on the SARS-CoV-2 spike protein, which is responsible for the virus’s relatively high infectivity compared to relatives in the betacoronavirus subgenus.

Study: SARS-CoV-2 and the secret of the furin site. Juan Gaertner / Shutterstock
Study: SARS-CoV-2 and the secret of the furin site. Image Credit: Juan Gaertner / Shutterstock

*Important notice: Preprints publishes preliminary scientific reports that are not peer-reviewed and, therefore, should not be regarded as conclusive, guide clinical practice/health-related behavior, or treated as established information.

The furin cleavage site

The SARS-CoV-2 is a betacoronavirus, and is most closely related to the bat SARS-related coronavirus (SARSr-CoV) represented by the genome sequence RaTG13, which shares 96% identity with the former. This has made the bat virus the most probable precursor of the virus in current circulation.

The origin of this strain is linked to the emergence of the novel furin cleavage site in the viral spike glycoprotein. The furin is a serine protease widely expressed in human cells, that cleaves the SARS-CoV-2 spike at the interface of its two subunits. It is encoded by a gene on chromosome 15.

Furin acts on substrates with single or paired basic residues during the processing of proteins within cells. Such a polybasic furin cleavage site is found in various proteins from many viruses, including Betacoronavirus Embecoviruses, and the Merbecovirus. However, within the betacoronaviruses of the sarbecovirus lineage B, this type of site is unique to SARS-CoV-2.

The study used a bioinformatic approach using the genomic data available on the National Center for Biotechnological Information (NCBI) databases, to identify the origin of the furin cleavage site.

Same ancestor

They found three coronaviruses that were very similar to the SARS-CoV-2 at the genomic level. These are Pangolin-CoVs (2017, 2019), Bat-SARS-like (CoVZC45, CoVZXC21) and bat RatG13.

The three genomic fingerprints used to identify these matches include fingerprint 1, in the orf1a RNA polymerase gene, including the nsp2 and nsp3 genes; fingerprint 2, at the beginning of S gene, covering the part encoding the N-terminal domain and the receptor-binding domain (RBD) that mediates attachment to the host cell receptor, the angiotensin-converting enzyme 2 (ACE2).; and fingerprint 3, the orf8 gene.

These fingerprints are distinctive to the three closely related coronaviruses only at the RNA level, but the amino acid sequences in the translated proteins are similar to other sarbecoviruses.

The sharing of these genomic sequences indicates their common ancestry, supported by other short sequence features, with one deletion and three insertions. All three strains show the same deletion-insertion pattern at the same four different locations in the spike gene.

Spike gene recombination in a common ancestor

The analysis of the phylogeny of these three strains showed that the first to diverge was the pangolin coronavirus, with the RatG13 being the closest. However, when only the spike is analyzed, there is a high similarity between the pangolin CoV, RaTG13 and SARS-CoV-2.

This may indicate the occurrence of recombination events between the Pangolin-CoV (2017) and RatG13 ancestors. This was followed by the shift of the pangolin CoV to pangolin hosts.

Phylogenetic tree of the closely related SARS-CoV-2 coronaviruses based on complete genomes
Phylogenetic tree of the closely related SARS-CoV-2 coronaviruses based on complete genomes.

Unique codons encoding arginines in the furin cleavage site

The furin cleavage site consists of four amino acids PRRA, which are encoded by 12 inserted nucleotides in the S gene. A characteristic feature of this site is an arginine doublet.

This insertion could have occurred by random insertion mutation, recombination or by laboratory insertion. The researchers say the possibility of random insertion is too low to explain the origin of this motif.

Surprisingly, the CGGCGG codons encoding the two arginines of the doublet in SARS-CoV-2 are not found in any of the furin sites in other viral proteins expressed by a wide range of viruses.

Even within the SARS-CoV-2, where arginine is encoded by six codons, only a minority of arginine residues are encoded by the CGG codon. Again, only two of the 42 arginines in the SARS-CoV-2 spike are encoded by this codon – and these are in the PRRA motif.

For recombination to occur, there must be a donor, from another furin site and probably from another virus. In the absence of a known virus containing this arginine doublet encoded by the CGGCGG codons, the researchers discount the recombination theory as the mechanism underlying the emergence of PRRA in SARS-CoV-2.

Time of acquisition

The second question is when this shift occurred. The RaTG13 virus was isolated in 2013, which could indicate that this site was acquired after that, giving rise to the current SARS-CoV-2 strain. This could mean it occurred within the bat host before it leaped the species barrier, or within the human host itself.

The first scenario is supported by the finding of the same RBD modifications in bat and human viruses, with three O-linked glycans around the furin site having been acquired in both. Both viruses also show completely identical sequences around this site.

To add weight to this hypothesis, however, a crucial piece of evidence is missing. RaTG13 sequences acquired in 2021 need to be analyzed to find whether the furin site was acquired by this virus, as well as the SARS-CoV-2 ancestor in the bat.

Conclusion

Describing this mystery site as “a furin site that has changed the world,” the researchers sum up:

All these lines of evidence and reasoning show that the acquisition of the polybasic furin cleavage site by SARS-CoV-2 is a “missing link” in our understanding of its evolutionary history, that can only be addressed through the discovery of new viruses.”

*Important notice: Preprints publishes preliminary scientific reports that are not peer-reviewed and, therefore, should not be regarded as conclusive, guide clinical practice/health-related behavior, or treated as established information.

Journal reference:
Dr. Liji Thomas

Written by

Dr. Liji Thomas

Dr. Liji Thomas is an OB-GYN, who graduated from the Government Medical College, University of Calicut, Kerala, in 2001. Liji practiced as a full-time consultant in obstetrics/gynecology in a private hospital for a few years following her graduation. She has counseled hundreds of patients facing issues from pregnancy-related problems and infertility, and has been in charge of over 2,000 deliveries, striving always to achieve a normal delivery rather than operative.

Citations

Please use one of the following formats to cite this article in your essay, paper or report:

  • APA

    Thomas, Liji. (2021, February 17). The origin of SARS-CoV-2 furin cleavage site remains a mystery. News-Medical. Retrieved on December 05, 2024 from https://www.news-medical.net/news/20210217/The-origin-of-SARS-CoV-2-furin-cleavage-site-remains-a-mystery.aspx.

  • MLA

    Thomas, Liji. "The origin of SARS-CoV-2 furin cleavage site remains a mystery". News-Medical. 05 December 2024. <https://www.news-medical.net/news/20210217/The-origin-of-SARS-CoV-2-furin-cleavage-site-remains-a-mystery.aspx>.

  • Chicago

    Thomas, Liji. "The origin of SARS-CoV-2 furin cleavage site remains a mystery". News-Medical. https://www.news-medical.net/news/20210217/The-origin-of-SARS-CoV-2-furin-cleavage-site-remains-a-mystery.aspx. (accessed December 05, 2024).

  • Harvard

    Thomas, Liji. 2021. The origin of SARS-CoV-2 furin cleavage site remains a mystery. News-Medical, viewed 05 December 2024, https://www.news-medical.net/news/20210217/The-origin-of-SARS-CoV-2-furin-cleavage-site-remains-a-mystery.aspx.

Comments

  1. John Farmer John Farmer United States says:

    I find it odd that this is not getting more attention.   It is the origin of this site that can explain the greatest mystery of are age.  Was SarsCov2 natural or lab based?   Anyways thanks for the great writing.

    • julius ceasar julius ceasar Canada says:

      if it came from nature they most assuridly would not have destroyed the genome . if it came from nature it is the only unidentified host in the history of viruses

  2. Pipa Luce Pipa Luce Italy says:

    Also Ebola and Hiv have polibasic cleavage site and their origin is natural.

    BUT all highly pathogenic virus have the same background: bad health condition (spanish flue: war, Wuhan: pollution, ebola and hiv: wars and violence)


    My question: " May weakened animals have an extracellular environment that favors the formation of the polibasic cleavage site?"

  3. USugo USugo Italy says:

    fun fact
    the authors of the preprint on which this article is based, appear to be mysterious.
    not clear which is their affiliation, and authors with matching names on pubmed have few publications, none related to virology or genomics.
    It smells fishy

  4. Jeff Reynolds Jeff Reynolds United States says:

    It would seem that the obvious conclusion from this excellent summary of the mystery of the origins of the FCS is that this novel FCS is the result of laboratory manipulation. I’m not sure why that point wasn’t made more strongly as the implications of this are far-reaching, to say the least. The relevant sections of your article that make this abundantly clear are the following:
    “The researchers say the possibility of random insertion is too low to explain the origin of this motif. Surprisingly, the CGGCGG codons encoding the two arginines of the doublet in SARS-CoV-2 are not found in any of the furin sites in other viral proteins expressed by a wide range of viruses….For recombination to occur, there must be a donor, from another furin site and probably from another virus. In the absence of a known virus containing this arginine doublet encoded by the CGGCGG codons, the researchers discount the recombination theory as the mechanism underlying the emergence of PRRA in SARS-CoV-2.“

  5. Dan Nesmith Dan Nesmith United States says:

    "The SARS-CoV-2 is a betacoronavirus, and is most closely related to the bat SARS-related coronavirus (SARSr-CoV) represented by the genome sequence RaTG13, which shares 96% identity with the former. This has made the bat virus the most probable precursor of the virus in current circulation."
    If overlap of genome is all that matters, then are humans descended from chimps or vice versa? Just because RaTG13 is the known closest (currently) doesn't mean it's even a precursor to SARS-CoV-2, does it? I didn't realize we knew that much about betacoronaviruses.

  6. erik brown erik brown Canada says:

    LMAO...you call 0.02% death rate (global) " massive loss. *tupid or deliberate. Either way, your consience should you keep you  up all night.

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of News Medical.
Post a new comment
Post

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.

You might also like...
How viral persistence and immune dysfunction drive long COVID