Machine learning method for determining impact of mutations in the SARS-CoV-2 RBD on binding and antibody escape

Download PDF Copy

Add News Medical on Googleas a preferred source

By Colin Lightfoot, M.Sc. Infection and ImmunityReviewed by Danielle Ellis, B.Sc.Dec 14 2021Revised

As of late 2021, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants associated with increased transmissibility and/or immune evasion (antibody escape) had nearly completely supplanted the original founder strain (Wu-Hu-1). Emerging variants frequently have at least one mutation in the receptor-binding domain (RBD), which can affect binding to angiotensin-converting enzyme 2 (ACE2). For example, alpha (B.1.1.7), beta, and gamma variants have the N501Y mutation, which is associated with higher affinity binding to ACE2, implying that this could be a selective pressure for variant emergence.

Study: Predictive profiling of SARS-CoV-2 variants by deep mutational learning. Image Credit: Lightspring/Shutterstock Study: Predictive profiling of SARS-CoV-2 variants by deep mutational learning. Image Credit: Lightspring/Shutterstock

Previous investigations used yeast surface display and deep mutational scanning (DMS) to examine the impact of single-position mutations on binding to ACE2 and monoclonal or serum antibodies on the complete 201 amino acid RBD of SARS-CoV-2. Several widely circulating variants (e.g., beta, gamma, and delta) as well as newly developing variants (e.g., mu (B.1621) and lambda (C.37) have numerous mutations in the RBD, which are associated with improved ACE2 binding and/or multi-class antibody escape.

This news article was a review of a preliminary scientific report that had not undergone peer-review at the time of publication. Since its initial publication, the scientific report has now been peer reviewed and accepted for publication in a Scientific Journal. Links to the preliminary and peer-reviewed reports are available in the Sources section at the bottom of this article. View Sources

The recent emergence of the omicron variant with 15 RBD mutations, which poses a significant danger of immune evasion, highlights the urgent need to understand the impact of combinatorial mutations. However, as the number of mutations and amino acid diversity increase, combinatorial sequence space expands exponentially, rapidly exceeding the capabilities of experimental screening procedures. For example, theoretical sequence space greatly exceeds what can be screened by yeast display libraries while focused only on a subset of twenty RBD residues directly involved in ACE2 binding.

Deep mutational learning (DML) is a technique developed by researchers from multiple institutions that combines experimental yeast display screening of RBD mutagenesis libraries with deep sequencing and machine learning. DML allows for a complete analysis of combinatorial RBD mutations and their impact on ACE2 binding and antibody escape, allowing for SARS-CoV-2 variant predictive profiling.

A preprint version of the study is available on the bioRxiv* server while the article undergoes peer review.

The study

The authors examined their classification performance on defined variants, followed by experimental validation and structural modeling, after establishing that ACE2 binding and antibody escape machine learning models can produce highly accurate predictions on test data. To replicate realistic evolutionary routes, synthetic lineages were created in silico, with variants lacking anticipated ACE2-binding intermediates at each mutational stage being discarded. The lineages were created to contain mutations from the original Wu-Hu-1 RBD sequence at edit distance 3 (ED3), ED5, and ED7 (nucleotide and amino acid). The sequences were also chosen to establish lineages with mutations found in circulating variations.

A consensus model was used to predict ACE2 binding, in which a given RBD sequence is projected to bind ACE2 if both the RF and RNN models provide P > 0.5; otherwise, they are anticipated to be non-binders. The 46 synthetic lineage variants were chosen for their ACE2 binding prediction variety (36 predicted binders, ten predicted non-binders). Additionally, predictions for escape from each of the four therapeutic antibodies were established using a similar consensus model technique for the synthetic variations (RBD sequence escapes an antibody when both RF and RNN outputs are P 0.5).

Each synthetic RBD variation was independently produced on the surface of yeast cells and tested for ACE2 binding and antibody escape after all machine learning predictions were completed. The consensus model accurately predicted ACE2 binding for 91.67 % of the synthetic variations, with a non-binding prediction accuracy of 100 %, yielding a prediction accuracy of 93.48 % overall. The cumulative accuracy of antibody escape predictions across all four therapeutic antibodies was 93.94 % for the 33 correctly predicted ACE2-binding variants.

In addition, consensus models predicted ACE2 binding and escape from all four therapeutic antibodies in three variations that were just ED3 (nucleotide and amino acid) from the Wu-Hu-1 RBD. Mutations were found in one of these variations at locations 493, 498, and 501, which are all mutated in the omicron variant. Following yeast display studies, the machine learning predictions of antibody escape from all four therapeutic antibodies, including the often mutation resistant REGN10987, were confirmed. AlphaFold2 was used to perform structural modeling on eight synthetic RBD variants. According to structural predictions, several non-binding ACE2 variants did not differ significantly from the original Wu-Hu-1 RBD. The ACE2-binding variations, on the other hand, displayed a wide range of potential structural conformations.

Implications

According to evidence, other endemic coronavirus receptor-binding domains may be undergoing adaptive evolution to avoid human antibody reactions. As a result, combining DML with phylogenetic models of viral evolution to predict SARS-CoV-2 escape from polyclonal antibodies present in the serum of vaccinated or convalescent individuals may enable the identification of future variants with the highest likelihood of emergence and thus support vaccine development for coronavirus disease 2019 (COVID-19).

Journal references:

Preliminary scientific report.
Taft, J. et al. (2021) "Predictive profiling of SARS-CoV-2 variants by deep mutational learning". bioRxiv. doi: 10.1101/2021.12.07.471580. https://www.biorxiv.org/content/10.1101/2021.12.07.471580v1
Peer reviewed and published scientific report. Taft, Joseph M., Cédric R. Weber, Beichen Gao, Roy A. Ehling, Jiami Han, Lester Frei, Sean W. Metcalfe, et al. 2022. “Deep Mutational Learning Predicts ACE2 Binding and Antibody Escape to Combinatorial Mutations in the SARS-CoV-2 Receptor-Binding Domain.” Cell 185 (21): 4008-4022.e14. https://doi.org/10.1016/j.cell.2022.08.024. https://www.cell.com/cell/fulltext/S0092-8674(22)01119-9.

Article Revisions

May 9 2023 - The preprint preliminary research paper that this article was based upon was accepted for publication in a peer-reviewed Scientific Journal. This article was edited accordingly to include a link to the final peer-reviewed paper, now shown in the sources section.

Posted in: Medical Science News | Medical Research News | Disease/Infection News

Comments (0)

Written by

Colin Lightfoot

Colin graduated from the University of Chester with a B.Sc. in Biomedical Science in 2020. Since completing his undergraduate degree, he worked for NHS England as an Associate Practitioner, responsible for testing inpatients for COVID-19 on admission.

Download PDF Copy

Citations

Please use one of the following formats to cite this article in your essay, paper or report:

APA
Lightfoot, Colin. (2023, May 09). Machine learning method for determining impact of mutations in the SARS-CoV-2 RBD on binding and antibody escape. News-Medical. Retrieved on July 20, 2026 from https://www.news-medical.net/news/20211214/Machine-learning-method-for-determining-impact-of-mutations-in-the-SARS-CoV-2-RBD-on-binding-and-antibody-escape.aspx.
MLA
Lightfoot, Colin. "Machine learning method for determining impact of mutations in the SARS-CoV-2 RBD on binding and antibody escape". News-Medical. 20 July 2026. <https://www.news-medical.net/news/20211214/Machine-learning-method-for-determining-impact-of-mutations-in-the-SARS-CoV-2-RBD-on-binding-and-antibody-escape.aspx>.
Chicago
Lightfoot, Colin. "Machine learning method for determining impact of mutations in the SARS-CoV-2 RBD on binding and antibody escape". News-Medical. https://www.news-medical.net/news/20211214/Machine-learning-method-for-determining-impact-of-mutations-in-the-SARS-CoV-2-RBD-on-binding-and-antibody-escape.aspx. (accessed July 20, 2026).
Harvard
Lightfoot, Colin. 2023. Machine learning method for determining impact of mutations in the SARS-CoV-2 RBD on binding and antibody escape. News-Medical, viewed 20 July 2026, https://www.news-medical.net/news/20211214/Machine-learning-method-for-determining-impact-of-mutations-in-the-SARS-CoV-2-RBD-on-binding-and-antibody-escape.aspx.

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of News Medical.

Post a new comment

(Logout)

Post

Sign in to keep reading

We're committed to providing free access to quality science. By registering and providing insight into your preferences you're joining a community of over 1m science interested individuals and help us to provide you with insightful content whilst keeping our service free.