Literature detail

Machine Learning Approach Effectively Predicts Binding Between SARS-CoV-2 Spike and ACE2 Across Mammalian Species - Worldwide, 2021.

Yue Ma1 Yu Hu1,2 Binbin Xia1 Pei Du1 Lili Wu1 Mifang Liang3 Qian Chen1,4 Huan Yan5 George F Gao1 Qihui Wang1 Jun Wang1
Affiliations 5 institutions
  1. CAS Key Laboratory of Pathogen Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China.
  2. School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, Anhui, China.
  3. State Key Laboratory for Molecular Virology and Genetic Engineering, National Institute for Viral Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing, China.
  4. Institute of Physical Science and Information, Anhui University, Hefei, Anhui, China.
  5. State Key Laboratory of Virology, Modern Virology Research Center, College of Life Sciences, Wuhan University, Wuhan, Hubei, China.
PMID 34804629 2021 China CDC Wkly eng ppublish
PubMed DOI Browse context

Article

Publication summary

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a recently emergent coronavirus of natural origin and caused the coronavirus disease (COVID-19) pandemic. The study of its natural origin and host range is of particular importance for source tracing, monitoring of this virus, and prevention of recurrent infections. One major approach is to test the binding ability of the viral receptor gene ACE2 from various hosts to SARS-CoV-2 spike protein, but it is time-consuming and labor-intensive to cover a large collection of species. In this paper, we applied state-of-the-art machine learning approaches and created a pipeline reaching >87% accuracy in predicting binding between different ACE2 and SARS-CoV-2 spike. We further validated our prediction pipeline using 2 independent test sets involving >50 bat species and achieved >78% accuracy. A large-scale screening of 204 mammal species revealed 144 species (or 61%) were susceptible to SARS-CoV-2 infections, highlighting the importance of intensive monitoring and studies in mammalian species. In short, our study employed machine learning models to create an important tool for predicting potential hosts of SARS-CoV-2 and achieved the highest precision to our knowledge in experimental validation. This study also predicted that a wide range of mammals were capable of being infected by SARS-CoV-2.

ACE2 machine learning SARS-CoV-2

Structured evidence records

Evidence records

3 total
1 records
Extraction confidence 0.80
Key finding

The study experimentally validated binding between SARS-CoV-2 spike and ACE2 proteins from over 50 bat species, demonstrating cross-species susceptibility.

Virus
Host
Location
Not specified
Supporting text

We applied state-of-the-art machine learning approaches and created a pipeline reaching >87% accuracy in predicting binding between different ACE2 and SARS-CoV-2 spike. We further validated our prediction pipeline using 2 independent test sets involving >50 bat species and achieved >78% accuracy. A large-scale screening of 204 mammal species revealed 144 species (or 61%) were susceptible to SARS-CoV-2 infections.

Method
receptor-binding assay; experimental validation; prediction validation
Experimental system
in vitro cell culture
1 records
Extraction confidence 0.95
Key finding

Machine learning accurately predicted SARS-CoV-2 spike protein binding to ACE2 receptors from multiple mammalian species.

Virus
Host
Location
Not specified
Supporting text

We applied state-of-the-art machine learning approaches and created a pipeline reaching >87% accuracy in predicting binding between different ACE2 and SARS-CoV-2 spike.

Method
machine learning prediction; binding validation
Receptors
ACE2
1 records
Extraction confidence 0.70
Key finding

A large-scale computational surveillance identified 144 mammal species as potentially susceptible to SARS-CoV-2 infection based on predicted ACE2–spike binding.

Virus
Host
Location
Supporting text

A large-scale screening of 204 mammal species revealed 144 species (or 61%) were susceptible to SARS-CoV-2 infections, highlighting the importance of intensive monitoring and studies in mammalian species.

Method
machine learning
Geographic raw
Worldwide