Literature detail

Evolutionary origins of the SARS-CoV-2 sarbecovirus lineage responsible for the COVID-19 pandemic.

Maciej F Boni1 Philippe Lemey2 Xiaowei Jiang3 Tommy Tsan-Yuk Lam4 Blair W Perry5 Todd A Castoe5 Andrew Rambaut6 David L Robertson7
Affiliations 7 institutions
  1. Center for Infectious Disease Dynamics, Department of Biology, Pennsylvania State University, University Park, PA, USA. [email protected].
  2. Department of Microbiology, Immunology and Transplantation, KU Leuven, Rega Institute, Leuven, Belgium. [email protected].
  3. Department of Biological Sciences, Xi'an Jiaotong-Liverpool University, Suzhou, China.
  4. State Key Laboratory of Emerging Infectious Diseases, School of Public Health, The University of Hong Kong, Hong Kong SAR, China.
  5. Department of Biology, University of Texas Arlington, Arlington, TX, USA.
  6. Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, UK. [email protected].
  7. MRC-University of Glasgow Centre for Virus Research, Glasgow, UK. [email protected].
PMID 32724171 2020 Nat Microbiol eng ppublish
PubMed DOI Browse context

Article

Publication summary

There are outstanding evolutionary questions on the recent emergence of human coronavirus SARS-CoV-2 including the role of reservoir species, the role of recombination and its time of divergence from animal viruses. We find that the sarbecoviruses-the viral subgenus containing SARS-CoV and SARS-CoV-2-undergo frequent recombination and exhibit spatially structured genetic diversity on a regional scale in China. SARS-CoV-2 itself is not a recombinant of any sarbecoviruses detected to date, and its receptor-binding motif, important for specificity to human ACE2 receptors, appears to be an ancestral trait shared with bat viruses and not one acquired recently via recombination. To employ phylogenetic dating methods, recombinant regions of a 68-genome sarbecovirus alignment were removed with three independent methods. Bayesian evolutionary rate and divergence date estimates were shown to be consistent for these three approaches and for two different prior specifications of evolutionary rates based on HCoV-OC43 and MERS-CoV. Divergence dates between SARS-CoV-2 and the bat sarbecovirus reservoir were estimated as 1948 (95% highest posterior density (HPD): 1879-1999), 1969 (95% HPD: 1930-2000) and 1982 (95% HPD: 1948-2009), indicating that the lineage giving rise to SARS-CoV-2 has been circulating unnoticed in bats for decades.

Angiotensin-Converting Enzyme 2 Animals Bayes Theorem Betacoronavirus China Chiroptera Coronavirus Infections COVID-19 Evolution, Molecular Genetic Variation Genome, Viral Humans Pandemics Peptidyl-Dipeptidase A Phylogeny Pneumonia, Viral Recombination, Genetic SARS-CoV-2

Structured evidence records

Evidence records

5 total
2 records
Extraction confidence 0.96
Key finding

Sarbecoviruses frequently undergo recombination, but SARS-CoV-2 is not a recombinant of sampled sarbecoviruses.

Virus
Host
Not specified
Location
Not specified
Supporting text

We find that the sarbecoviruses—the viral subgenus containing SARS-CoV and SARS-CoV-2—undergo frequent recombination and exhibit spatially structured genetic diversity on a regional scale in China. SARS-CoV-2 itself is not a recombinant of any sarbecoviruses detected to date.

Event type
recombination
Extraction confidence 0.90
Key finding

SARS-CoV-2 is not a recombinant of known sarbecoviruses, and its receptor-binding motif was not recently acquired via recombination.

Virus
Host
Not specified
Location
Not specified
Supporting text

SARS-CoV-2 itself is not a recombinant of any sarbecoviruses detected to date, and its receptor-binding motif, important for specificity to human ACE2 receptors, appears to be an ancestral trait shared with bat viruses and not one acquired recently via recombination.

Event type
recombination
1 records
Extraction confidence 0.98
Key finding

Phylogenetic and Bayesian analyses of 68 sarbecovirus genomes estimated that SARS-CoV-2 diverged from bat sarbecoviruses decades ago, indicating long-term circulation in bats and no recent recombination origin.

Virus
Host
Location
Not specified
Supporting text

To employ phylogenetic dating methods, recombinant regions of a 68-genome sarbecovirus alignment were removed with three independent methods. Bayesian evolutionary rate and divergence date estimates were shown to be consistent for these three approaches and for two different prior specifications of evolutionary rates based on HCoV-OC43 and MERS-CoV.

Analysis methods
phylogenetic analysis; Bayesian evolutionary dating
1 records
Extraction confidence 0.85
Key finding

The receptor-binding motif conferring human ACE2 specificity in SARS-CoV-2 is an ancestral feature shared with bat sarbecoviruses, suggesting a long-term molecular adaptation predating recent recombination events.

Virus
Host
Not specified
Location
Not specified
Supporting text

SARS-CoV-2 itself is not a recombinant of any sarbecoviruses detected to date, and its receptor-binding motif, important for specificity to human ACE2 receptors, appears to be an ancestral trait shared with bat viruses and not one acquired recently via recombination.

Genes or proteins
receptor-binding motif
Receptors
ACE2
Mechanism types
receptor_binding; host_range; molecular_adaptation
1 records
Extraction confidence 0.82
Key finding

SARS-CoV-2's ancestral lineage has circulated undetected in bats for decades, supporting bats in China as a long-term reservoir for sarbecoviruses.

Virus
Host
Location
Supporting text

Divergence dates between SARS-CoV-2 and the bat sarbecovirus reservoir were estimated as 1948 (95% HPD: 1879–1999), 1969 (95% HPD: 1930–2000) and 1982 (95% HPD: 1948–2009), indicating that the lineage giving rise to SARS-CoV-2 has been circulating unnoticed in bats for decades.

Method
phylogenetic dating; Bayesian evolutionary analysis
Geographic raw
China
Country inferred
China