Literature detail

Evolutionary history, potential intermediate animal host, and cross-species analyses of SARS-CoV-2.

Xingguang Li1 Junjie Zai2 Qiang Zhao3 Qing Nie4 Yi Li1 Brian T Foley5 Antoine Chaillon6
Affiliations 6 institutions
  1. Hubei Engineering Research Center of Viral Vector, Wuhan University of Bioengineering, Wuhan, China.
  2. Immunology Innovation Team, School of Medicine, Ningbo University, Ningbo, China.
  3. Precision Cancer Center Airport Center, Tianjin Cancer Hospital Airport Hospital, Tianjin, China.
  4. Department of Microbiology, Weifang Center for Disease Control and Prevention, Weifang, China.
  5. HIV Databases, Theoretical Biology and Biophysics Group, Los Alamos National Laboratory, Los Alamos, New Mexico.
  6. Department of Medicine, University of California San Diego, La Jolla, California.
PMID 32104911 2020 J Med Virol eng ppublish
PubMed DOI Browse context

Article

Publication summary

To investigate the evolutionary history of the recent outbreak of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in China, a total of 70 genomes of virus strains from China and elsewhere with sampling dates between 24 December 2019 and 3 February 2020 were analyzed. To explore the potential intermediate animal host of the SARS-CoV-2 virus, we reanalyzed virome data sets from pangolins and representative SARS-related coronaviruses isolates from bats, with particular attention paid to the spike glycoprotein gene. We performed phylogenetic, split network, transmission network, likelihood-mapping, and comparative analyses of the genomes. Based on Bayesian time-scaled phylogenetic analysis using the tip-dating method, we estimated the time to the most recent common ancestor and evolutionary rate of SARS-CoV-2, which ranged from 22 to 24 November 2019 and 1.19 to 1.31 × 10<sup>-3</sup> substitutions per site per year, respectively. Our results also revealed that the BetaCoV/bat/Yunnan/RaTG13/2013 virus was more similar to the SARS-CoV-2 virus than the coronavirus obtained from the two pangolin samples (SRR10168377 and SRR10168378). We also identified a unique peptide (PRRA) insertion in the human SARS-CoV-2 virus, which may be involved in the proteolytic cleavage of the spike protein by cellular proteases, and thus could impact host range and transmissibility. Interestingly, the coronavirus carried by pangolins did not have the RRAR motif. Therefore, we concluded that the human SARS-CoV-2 virus, which is responsible for the recent outbreak of COVID-19, did not come directly from pangolins.

COVID-19 cross-species transmission evolutionary rate potential intermediate animal host SARS-CoV-2 TMRCA Genome, Viral Pandemics Amino Acid Sequence Animals Betacoronavirus Chiroptera Coronavirus Infections COVID-19 Eutheria Evolution, Molecular Host Specificity Humans

Structured evidence records

Evidence records

4 total
2 records
Extraction confidence 0.95
Key finding

Phylogenetic and comparative genomics of 70 SARS-CoV-2 genomes showed a recent common ancestor in late November 2019, evolutionary rates around 1.2×10^-3 substitutions/site/year, and closest genomic similarity to the bat coronavirus BetaCoV/bat/Yunnan/RaTG13/2013 rather than pangolin viruses.

Virus
Location
Not specified
Supporting text

We performed phylogenetic, split network, transmission network, likelihood-mapping, and comparative analyses of the genomes. Based on Bayesian time-scaled phylogenetic analysis using the tip-dating method, we estimated the time to the most recent common ancestor and evolutionary rate of SARS-CoV-2, which ranged from 22 to 24 November 2019 and 1.19 to 1.31×10^-3 substitutions per site per year, respectively. Our results also revealed that the BetaCoV/bat/Yunnan/RaTG13/2013 virus was more similar to the SARS-CoV-2 virus than the coronavirus obtained from the two pangolin samples.

Genes or proteins
spike glycoprotein
Analysis methods
phylogenetic analysis; comparative genomics; split network analysis; Bayesian time-scaled phylogenetic analysis
Extraction confidence 0.95
Key finding

Comparative genomic analysis showed that the bat coronavirus BetaCoV/bat/Yunnan/RaTG13/2013 is most similar to SARS-CoV-2, whereas pangolin coronaviruses lack the RRAR motif in the spike, indicating they are not the direct source of human SARS-CoV-2.

Virus
Host
Location
Not specified
Supporting text

Our results also revealed that the BetaCoV/bat/Yunnan/RaTG13/2013 virus was more similar to the SARS-CoV-2 virus than the coronavirus obtained from the two pangolin samples (SRR10168377 and SRR10168378). Interestingly, the coronavirus carried by pangolins did not have the RRAR motif.

Genes or proteins
spike glycoprotein
Analysis methods
comparative genomics
1 records
Extraction confidence 0.90
Key finding

The study compared SARS-related coronaviruses from bats and pangolins, demonstrating animal-to-animal cross-species relationships among these hosts.

Virus
Location
Not specified
Supporting text

We reanalyzed virome data sets from pangolins and representative SARS-related coronaviruses isolates from bats, with particular attention paid to the spike glycoprotein gene.

Method
phylogenetic analysis; virome reanalysis; comparative genomic analysis
Study design
phylogenetic analysis
Transmission direction
animal-to-animal
1 records
Extraction confidence 0.90
Key finding

A PRRA insertion in the SARS-CoV-2 spike glycoprotein may enable proteolytic cleavage and influence host range and transmission efficiency.

Virus
Host
Not specified
Location
Not specified
Supporting text

We also identified a unique peptide (PRRA) insertion in the human SARS-CoV-2 virus, which may be involved in the proteolytic cleavage of the spike protein by cellular proteases, and thus could impact host range and transmissibility.

Genes or proteins
spike glycoprotein
Host factors
cellular proteases
Mutations
PRRA insertion
Mechanism types
proteolytic_cleavage; host_range; transmission_fitness