Skip to main content

Molecular phylogeny of coronaviruses and host receptors among domestic and close-contact animals reveals subgenome-level conservation, crossover, and divergence



Coronaviruses have the potential to cross species barriers. To learn the molecular intersections among the most common coronaviruses of domestic and close-contact animals, we analyzed representative coronavirus genera infecting mouse, rat, rabbit, dog, cat, cattle, white-tailed deer, swine, ferret, mink, alpaca, Rhinolophus bat, dolphin, whale, chicken, duck and turkey hosts; reference or complete genome sequences were available for most of these coronavirus genera. Protein sequence alignments and phylogenetic trees were built for the spike (S), envelope (E), membrane (M) and nucleocapsid (N) proteins. The host receptors and enzymes aminopeptidase N (APN), angiotensin converting enzyme 2 (ACE2), sialic acid synthase (SAS), transmembrane serine protease 2 (TMPRSS2), dipeptidyl peptidase 4 (DPP4), cathepsin L (and its analogs) and furin were also compared.


Overall, the S, E, M, and N proteins segregated according to their viral genera (α, β, or γ), but the S proteins of alphacoronaviruses lacked conservation of phylogeny. Interestingly, the unique polybasic furin cleavage motif found in severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) but not in severe acute respiratory syndrome coronavirus (SARS-CoV) or Middle East respiratory syndrome coronavirus (MERS-CoV) exists in several β-coronaviruses and a few α- or γ-coronaviruses. Receptors and enzymes retained host species-dependent relationships with one another. Among the hosts, critical ACE2 residues essential for SARS-CoV-2 spike protein binding were most conserved in white-tailed deer and cattle.


The polybasic furin cleavage motif found in several β- and other coronaviruses of animals points to the existence of an intermediate host for SARS-CoV-2, and it also offers a counternarrative to the theory of a laboratory-engineered virus. Generally, the S proteins of coronaviruses show crossovers of phylogenies indicative of recombination events. Additionally, the consistency in the segregation of viral proteins of the MERS-like coronavirus (NC_034440.1) from pipistrelle bat supports its classification as a β-coronavirus. Finally, similarities in host enzymes and receptors did not always explain natural cross-infections. More studies are therefore needed to identify factors that determine the cross-species infectivity of coronaviruses.

Peer Review reports


The ongoing pandemic caused by severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) has led to unprecedented interest in the study of coronaviruses, although the first coronavirus was identified in the 1930s [1]. Given the most recent outbreaks by SARS-CoV-2, severe acute respiratory syndrome coronavirus (SARS-CoV) and Middle East respiratory syndrome coronavirus (MERS-CoV), the potential for zoonoses and reverse zoonoses, in conjunction with viral, host and environmental factors, have elevated the need to identify the origin, transmission, pathogenesis and control of and novel therapeutic and preventive strategies against these viruses.

The first two-thirds of the coronavirus genome encodes proteins needed for replication, and the remaining one-third encodes accessory and structural proteins, which include hemagglutinin esterase (HE) (present in only Group 2 coronaviruses), spike (S), envelope (E), membrane (M) and nucleocapsid (N) [2, 3]. Previously grouped based on serology, coronaviruses are now categorized into α (alpha), β (beta), γ (gamma) and δ (delta) groups using genetics. Coronaviruses affect many host species ranging from mammals to birds [3, 4].

Cross-species coronavirus infections have been reported among humans and animals. The SARS-CoV epidemic in 2002, for example, appears to have jumped from bats to infect palm civets, then to raccoon dogs, Chinese ferret badgers and finally into the human population [5]. In domestic animals, the most likely recent interspecies transmission leading to an outbreak may be that of canine respiratory coronavirus discovered in 2003 [6]. In Africa, a recent study by Burimuah et al. also showed that the close association of ruminants can lead to a spillover of coronaviruses from cattle to other small ruminants [7]. A high degree of sequence identity among some canine, bovine and human β-coronaviruses [2, 5, 8] strongly suggests that coronaviruses in close-contact environments could recombine or jump the species barrier to initiate emerging infections with sustained transmissions in new hosts. In the last two decades, SARS-CoV-2 has been the third zoonotic coronavirus to originate from animals and cause a pandemic in humans. Although the intermediate host for SARS-CoV-2 has not yet been identified [9, 10], recent comparative molecular analysis indicates that the pangolin and bats harbor the closest relative viruses [11,12,13,14].

With the current public availability of whole-genome sequences and additional tools for the analysis of both proteome and genome data, it is now possible to examine viruses and other microbes in great detail even before experimental validations. In this study, we analyzed the phylogenetic relationships among selected coronaviruses that infect domestic and close-contact animals to sketch the subgenomic relationships among the viruses. The molecular phylogeny of coronavirus proteins (spike, envelope, membrane and nucleocapsid), putative host-cell receptors and key functional domains of host enzymes were compared.


Viruses of the same genera may form variable clades at subgenomic levels

Overall, viral S, E, M, and N proteins clustered together according to the viral genera groups (Fig. 1 A-D), although intragroup discordances were evident. For alpaca respiratory α-coronavirus, the S protein shared the same origin as that of the porcine epidemic diarrhea (PED) virus, but distinct from its clade, the E protein shared a distant origin with β-coronaviruses (Fig. 1B). Among β-coronaviruses, proteins from canine respiratory, bovine, rabbit, and white-tailed deer coronaviruses were closely related. The E, M, and N proteins from transmissible gastroenteritis (TGE) virus, canine coronavirus (also referred to as canine enteric coronavirus) and PED virus showed close phylogeny with each other, while the S protein showed discordance. Feline coronavirus (strain UU11) and feline infectious peritonitis (FIP) virus E, M, and N proteins were closely related (Fig. 1 B-D). However, the phylogeny of the S protein from FIP virus was distant from that of feline coronavirus (strain UU11) (Fig. 1 A). To test for possible past recombination events that may have led to the crossover phylogeny between viruses of unrelated host species, we performed a SimPlot analysis of the entire genomes of canine coronavirus, PED virus, TGE virus and alpaca respiratory coronavirus, holding the canine coronavirus sequence as a query against the other three. The results showed that both TGE- and PED-coronaviruses maintained 70-98% similarity with the canine coronavirus along the genome region (approximately 20,000 bases) proximal to the segment coding for the S protein. However, in the S gene segment, the PED virus showed no similarity to either canine coronavirus or TGE virus. TGE virus regained 90-96% similarity to canine coronavirus after a drop of the curve in the proximal region of the S gene (Fig. 1E). The alpaca respiratory coronavirus shared very limited similarity with the canine coronavirus between the proximal 10,000 and 20,000 bases but was similar to the PED virus between the 23,000 and 25,000 base marks corresponding to the spike gene segment. This suggests that the S gene segments of some coronaviruses may originate from other viruses via recombination, leading to divergent subgenome phylogeny among proteins of the same virus.

Fig. 1
figure 1

Clustering patterns for representative α, β, or γ coronaviruses in domestic and close-contact animals. Phylogenetic trees built for the A spike (S), B envelope (E), C membrane (M) and D nucleocapsid (N) proteins. Viruses are clustered according to their α, β, or γ groupings. *MERS-like coronavirus (PREDICT/PDF-2180, Acc #: NC_034440.1) currently not belonging to a classified coronavirus is seen clustering among β-coronaviruses. E SimPlot analysis showing the similarity score between the whole genomes of canine coronavirus, PED virus, TGE virus and alpaca respiratory coronavirus with canine coronavirus (also referred to as canine enteric coronavirus) set as the query sequence

The polybasic furin cleavage motif in SARS-CoV-2 is present in several β-coronaviruses

In SARS-CoV-2, the S protein contains a polybasic furin cleavage site with the motif RRAR (where R and A are arginine and alanine residues, respectively). This motif is present in the murine β-coronavirus but absent in SARS-CoV, while MERS-CoV has an RSVR sequence at approximately the same location. This cleavage site, located at the junction of the S1/S2 subdomains, specifically has an arginine residue at the second position (RRAR), which is essential for efficient cleavage of the S protein [15]. We show that this furin cleavage motif RRXR (where X is an alanine residue or another amino acid) is present in several β-coronaviruses and the avian infectious bronchitis virus (a γ-coronavirus) as shown in Fig. 2. Notably, a variation in the sequence is also evident in other S proteins, such as RVGR in a β-coronavirus of bat, RSRR in an α-coronavirus of cat, and RKRR in a γ-coronavirus of turkey. Therefore, this motif or its mutant variants exist in natural coronaviruses belonging to different groups.

Fig. 2
figure 2

Polybasic furin cleavage motif of SARS-CoV-2 is present in other coronaviruses. The protein sequence of the furin cleavage site within the S protein, with polybasic amino acid residues conserved among some coronaviruses, is shown. The residues constituting the RRXR motif (where R is an arginine residue and X is another amino acid) are marked with rectangles. The exact RRAR configuration of this motif in SARS-CoV-2 is present in the murine coronavirus but absent in others, including the bat virus and MERS-CoV. The RRXR motif is reversed in others such as the rabbit HKU14 coronavirus and turkey coronavirus. Amino acids are listed in single letter code

The TMPRSS2 cleavage site on the viral S protein and the catalytic residues of the host TMPRSS2 enzyme are conserved

The TMPRSS2 cleavage site of the SARS-CoV-2 S protein is well conserved among all viruses studied, although the residues flanking the cleavage site vary widely (Fig. 3A). TMPRSS2 is expressed in tissues of the aerodigestive tract of humans [16], and it is important for initiating SARS-CoV-2 infection by processing the S protein to release fusion peptides for membrane fusion [17]. On the host TMPRSS2 enzyme, we found both the substrate binding site and the triad of catalytic residues (histidine (H) 296, aspartic acid (D) 345 and serine (S) 441) located in the active site to be well conserved among all host species as shown in Fig. 3B and C, respectively.

Fig. 3
figure 3

S protein cleavage site and catalytic residues of the TMPRSS2 enzyme are conserved. A S protein sequence alignment at the site where human TMPRSS2 cleaves the SARS-CoV-2 spike protein. The TMPRSS2 enzyme cleaves between the arginine (R) and serine (S) residues. Only the FIP virus has a glycine (G) in place of the arginine (R) residue. Aligned protein sequences of the substrate binding site (B) and active site (C) of the TMPRSS2 enzyme of various host species are shown. Arrows in panel C point to the triad of catalytic residues histidine (H) 296, aspartic acid (D) 345 and serine (S) 441 that are essential for binding to the SARS-CoV-2 S protein. All conserved residues are shaded blue, and sequence consensus conservation is shown as colored bars (red, tall bars mean more conserved). The threshold for showing a consensus is set at > 70 for A and > 50% for B and C. The letter X denotes no consensus, and amino acids are listed in single letter codes

Comparison of host ACE2, TMPRSS2, APN SAS, DPP4, cathepsin L and furin proteins

Phylogenetic analysis of host enzymes and receptors was performed. TMPRSS2, SAS, APN, DPP4, cathepsin L (and its analogs) and furin showed high similarity among mammals, with distinct separation from those of birds. An overall species-based segregation pattern was observed for the various host enzymes and receptors, except for the ACE2 enzyme, where bat, rabbit and beluga whale ACE2 proteins were distantly related to proteins of other mammals and birds (Fig. 4A-G). Unlike the phylogenies of host species cathepsin and furin, which generally followed species relationship patterns, the phylogeny of DPP4 was rather peculiar in that proteins of cattle and white-tailed deer or beluga whale and bottlenose dolphin were very distantly related. On the other hand, cathepsins of dog and cattle were in the same clade (Fig. 4 E-G). Upon analyzing both the required and critical residues of ACE2 needed for SARS-CoV-2 S protein binding, the cattle and white-tailed deer proteins shared the most similarity scores with humans, followed closely by the dolphin and pig proteins and then by the cat, alpaca and dog proteins (Fig. 4H).

Fig. 4
figure 4

Comparisons of phylogenetic relationships among host receptors and enzymes. Phylogenetic comparison of A ACE2, B APN, C TMPRSS2, D SAS, E DPP4, F cathepsin L/procathepsin L and G furin among various host species. Generally, the phylogenetic segregation of host enzymes and receptors followed the species-related pattern except for the ACE2 enzyme, which also showed the highest genetic variation scale of 0.10. H Comparison of the key amino acids in human ACE2 projected to interact with the receptor binding domain of the S protein of SARS-CoV-2. Shaded in dark gray and arrowed in red are the positions and the single letter codes of the 20 amino acids and 5 critical residues, respectively, required for successful S protein attachment. The proportion of amino acids that overlap with the twenty required (/20) and the 5 critical (/5) residues are shown on the right in blue and red, respectively. The fractional scores for cattle and white-tailed deer are colored in aqua


We compared the protein-level phylogenetic relationships among common coronaviruses of domestic and close-contact animals and key infection-associated proteins in the hosts. In this report, we highlight that the polybasic furin cleavage site found in SARS-CoV-2, but not in SARS-CoV or MERS-CoV [15], exists in several β-coronaviruses included in this report, although the configuration varied in some of those. For example, in the murine coronavirus (accession no. ACN89705.1), the exact RRAR motif is observed, but in others, it is either reversed or palindromic (Fig. 2). The polybasic furin cleavage site in the SARS-CoV-2 S protein is considered unique to the novel virus that causes COVID-19 [18]. This has led to speculations about the possibility of a laboratory-engineered virus. However, the presence of the same and similar motifs in other β-coronaviruses and even other coronaviruses eliminates the theory of an artificially engineered virus. Although the current search for the intermediate host of SARS-CoV-2 suggests several potential candidates [19], possible recombination events of viruses in the same or different host species, which can give rise to a hybrid or a variant species, should not be ruled out. RpYN09 bat coronavirus, the most recently found closest relative of SARS-CoV-2, lacks the RRAR motif [14], suggesting that the parent virus containing this motif has yet to be discovered.

Although the identities of all host receptors for coronaviruses are not fully known, ACE2 is documented as a receptor mediating both infection and transmission of SARS-CoV-2 [20, 21]. ACE2 is a key regulator of the angiotensin system and is well expressed in the vascular endothelium, smooth muscle cells of the intestines, kidneys and heart muscle cells [22,23,24]. During infection, twenty key amino acid residues in the ACE2 enzyme interact with the S protein of SARS-CoV-2 [25]. Five of these residues, lysine (K), glutamic acid (E), aspartic acid (D), methionine (M) and lysine (K), at positions 31, 35, 38, 82 and 353, respectively, are critical for S protein binding [26]. Sequence alignment analysis showed that cattle and white-tailed deer share the most similarity scores to humans for both required and critical amino acid residues. Interestingly, a recent USDA/APHIS study showed that approximately 40% of the wild white-tailed deer population in four states in the USA tested positive for anti-SARS-CoV-2 immunoglobulins ( This finding indicates the possibility that SARS-CoV-2 may establish and spread in hosts with high affinity receptors. However, the similarity of a single receptor may not explain or predict cross-species infections. For example, while the bovine-canine cross-species jump of a β-coronavirus is obvious from the phylogeny, the comparison of their ACE2 does not indicate a unique relationship of this receptor between cow and dog, although their cathepsins belong to the same clade. Therefore, ACE2 may not necessarily be a receptor for bovine-canine interspecies viral infection. Similarly, ACE2 of bottlenose dolphin and pig shared more common residues with those of human ACE2 than other hosts. Specifically, the Rhinolophus bat reference sequence has one of the least phylogenetic and key amino acid commonalities with human ACE2 (Fig. 4H). This brings into question the relevance of the ACE2 receptor in this bat species.

The TMPRSS2 cleavage site in SARS-CoV-2 was conserved among all viruses we included (Fig. 3A). In the host species, both the substrate binding and active sites of the enzyme were well conserved. Notably, the triad of catalytic residues histidine (H) 296, aspartic acid (D) 345 and serine (S) 441 [27], which interact with the SARS-CoV-2 S protein, are conserved among all species included in this study (Fig. 3B). The overall phylogenetic segregation pattern of host enzymes and receptors was species-related except for the ACE2 enzyme, where bat, rabbit and whale proteins were distantly related to those of other mammals.

Among the viruses we analyzed, consistent patterns with little variation were observed among γ-coronaviruses. The canine respiratory, rabbit, bovine respiratory, and white-tailed deer coronaviruses also consistently clustered together. However, phylogenetic crossovers were evident among canine coronavirus, TGE virus, PED virus, feline coronavirus, and FIP virus. The S protein of the feline coronavirus included in this report was distant from that of the FIP virus, displaying discordant relationships that indicate past recombination events [28]. An interesting phylogenetic common origin was also noted for the S proteins of PED virus and the distantly related alpaca respiratory coronavirus. These crossovers are most likely from past recombination and mutation events. Coronaviruses are noted for their high frequency of recombination [29], and the patterns of recombination among coronaviruses of domestic animals deserve a closer look for us to recognize the origins of cross-species infections and to design effective disease control strategies.

Finally, we also noted that the MERS-like bat coronavirus (accession number YP_009361857.1; is annotated as unclassified in the National Center for Biotechnology Information (NCBI) database. Considering the phylogenetic clustering of the S, M, E, and N proteins, they most likely belong to the β-coronavirus group.

Limitations of our study include the unknown nature of the complete receptor repertoire for coronaviruses among domestic and close-contact animals. Although we based our comparative phylogeny primarily on the reported or putative human receptor or coronavirus infection-associated proteins, coronaviruses may not use the same receptors in different species. It is possible that a virus crossing to another species will adapt to using different receptors or associated proteins.


Recombination or a few critical mutations in viral receptor binding domains or receptor protein key residues may contribute to cross-species infections [30]. We highlighted key molecular relationships that exist among common coronaviruses and their host receptors. Notably, the existence of similar polybasic residues within the S proteins of several coronaviruses suggests that the RRXR sequence motif is not unique to SARS-CoV-2. Unlike genome-level phylogenies, subgenomic-level comparisons provide stronger insights into functional ontogeny and the identification of consequential recombination events. However, not all factors that determine species-specific or cross-species infections are currently known. Therefore, additional studies of host determinants for virus attachment and productive infectivity are urgently needed to understand the role different hosts play in coronavirus infections, maintenance, and host adaptations. Such studies will be critical to identify molecules that can be targeted for the prevention and control of diseases caused by these viruses, especially in hosts of economic or public health importance.


All protein sequences included in our analysis were collected from the National Center for Biotechnology Information (NCBI) database (, stored in FASTA formats and analyzed. In total, 32 coronaviruses, comprising eight α-coronaviruses, 19 β-coronaviruses and five γ-coronaviruses, were included in our comparative study as shown in Table 1. For each of the viruses, we assembled protein sequence data on the spike (S), membrane (M), envelope (E) and nucleocapsid (N) structural proteins. Furthermore, protein sequence data on key receptors and enzymes reported to be involved in viral infection, namely, angiotensin converting enzyme 2 (ACE2), transmembrane serine protease 2 (TMPRSS2), aminopeptidase N (APN), sialic acid synthase (SAS), dipeptidyl peptidase 4 (DPP4), cathepsin L and furin, of seventeen hosts were also assembled, and their details are shown in Table 2. The FASTA format of all sequences was organized in Text Editor for analyses. Using MEGA X software version 11, sequence alignments and phylogenetic trees were developed. Alignments were generated using the Muscle Tool with neighbor-joining as the cluster method, and the degree of consensus among conserved residues was visualized using Snap Gene software. The SimPlot program [31] was used to graphically depict similarities among selected sequences. Phylogenetic trees were constructed using the neighbor-joining method with a Poisson model of substitution at 1000 bootstrap replications, and all the results were validated with the maximum likelihood method. All other default settings of the bioinformatics programs used were applied.

Table 1 Virus sequences included in this study and their respective hosts
Table 2 Host receptor and enzyme sequences included in this study

Availability of data and materials

All data used in this study are publicly available in the National Center for Biotechnology Information (NCBI,, and the accession numbers for both virus and host sequences are provided in Tables 1 and 2.



















Angiotensin Converting Enzyme 2


Sialic Acid Synthase


Transmembrane Serine Protease 2


Acute Respiratory Syndrome Coronavirus-2


Acute Respiratory Syndrome Coronavirus


Middle East Respiratory Syndrome Coronavirus


Transmissible gastroenteritis


Porcine Epidemic Diarrhea PED


Feline Infectious Peritonitis


National Center for Biotechnology Information


  1. Hudson CB, Beaudette FR. Infection of the cloaca with the virus of infectious bronchitis. Science. 1932;76:34.

    CAS  Article  PubMed  Google Scholar 

  2. Vijgen L, Keyaerts E, Moës E, Thoelen I, Wollants E, Lemey P, et al. Complete genomic sequence of human coronavirus OC43: molecular clock analysis suggests a relatively recent zoonotic coronavirus transmission event. J Virol. 2005;79:1595–604.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  3. Belouzard S, Millet JK, Licitra BN, Whittaker GR. Mechanisms of coronavirus cell entry mediated by the viral spike protein. Viruses. 2012;4:1011–33.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  4. Wang L, Byrum B, Zhang Y. Detection and genetic characterization of deltacoronavirus in pigs, Ohio, USA, 2014. Emerg Infect Dis. 2014;20:1227–30.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  5. Perlman S, Netland J. Coronaviruses post-SARS: update on replication and pathogenesis. Nat Rev Microbiol. 2009;7:439–50.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  6. Erles K, Toomey C, Brooks HW, Brownlie J. Detection of a group 2 coronavirus in dogs with canine infectious respiratory disease. Virology. 2003;310:216–23.

    CAS  Article  Google Scholar 

  7. Burimuah V, Sylverken A, Owusu M, El-Duah P, Yeboah R, Lamptey J, et al. Molecular-based cross-species evaluation of bovine coronavirus infection in cattle, sheep and goats in Ghana. BMC Vet Res. 2020;16:405.

    CAS  Article  Google Scholar 

  8. Lu S, Wang Y, Chen Y, Wu B, Qin K, Zhao J, et al. Discovery of a novel canine respiratory coronavirus support genetic recombination among betacoronavirus1. Virus Res. 2017;237:7–13.

    CAS  Article  Google Scholar 

  9. Goldstein SA, Brown J, Pedersen BS, Quinlan AR, Elde NC. Extensive recombination-driven coronavirus diversification expands the pool of potential pandemic pathogens. bioRxiv. 2021:2021.02.03.429646.

  10. Gabutti G, d’Anchera E, Sandri F, Savio M, Stefanati A. Coronavirus: update related to the current outbreak of COVID-19. Infect Dis Ther. 2020;9:241–53.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Boni MF, Lemey P, Jiang X, Lam TT-Y, Perry BW, Castoe TA, et al. Evolutionary origins of the SARS-CoV-2 sarbecovirus lineage responsible for the COVID-19 pandemic. Nat Microbiol. 2020;511. 2020;5:1408–17.

  12. Wrobel AG, Benton DJ, Xu P, Calder LJ, Borg A, Roustan C, et al. Structure and binding properties of pangolin-CoV spike glycoprotein inform the evolution of SARS-CoV-2. Nat Commun. 2021;12:837.

    CAS  Article  Google Scholar 

  13. Zhang T, Wu Q, Zhang Z. Probable pangolin origin of SARS-CoV-2 associated with the COVID-19 outbreak. Curr Biol. 2020;30:1346–1351.e2.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  14. Zhou H, Ji J, Chen X, Bi Y, Li J, Wang Q, et al. Identification of novel bat coronaviruses sheds light on the evolutionary origins of SARS-CoV-2 and related viruses. Cell. 2021;184:4380–4391.e14.

    CAS  Article  Google Scholar 

  15. Örd M, Faustova I, Loog M. The sequence at spike S1/S2 site enables cleavage by furin and phospho-regulation in SARS-CoV2 but not in SARS-CoV1 or MERS-CoV. Sci Rep. 2020;101. 2020;10:1–10.

  16. Russo R, Andolfo I, Lasorsa VA, Iolascon A, Capasso M. Genetic analysis of the coronavirus SARS-CoV-2 host protease TMPRSS2 in different populations. Front Genet. 2020;11:872.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  17. Papa G, Mallery DL, Albecka A, Welch LG, Cattin-Ortolá J, Luptak J, et al. Furin cleavage of SARS-CoV-2 spike promotes but is not essential for infection and cell-cell fusion. PLoS Pathog. 2021;17:e1009246.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  18. Li W. Delving deep into the structural aspects of a furin cleavage site inserted into the spike protein of SARS-CoV-2: a structural biophysical perspective. Biophys Chem. 2020;264:106420.

    CAS  Article  Google Scholar 

  19. Zhao J, Cui W, Tian B. The potential intermediate hosts for SARS-CoV-2. Front Microbiol. 2020;0:2400.

    Google Scholar 

  20. Ni W, Yang X, Yang D, Bao J, Li R, Xiao Y, et al. Role of angiotensin-converting enzyme 2 (ACE2) in COVID-19. Crit Care. 2020;24:422.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Millet JK, Jaimes JA, Whittaker GR. Molecular diversity of coronavirus host cell entry receptors. FEMS Microbiol Rev. 2021;45:1–16.

    Article  Google Scholar 

  22. Bornstein SR, Dalan R, Hopkins D, Mingrone G, Boehm BO. Endocrine and metabolic link to coronavirus infection. Nat Rev Endocrinol. 2020;16:297–8.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  23. Bavishi C, Maddox TM, Messerli FH. Coronavirus disease 2019 (COVID-19) infection and renin angiotensin system blockers. JAMA Cardiol. 2020;5:745–7.

    Article  PubMed  Google Scholar 

  24. Bai F, Pang XF, Zhang LH, Wang NP, McKallip RJ, Garner RE, et al. Angiotensin II AT1 receptor alters ACE2 activity, eNOS expression and CD44-hyaluronan interaction in rats with hypertension and myocardial fibrosis. Life Sci. 2016;153:141–52.

    CAS  Article  Google Scholar 

  25. Wu L, Chen Q, Liu K, Wang J, Han P, Zhang Y, et al. Broad host range of SARS-CoV-2 and the molecular basis for SARS-CoV-2 binding to cat ACE2. Cell Discov. 2020;61. 2020;6:1–12.

  26. Luan J, Lu Y, Jin X, Zhang L. Spike protein recognition of mammalian ACE2 predicts the host range and an optimized ACE2 for SARS-CoV-2 infection. Biochem Biophys Res Commun. 2020;526:165–9.

    CAS  Article  Google Scholar 

  27. Hussain M, Jabeen N, Amanullah A, Baig AA, Aziz B, Shabbir S, et al. Molecular docking between human tmprss2 and sars-cov-2 spike protein: conformation and intermolecular interactions. AIMS Microbiol. 2020;6:350–60.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  28. Brown MA. Genetic determinants of pathogenesis by feline infectious peritonitis virus. Vet Immunol Immunopathol. 2011;143:265–8.

    CAS  Article  Google Scholar 

  29. Lai MM. Recombination in large RNA viruses: coronaviruses. Semin Virol. 1996;7:381–8.

    CAS  Article  Google Scholar 

  30. Li F. Structure, function, and evolution of coronavirus spike proteins. Annu Rev Virol. 2016;3:237–61.

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  31. Lole KS, Bollinger RC, Paranjape RS, Gadkari D, Kulkarni SS, Novak NG, et al. Full-length human immunodeficiency virus type 1 genomes from subtype C-infected Seroconverters in India, with evidence of Intersubtype recombination. J Virol. 1999;73:152–60 Accessed 22 Sep 2021.

    CAS  Article  Google Scholar 

Download references


We thank Dr. Ruby Perry, Dean, Tuskegee University College of Veterinary Medicine, for support to veterinary students’ summer research, through which this project was initiated.

Authors’ information (optional)

None Declared.


This study was supported through grants from Health and Human Services HHS/HRSA #D34HP00001, and National Institutes of Health grants (#U54CA118623, SRE #T35OD010432, TU-CBR/RCMI #U54MD007585 shared resources core facility).

Author information

Authors and Affiliations



TS and PM: conceptualization, investigation, methodology, review and editing, data curation, validation, supervision. KB: data curation, validation, comparative analysis, methodology, writing manuscript draft, review and editing. SS and CW: data curation and organization, methodology, comparative analysis. GR, WA and RF: scientific and conceptual input, review and editing. The author(s) read and approved the final manuscript.

Corresponding author

Correspondence to Temesgen Samuel.

Ethics declarations

Ethics approval and consent to participate

Not Applicable.

Consent for publication

Not Applicable.

Competing interests

Authors declare no competing personal relationships or financial interests that could have appeared to influence the findings reported in this paper.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Bentum, K., Shaddox, S., Ware, C. et al. Molecular phylogeny of coronaviruses and host receptors among domestic and close-contact animals reveals subgenome-level conservation, crossover, and divergence. BMC Vet Res 18, 124 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Coronavirus
  • Host receptors
  • Comparative phylogeny
  • Furin cleavage
  • Spike protein