Human coronavirus 229E encodes a single ORF4 protein between the spike and the envelope genes
© Dijkman et al; licensee BioMed Central Ltd. 2006
Received: 08 December 2006
Accepted: 28 December 2006
Published: 28 December 2006
The genome of coronaviruses contains structural and non-structural genes, including several so-called accessory genes. All group 1b coronaviruses encode a single accessory protein between the spike and envelope genes, except for human coronavirus (HCoV) 229E. The prototype virus has a split gene, encoding the putative ORF4a and ORF4b proteins. To determine whether primary HCoV-229E isolates exhibit this unusual genome organization, we analyzed the ORF4a/b region of five current clinical isolates from The Netherlands and three early isolates collected at the Common Cold Unit (CCU) in Salisbury, UK.
All Dutch isolates were identical in the ORF4a/b region at amino acid level. All CCU isolates are only 98% identical to the Dutch isolates at the nucleotide level, but more closely related to the prototype HCoV-229E (>98%). Remarkably, our analyses revealed that the laboratory adapted, prototype HCoV-229E has a 2-nucleotide deletion in the ORF4a/b region, whereas all clinical isolates carry a single ORF, 660 nt in size, encoding a single protein of 219 amino acids, which is a homologue of the ORF3 proteins encoded by HCoV-NL63 and PEDV.
Thus, the genome organization of the group 1b coronaviruses HCoV-NL63, PEDV and HCoV-229E is identical. It is possible that extensive culturing of the HCoV-229E laboratory strain resulted in truncation of ORF4. This may indicate that the protein is not essential in cell culture, but the highly conserved amino acid sequence of the ORF4 protein among clinical isolates suggests that the protein plays an important role in vivo.
Coronaviruses (CoVs) are enveloped, plus-strand RNA viruses belonging to the family Coronaviridae . The genomic RNA is 27 – 32 Kb in size, capped and polyadenylated. The virions are 80 – 150 nm in diameter and have a unique morphology, with extended, petal-shaped spikes that give the virus a crown-like projection (Latin; corona) under the electron microscope . CoVs are classified into three groups based on phylogenetic and serological relationships. Group 1 and 2 consist of different mammalian coronaviruses, whereas bird viruses dominate group 3. All coronaviruses employ a common genome organization where the replicase gene encompasses the 5'-two thirds of the genome and is comprised of two overlapping open reading frames (ORFs), ORF1a and ORF1b. The structural gene region, which covers the 3'-third of the genome, encodes the canonical set of structural protein genes in the order 5'-spike (S) – envelope (E) – membrane (M) and nucleocapsid (N) – 3'. Expression of the replicase gene is mediated by translation of the genomic RNA that gives rise to the biosynthesis of two large polyproteins, pp1a (encoded by ORF1a) and pp1ab (encoded by ORF1a and ORF1b using a ribosomal frameshift at the ORF1a/1b junction). Expression of the structural gene region is mediated via discontinuous transcription of subgenomic (sg) mRNAs, a hallmark of coronavirus gene expression. The number of sg mRNAs produced by a particular coronavirus usually exceeds the number of encoded structural proteins and, consequently, coronaviruses are able to express additional, so-called – accessory – genes (formerly called group-specific genes). These genes are interspersed between the structural genes and their number and location varies within coronavirus genomes. The functions of coronavirus accessory proteins are largely unknown, however, reverse genetic analyses of Mouse Hepatits Virus (MHV) and Feline Infectious Peritonitis Virus (FIPV) suggest that they are not required for virus replication [2–4]. Moreover, deletion of MHV and FIPV accessory genes results in attenuation in their respective hosts, indicating that accessory genes represent pathogenicity factors [2–4].
HCoV-229E contains two ORFs, ORF4a and ORF4b between the S and E genes (Figure 1). Since both genes share the same sg mRNA (i.e. sg mRNA4), the expression of gene 4b would require alternative mechanisms of translation, such as internal entry, leaky scanning, or translational reinitiation of ribosomes. However, comparison of the hydrophobic domains of both ORF4 parts with the single ORF3 homologs indicates that they encode a similar protein , suggesting a scenario in which HCoV-229E acquired an out-of-frame insertion or deletion. It should be noted that the origin of full-length genomic sequences of the group 1b coronaviruses PEDV and HCoV-NL63 are derived from clinical isolates, CV777 and Amsterdam-1, Amsterdam-057 and Amsterdam-496, respectively. In contrast, the HCoV-229E ORF4a/b sequence  and the HCoV-229E full-length genomic sequence  has been determined from a cell culture-adapted virus more than 30 years after the initial isolation of HCoV-229E by Hamre and Procknow . We, therefore, hypothesized that HCoV-229E ORFs 4a/b might actually had been a single ORF that was truncated upon adaptation of HCoV-229E to cell culture.
Analysis of HCoV-229E cell culture adapted virus
Analysis of current clinical HCoV-229E isolates
Clinical symptoms of HCoV-229E infected patients.
Year of Sampling
Common Cold, Acute tonsillitis
Exacerbation of COPD
Analysis of early HCoV-229E isolates
Hypothetically, an ORF4b protein could be translated via alternative translation mechanisms, as described for some other coronavirus proteins [15, 16], but those mechanisms have not been described for this region, nor has any evidence for the expression of HCoV-229E ORF4b protein ever been reported. In addition, our results show that a large fragment is deleted in the cell-culture adapted HC-LP virus, which corresponds with the ORF4b region of the prototype virus. As mentioned previously, it has been shown for MHV and FIPV that accessory genes are dispensable for virus growth in cell culture. Moreover, the deletion of accessory genes resulted in these cases in viruses that are attenuated in vivo. Similarly, attenuation of in vivo viral infectivity and pathogenicity has been linked to ORF3 truncation upon in vitro culturing of other group 1b coronaviruses. For a virulent PEDV strain this occurred after 40 passages, and more severe truncation and attenuation was observed after 60 or more passages . Similar results have been reported for TGEV after at least 35 passages . Unfortunately, no detailed information is available about the in vitro passaging of the cell culture-adapted HCoV-229E strains. It is tempting to speculate that ORF4 of HCoV-229E, like ORF3 of PEDV, is vital for efficient in vivo replication. The fact that VR-740™ contains a truncated ORF4 may explain why this virus replicates in vitro in murine cells expressing HCoV-229E receptor (human CD13), but not in vivo in the human CD13 transgenic animals [17, 18]. It is of interest to investigate whether an HCoV-229E strain with a more severe truncated or a non-truncated ORF4 gene can replicate in these mice.
Accompanied with the deletions, we also observed several non-silent nucleotide differences between the cell culture-adapted viruses and the clinical isolates. In our Dutch isolates the ORF4 is highly conserved on the protein level. The CCU T935 isolate that was collected in 1986 at the CCU, Salisbury, is a clinical isolate with high ORF4 similarity to the cell culture-adapted viruses. Since we cannot reconstruct the experimental setting performed during the clinical trail T935 at the CCU in 1986, we cannot exclude the possibility that the CCU T935 sample was obtained from a volunteer inoculated with an HCoV-229E laboratory strain. This strain might even have the same origin as the cell culture-adapted viruses . In any case, the CCU T935 sample is derived from an "in vivo" infection, be it experimental or natural, and this further supports a relevant in vivo function of HCoV-229E full-length ORF4 protein. We believe that the divergence between the current Dutch isolates and the early CCU T935 strain most likely represents genetic drift over 20 – 30 years of evolution . Molecular clock analysis with the average mutation rate of coronaviruses [20, 21] supports this idea (data not shown). Given the long time of evolution the differences between the CCU T935 and Dutch isolates are remarkably small. For HCoV-NL63 we also observed a highly conserved ORF3 among different clinical isolates , and although for PEDV limited sequence data are available, Song et al. found only one nucleotide difference in ORF3 between two PEDV field isolates .
Recently, Tang et al. reported on novel bat coronaviruses (Bt-CoVs), of which several cluster with group 1b coronaviruses. They determined the full-length genomic sequence of one of these wild-type Bt-CoVs (Bt-CoV/512/2005) . The genome organization of this Bt-CoV strain is similar to that of the other group 1b members, with the exception of one putative gene at the 3'end of the genome. However, only one accessory protein, encoded by ORF3, is identified between the structural genes S and E. The ORF3 protein of Bt-CoV/512/2005 is homologous to ORF3 proteins of PEDV, HCoV-NL63 and the ORF4 protein from our clinical HCoV-229E isolates. These data show that all currently sequenced group 1b coronaviruses contain one homologous accessory gene between the S and E genes.
We report the first sequences of the ORF4a/b region of clinical HCoV-229E isolates. The experimental data strongly support the hypothesis that a separation of a formerly single ORF4 had taken place upon adaptation of HCoV-229E to cell culture. We observed two different types of deletions, 2 or 118 nucleotides, of the ORF4 gene only in cell culture-adapted viruses whereas all clinical isolates, including CCU T935, encoded a single ORF4 gene. Both types of nucleotide deletion within the ORF4a/b region of cell culture-adapted HCoV-229E viruses creates a frame shift that introduces an early termination codon, which either separates ORF4 to ORF4a and ORF4b or results in a truncated ORF4(a) fragment (HC-LP). Most likely, the two types of deletion occurred independently and are not site specific. Therefore the genome organization for the group 1b coronaviruses (HCoV-NL63, PEDV, Bt-CoV and HCoV-229E) is identical. The amino acid sequence of the HCoV-229E ORF4 protein is highly conserved among clinical isolates suggesting that the protein plays an important role during in vivo infection.
Collection of patient material
Patient materials were collected at the department of Medical Microbiology, Academic Medical Center (AMC), The Netherlands (VS03-099) and from the Laboratory for Infectious Diseases and Screening, National Institute of Public Health and the Environment (RIVM), Bilthoven, The Netherlands (RIVM02-034, RIVM02-041, RIVM03-224 and RIVM04-037) (Table 1). One sample was collected in 1986 at the common cold unit (CCU), Salisbury, Great Britain, during Trail no. T935.
Viral RNA isolation
Total viral RNA was isolated either from 200 μl cell culture supernatant, 100 – 200 μl nose/throat swab (RIVM) or nasopharyngeal aspirate (AMC) as previously described .
Reverse transcription and PCR reactions were performed as described [22, 25]. Amplification of the ORF4a/b region was performed with the primer combination 5'-229E-ORF4ab (5'-AAC TTC CTT ATT ACG ACG TT-'3) and 3'-229E-ORF4ab (5'-ATC CAC TAG CTT AAG GAA CA-'3). If required, a semi-nested PCR was performed with the primers 5'-229E-ORF4abNested (5'-CAT ACA GTA ATG GCT CTA GG-'3) and 3'-229E-ORF4ab, and the cycle profile of the first PCR was modified to 30 cycles.
Sequence analysis of ORF4a/b region
RT-PCR fragments were directly sequenced with primers 3'-229E-ORF4ab, 5'-229E-ORF4ab, 5'-229E-ORF4abNested, 5'229E4int (5'-GCA ACT TTG ATT GCT G-'3) and 3'229E4int (5'-GTC CTC TAA GAG CAA C-'3). Sequence reaction was preformed without purifying steps, according to the BigDye® terminator V1.1 cycle sequencing manufacturer's protocol on a GeneAmp® PCR System 9700 thermal cycler (Perkin Elmer). Electrophoresis and data collection was performed on a 3100 Genetic Analyzer (Applied Biosystems). Raw collection data was processed and analyzed with Codoncode Aligner v1.52 software (CodonCode Corporation).
Deduced protein sequences
Deduced protein sequences encoded by ORF4a and ORF4b from the published HCoV-229E sequence (ORF4a; NP_073552, ORF4b; NP_073553), the sequence of VR-740™, determined in this study, and the sequences of the 5 current and the 3 early isolates were aligned with ClustalX v1.8, and manually adjusted with Bioedit v7.0.1.
Phylogenetic analysis of the ORF4a/b region
The sequences of the ORF4a/b regions were aligned with ClustalX v1.8 and phylogenetic analyses was conducted with the neighbor-joining method, Kimura distances and a bootstrap of 1000 replicates, using MEGA version 3.1 .
The sequences reported in this paper have been deposited under the Genbank database accession numbers EF198671–EF198679.
We thank Stuart G. Siddell from the university of Bristol, United Kingdom, for the kind gift of the HC-LP and HC-Killick samples. The RIVM isolates were collected by the general practitioners from the Continuous Morbidity Registration of the Netherlands Institute for Health Services Research (NIVEL) within the framework of national surveillance of acute respiratory tract infections. R.D. and L.v.d.H. are supported by VIDI grant 016.066.318 from the Netherlands Organization for Scientific Research (NWO). V.T. received support from the European Commission (SARS-DTV SP22-CT-2004-511064).
- Holmes KV, Lai MMC: Coronaviridae : The viruses and their replication. In Fields Virology. Third edition. Edited by: Fields BN, Knipe DM, Howley PM, et al. Philadelphia: Lippincott-Raven Publishers; 1996:1075-1093.Google Scholar
- de Haan CA, Masters PS, Shen X, Weiss S, Rottier PJ: The group-specific murine coronavirus genes are not essential, but their deletion, by reverse genetics, is attenuating in the natural host. Virol 2002, 296: 177-189. 10.1006/viro.2002.1412View ArticleGoogle Scholar
- Herrewegh AA, Vennema H, Horzinek MC, Rottier PJ, de Groot RJ: The molecular genetics of feline coronaviruses: comparative sequence analysis of the ORF7a/7b transcription unit of different biotypes. Virol 1995, 212: 622-631. 10.1006/viro.1995.1520View ArticleGoogle Scholar
- Haijema BJ, Volders H, Rottier PJ: Live, attenuated coronavirus vaccines through the directed deletion of group-specific genes provide protection against feline infectious peritonitis. J Virol 2004, 78: 3863-3871. 10.1128/JVI.78.8.3863-3871.2004PubMed CentralView ArticlePubMedGoogle Scholar
- Gorbalenya AE, Snijder EJ, Spaan WJ: Severe acute respiratory syndrome coronavirus phylogeny: toward consensus. J Virol 2004, 78: 7863-7866. 10.1128/JVI.78.15.7863-7866.2004PubMed CentralView ArticlePubMedGoogle Scholar
- Thiel V, Herold J, Schelle B, Siddell SG: Infectious RNA transcribed in vitro from a cDNA copy of the human coronavirus genome cloned in vaccinia virus. J Gen Virol 2001, 82: 1273-1281.View ArticlePubMedGoogle Scholar
- Song DS, Yang JS, Oh JS, Han JH, Park BK: Differentiation of a Vero cell adapted porcine epidemic diarrhea virus from Korean field strains by restriction fragment length polymorphism analysis of ORF 3. Vaccine 2003, 21: 1833-1842. 10.1016/S0264-410X(03)00027-6View ArticlePubMedGoogle Scholar
- Woods RD: Efficacy of a transmissible gastroenteritis coronavirus with an altered ORF-3 gene. Can J Vet Res 2001, 65: 28-32.PubMed CentralPubMedGoogle Scholar
- Duarte M, Tobler K, Bridgen A, Rasschaert D, Ackermann M, Laude H: Sequence analysis of the porcine epidemic diarrhea virus genome between the nucleocapsid and spike protein genes reveals a polymorphic ORF. Virol 1994, 198: 466-476. 10.1006/viro.1994.1058View ArticleGoogle Scholar
- Raabe T, Siddell S: Nucleotide sequence of the human coronavirus HCV 229E mRNA 4 and mRNA 5 unique regions. Nucleic Acids Res 1989, 17: 6387. 10.1093/nar/17.15.6387PubMed CentralView ArticlePubMedGoogle Scholar
- Hamre D, Procknow JJ: A new virus isolated from the human respiratory tract. Proc Soc Exp Biol Med 1966, 121: 190-193.View ArticlePubMedGoogle Scholar
- Bradburne AF, Bynoe ML, Tyrrell DA: Effects of a "new" human respiratory virus in volunteers. Br Med J 1967, 3: 767-769.PubMed CentralView ArticlePubMedGoogle Scholar
- Tyrrell DA, Bynoe ML, Hoorn B: Cultivation of "difficult" viruses from patients with common colds. Br Med J 1968, 1: 606-610.PubMed CentralView ArticlePubMedGoogle Scholar
- Myint S, Harmsen D, Raabe T, Siddell SG: Characterization of a nucleic acid probe for the diagnosis of human coronavirus 229E infections. J Med Virol 1990, 31: 165-172.View ArticlePubMedGoogle Scholar
- Thiel V, Siddell SG: Internal ribosome entry in the coding region of murine hepatitis virus mRNA 5. J Gen Virol 1994,75(Pt 11):3041-3046.View ArticlePubMedGoogle Scholar
- Liu DX, Inglis SC: Internal entry of ribosomes on a tricistronic mRNA encoded by infectious bronchitis virus. J Virol 1992, 66: 6143-6154.PubMed CentralPubMedGoogle Scholar
- Wentworth DE, Tresnan DB, Turner BC, Lerman IR, Bullis B, Hemmila EM, Levis R, Shapiro LH, Holmes KV: Cells of human aminopeptidase N (CD13) transgenic mice are infected by human coronavirus-229E in vitro, but not in vivo. Virol 2005, 335: 185-197. 10.1016/j.virol.2005.02.023View ArticleGoogle Scholar
- Lassnig C, Sanchez CM, Egerbacher M, Walter I, Majer S, Kolbe T, Pallares P, Enjuanes L, Muller M: Development of a transgenic mouse model susceptible to human coronavirus 229E. Proc Natl Acad Sci USA 2005, 102: 8275-8280. 10.1073/pnas.0408589102PubMed CentralView ArticlePubMedGoogle Scholar
- Chibo D, Birch C: Analysis of human coronavirus 229E spike and nucleoprotein genes demonstrates genetic drift between chronologically distinct strains. J Gen Virol 2006, 87: 1203-1208. 10.1099/vir.0.81662-0View ArticlePubMedGoogle Scholar
- Vijgen L, Keyaerts E, Moes E, Thoelen I, Wollants E, Lemey P, Vandamme AM, Van Ranst M: Complete genomic sequence of human coronavirus OC43: molecular clock analysis suggests a relatively recent zoonotic coronavirus transmission event. J Virol 2005, 79: 1595-1604. 10.1128/JVI.79.3.1595-1604.2005PubMed CentralView ArticlePubMedGoogle Scholar
- Sanchez CM, Gebauer F, Sune C, Mendez A, Dopazo J, Enjuanes L: Genetic evolution and tropism of transmissible gastroenteritis coronaviruses. Virol 1992, 190: 92-105. 10.1016/0042-6822(92)91195-ZView ArticleGoogle Scholar
- Pyrc K, Dijkman R, Deng L, Jebbink MF, Berkhout B, van der Hoek L: Mosaic structure of human coronavirus NL63, one thousand years of evolution. J Mol Biol 2006, 364: 964-73. 10.1016/j.jmb.2006.09.074View ArticlePubMedGoogle Scholar
- Tang XC, Zhang JX, Zhang SY, Wang P, Fan XH, Li LF, Li G, Dong BQ, Liu W, Cheung CL, et al.: Prevalence and genetic diversity of coronaviruses in bats from China. J Virol 2006, 80: 7481-7490. 10.1128/JVI.00697-06PubMed CentralView ArticlePubMedGoogle Scholar
- Boom R, Sol CJ, Salimans MM, Jansen CL, Wertheim-van Dillen PM, van der NJ: Rapid and simple method for purification of nucleic acids. J Clin Microbiol 1990, 28: 495-503.PubMed CentralPubMedGoogle Scholar
- Pyrc K, Bosch BJ, Berkhout B, Jebbink MF, Dijkman R, Rottier P, van der Hoek L: Inhibition of HCoV-NL63 infection at early stages of the replication cycle. Antim Ag Chemoth 2006, 50: 2000-2008. 10.1128/AAC.01598-05View ArticleGoogle Scholar
- Kumar S, Tamura K, Nei M: MEGA: Molecular Evolutionary Genetics Analysis software for microcomputers. Comput Appl Biosci 1994, 10: 189-191.PubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.