Complete genome sequence of human astrovirus genotype 6
© Guo et al; licensee BioMed Central Ltd. 2010
Received: 5 November 2009
Accepted: 8 February 2010
Published: 8 February 2010
Human astroviruses (HAstVs) are one of the important causes of acute gastroenteritis in children. Currently, eight HAstV genotypes have been identified and all but two (HAstV-6 and HAstV-7) have been fully sequenced. We here sequenced and analyzed the complete genome of a HAstV-6 strain (192-BJ07), which was identified in Beijing, China.
The genome of 192-BJ07 consists of 6745 nucleotides. The 192-BJ07 strain displays a 77.2-78.0% nucleotide sequence identity with other HAstV genotypes and exhibits amino acid sequence identities of 86.5-87.4%, 94.2-95.1%, and 65.5-74.8% in the ORF1a, ORF1b, and ORF2 regions, respectively. Homological analysis of ORF2 shows that 192-BJ07 is 96.3% identical to the documented HAstV-6 strain. Further, phylogenetic analysis indicates that different genomic regions are likely undergoing different evolutionary and selective pressures. No recombination event was observed in HAstV-6 in this study.
The completely sequenced and characterized genome of HAstV-6 (192-BJ07) provides further insight into the genetics of astroviruses and aids in the surveillance and control of HAstV gastroenteritis.
Human astroviruses (HAstVs) are one of the most common causes of acute gastroenteritis in children worldwide [1–3]. HAstV was first identified during an outbreak of gastroenteritis among hospitalized infants in 1975 . Its name is derived from its distinctive star-shaped appearance under the electron microscopy (EM). Molecular analyses indicate that HAstVs are non-enveloped viruses with a 6-8 kb single-stranded, positive-sense RNA genome consisting of three overlapping open reading frames (ORFs)--ORF1a, ORF1b and ORF2--as well as the 5'- and 3' nontranslated regions (NTRs) . ORF 1a encodes a serine protease; ORF 1b encodes an RNA dependent polymerase; and ORF 2 encodes a capsid precursor protein .
HAstVs have been grouped into eight known serotypes (HAstV-1 through HAstV-8) based on their reactivity to polyclonal antibodies and on analysis by immunofluorescence assays, neutralization assays, and immunoelectron microscopy (IEM) [5–7]. Phylogenetic analyses of the HAstV nucleotide sequence have defined eight genotypes, and further studies have indicated a strong correlation between the genotypes and serotypes . As such, genotypes are frequently applied to type HAstVs.
Genomic characterization studies are important to the understanding of the origin, molecular evolution, and phylogenetic relationships among HAstV genotypes. The full-length genome sequence for a HAstV (HAstV-2) was first determined in 1993 . Subsequently, the complete genomic sequences of five more genotypes (HAstV-1, HAstV-3, HAstV-4, HAstV-5, and HAstV-8) were reported [9–12]. Because the dominant, disease-causing HAstV type and strain often fluctuate with time and geographic location, it is critical that we characterize the complete genomic sequences of all known genotypes in order to better control and prevent future epidemics . Limited sequence information for HAstV genotype 6 is available. Only a partial genome sequence has been reported [14, 15], even though this genotype has been identified as one cause of sporadic or large scale outbreaks of acute gastroenteritis worldwide [16, 17].
In 2007, we identified a case of HAstV-6 infection in Beijing, China, suggesting that this strain might be more epidemiologically relevant than previously recognized . Here we sequenced and analyzed the complete genomic sequence of this HAstV-6 192-BJ07 strain, and describe its genetic characteristics by comparing its sequence with other known HAstV genotypes. The characterization of HAstV-6 by whole genome sequencing provides critical insight into the genetics of this virus as well as valuable information for the control and prevention of HAstV-induced gastroenteritis.
The sequence of the HAstV-6 192-BJ07 strain displayed similarity to those of other known HAstV genotypes. ORF1a of the HAstV-6 192-BJ07 strain shared 79.0%-79.9% nucleotide identity and 86.5-87.4% amino acid identity with those of genotypes 1 through 5 and with genotype 8. Two mutation sites were found at amino acids 757 and 758 in 192-BJ07 ORF1a, which result in the insertion of Arg and Lys.
Sequence identity between HAstV-6 (192-BJ07) and other HAstV genotypes
(Genbank accession numbers)
Pairwise comparisons of the nucleotide and amino acid sequences of the ORF2 region showed that 192-BJ07 shares relatively low identity with other known HAstV genotypes in this region (62.4-72.6% nucleotide identity, and 65.5-74.8% amino acid identity; Table 1). However, 192-BJ07 exhibited high identity--96.3% nucleotide identity and 95.9% amino acid identity--with the documented sequence of the HAstV-6 strain (GenBank accession number Z46658 and Table 1). Structural predictions of ORF2 indicated that there are three highly conserved amino acid residues that can be cleaved to yield proteins with different sizes: Lys 71 for a 79-kDa protein, Arg 361 for VP29 and Arg 395 for VP26 .
Non-coding region analysis
In contrast, we found that the nucleotide identities of the 3'-NTR sequences are as high as 92.6-98.8% compared with other known HAstV genotypes. However, the sequence variability within this region also results in secondary structure disparities of the 3'-NTR between HAstV genotypes (data not shown).
It has been recognized that HAstV RNA has a cis-acting element [ribosomal frameshifting heptamer sequence (AAAAAAC)] followed by a stem-loop structure in the ORF1a/1b junction region . The 192-BJ07 strain also has such a shifty heptamer sequence and a similar stem-loop structure based on analysis with RNAstructure 4.5 software. This conservation may reflect the importance of such structures for translational regulation .
The ORF1b/ORF2 junction has been regarded as a regulatory element of the sub-genomic RNA (sgRNA) . The alignment analysis of 52 nt at the ORF1b/ORF2 junction revealed that 192-BJ07 has a very high identity (98.4-100%) with other HAstV genotypes, consistent with a previous report .
In this study, we report the whole genome sequence of HAstV-6 based on a strain (192-BJ07) identified in an etiological investigation of viral gastroenteritis in Beijing . The sequence analysis shows that the 192-BJ07 strain has a typical astrovirus genome organization with three ORFs (ORF1a, ORF1b, and ORF2), an 80-85 nt 5'-NTR, and an 80-85 nt 3'-NTR. Phylogenetic and homological analyses of the ORF2 regions indicate that the 192-BJ07 strain genome possesses a 95.9% amino acid identity to the documented HAstV-6 strain (GenBank accession number Z46658), but a <75% amino acid identity to other HAstV genotypes.
Consistent with previous reports of other HAstV genotypes, our results also show the existence of three potential cleavage sites at Lys 71, Arg 361, and Arg 395 in HAstV6 ORF2 [3, 19, 20]. It is thought that the cleavage at Lys 71 leads to the generation of the 79-kDa capsid protein . The 79-kDa capsid protein can be converted into three smaller peptides--VP34, VP29, and VP26--and leads to an enhancement of HAstV infectivity . Our observations support the critical role of these three amino acid residues in HAstV replication and pathogenesis.
In our study, we found two insertional mutations, Arg 757 and Lys 758, in ORF1a. How these hydrophilic amino acids contribute to the characteristic/function of the virus is unknown at present and needs to be addressed in further functional studies.
Our phylogenetic analysis suggests that HAstV-6 may be an ancestor of other HAstV genotypes as shown by the phylogenetic analysis of the whole genome sequence (Fig. 4A). This observation was further supported by the phylogenetic analysis of the ORF1a protein region (Fig. 4B). Moreover, detailed analysis of all genotype ORF1b amino acid sequences indicates that HAstV-6 and HAstV-3 may have functioned as the common ancestor of other HAstV genotypes (Fig. 4C). However, the analysis of HAstVs ORF2 suggests that HAstV-8 and HAstV-4 may have been the common ancestor of other HAstV genotypes (Fig. 4D). Different evolutionary and selective pressures in different HAstV genomic regions may be responsible for this discrepancy of the evolutionary relationships .
The secondary structure predictions indicate that stem-loop structures are not conserved in the 5'- and 3'-NTRs of known HAstV genotype genomes. This difference may be responsible for the possible discrepancy at the replication and/or transcription level among HAstV genotypes. The fact that the 5'-end of the 5'-NTR and the 3'-NTR and the 52 nt region at the ORF1b/ORF2 junction are highly conserved points to their critical role in the interaction with the viral replicative or transcriptive machinery. The variation in the 3'-end of 5'-NTR may influence the efficiency of viral genome replication or transcription, resulting in a difference in replication ability or virulence among different genotypes or strains .
The -1 ribosomal frameshifting is critical for the translation of the astrovirus genome . The -1 ribosomal frameshifting requires two cis-acting signals: a shifty heptamer sequence (AAAAAAC) and a potential stem-loop structure [10, 26]. This study showed that the HAstV-6 192-BJ07 strain also has such cis-acting elements, and further demonstrates the conservation of such elements among HAstV genotypes .
At present, the mechanism of HAstVs' variations is unclear. One study has indicated that recombination may be responsible for HAstVs' variation . However, current studies have not broadly established the role of recombination in HAstV variation [25, 27]. In agreement with most reports, we found no clear evidence of recombination between the 192-BJ07 strain and other HAstV genotypes based on similarity plot analysis. Diversification of the HAstV amino sequences may be attributed to accumulated single nucleotide mutations. This mechanism is similar to the antigen drift in other viruses, such as in influenza viruses [28, 29], which could lead to HAstVs escaping from existing host immunities and could result in the emergence of a new epidemic HAstV strain . Additional studies, such as large scale whole genome sequencing, are needed to address the evolutionary patterns of HAstVs.
We have sequenced and characterized the complete genome of HAstV-6 (192-BJ07). This sequence will provide insight into the genetics of astroviruses, broaden our understanding of their properties, and inform surveillance and control of HAstV gastroenteritis around the world.
A stool sample (termed 192-BJ07) that tested positive for HAstV-6 by RT-PCR was collected from a 2-year old boy who visited the Beijing Children's Hospital with acute diarrhea in 2007 . Viral RNA was extracted from the stool supernatant using Trizol reagent (Invitrogen, Carlsbad, CA) according to the manufacturer's instructions.
The primers ORF2-F (5'-atggctagcaagtctgacaagcagg-3') and ORF2-R (5'-gaagctgtaccctcgatcctactc-3') targeting ORF2 of 192-BJ07 were designed based on the only available HAstV-6 sequence in GenBank (GenBank accession number Z46658). For reverse transcription (RT) reactions, cDNA was generated with the SuperScript™ III RT kit (Invitrogen, Carlsbad, CA) using a random primer (Takara, Dalian, China) as described in the manufacturer's protocol. The PCR reaction was performed as follows: 94°C for 3 minutes, 35 cycles of amplification (94°C for 30 seconds; 50°C for 30 seconds; and 72°C for 3 minutes), and a final 10 minutes extension at 72°C. The PCR products were analyzed by 1.0% agarose gel electrophoresis and stained with ethidium bromide.
Genome amplification and sequencing
Rapid amplification of cDNA end (RACE) reactions were performed to obtain the entire sequence of the viral genome by using the 5'- and 3'-RACE System for Rapid Amplification of cDNA Ends kit (Invitrogen, Carlsbad, CA) according to the manufacturer's protocol. The ORF2 sequence obtained above was used as the starting point for the amplification. PCR-amplified products were cloned into the pMD18-T vector (TaKaRa, Dalian, China) and were introduced into chemically competent E. coli DH5α cells. The plasmid DNA was sequenced using an ABI3730 DNA Analyzer (Applied Biosystems). The complete genome sequence of HAstV-6 has been deposited in GenBank (GenBank Accession number GQ495608).
ORF prediction and RNA structure analysis
ORF1a and ORF2 were predicted for HAstV-6 192-BJ07 using the DNAStar ORF search program. ORF1b was predicted based on the "shifty"' heptanucleotide (AAAAAAC) that occurs in other HAstVs . RNA secondary structures were evaluated using RNAstructure 4.5 software.
The MegAlign programs in the DNAStar software package were used to perform multiple sequence alignments. HAstV phylogenies with 1000 bootstrap replicates were created using the neighbor-joining method and the Kimura two-parameter model with the MEGA software version 4.0 .
SimPlot software version 3.5.1  was used to analyze the relationships among the aligned HAstV genome sequences. The complete genome sequences of 192-BJ07, HAstV-1 (GenBank accession numbers L23513), HAstV-2 (GenBank accession number L13745), HAstV-3 (GenBank accession number AF141381), HAstV-4 (GenBank accession numbers AY720891), HAstV-5 (GenBank accession number DQ028633), and HAstV-8 (GenBank accession number AF260508) were first aligned by using Clustal W of the MEGA 4 program, and then 192-BJ07 was chosen as the query sequence for the similarity analysis. Similarity was calculated in each window of 200 bp using the Kimura two-parameter method.
This work is supported in part by the National Major Science and Technology Research Project for the Control and Prevention of Major Infectious Diseases in China (2009ZX10004-206).
- Chen SY, Chang YC, Lee YS, Chao HC, Tsao KC, Lin TY, Ko TY, Tsai CN, Chiu CH: Molecular epidemiology and clinical manifestations of viral gastroenteritis in hospitalized pediatric patients in Northern Taiwan. J Clin Microbiol 2007, 45: 2054-2057. 10.1128/JCM.01519-06PubMedPubMed CentralView ArticleGoogle Scholar
- Guix S, Caballero S, Villena C, Bartolomé R, Latorre C, Rabella N, Simó M, Bosch A, Pintó RM: Molecular epidemiology of astrovirus infection in Barcelona, Spain. J Clin Microbiol 2002, 40: 133-139. 10.1128/JCM.40.1.133-139.2002PubMedPubMed CentralView ArticleGoogle Scholar
- Liu MQ, Yang BF, Peng JS, Zhou DJ, Tang L, Wang B, Liu Y, Sun SH, Ho WZ: Molecular epidemiology of astrovirus infection in infants in Wuhan, China. J Clin Microbiol 2007, 45: 1308-1309. 10.1128/JCM.00010-07PubMedPubMed CentralView ArticleGoogle Scholar
- Madeley CR, Cosgrove BP: Letter: Viruses in infantile gastroenteritis. Lancet 1975, 2: 124. 10.1016/S0140-6736(75)90020-3PubMedView ArticleGoogle Scholar
- Mendez E, Arias CF: Astroviruses. In Fields Virology. Volume 1. 5th edition. Edited by: Knipe DM, Howley PM. Philadelphia: Lippincott Williams & Wilkins; 2007:981-1000.Google Scholar
- Kurtz JB, Lee TW: Human astrovirus serotypes. Lancet 1984, 2: 1405. 10.1016/S0140-6736(84)92101-9PubMedView ArticleGoogle Scholar
- Koopmans MP, Bijen MH, Monroe SS, Vinjé J: Age-stratified seroprevalence of neutralizing antibodies to astrovirus types 1 to 7 in humans in The Netherlands. Clin Diagn Lab Immunol 1998, 5: 33-37.PubMedPubMed CentralGoogle Scholar
- Noel JS, Lee TW, Kurtz JB, Glass RI, Monroe SS: Typing of human astroviruses from clinical isolates by enzyme immunoassay and nucleotide sequencing. J Clin Microbiol 1995, 33: 797-801.PubMedPubMed CentralGoogle Scholar
- Jiang B, Monroe SS, Koonin EV, Stine SE, Glass RI: RNA sequence of astrovirus: distinctive genomic organization and a putative retrovirus-like ribosomal frameshifting signal that directs the viral replicase synthesis. Proc Natl Acad Sci USA 1993, 90: 10539-10543. 10.1073/pnas.90.22.10539PubMedPubMed CentralView ArticleGoogle Scholar
- Lewis TL, Greenberg HB, Herrmann JE, Smith LS, Matsui SM: Analysis of astrovirus serotype 1 RNA, identification of the viral RNA-dependent RNA polymerase motif, and expression of a viral structural protein. J Virol 1994, 68: 77-83.PubMedPubMed CentralGoogle Scholar
- Oh D, Schreier E: Molecular characterization of human astroviruses in Germany. Arch Virol 2001, 146: 443-455. 10.1007/s007050170154PubMedView ArticleGoogle Scholar
- Silva PA, Cardoso DD, Schreier E: Molecular characterization of human astroviruses isolated in Brazil, including the complete sequences of astrovirus genotypes 4 and 5. Arch Virol 2006, 151: 1405-1417. 10.1007/s00705-005-0704-9PubMedView ArticleGoogle Scholar
- Glass RI, Noel J, Mitchell D, Herrmann JE, Blacklow NR, Pickering LK, Dennehy P, Ruiz-Palacios G, de Guerrero ML, Monroe SS: The changing epidemiology of astrovirus-associated gastroenteritis: a review. Arch Virol Suppl 1996, 12: 287-300.PubMedView ArticleGoogle Scholar
- Lee TW, Kurtz JB: Prevalence of human astrovirus serotypes in the Oxford region 1976-92, with evidence for two new serotypes. Epidemiol Infect 1994, 112: 187-193. 10.1017/S0950268800057551PubMedPubMed CentralView ArticleGoogle Scholar
- Sakon N, Yamazaki K, Utagawa E, Okuno Y, Oishi I: Genomic characterization of human astrovirus type 6 Katano virus and the establishment of a rapid and effective reverse transcription-polymerase chain reaction to detect all serotypes of human astrovirus. J Med Virol 2000, 61: 125-131. 10.1002/(SICI)1096-9071(200005)61:1<125::AID-JMV20>3.0.CO;2-BPubMedView ArticleGoogle Scholar
- Gabbay YB, Linhares AC, Cavalcante-Pepino EL, Nakamura LS, Oliveira DS, da Silva LD, Mascarenhas JD, Oliveira CS, Monteiro TA, Leite JP: Prevalence of human astrovirus genotypes associated with acute gastroenteritis among children in Belém, Brazil. J Med Virol 2007, 79: 530-538. 10.1002/jmv.20813PubMedView ArticleGoogle Scholar
- Oishi I, Yamazaki K, Kimoto T, Minekawa Y, Utagawa E, Yamazaki S, Inouye S, Grohmann GS, Monroe SS, Stine SE: A large outbreak of acute gastroenteritis associated with astrovirus among students and teachers in Osaka, Japan. J Infect Dis 1994, 170: 439-443.PubMedView ArticleGoogle Scholar
- Guo L, Xu X, Song J, Wang W, Wang J, Hung T: Molecular characterization of astrovirus infection in children with diarrhea in Beijing, 2005-2007. J Med Virol 2010, 82: 415-423. 10.1002/jmv.21729PubMedView ArticleGoogle Scholar
- Bass DM, Qiu S: Proteolytic processing of the astrovirus capsid. J Virol 2000, 74: 1810-1814. 10.1128/JVI.74.4.1810-1814.2000PubMedPubMed CentralView ArticleGoogle Scholar
- Méndez-Toss M, Romero-Guido P, Munguía ME, Méndez E, Arias CF: Molecular analysis of a serotype 8 human astrovirus genome. J Gen Virol 2000, 81: 2891-2897.PubMedView ArticleGoogle Scholar
- Willcocks MM, Kurtz JB, Lee TW, Carter MJ: Prevalence of human astrovirus serotype 4: capsid protein sequence and comparison with other strains. Epidemiol Infect 1995, 114: 385-391. 10.1017/S0950268800058015PubMedPubMed CentralView ArticleGoogle Scholar
- Lewis TL, Matsui SM: Astrovirus ribosomal frameshifting in an infection-transfection transient expression system. J Virol 1996, 70: 2869-2875.PubMedPubMed CentralGoogle Scholar
- Finkbeiner SR, Kirkwood CD, Wang D: Complete genome sequence of a highly divergent astrovirus isolated from a child with acutediarrhea. Virol J 2008, 5: 117.PubMedPubMed CentralView ArticleGoogle Scholar
- Walter JE, Briggs J, Guerrero ML, Matson DO, Pickering LK, Ruiz-Palacios G, Berke T, Mitchell DK: Molecular characterization of a novel recombinant strain of human astrovirus associated with gastroenteritis in children. Arch Virol 2001, 146: 2357-2367. 10.1007/s007050170008PubMedView ArticleGoogle Scholar
- Lukashov VV, Goudsmit J: Evolutionary relationships among Astroviridae. J Gen Virol 2002, 83: 1397-1405.PubMedView ArticleGoogle Scholar
- Willcocks MM, Brown TD, Madeley CR, Carter MJ: The complete sequence of a human astrovirus. J Gen Virol 1994, 75: 1785-1788. 10.1099/0022-1317-75-7-1785PubMedView ArticleGoogle Scholar
- Ulloa JC, Matiz A, Lareo L, Gutierrez MF: Molecular analysis of a 348 base-pair segment of open reading frame 2 of human astrovirus. A characterization of Colombian isolates. In Silico Biol 2005, 5: 537-546.PubMedGoogle Scholar
- Blackburne BP, Hay AJ, Goldstein RA: Changing selective pressure during antigenic changes in human influenza H3. PLoS Pathog 2008, 4: e1000058. 10.1371/journal.ppat.1000058PubMedPubMed CentralView ArticleGoogle Scholar
- Shen J, Ma J, Wang Q: Evolutionary trends of A(H1N1) influenza virus hemagglutinin since 1918. PLoS One 2009, 4: e7789. 10.1371/journal.pone.0007789PubMedPubMed CentralView ArticleGoogle Scholar
- Tu ET, Bull RA, Greening GE, Hewitt J, Lyon MJ, Marshall JA, McIver CJ, Rawlinson WD, White PA: Epidemics of gastroenteritis during 2006 were associated with the spread of norovirus GII.4 variants 2006a and 2006b. Clin Infect Dis 2008, 46: 413-420. 10.1086/525259PubMedView ArticleGoogle Scholar
- Tamura K, Dudley J, Nei M, Kumar S: MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol 2007, 24: 1596-1599. 10.1093/molbev/msm092PubMedView ArticleGoogle Scholar
- Lole KS, Bollinger RC, Paranjape RS, Gadkari D, Kulkarni SS, Novak NG, Ingersoll R, Sheppard HW, Ray SC: Full-length human immunodeficiency virus type 1 genomes from subtype C-infected seroconverters in India, with evidence of intersubtype recombination. J Virol 1999, 73: 152-160.PubMedPubMed CentralGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.