A conserved predicted pseudoknot in the NS2A-encoding sequence of West Nile and Japanese encephalitis flaviviruses suggests NS1' may derive from ribosomal frameshifting
© Firth and Atkins; licensee BioMed Central Ltd. 2009
Received: 21 December 2008
Accepted: 05 February 2009
Published: 05 February 2009
Japanese encephalitis, West Nile, Usutu and Murray Valley encephalitis viruses form a tight subgroup within the larger Flavivirus genus. These viruses utilize a single-polyprotein expression strategy, resulting in ~10 mature proteins. Plotting the conservation at synonymous sites along the polyprotein coding sequence reveals strong conservation peaks at the very 5' end of the coding sequence, and also at the 5' end of the sequence encoding the NS2A protein. Such peaks are generally indicative of functionally important non-coding sequence elements. The second peak corresponds to a predicted stable pseudoknot structure whose biological importance is supported by compensatory mutations that preserve the structure. The pseudoknot is preceded by a conserved slippery heptanucleotide (Y CCU UUU), thus forming a classical stimulatory motif for -1 ribosomal frameshifting. We hypothesize, therefore, that the functional importance of the pseudoknot is to stimulate a portion of ribosomes to shift -1 nt into a short (45 codon), conserved, overlapping open reading frame, termed foo. Since cleavage at the NS1-NS2A boundary is known to require synthesis of NS2A in cis, the resulting transframe fusion protein is predicted to be NS1-NS2AN-term-FOO. We hypothesize that this may explain the origin of the previously identified NS1 'extension' protein in JEV-group flaviviruses, known as NS1'.
The genus Flavivirus (see [1–3] for reviews) includes species such as Dengue virus, Japanese encephalitis virus (JEV), West Nile virus (WNV), Tick-borne encephalitis virus and Yellow fever virus. Also within the family Flaviviridae are Hepatitis C, Hepatitis G and the Pestivirus genus. The Japanese encephalitis group includes JEV, WNV, Murray Valley encephalitis virus (MVEV), Usutu virus and St Louis encephalitis virus (SLEV). These important human pathogens are transmitted by mosquitoes and can cause potentially fatal encephalitis. The single-stranded positive sense genomic RNA is ~11 kb in length and contains a single long open reading frame, translated as a polyprotein that is cleaved by virus-encoded and host proteases to produce ~10 mature proteins.
Inspired by the 'suppression of synonymous site variation' (SSSV) statistic of ref. , we decided to investigate conservation at synonymous sites in the Flaviviridae family. For a given species or group within the family, the polyprotein coding sequences (CDSs) were extracted, translated, aligned with CLUSTALW , and back-translated to nucleotide sequence alignments. Beginning with pairwise sequence comparisons, conservation at synonymous sites (only) was evaluated by comparing the observed number of base substitutions with the number expected under a neutral evolution model. The procedure takes into account whether synonymous site codons are 1-, 2-, 3-, 4- or 6-fold degenerate and the differing probabilities of transitions and transversions (full details are available on request from the authors). Statistics were then summed over a phylogenetic tree as described in , and averaged over a sliding window.
The -1 frame ORF (termed foo, for "Flavivirus Overlapping ORF") comprises 45 codons. Such short out-of-frame ORFs are not well-represented amongst known cases of programmed ribosomal frameshifting, but this may be more a consequence of the difficulty in finding such cases rather than any inherent rarity. Indeed we recently demonstrated the occurrence of -1 frameshifting, at a level of 10–18%, into a short ORF overlapping the 6K CDS in the Alphavirus genus .
The combination of slippery heptanucleotide, 3' pseudoknot, and 45-codon -1 frame ORF is conserved in all five RefSeqs (listed in the caption to Figure 2), and essentially all their GenBank 'genome neighbours'  as of December 2008 (223 sequences). The only exceptions are seven sequences – four with single mispairings in stem 1 of the pseudoknot, one with a shortened stem 2, and two with a truncated -1 frame ORF. At least three of the seven are annotated as attenuated.
The putative shift site is located at codons 8–9 of the NS2A CDS. Thus frameshifting would result in a 52 amino acid NS2AN-FOO fusion peptide (where NS2AN represents the N-terminal nine amino acids of NS2A). Previous work has demonstrated that, at least in Dengue virus, cleavage at the non-standard NS1-NS2A cleavage site requires translation of substantial (≫ 9 amino acids) parts of NS2A [10–12]. Thus NS2AN-FOO is likely not cleaved from NS1, i.e. the predicted mature transframe protein is NS1-NS2AN-FOO.
On protein gels, NS1 typically migrates as a cluster of bands, partly due to differing degrees of glycoslyation which can add ~6 kDa to the NS1 mass [13–15]. NS1 also forms multiple disulfide bonds and migrates with a substantially lower apparent molecular mass under non-reducing conditions . In JEV, MVEV and WNV, however, the existance of an elongated form of NS1, termed NS1', has been demonstrated [13–15, 17–19]. NS1' is N-terminally coincident with NS1  but extends into NS2A, as demonstrated by the presence of an epitope not present in NS1 but present in polyprotein sequence overlapping the carboxy terminus of NS1 [15, 17, 18], and by the necessity of NS2A coding sequence for NS1' expression . Thus, NS1' has been proposed to result from cleavage at an alternative site within NS2A. However, sites proposed by ref.  are too far downstream to account for NS1' , and attempts to localize the cleavage site by determining the carboxy terminal sequence of NS1' have been unsuccessful . Pulse-chase experiments have demonstrated that NS1' is not simply a precursor of NS1 but is, instead, a stable end product [13, 15].
For example, working with JEV, ref.  estimated masses for glycosylated NS1 and NS1' of 48 kDa and 55 kDa, respectively, while for unglycosylated NS1 and NS1' the masses were 42 kDa and 49 kDa. Thus, for JEV, the mass of the C-terminal extension in NS1' is ~7 kDa. Similarly, working with MVEV, ref.  estimated masses of 45 kDa and 53 kDa for glycosylated NS1 and NS1', and 39 kDa and 47 kDa for unglycosylated NS1 and NS1'. Thus, for MVEV, the mass of the C-terminal extension in NS1' is ~8 kDa. Similar results were obtained by ref. .
Consistent with our hypothesis, when ref.  expressed what they supposed to be an approximatation of JEV NS1' from a plasmid containing NS1 and the first 60 amino acids of NS2A (hereafter NS1-NS2A1..60), they appeared to obtain an NS1' doublet – consistent with a mixture of the zero-frame (NS1-NS2A1..60) and the predicted transframe (NS1-NS2AN-FOO) products. No such doublet was observed in controls comprising lysates from JEV-infected cells. Assuming the product that comigrates with wild-type NS1' is NS1-NS2AN-FOO, then the other product corresponds to a fainter, more rapidly migrating band. This is plausible if the high proline content of FOO causes it to migrate more slowly and if the artificial product NS1-NS2A1..60 is more rapidly degraded. The data do not fit a cleavage hypothesis (as then the uncleaved NS1-NS2A1..60 would be expected to migrate more slowly than wild-type NS1') and an impaired glycosylation explanation seems unlikely (since the doublet appears to be present even in a sample treated with endoglycosidase F). No NS1 was produced from the NS1-NS2A1..60 plasmid (consistent with NS1-NS2A cleavage requiring synthesis of NS2A; see above) and no NS1' was produced from a plasmid just expressing NS1 (consistent with NS1' requiring the 5' end of the NS2A CDS).
A corresponding frameshift stimulatory motif was not found in SLEV – the most divergent of the six JEV-group RefSeqs but, interestingly, there was a long (89–165 codons) -1 frame ORF overlapping the boundary between NS1 and NS2A, so it is possible that frameshifting also occurs in SLEV at a non-canonical site, possibly further 5' such as within the NS1 CDS. No evidence for frameshifting was found in Dengue or Kokobera viruses – consistent with the apparent absence of NS1' in these species . In contrast to JEV-group NS1', the elongated NS1 product (NS1-2A*; ) seen in Yellow fever virus apparently results from cleavage much closer to the carboxy terminus of NS2A [22, 23] and, in any case, NS1-2A* appears to be simply a precursor of mature NS1 rather than a stable end product in itself, as demonstrated by pulse-chase analyses .
This work was supported by an award from Science Foundation Ireland and by NIH Grant R01 GM079523, both to JFA.
- Lindenbach BD, Thiel HJ, Rice CM: Flaviviridae: the viruses and their replication. In Fields Virology. 5th edition. Edited by: Knipe DM, Howley PM. Philadelphia: Lippincott-Raven Publishers; 2007:1101-1152.
- Brinton MA: The molecular biology of West Nile Virus: a new invader of the western hemisphere. Annu Rev Microbiol 2002, 56: 371-402. 10.1146/annurev.micro.56.012302.160654View ArticlePubMed
- Mackenzie JS, Gubler DJ, Petersen LR: Emerging flaviviruses: the spread and resurgence of Japanese encephalitis, West Nile and dengue viruses. Nat Med 2004, 10: S98-109. 10.1038/nm1144View ArticlePubMed
- Simmonds P, Karakasiliotis I, Bailey D, Chaudhry Y, Evans DJ, Goodfellow IG: Bioinformatic and functional analysis of RNA secondary structure elements among different genera of human and animal caliciviruses. Nucleic Acids Res 2008, 36: 2530-2546. 10.1093/nar/gkn096PubMed CentralView ArticlePubMed
- Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 1994, 22: 4673-4680. 10.1093/nar/22.22.4673PubMed CentralView ArticlePubMed
- Firth AE, Brown CM: Detecting overlapping coding sequences in virus genomes. BMC Bioinformatics 2006, 7: 75. 10.1186/1471-2105-7-75PubMed CentralView ArticlePubMed
- Brierley I, Pennell S: Structure and function of the stimulatory RNAs involved in programmed eukaryotic -1 ribosomal frameshifting. Cold Spring Harb Symp Quant Biol 2001, 66: 233-248. 10.1101/sqb.2001.66.233View ArticlePubMed
- Firth AE, Chung BY, Fleeton MN, Atkins JF: Discovery of frameshifting in Alphavirus 6K resolves a 20-year enigma. Virol J 2008, 5: 108. 10.1186/1743-422X-5-108PubMed CentralView ArticlePubMed
- Bao Y, Federhen S, Leipe D, Pham V, Resenchuk S, Rozanov M, Tatusov R, Tatusova T: National center for biotechnology information viral genomes project. J Virol 2004, 78: 7291-7298. 10.1128/JVI.78.14.7291-7298.2004PubMed CentralView ArticlePubMed
- Falgout B, Chanock R, Lai CJ: Proper processing of dengue virus nonstructural glycoprotein NS1 requires the N-terminal hydrophobic signal sequence and the downstream nonstructural protein NS2a. J Virol 1989, 63: 1852-1860.PubMed CentralPubMed
- Falgout B, Markoff L: Evidence that flavivirus NS1-NS2A cleavage is mediated by a membrane-bound host protease in the endoplasmic reticulum. J Virol 1995, 69: 7232-7243.PubMed CentralPubMed
- Leblois H, Young PR: Maturation of the dengue-2 virus NS1 protein in insect cells: effects of downstream NS2A sequences on baculovirus-expressed gene constructs. J Gen Virol 1995, 76: 979-984. 10.1099/0022-1317-76-4-979View ArticlePubMed
- Mason PW: Maturation of Japanese encephalitis virus glycoproteins produced by infected mammalian and mosquito cells. Virology 1989, 169: 354-364. 10.1016/0042-6822(89)90161-XView ArticlePubMed
- Chen LK, Liao CL, Lin CG, Lai SC, Liu CI, Ma SH, Huang YY, Lin YL: Persistence of Japanese encephalitis virus is associated with abnormal expression of the nonstructural protein NS1 in host cells. Virology 1996, 217: 220-229. 10.1006/viro.1996.0109View ArticlePubMed
- Blitvich BJ, Scanlon D, Shiell BJ, Mackenzie JS, Hall RA: Identification and analysis of truncated and elongated species of the flavivirus NS1 protein. Virus Res 1999, 60: 67-79. 10.1016/S0168-1702(99)00003-9View ArticlePubMed
- Flamand M, Chevalier M, Henchal E, Girard M, Deubel V: Purification and renaturation of Japanese encephalitis virus nonstructural glycoprotein NS1 overproduced by insect cells. Protein Expr Purif 1995, 6: 519-527. 10.1006/prep.1995.1069View ArticlePubMed
- Mason PW, McAda PC, Dalrymple JM, Fournier MJ, Mason TL: Expression of Japanese encephalitis virus antigens in Escherichia coli. Virology 1987, 158: 361-372. 10.1016/0042-6822(87)90208-XView ArticlePubMed
- Hall RA, Kay BH, Burgess GW, Clancy P, Fanning ID: Epitope analysis of the envelope and non-structural glycoproteins of Murray Valley encephalitis virus. J Gen Virol 1990, 71: 2923-2930. 10.1099/0022-1317-71-12-2923View ArticlePubMed
- Blitvich BJ, Mackenzie JS, Coelen RJ, Howard MJ, Hall RA: A novel complex formed between the flavivirus E and NS1 proteins: analysis of its structure and function. Arch Virol 1995, 140: 145-156. 10.1007/BF01309729View ArticlePubMed
- Jan LR, Yang CS, Trent DW, Falgout B, Lai CJ: Processing of Japanese encephalitis virus non-structural proteins: NS2B-NS3 complex and heterologous proteases. J Gen Virol 1995, 76: 573-580. 10.1099/0022-1317-76-3-573View ArticlePubMed
- Lin YL, Chen LK, Liao CL, Yeh CT, Ma SH, Chen JL, Huang YL, Chen SS, Chiang HY: DNA immunization with Japanese encephalitis virus nonstructural protein NS1 elicits protective immunity in mice. J Virol 1998, 72: 191-200.PubMed CentralPubMed
- Chambers TJ, McCourt DW, Rice CM: Production of yellow fever virus proteins in infected cells: identification of discrete polyprotein species and analysis of cleavage kinetics using region-specific polyclonal antisera. Virology 1990, 177: 159-174. 10.1016/0042-6822(90)90470-CView ArticlePubMed
- Nestorowicz A, Chambers TJ, Rice CM: Mutagenesis of the yellow fever virus NS2A/2B cleavage site: effects on proteolytic processing, viral replication, and evidence for alternative processing of the NS2A protein. Virology 1994, 199: 114-123. 10.1006/viro.1994.1103View ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.