Recombination in West Nile Virus: minimal contribution to genomic diversity
© Pickett and Lefkowitz. 2009
Received: 25 August 2009
Accepted: 12 October 2009
Published: 12 October 2009
Recombination is known to play a role in the ability of various viruses to acquire sequence diversity. We consequently examined all available West Nile virus (WNV) whole genome sequences both phylogenetically and with a variety of computational recombination detection algorithms. We found that the number of distinct lineages present on a phylogenetic tree reconstruction to be identical to the 6 previously reported. Statistically-significant evidence for recombination was only observed in one whole genome sequence. This recombination event was within the NS5 polymerase coding region. All three viruses contributing to the recombination event were originally isolated in Africa at various times, with the major parent (SPU116_89_B), minor parent (KN3829), and recombinant sequence (AnMg798) belonging to WNV taxonomic lineages 2, 1a, and 2 respectively. This one isolated recombinant genome was out of a total of 154 sequences analyzed. It therefore does not seem likely that recombination contributes in any significant manner to the overall sequence variation within the WNV genome.
The species West Nile virus (WNV) is a member of the family Flaviviridae, genus Flavivirus. West Nile virus is a positive-sense, single-stranded RNA virus that has 6 separate phylogenetically-distinct lineages which correlate well with the geographical point of isolation . Sequence variation in positive-sense RNA viruses such as flaviviruses, can occur via single base changes and small insertions and deletions within the linear evolutionary pathway of the virus lineage [2–4]. In addition, larger scale sequence changes can occur via exchange of genetic information with other related viruses via the process of recombination [5, 6]. Recombination has been detected in several members of the Flaviviridae family including: hepatitis C virus  and dengue virus [8, 9]; and it has been hypothesized that West Nile virus would follow suit as more sequence data becomes available .
Homologous recombination in single-stranded RNA molecules occurs via a template-switch , also called copy-choice , mechanism. More specifically, when two positive-polarity, single-stranded RNA viruses belonging to the same species co-infect a single cell, a replicating viral RNA-dependent RNA polymerase (RdRp) can dissociate from the first genome and continue replication by binding to, and using a second distinct genome as the replication template. This dissociation process is thought to be initiated by the RdRp pausing or stalling at specific sequences or RNA structural elements [11, 13, 14]. The act of moving the RdRp complex from one "parental" genome to another yields a chimera "daughter" viral genome containing one fraction of the first "parental" genome and the other fraction of the second "parent" genome.
Such recombination events in natural sequences are difficult to detect in the wet-lab due to the sequence similarity that exists between parental and daughter sequences at any putative recombination breakpoint . As a consequence of this fact, in silico techniques have been developed to assist in this endeavor. These algorithms function by comparing all possible combinations of three sequences at a time from a multiple sequence alignment to determine whether or not a nucleotide pattern signifying the presence of a recombination breakpoint exists within between any 3 sequences (two parental, and one recombinant).
To manually detect phylogenetic incongruencies between different regions of the aligned genomes, we analyzed portions of the MSA containing: the complete NS5 coding region, the NS5 coding region lacking the recombinant region, or only the region within the NS5 coding sequence that showed evidence of recombination. MrBayes was then used to reconstruct separate consensus phylogenetic trees using the parameters described below. The topologies of these three trees were compared to confirm recombination within the region.
Phylogenetic Tree Reconstructions
Detection of Recombination in Whole Genome Sequences
4.936 × 10-2
2.033 × 10-6
8.269 × 10-5
3.600 × 10-1
7.235 × 10-8
3.986 × 10-5
Confirmation of Recombination Event
The purpose of the present study was to examine a dataset consisting of multiple whole genome WNV sequences in order to determine the extent to which recombination contributed to the overall sequence variation within the this viral species and compare the contribution of recombination in WNV to that in other members of the Flaviviridae family.
We confirm the fact that WNV isolates can be grouped into 6 distinct phylogenetic clades or lineages [1, 18]. Whether this implies that only 6 such lineages exist can only be confirmed with the acquisition of more sequence data. While the genetic differences producing these separate clades have apparently been produced as a result of geographic isolation, it is possible that temporal, host genetic, immune, and/or additional factors may also play some role in the generation of WNV diversity in these, or other replicating lineages.
Previous studies attempting to detect recombination in West Nile virus used only the envelope coding region . For our current study, we hoped to increase the sensitivity of the analysis by utilizing the entire genome sequence for recombination detection. In spite of this, we were only able to detect one recombination event among all of the 154 WNV isolates that are available as complete genomic sequences. The NS5 region containing this recombination event is known to contain the WNV-specific loop/alpha-helix as well as the back subdomain of the RNA template tunnel .
Although recombination within certain species of the Flavivirus genus has been reported as fairly frequent--an observation which may likely be attributed to the vector-vertebrate host life cycle that is exploited by these arboviruses , it is not common across all species within the genus. Recombination is rare in Japanese encephalitis virus and St. Louis encephalitis virus, while recombination appears to be relatively frequent among the four serotypes of dengue virus with at least one known intergenotypic recombination event in serotype 1 [5, 6, 10]. Recombination also seems to be a relevant cause of genetic diversity within the Hepatitis C virus species (Hepacivirus genus). Such events have mostly been reported between genomes belonging to different genotypes or subtypes [7, 20]; however, very few intra-subtype recombination events have been reported perhaps due to the difficulty of detecting recombination between very closely related viral genomes . Since WNV is more closely related to Japanese encephalitis virus and St. Louis encephalitis virus than to either hepatitis C virus or dengue virus , its ability to utilize recombination as a mechanism for generating sequence variation may also be more limited.
We believe that this recombination event was identified because of the sequence variation existing between the two original parental lineages, and subsequently passed down through the progeny of the recombinant virus. Whether intra-lineage recombination is detectable is still unknown due to the high sequence similarity existing between such sequences. This idea is further supported by the previous observations that purifying selection pressure is present in arthropod-borne viruses , and that the sequence diversity present within the distinct lineages, and by extension, throughout the WNV species as a whole is remarkably low . These arguments support our finding that the occurrence, and consequently the detection, of recombination within WNV is an especially rare event.
It is also important to realize that even though recombination was detected to have occurred between the SPU116_89_B and KN3829 sequences to yield the AnMg798 sequence, these are not likely the actual sequences that participated in the original recombination event. This statement is based on the knowledge that these sequences differ both in time and place of isolation, it is therefore probable that they are progeny of the original parental (and daughter) sequences. These extant sequences were likely flagged as having undergone a statistically significant recombination event due to the conservation of the original ancestral recombinant signal in the descendents.
Unfortunately, the sequence and metadata associated with these isolates is insufficient to determine the temporal or geographical point of origin for either the ancestral parental or daughter sequences. Therefore, while we know that the strains were isolated from eastern Africa, it is impossible to determine whether the ancestral parental strains were originally located adjacent to each other geographically or whether a bird, mosquito, human or other host infected with one of the parental strains migrated to an area where the second parental strain was either present or endemic. Either of these possibilities would result in the introduction of one of the parental strains into the same territory as the other and would allow for co-circulation of both viruses within the local environment until they eventually infected the same host and the recombination event occurred. It is also impossible with the present amount of information to determine which organism was co-infected and produced the recombinant virus.
There are several possible biological reasons why recombination may be so rare in WNV and therefore why we were only able to detect recombination in only 1 of the 154 WNV whole genome sequences. First, it has been shown that the concentration of WNV in the blood throughout the human portion of the replication cycle is low , which markedly decreases the probability that a single cell would become infected with the two distinct viral isolates required for recombination to occur. This is in contrast to infection in birds, the natural reservoir of WNV, which in some avian species can result in high levels of viremia . So the possibility exists for a single avian cell to become infected by multiple strains of virus. Therefore the possibility remains for recombination to occur in birds (though if present, our analysis would have detected recombination within the available sequenced isolates irrespective of where recombination may have occurred). Secondly, it has also been shown in vitro that the WNV RNA polymerase is more likely to abort RNA replication after falling off of a template molecule than it is to reinitiate on a homologous RNA template . This will decrease the likelihood of recombination in either the human or avian host.
Using bioinformatics analysis, we were able to detect only a single incidence of recombination in available sequenced isolates of WNV. And in addition, reports indicate that the capability of the RdRp to template switch-and by extension to cause recombination-in WNV is severely diminished. For these reasons recombination appears not to be a likely mechanism for the generation of sequence diversity in West Nile virus.
Multiple Sequence Alignments and Phylogenetic Trees
To look for recombination in WNV isolates, we used 154 whole genome Kunjin virus and West Nile virus sequences (See additional file 3 for the original data used) obtained from the Viral Bioinformatics Resource Center http://www.vbrc.org. A multiple sequence alignment (MSA) of these genomes was constructed using MUSCLE . Phylogenetic reconstruction of all available genomic sequences was performed using Bayesian analysis as implemented by the program MrBayes . We used the default parameters in MrBayes (General Time Reversible evolutionary model, gamma-distributed rate variation and proportion of invariable sites) and sampled every 100 generations for 1 million generations using 4 chains. The first 2,500 trees were discarded as "burn-in".
For detection of recombination events, we used the automated suite of algorithms contained within the Recombination Detection Program 3 (RDP3) [8, 30–38] to analyze the complete genomic sequences present in our MSA. In general, we used the default settings for each program in the RDP3 suite except for the following: for RDP we used a window size of 30; Bootscan used a window size of 200, step size of 50, and 50 bootstrap replicates; Siscan used a window size of 200 and step size of 20; and RDP3 was set to report all hits detected by 2 or more algorithms. In order to confirm the results from the automated tests, additional algorithms which are not part of the automated process were also run. SplitsTree4  was used with default settings to assess the presence of a reticulated phylogenetic network as a representation of recombination (unpublished data).
We would like to thank the members of the Lefkowitz laboratory as well as the staff of the Viral Bioinformatics Resource Center for their help, support, and provision of the sequence data for download. This work was supported by NIH/NIAID Contract No. HHSN266200400036C to EJL.
- Botha EM, Markotter W, Wolfaardt M, Paweska JT, Swanepoel R, Palacios G, Nel LH, Venter M: Genetic determinants of virulence in pathogenic lineage 2 West Nile virus strains. Emerg Infect Dis 2008, 14:222–230.View ArticlePubMed
- Deas TS, Bennett CJ, Jones SA, Tilgner M, Ren P, Behr MJ, Stein DA, Iversen PL, Kramer LD, Bernard KA, Shi PY: In vitro resistance selection and in vivo efficacy of morpholino oligomers against West Nile virus. Antimicrob Agents Chemother 2007, 51:2470–2482.View ArticlePubMed
- Vasilakis N, Deardorff ER, Kenney JL, Rossi SL, Hanley KA, Weaver SC: Mosquitoes put the brake on arbovirus evolution: experimental evolution reveals slower mutation accumulation in mosquito than vertebrate cells. PLoS Pathog 2009, 5:e1000467.View ArticlePubMed
- Iyer AV, Boudreaux MJ, Wakamatsu N, Roy AF, Baghian A, Chouljenko VN, Kousoulas KG: Complete genome analysis and virulence characteristics of the Louisiana West Nile virus strain LSU-AR01. Virus Genes 2009, 38:204–214.View ArticlePubMed
- Aaskov J, Buzacott K, Field E, Lowry K, Berlioz-Arthaud A, Holmes EC: Multiple recombinant dengue type 1 viruses in an isolate from a dengue patient. J Gen Virol 2007, 88:3334–3340.View ArticlePubMed
- Worobey M, Rambaut A, Holmes EC: Widespread intra-serotype recombination in natural populations of dengue virus. Proc Natl Acad Sci USA 1999, 96:7352–7357.View ArticlePubMed
- Sentandreu V, Jimenez-Hernandez N, Torres-Puente M, Bracho MA, Valero A, Gosalbes MJ, Ortega E, Moya A, Gonzalez-Candelas F: Evidence of recombination in intrapatient populations of hepatitis C virus. PLoS ONE 2008, 3:e3239.View ArticlePubMed
- Holmes EC, Worobey M, Rambaut A: Phylogenetic evidence for recombination in dengue virus. Mol Biol Evol 1999, 16:405–409.PubMed
- Chen SP, Yu M, Jiang T, Deng YQ, Qin CF, Han JF, Qin ED: Identification of a recombinant dengue virus type 1 with 3 recombination regions in natural populations in Guangdong province, China. Arch Virol 2008, 153:1175–1179.View ArticlePubMed
- Twiddy SS, Holmes EC: The extent of homologous recombination in members of the genus Flavivirus. J Gen Virol 2003, 84:429–440.View ArticlePubMed
- Worobey M, Holmes EC: Evolutionary aspects of recombination in RNA viruses. J Gen Virol 1999,80(Pt 10):2535–2543.PubMed
- Cooper PD, Steiner-Pryor A, Scotti PD, Delong D: On the nature of poliovirus genetic recombinants. J Gen Virol 1974, 23:41–49.View Article
- Neufeld KL, Richards OC, Ehrenfeld E: Purification, characterization, and comparison of poliovirus RNA polymerase from native and recombinant sources. J Biol Chem 1991, 266:24212–24219.PubMed
- Mindich L: Packaging, replication and recombination of the segmented genome of bacteriophage Phi6 and its relatives. Virus Res 2004, 101:83–92.View ArticlePubMed
- Weaver SC, Hagenbaugh A, Bellew LA, Gousset L, Mallampalli V, Holland JJ, Scott TW: Evolution of alphaviruses in the eastern equine encephalomyelitis complex. J Virol 1994, 68:158–169.PubMed
- Beasley DW, Li L, Suderman MT, Barrett AD: Mouse neuroinvasive phenotype of West Nile virus strains varies depending upon virus genotype. Virology 2002, 296:17–23.View ArticlePubMed
- Venter M, Myers TG, Wilson MA, Kindt TJ, Paweska JT, Burt FJ, Leman PA, Swanepoel R: Gene expression in mice infected with West Nile virus strains of different neurovirulence. Virology 2005, 342:119–140.View ArticlePubMed
- Bakonyi T, Ivanics E, Erdelyi K, Ursu K, Ferenczi E, Weissenbock H, Nowotny N: Lineage 1 and 2 strains of encephalitic West Nile virus, central Europe. Emerg Infect Dis 2006, 12:618–623.PubMed
- Malet H, Egloff MP, Selisko B, Butcher RE, Wright PJ, Roberts M, Gruez A, Sulzenbacher G, Vonrhein C, Bricogne G, et al.: Crystal structure of the RNA polymerase domain of the West Nile virus non-structural protein 5. J Biol Chem 2007, 282:10678–10689.View ArticlePubMed
- Yun Z, Lara C, Johansson B, Lorenzana de Rivera I, Sonnerborg A: Discrepancy of hepatitis C virus genotypes as determined by phylogenetic analysis of partial NS5 and core sequences. J Med Virol 1996, 49:155–160.View ArticlePubMed
- Moreno MP, Casane D, Lopez L, Cristina J: Evidence of recombination in quasispecies populations of a Hepatitis C Virus patient undergoing anti-viral therapy. Virol J 2006, 3:87.View ArticlePubMed
- Kuno G, Chang GJ, Tsuchiya KR, Karabatsos N, Cropp CB: Phylogeny of the genus Flavivirus. J Virol 1998, 72:73–83.PubMed
- Weaver SC: Evolutionary influences in arboviral disease. Curr Top Microbiol Immunol 2006, 299:285–314.View ArticlePubMed
- Charrel RN, Brault AC, Gallian P, Lemasson JJ, Murgue B, Murri S, Pastorino B, Zeller H, de Chesse R, de Micco P, de Lamballerie X: Evolutionary relationship between Old World West Nile virus strains. Virology 2003, 315:381–388.View ArticlePubMed
- Davis LE, DeBiasi R, Goade DE, Haaland KY, Harrington JA, Harnar JB, Pergam SA, King MK, DeMasters BK, Tyler KL: West Nile virus neuroinvasive disease. Ann Neurol 2006, 60:286–300.View ArticlePubMed
- Meulen KM, Pensaert MB, Nauwynck HJ: West Nile virus in the vertebrate world. Arch Virol 2005, 150:637–657.View ArticlePubMed
- Selisko B, Dutartre H, Guillemot JC, Debarnot C, Benarroch D, Khromykh A, Despres P, Egloff MP, Canard B: Comparative mechanistic studies of de novo RNA synthesis by flavivirus RNA-dependent RNA polymerases. Virology 2006, 351:145–158.View ArticlePubMed
- Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 2004, 32:1792–1797.View ArticlePubMed
- Huelsenbeck JP, Ronquist F: MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics 2001, 17:754–755.View ArticlePubMed
- Martin DP, Williamson C, Posada D: RDP 2: recombination detection and analysis from sequence alignments. Bioinformatics 2005, 21:260–262.View ArticlePubMed
- Posada D, Crandall KA: Evaluation of methods for detecting recombination from DNA sequences: computer simulations. Proc Natl Acad Sci USA 2001, 98:13757–13762.View ArticlePubMed
- Martin D, Rybicki E: RDP: detection of recombination amongst aligned sequences. Bioinformatics 2000, 16:562–563.View ArticlePubMed
- Smith JM: Analyzing the mosaic structure of genes. J Mol Evol 1992, 34:126–129.PubMed
- Gibbs MJ, Armstrong JS, Gibbs AJ: Sister-scanning: a Monte Carlo procedure for assessing signals in recombinant sequences. Bioinformatics 2000, 16:573–582.View ArticlePubMed
- Boni MF, Posada D, Feldman MW: An exact nonparametric method for inferring mosaic structure in sequence triplets. Genetics 2007, 176:1035–1047.View ArticlePubMed
- Weiller GF: Phylogenetic profiles: a graphical method for detecting genetic recombinations in homologous sequences. Mol Biol Evol 1998, 15:326–335.PubMed
- Padidam M, Sawyer S, Fauquet CM: Possible emergence of new geminiviruses by frequent recombination. Virology 1999, 265:218–225.View ArticlePubMed
- Martin DP, Posada D, Crandall KA, Williamson C: A modified bootscan algorithm for automated identification of recombinant sequences and recombination breakpoints. AIDS Res Hum Retroviruses 2005, 21:98–102.View ArticlePubMed
- Huson DH, Bryant D: Application of phylogenetic networks in evolutionary studies. Mol Biol Evol 2006, 23:254–267.View ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.