Recombination in West Nile Virus: minimal contribution to genomic diversity
© Pickett and Lefkowitz. 2009
Received: 25 August 2009
Accepted: 12 October 2009
Published: 12 October 2009
Skip to main content
© Pickett and Lefkowitz. 2009
Received: 25 August 2009
Accepted: 12 October 2009
Published: 12 October 2009
Recombination is known to play a role in the ability of various viruses to acquire sequence diversity. We consequently examined all available West Nile virus (WNV) whole genome sequences both phylogenetically and with a variety of computational recombination detection algorithms. We found that the number of distinct lineages present on a phylogenetic tree reconstruction to be identical to the 6 previously reported. Statistically-significant evidence for recombination was only observed in one whole genome sequence. This recombination event was within the NS5 polymerase coding region. All three viruses contributing to the recombination event were originally isolated in Africa at various times, with the major parent (SPU116_89_B), minor parent (KN3829), and recombinant sequence (AnMg798) belonging to WNV taxonomic lineages 2, 1a, and 2 respectively. This one isolated recombinant genome was out of a total of 154 sequences analyzed. It therefore does not seem likely that recombination contributes in any significant manner to the overall sequence variation within the WNV genome.
The species West Nile virus (WNV) is a member of the family Flaviviridae, genus Flavivirus. West Nile virus is a positive-sense, single-stranded RNA virus that has 6 separate phylogenetically-distinct lineages which correlate well with the geographical point of isolation . Sequence variation in positive-sense RNA viruses such as flaviviruses, can occur via single base changes and small insertions and deletions within the linear evolutionary pathway of the virus lineage [2–4]. In addition, larger scale sequence changes can occur via exchange of genetic information with other related viruses via the process of recombination [5, 6]. Recombination has been detected in several members of the Flaviviridae family including: hepatitis C virus  and dengue virus [8, 9]; and it has been hypothesized that West Nile virus would follow suit as more sequence data becomes available .
Homologous recombination in single-stranded RNA molecules occurs via a template-switch , also called copy-choice , mechanism. More specifically, when two positive-polarity, single-stranded RNA viruses belonging to the same species co-infect a single cell, a replicating viral RNA-dependent RNA polymerase (RdRp) can dissociate from the first genome and continue replication by binding to, and using a second distinct genome as the replication template. This dissociation process is thought to be initiated by the RdRp pausing or stalling at specific sequences or RNA structural elements [11, 13, 14]. The act of moving the RdRp complex from one "parental" genome to another yields a chimera "daughter" viral genome containing one fraction of the first "parental" genome and the other fraction of the second "parent" genome.
Such recombination events in natural sequences are difficult to detect in the wet-lab due to the sequence similarity that exists between parental and daughter sequences at any putative recombination breakpoint . As a consequence of this fact, in silico techniques have been developed to assist in this endeavor. These algorithms function by comparing all possible combinations of three sequences at a time from a multiple sequence alignment to determine whether or not a nucleotide pattern signifying the presence of a recombination breakpoint exists within between any 3 sequences (two parental, and one recombinant).
To manually detect phylogenetic incongruencies between different regions of the aligned genomes, we analyzed portions of the MSA containing: the complete NS5 coding region, the NS5 coding region lacking the recombinant region, or only the region within the NS5 coding sequence that showed evidence of recombination. MrBayes was then used to reconstruct separate consensus phylogenetic trees using the parameters described below. The topologies of these three trees were compared to confirm recombination within the region.
4.936 × 10-2
2.033 × 10-6
8.269 × 10-5
3.600 × 10-1
7.235 × 10-8
3.986 × 10-5
The purpose of the present study was to examine a dataset consisting of multiple whole genome WNV sequences in order to determine the extent to which recombination contributed to the overall sequence variation within the this viral species and compare the contribution of recombination in WNV to that in other members of the Flaviviridae family.
We confirm the fact that WNV isolates can be grouped into 6 distinct phylogenetic clades or lineages [1, 18]. Whether this implies that only 6 such lineages exist can only be confirmed with the acquisition of more sequence data. While the genetic differences producing these separate clades have apparently been produced as a result of geographic isolation, it is possible that temporal, host genetic, immune, and/or additional factors may also play some role in the generation of WNV diversity in these, or other replicating lineages.
Previous studies attempting to detect recombination in West Nile virus used only the envelope coding region . For our current study, we hoped to increase the sensitivity of the analysis by utilizing the entire genome sequence for recombination detection. In spite of this, we were only able to detect one recombination event among all of the 154 WNV isolates that are available as complete genomic sequences. The NS5 region containing this recombination event is known to contain the WNV-specific loop/alpha-helix as well as the back subdomain of the RNA template tunnel .
Although recombination within certain species of the Flavivirus genus has been reported as fairly frequent--an observation which may likely be attributed to the vector-vertebrate host life cycle that is exploited by these arboviruses , it is not common across all species within the genus. Recombination is rare in Japanese encephalitis virus and St. Louis encephalitis virus, while recombination appears to be relatively frequent among the four serotypes of dengue virus with at least one known intergenotypic recombination event in serotype 1 [5, 6, 10]. Recombination also seems to be a relevant cause of genetic diversity within the Hepatitis C virus species (Hepacivirus genus). Such events have mostly been reported between genomes belonging to different genotypes or subtypes [7, 20]; however, very few intra-subtype recombination events have been reported perhaps due to the difficulty of detecting recombination between very closely related viral genomes . Since WNV is more closely related to Japanese encephalitis virus and St. Louis encephalitis virus than to either hepatitis C virus or dengue virus , its ability to utilize recombination as a mechanism for generating sequence variation may also be more limited.
We believe that this recombination event was identified because of the sequence variation existing between the two original parental lineages, and subsequently passed down through the progeny of the recombinant virus. Whether intra-lineage recombination is detectable is still unknown due to the high sequence similarity existing between such sequences. This idea is further supported by the previous observations that purifying selection pressure is present in arthropod-borne viruses , and that the sequence diversity present within the distinct lineages, and by extension, throughout the WNV species as a whole is remarkably low . These arguments support our finding that the occurrence, and consequently the detection, of recombination within WNV is an especially rare event.
It is also important to realize that even though recombination was detected to have occurred between the SPU116_89_B and KN3829 sequences to yield the AnMg798 sequence, these are not likely the actual sequences that participated in the original recombination event. This statement is based on the knowledge that these sequences differ both in time and place of isolation, it is therefore probable that they are progeny of the original parental (and daughter) sequences. These extant sequences were likely flagged as having undergone a statistically significant recombination event due to the conservation of the original ancestral recombinant signal in the descendents.
Unfortunately, the sequence and metadata associated with these isolates is insufficient to determine the temporal or geographical point of origin for either the ancestral parental or daughter sequences. Therefore, while we know that the strains were isolated from eastern Africa, it is impossible to determine whether the ancestral parental strains were originally located adjacent to each other geographically or whether a bird, mosquito, human or other host infected with one of the parental strains migrated to an area where the second parental strain was either present or endemic. Either of these possibilities would result in the introduction of one of the parental strains into the same territory as the other and would allow for co-circulation of both viruses within the local environment until they eventually infected the same host and the recombination event occurred. It is also impossible with the present amount of information to determine which organism was co-infected and produced the recombinant virus.
There are several possible biological reasons why recombination may be so rare in WNV and therefore why we were only able to detect recombination in only 1 of the 154 WNV whole genome sequences. First, it has been shown that the concentration of WNV in the blood throughout the human portion of the replication cycle is low , which markedly decreases the probability that a single cell would become infected with the two distinct viral isolates required for recombination to occur. This is in contrast to infection in birds, the natural reservoir of WNV, which in some avian species can result in high levels of viremia . So the possibility exists for a single avian cell to become infected by multiple strains of virus. Therefore the possibility remains for recombination to occur in birds (though if present, our analysis would have detected recombination within the available sequenced isolates irrespective of where recombination may have occurred). Secondly, it has also been shown in vitro that the WNV RNA polymerase is more likely to abort RNA replication after falling off of a template molecule than it is to reinitiate on a homologous RNA template . This will decrease the likelihood of recombination in either the human or avian host.
Using bioinformatics analysis, we were able to detect only a single incidence of recombination in available sequenced isolates of WNV. And in addition, reports indicate that the capability of the RdRp to template switch-and by extension to cause recombination-in WNV is severely diminished. For these reasons recombination appears not to be a likely mechanism for the generation of sequence diversity in West Nile virus.
To look for recombination in WNV isolates, we used 154 whole genome Kunjin virus and West Nile virus sequences (See additional file 3 for the original data used) obtained from the Viral Bioinformatics Resource Center http://www.vbrc.org. A multiple sequence alignment (MSA) of these genomes was constructed using MUSCLE . Phylogenetic reconstruction of all available genomic sequences was performed using Bayesian analysis as implemented by the program MrBayes . We used the default parameters in MrBayes (General Time Reversible evolutionary model, gamma-distributed rate variation and proportion of invariable sites) and sampled every 100 generations for 1 million generations using 4 chains. The first 2,500 trees were discarded as "burn-in".
For detection of recombination events, we used the automated suite of algorithms contained within the Recombination Detection Program 3 (RDP3) [8, 30–38] to analyze the complete genomic sequences present in our MSA. In general, we used the default settings for each program in the RDP3 suite except for the following: for RDP we used a window size of 30; Bootscan used a window size of 200, step size of 50, and 50 bootstrap replicates; Siscan used a window size of 200 and step size of 20; and RDP3 was set to report all hits detected by 2 or more algorithms. In order to confirm the results from the automated tests, additional algorithms which are not part of the automated process were also run. SplitsTree4  was used with default settings to assess the presence of a reticulated phylogenetic network as a representation of recombination (unpublished data).
We would like to thank the members of the Lefkowitz laboratory as well as the staff of the Viral Bioinformatics Resource Center for their help, support, and provision of the sequence data for download. This work was supported by NIH/NIAID Contract No. HHSN266200400036C to EJL.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.