- Open Access
Evidence that Lokern virus (family Peribunyaviridae) is a reassortant that acquired its small and large genome segments from Main Drain virus and its medium genome segment from an undiscovered virus
Virology Journalvolume 15, Article number: 122 (2018)
Lokern virus (LOKV) is a poorly characterized arthropod-borne virus belonging to the genus Orthobunyavirus (family Peribunyaviridae). All viruses in this genus have tripartite, single-stranded, negative-sense RNA genomes, and the three RNA segments are designated as small, (S), medium (M) and large (L). A 559 nt. region of the M RNA segment of LOKV has been sequenced and there are no sequence data available for its S or L RNA segments. The purpose of this study was to sequence the genome of LOKV.
The genome of LOKV was fully sequenced by unbiased high-throughput sequencing, 5′ and 3′ rapid amplification of cDNA ends, reverse transcription-polymerase chain reaction and Sanger sequencing.
The S and L RNA segments of LOKV consist of 952 and 6864 nt. respectively and both have 99.0% nucleotide identity with the corresponding regions of Main Drain virus (MDV). In contrast, the 4450-nt. M RNA segment has only 59.0% nucleotide identity with the corresponding region of MDV and no more than 72.7% nucleotide identity with all other M RNA segment sequences in the Genbank database. Phylogenetic data support these findings.
This study provides evidence that LOKV is a natural reassortant that acquired its S and L RNA segments from MDV and its M RNA segment from an undiscovered, and possibly extinct, virus. The availability of complete genome sequence data facilitates the accurate detection, identification and diagnosis of viruses and viral infections, and this is especially true for viruses with segmented genomes because it can be difficult or even impossible to differentiate between reassortants and their precursors when incomplete sequence data are available.
Lokern virus (LOKV) is a poorly characterized virus in the genus Orthobunyavirus (family Peribunyaviridae). The genus has previously been separated into serogroups, with LOKV classified in the Bunyamwera (BUN) serogroup . Other viruses in this serogroup include Batai virus (BATV), Bunyamwera virus (BUNV), Cache Valley virus (CVV), Maguari virus (MAGV), Ngari virus (NRIV), Northway virus (NORV), and Tensaw virus (TENV). LOKV was originally isolated from Culex tarsalis mosquitoes in California in 1962 . Additional isolations have been made from arthropods of various species as well as rabbits and hares in the western half of the United States, with most isolates recovered from Culicoides spp. midges [2,3,4,5]. Serologic data indicate that the geographic range of LOKV also includes the eastern half of the United States . LOKV is not a recognized pathogen of humans or vertebrate animals but its ability to cause disease has not been widely investigated.
All viruses in the genus Orthobunyavirus possess a tripartite, single-stranded, negative-sense RNA genome . The three RNA segments are designated as small, (S), medium (M) and large (L) and are approximately 1.0, 4.5 and 6.9 kb in length, respectively. Complementary and highly conserved nucleotide sequences are located at the 5′ and 3′ termini of each RNA segment. Base pairing of these sequences results in the formation of panhandle structures assumed to be required for the initiation of genome transcription and replication [8, 9]. The S RNA segment codes the nucleocapsid (N) protein and a nonstructural protein NSS in overlapping reading frames . The M RNA segment codes for a polyprotein that is post-translationally processed to generate the N- and C-terminal glycoproteins, Gn and Gc, and a nonstructural protein NSM . The L RNA segment codes for the L protein which functions as the RNA-dependent RNA polymerase .
The exchange of RNA segments (genome reassortment) between closely-related orthobunyaviruses can occur when a vector or host is co-infected with two or more viruses [13, 14]. Reassortment is considered to be a major means of evolution for these viruses because the resulting progeny can possess new and advantageous genetic and phenotypic traits. Reassortment sometimes generates viruses that are more pathogenic than their donor viruses. One example is NRIV which acquired its S and L RNA segments from BUNV and its M RNA segment from BATV [15, 16]. NRIV has been associated with large outbreaks of hemorrhagic fever in humans in East Africa while the most serious disease manifestations associated with its precursor viruses are low-grade fever and myalgia.
The genome of LOKV has not been fully sequenced. A 559 nt. region of the M RNA segment that spans part of the 5′ untranslated region (UTR) and Gn coding region has been sequenced (Genbank Accession no. AY593736.1) and no sequence data are available for the S or L RNA segments. The objective of this study is to fully sequence the genome of LOKV and to determine its genetic and phylogenetic relationships with other BUN serogroup viruses.
LOKV (isolate FMS 4332) was obtained from the World Reference Center for Emerging Viruses and Arboviruses at the University of Texas Medical Branch in Galveston, TX. The virus had been passaged 13 times in suckling mouse brains prior to receipt and underwent an additional passage in African Green Monkey kidney (Vero) cells at Iowa State University.
Total RNA was extracted from subconfluent monolayers of LOKV-infected Vero cells using Trizol Reagent (ThermoFisher, Carlsbad, CA), fragmented using RNase III (New England BioLabs, Ipswich, MA) and assessed for quality using an Agilent 2100 Bioanalyzer (Agilent, Santa Clara, CA). Libraries were constructed using the Ion Total RNA-Seq Kit v2 (ThermoFisher) and assessed for quality and analyzed at the Genomic Technologies Facility at Iowa State University using an Ion Proton Sequencer (ThermoFisher). Ion-Torrent reads were mapped to the genome of Chlorocebus sabaeus using Bowtie 2 in order to identify host cell sequences . Unmapped reads were analyzed using the sortMeRNA program to remove rRNA-related reads . All remaining reads with Phred values ≥ 33 were subjected to de novo SPAdes (ver 3.5.0) assembly . Contigs were mapped to a reference orthobunyavirus genome (MDV for S and L RNA segment sequences; CVV for M RNA segment sequences) using LASTZ . Alignment files were manually verified on Tablet .
Reverse transcription-polymerase chain reaction and sanger sequencing
Reverse transcription-polymerase chain reaction (RT-PCR) and Sanger sequencing were performed to verify LOKV sequences generated by high-throughput sequencing and to close gaps between contigs. Complementary DNAs were generated using Superscript III reverse transcriptase (ThermoFisher) and PCRs were performed using high-fidelity Taq polymerase (ThermoFisher) in accordance to the manufacturer’s instructions. Primers were designed from the sequences generated by high-throughput sequencing and are available upon request. RT-PCR products were purified using the purelink gel extraction kit (ThermoFisher) and sequenced using a 3730 × 1 DNA sequencer (Applied Biosystems, Foster City, CA).
5′ and 3′ rapid amplification of cDNA ends
The 5′ and 3′ ends of each RNA segment were identified using 5′ and 3′ rapid amplification of cDNA ends, respectively. Briefly, 20 pmol of pre-adenylated DNA adaptor (5′-rApp/TGGAATTCTCGGGTGCCAAGGT/ddC-3′) was ligated to 1 μg of viral genomic and anti-genomic RNAs using 200 U of T4 RNA ligase 2 (New England BioLabs) and incubated for 1 h at 25 °C. Ligated products were recovered by ethanol precipitation and complementary cDNAs were created using Superscript III reverse transcriptase and an adapter-specific primer. PCRs were performed using adapter- and gene-specific primers, and RT-PCR products were purified and subjected to Sanger sequencing.
Nucleotide and amino acid sequence alignments
Nucleotide and predicted amino acid sequences of LOKV were aligned with other sequences in the Genbank database by application of BLASTn and BLASTp, respectively. Alignments were also performed using Clustal Omega (available at http://www.ebi.ac.uk/Tools/msa/clustalo/) in order to calculate percent nucleotide and amino acid identities between select sequences.
Nucleotide sequences were alignment using the program Multiple Alignment Using Fast Fourier Transform. A Markov Chain Monte Carlo-based Bayesian analysis was performed by MrBayes using the general time reversible plus invariant sites plus gamma distribution 4 model . To reach congruence, 10 million iterations were run and sampled every 1000th iteration. The maximum clade credibility tree was created using TreeAnnotator (version 1.8.2) with 10% burn-in  and visualized with FigTree (version 1.4.0).
The complete nucleotide sequences of the S, M and L RNA segments of LOKV were determined (Genbank Accession nos. MG696865, MG820264 and MG828823, respectively). The S RNA segment consists of 952 nt., and is most closely related to the corresponding region of MDV with 99.0% nt. identity, followed by NRIV and NORV with 85.0 and 84.6% nt. identity, respectively (Table 1). The S RNA segment of LOKV contains two open reading frames (ORFs). The most 5’ AUG initiates the longest ORF which encompasses nucleotide positions 77 to 778, and encodes for the N protein. The predicted translation product consists of 233 amino acids, and is most closely related to the N protein of MDV with 99.6% identity. The second ORF encompasses nucleotide positions 96 to 401 and encodes for the NSS protein. The predicted translation product consists of 101 amino acids and is identical to the NSS protein of MDV.
The M RNA segment of LOKV consists of 4450 nt., and is most closely related to the corresponding regions of CVV and its variants, Fort Sherman, Playas and Tlacotalpan viruses, with 72.4 to 72.7% nt. identity, followed by MAGV and NORV with 72.2 and 71.4% nt. identity, respectively (Table 1). The partial M RNA segment sequence of LOKV previously available in the Genbank database has 99.6% nt. identity with the corresponding region of our sequence. The sequences differ in two nucleotide positions; there is a nucleotide substitution in the 5’ UTR and a synonymous substitution in the Gn coding region. Pairwise nucleotide sequence alignments were also performed using the partial M RNA segment sequences of Birao, Bozo, Mboke, Shokwe, and Xingu viruses, revealing 60.2 to 68.8% nt. identity with the corresponding region of LOKV. The M RNA segment of LOKV contains a single ORF that encodes for a polyprotein of 1434 amino acids that is predicted to yield Gn, Gc and NSM proteins of 286, 866 and 173 amino acids, respectively. The deduced amino acid sequence of the polyprotein has greatest identity (77.9 to 78.4%) with the corresponding regions of CVV and its variants (Table 1). Alignments were also performed using the deduced amino acid sequences of the individual proteins, revealing that the Gn, Gc and NSM proteins of LOKV have at least 87.0, 79.1 and 67.0% identity, respectively to the corresponding proteins of CVV and its variants.
The L RNA segment of LOKV consists of 6864 nt., and is most closely related to the corresponding region of MDV with 99.0% nt. identity, followed by NORV and TENV with 86.7 and 86.1% nt. identity, respectively (Table 1). The L RNA segment of LOKV contains a single ORF that encompasses nucleotide positions 49 to 6765 and encodes for the L protein. The predicted translation product consists of 1434 residues and has 99.0% identity with the L protein of MDV.
Complementary sequences were identified at the 5′ and 3′ ends of all three RNA segments of LOKV (Fig. 1). Seventeen of the 21 terminal nucleotides at the 5′ and 3′ ends of the S RNA segment are complementary, and an inspection of the M and L RNA segment sequences revealed that 19 of 20 and 24 of 27 nt., respectively are complementary. The first 11 nt. at the 5′ ends of all three segments are characterized by the sequence 5’-AGUAGUGUACU-3′. The same sequence is present at the 3′ ends of all three segments, when in reverse complementary orientation, aside from a mismatch at positions 9 and − 9.
Phylogenetic trees were constructed with Bayesian methods using the nucleotide sequences of the S, M and L RNA segments of LOKV and selected other orthobunyaviruses (Figs. 2, 3, 4). The S RNA segment sequence of LOKV is most closely related phylogenetically to the corresponding segment of MDV. The Bayesian posterior support for this topological arrangement is 100%. Likewise, in the Bayesian tree constructed using L RNA segment sequences, LOKV is most closely related to MDV with strong posterior support for this grouping (100%). In contrast, the M RNA segment sequence of LOKV is most closely related phylogenetically to the corresponding regions of CVV and its variants. The posterior support for this topological arrangement is 99.9%. These viruses, along with NORV, MAGV, and TENV, comprise a distinct clade for which there is strong posterior support (99.9%).
This study provides evidence that LOKV is a natural reassortant that acquired its S and L RNA segments from MDV and its M RNA segment from an undiscovered, and possibly extinct, virus. MDV was assumed to be one of the donor viruses because its S and L RNA segments are almost identical to those of LOKV. The two viruses have similar geographic distributions, and vector and host associations, thus providing the opportunity for reassortment to occur. LOKV and MDV both occur in the western half of the United States, and most arthropod- and vertebrate-derived isolations of each virus have been from Culicoides spp. midges and rabbits, respectively . Another, albeit less likely, explanation is that LOKV and MDV are both reassortants that acquired their S and L RNA segments from the same ancestor(s). It has been suggested that MDV is a reassortant that acquired its S RNA segment from MAGV or NORV, its M RNA segment from Potosi or Kairi viruses, and its L RNA segment from CVV or MAGV .
The M RNA segment of LOKV was assumed to be acquired from an undiscovered virus because it has no more than 72.7% nt. identity to the M RNA segments of all other known orthobunyaviruses. Another, less likely, explanation is that the M segment donor is a known virus but the reassortant event was not recent, thus providing sufficient time for a considerable amount of sequence divergence to occur by genetic drift. Of all viruses positioned basally to LOKV in Fig. 3, MAGV and NORV are its closest relatives. It is unlikely that MAGV is the M RNA segment donor because its known geographic distribution does not overlap with LOKV. MAGV occurs in Latin American and the Caribbean, but has never been detected in the United States (Centers for Disease Control and Prevention Arbovirus Catalog, https://wwwn.cdc.gov/arbocat/VirusDetails.aspx?ID=276&SID=12, accessed April 2, 2018). However, NORV and LOKV have overlapping geographic distributions and vector and host ranges. NORV occurs in the western half of the United States and Canada, and has been isolated from rabbits and other rodents as well as mosquitoes [3, 25].
Our data support earlier studies which have shown that the most common reassortment event among BUN serogroup viruses is the M RNA segment exchange. In a classic study, heterologous crosses were performed in the laboratory using BUNV/BATV and BUNV/MAGV, with all resulting recombinants acquiring their S and L RNA segments from the same parental virus . Furthermore, MDV, NRIV and Potosi virus are all natural reassortants that arose from M RNA segment exchanges [15, 16, 24, 27]. Cholul virus is an exception; it is a natural reassortant that arose from an S RNA segment exchange . The M RNA segment exchange is not the preferred reassortment pattern for California serogroup viruses which usually have homologous M-L and S-M associations and Group C viruses which appear to favor homologous M-L associations [29, 30].
A characteristic feature of the genomes of all viruses in the genus Orthobunyavirus is the presence of complementary nucleotide sequences at the 5′ and 3′ termini of each RNA segment [7, 31,32,33]. The sequences are highly conserved between each RNA segment of a virus and between each virus in the genus. The consensus sequence consists of 11 nt. and is defined as 5’-AGUAGUGUACU-3′, with a mismatch at positions 9 and − 9 at each end, as we observed with LOKV. The terminal 11 nt. are followed by 3 to 4 more nucleotides that are complementary and highly conserved on a segment-specific basis, followed by several more nucleotides that are complementary and usually unique to the individual virus and segment [29, 30]. An inspection of the genome sequence of LOKV revealed the presence of terminal complementary sequences of 21 to 27 nt., with one to four mismatches, at the terminal ends of each segment, consistent with earlier reports that the region of complementary extends beyond the highly conserved 11 nt. consensus sequence.
To conclude, the sequence and phylogenetic data presented in this study provide evidence that LOKV is a reassortant that arose from an M RNA segment exchange. We consider it most likely that LOKV acquired its S and L RNA segments from MDV and its M RNA segment from an unrecognized virus, but additional explanations are also provided. Complete genome sequence data is needed for the accurate detection and identification of viruses, particularly when the virus in question has a segmented genome because reassortants and their precursors could otherwise be mistaken for one another.
Cache Valley virus
Open reading frame
Reverse transcription-polymerase chain reaction
Blitvich BJ, Beaty BJ, Blair CD, Brault AC, Dobler G, Drebot MA, Haddow AD, Kramer LD, LaBeaud AD, Monath TP, Mossel EC, Plante K, Powers AM, Tesh RB, Turell MJ, Vasilakis N, Weaver SC. Bunyavirus taxonomy: limitations and misconceptions associated with the current ICTV criteria used for species demarcation. Am J Trop Med Hyg. 2018;99(1):11–6. in press
Scrivani RP. Lokern (LOK) strain. FMS 4332. Am J Trop Med Hyg. 1970;19(6):1111–2.
Calisher CH, Francy DB, Smith GC, Muth DJ, Lazuick JS, Karabatsos N, Jakob WL, McLean RG. Distribution of Bunyamwera serogroup viruses in North America, 1956-1984. Am J Trop Med Hyg. 1986;35:429–43.
Crane GT, Elbel RE, Francy DB, Calisher CH. Arboviruses from western Utah, USA, 1967-1976. J Med Entomol. 1983;20:294–300.
Kramer WL, Jones RH, Holbrook FR, Walton TE, Calisher CH. Isolation of arboviruses from Culicoides midges (Diptera: Ceratopogonidae) in Colorado during an epizootic of vesicular stomatitis New Jersey. J Med Entomol. 1990;27:487–93.
Meyers MT, Bahnson CS, Hanlon M, Kopral C, Srisinlapaudom S, Cochrane ZN, Sabas CE, Saiyasombat R, Burrough ER, Plummer PJ, O’Connor AM, Marshall KL, Blitvich BJ. Management factors associated with operation-level prevalence of antibodies to Cache Valley virus and other Bunyamwera serogroup viruses in sheep in the United States. Vector Borne Zoonotic Dis. 2015;15:683–93.
Schmaljohn CS, Nichol ST. Bunyaviridae. In: Knipe DM, editor. Fields Virology. 5th ed. Philadelphia: Lippincott Williams and Wilkins; 2007. p. 1741–89.
Barr JN, Wertz GW. Bunyamwera bunyavirus RNA synthesis requires cooperation of 3’- and 5’-terminal sequences. J Virol. 2004;78:1129–38.
Pardigon N, Vialat P, Girard M, Bouloy M. Panhandles and hairpin structures at the termini of Germiston virus RNAs (Bunyavirus). Virology. 1982;122:191–7.
Fuller F, Bhown AS, Bishop DH. Bunyavirus nucleoprotein, N, and a non-structural protein, NSs, are coded by overlapping reading frames in the S RNA. J Gen Virol. 1983;64:1705–14.
Fazakerley JK, Gonzalez-Scarano F, Strickler J, Dietzschold B, Karush F, Nathanson N. Organization of the middle RNA segment of snowshoe hare Bunyavirus. Virology. 1988;167:422–32.
Endres MJ, Jacoby DR, Janssen RS, Gonzalez-Scarano F, Nathanson N. The large viral RNA segment of California serogroup bunyaviruses encodes the large viral protein. J Gen Virol. 1989;70:223–8.
Briese T, Calisher CH, Higgs S. Viruses of the family Bunyaviridae: are all available isolates reassortants? Virology. 2013;446:207–16.
Beaty BJ, Sundin DR, Chandler LJ, Bishop DH. Evolution of bunyaviruses by genome reassortment in dually infected mosquitoes (Aedes triseriatus). Science. 1985;230:548–50.
Briese T, Bird B, Kapoor V, Nichol ST, Lipkin WI. Batai and Ngari viruses: M segment reassortment and association with severe febrile disease outbreaks in East Africa. J Virol. 2006;80:5627–30.
Gerrard SR, Li L, Barrett AD, Nichol ST. Ngari virus is a Bunyamwera virus reassortant that can be associated with large outbreaks of hemorrhagic fever in Africa. J Virol. 2004;78:8922–6.
Langmead B, Salzberg SL. Fast gapped-read alignment with bowtie 2. Nat Methods. 2012;9:357–9.
Kopylova E, Noe L, Touzet H. SortMeRNA: fast and accurate filtering of ribosomal RNAs in metatranscriptomic data. Bioinformatics. 2012;28:3211–7.
Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012;19:455–77.
Harris R. Improved pairwise alignment of genomic DNA: The Pennsylvania State University; 2007. Available at: http://www.bx.psu.edu/~rsharris/rsharris_phd_thesis_2007.pdf
Milne I, Stephen G, Bayer M, Cock PJ, Pritchard L, Cardle L, Shaw PD, Marshall D. Using tablet for visual exploration of second-generation sequencing data. Brief Bioinform. 2013;14:193–202.
Ronquist F, Huelsenbeck JP. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003;19:1572–4.
Drummond AJ, Rambaut A. BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol Biol. 2007;7:214–21.
Briese T, Kapoor V, Lipkin WI. Natural M-segment reassortment in Potosi and main drain viruses: implications for the evolution of orthobunyaviruses. Arch Virol. 2007;152:2237–47.
Campbell GL, Hardy JL, Eldridge BF, Reeves WC. Isolation of Northway serotype and other Bunyamwera serogroup bunyaviruses from California and Oregon mosquitoes, 1969-1985. Am J Trop Med Hyg. 1991;44:581–8.
Pringle CR, Lees JF, Clark W, Elliott RM. Genome subunit reassortment among Bunyaviruses analysed by dot hybridization using molecularly cloned complementary DNA probes. Virology. 1984;135:244–56.
Yanase T, Kato T, Yamakawa M, Takayoshi K, Nakamura K, Kokuba T, Tsuda T. Genetic characterization of Batai virus indicates a genomic reassortment between orthobunyaviruses in nature. Arch Virol. 2006;151:2253–60.
Blitvich BJ, Saiyasombat R, Dorman KS, Garcia-Rejon JE, Farfan-Ale JA, Lorono-Pino MA. Sequence and phylogenetic data indicate that an orthobunyavirus recently detected in the Yucatan peninsula of Mexico is a novel reassortant of Potosi and Cache Valley viruses. Arch Virol. 2012;157:1199–204.
Urquidi V, Bishop DH. Non-random reassortment between the tripartite RNA genomes of La Crosse and snowshoe hare viruses. J Gen Virol. 1992;73:2255–65.
Nunes MR, da Rosa AP, Weaver SC, Tesh RB, Vasconcelos PF. Molecular epidemiology of group C viruses (Bunyaviridae, Orthobunyavirus) isolated in the Americas. J Virol. 2005;79:10561–70.
Elliott RM. Molecular biology of the Bunyaviridae. J Gen Virol. 1990;71:501–22.
Elliott RM, Schmaljohn CS, Collett MS. Bunyaviridae genome structure and gene expression. Curr Top Microbiol Immunol. 1991;169:91–141.
Gerlach P, Malet H, Cusack S, Reguera J. Structural insights into bunyavirus replication and its regulation by the vRNA promoter. Cell. 2015;161:1267–79.
The authors thank Robert Tesh for providing the isolate of LOKV.
The study was supported by an intramural grant from Iowa State University.
Availability of data and materials
All data from the current study are available from the corresponding author on request.
Ethics approval and consent to participate
Consent for publication
All authors consent for publication.
The authors declare no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.