The complete genome sequence of a Crimean-Congo Hemorrhagic Fever virus isolated from an endemic region in Kosovo
- Darja Duh1,
- Stuart T Nichol2,
- Marina L Khristova2,
- Ana Saksida1,
- Iva Hafner-Bratkovič3,
- Miroslav Petrovec1,
- Iusuf Dedushaj4,
- Salih Ahmeti5 and
- Tatjana Avšič-Županc1Email author
© Duh et al. 2008
Received: 11 December 2007
Accepted: 15 January 2008
Published: 15 January 2008
The Balkan region and Kosovo in particular, is a well-known Crimean-Congo hemorrhagic fever (CCHF) endemic region, with frequent epidemic outbreaks and sporadic cases occurring with a hospitalized case fatality of approximately 30%. Recent analysis of complete genome sequences of diverse CCHF virus strains showed that the genome plasticity of the virus is surprisingly high for an arthropod-borne virus. High levels of nucleotide and amino acid differences, frequent RNA segment reassortment and even RNA recombination have been recently described. This diversity illustrates the need to determine the complete genome sequence of CCHF virus representatives of all geographically distinct endemic areas, particularly in light of the high pathogenicity of the virus and its listing as a potential bioterrorism threat. Here we describe the first complete CCHF virus genome sequence of a virus (strain Kosova Hoti) isolated from a hemorrhagic fever case in the Balkans. This virus strain was isolated from a fatal CCHF case, and passaged only twice on Vero E6 cells prior to sequence analysis. The virus total genome was found to be 19.2 kb in length, consisting of a 1672 nucleotide (nt) S segment, a 5364 nt M segment and a 12150 nt L segment. Phylogenetic analysis of CCHF virus complete genomes placed the Kosova Hoti strain in the Europe/Turkey group, with highest similarity seen with Russian isolates. The virus M segments are the most diverse with up to 31 and 27% differences seen at the nt and amino acid levels, and even 1.9% amino acid difference found between the Kosova Hoti and another strain from Kosovo (9553-01). This suggests that distinct virus strains can coexist in highly endemic areas.
Bioinformatics analysis of complete microbial genomes has led to advances in the development of novel diagnostic techniques, in the research of microbial pathogenesis, and in the control and prevention of infectious diseases. Until the year 2006, only 2 complete genomes of Crimean-Congo hemorrhagic fever virus (CCHFV) had been sequenced . CCHFV, is a tick-borne virus with tripartite RNA genome (S, M and L segment), and is the causative agent of a lethal zoonosis named Crimean-Congo hemorrhagic fever (CCHF). The virus is distributed over much of Asia, extending from China to the Middle East and Southern Russia and to the focal endemic areas in Africa and southern Europe, including Kosovo and Turkey . Yearly epidemics, as well as sporadic cases of CCHF are seen in some of these areas, often with high case fatality (approx. 30%) . CCHFV can be transmitted to humans by bites of Ixodid ticks and by the contact with blood or tissue from viremic livestock and human patients . Development of diagnostic approaches and potential vaccines is dependent on knowledge of the broad geographic distribution of diverse virus variants and on understanding of the extent of virus genetic reassortment and recombination [3, 4]. The analysis of the 16 existing complete CCHFV genomes up to date indicated considerable evolution and high diversity of CCHFV [1, 5]. Presumably this reflects the typical high polymerase error rates seen with negative stranded RNA viruses. In addition, previous reports have found evidence of RNA segment reassortment events between CCHFV M segments, and the recombination in CCHFV S segments [1, 3, 4]. The genetic diversity of CCHFV, its virulence, and its potential as a bioterrorism agent, make it important to obtain the complete genome of CCHFV from all geographically distinct endemic areas.
The Balkan peninsula, and Kosovo in particular, is a well-known endemic region for CCHF, and epidemic outbreaks and sporadic cases have been frequently been recorded [6–8]. Five nucleotide sequences of CCHFV from Kosovo have been published [9–12]. Three of them are partial sequences of S segment, the remaining 2 represent complete sequences of S and M segment of different CCHFV strains, Kosova Hoti and Kosovo 9553-01, respectively. We describe the first complete CCHFV genome sequence of a virus (strain Kosova Hoti) isolated from a hemorrhagic fever case in the Balkans.
The CCHFV Kosova Hoti strain was isolated from a blood of a female fatal case during the epidemic in Kosovo in 2001 . The blood was taken on the 5th day after onset of symptoms. Results of the laboratory analysis showed the presence of IgM antibodies (titer 1:400) and the presence of viral RNA in the concentration of 1.08 × 1010 copies per mL of serum. Virus was grown on Vero E6 cells in BSL-3 laboratory. Viral RNA was extracted with the Trizol reagent from the second passage of the CCHFV in Vero E6 cells, and used for the direct sequencing of the complete genome of the virus. Amplicons of S, M and L full length segments were obtained by following the protocols described previously [1, 9, 13]. Briefly, a total of 16 S, 40 M and 84 L sequencing primers were used to generate the complete sequence of the S, M and L segments and these are deposited in the GenBank under the accession numbers DQ133507, EU037902 and EU044832, respectively. Sequence alignment of CCHFV Kosova Hoti strain complete genome with preexisting CCHFV genomes was performed using the CLUSTAL W algorithm of MegAlign module (Lasergene 1999, DNASTAR, USA). Phylogenetic relationships of different CCHFV strains were established with a software package TREECON . The phylogenetic tree was constructed by the neighbor-joining method. The topology of the tree was obtained with the Kimura 80 model and support for the tree nodes was calculated with 500 bootstrap replicates. SignalIP was used to predict the signal sequence cleavage site and TMHMM 2.0 was used to predict transmembrane helices of M segment [15, 16]. The amino acid (aa) sequence of L segment was subjected to the PSI-BLAST and PredictProtein server for search of conserved aa motifs [17, 18].
The genome size of CCHFV, strain Kosova Hoti, was found to be approximately 19.2 kb in length, consisting of a 1672 nucleotide (nt) S segment, a 5364 nt M segment and a 12150 nt L segment. The open reading frame (ORF) of S segment is 1449 nt in length, encoding a 482 aa (nt position 56 – 1504) nucleocapsid protein. The ORF lengths of M and L segments are 5067 nt/1688 aa (nt position 78–5144) and 11838 nt/3945 aa (nt position 78–11915), respectively.
The difference between complete S segment of CCHFV strain Kosova Hoti and other strains in the V. group (Europe/Turkey) calculated by the MegAlign module.
S segment, difference (%)
nt sequence (complete)
nt sequence (ORF)
non-synonymous mutations (%)
The difference between complete M segment of CCHFV strain Kosova Hoti and other strains in the V. group (Europe/Turkey) calculated by the MegAlign module. Table includes a separate column for the Mucin-like variable region present in M segment.
M segment, difference (%)
nt sequence (complete)
nt sequence (ORF)
non- synonymous mutations (%)
Mucin-like VR, aa 28–251 (% difference)
The difference between complete L segment of CCHFV strain Kosova Hoti and other strains in the V. group (Europe/Turkey) calculated by the MegAlign module.
L segment, difference (%)
nt sequence (complete)
nt sequence (ORF)
non-synonymous mutations (%)
The analysis of the Kosova Hoti strain M segment encoded polyprotein predicted the cleavage of the signal peptide to occur between aa 27 and 28 (AHG-QS). This site is identical to those described for Kosovo 9553-01 and Kashmanov but differs from other strains in group V. (Fig. 2). The mucin-like variable region of Kosova Hoti strain polyprotein stretches from aa 28 to 251 and differs by up to 20.5% from Turkish 200310849 strain (Table 2). Tetrapeptides RSKR251, RKLL523 and RKPL1043 were identified in Kosova Hoti and are identical among all strains in V. group. They represent the cleavage sites for GP38, Gn and Gc proteins, respectively [21, 22]. The RKLL523 tetrapeptid of Kosova Hoti is typical for all strains in group V (Europe/Turkey) but it differs from RRLL tetrapeptid in all other CCHFV strains sequenced. However, both tetrapeptides constitute a cleavage recognition site for subtilase SKI-1 [12, 22, 23]. Five transmembrane helices were predicted for polyprotein of Kosova Hoti as shown on Figure 2.
Analysis of L protein encoded by the L segment of the Kosova Hoti strain revealed the conserved OTU-like protease domain from aa 35 to 152 (Fig. 2). The identified sequence G37 DGN40 CFYHSIAE.....151 HFD with the catalytic triad (indicated in bold) was identical among all CCHFV strains used in the L segment alignment (Fig. 1, panel C). Amino-acids 2043–2714 corresponded to the RNA-dependent RNA polymerase catalytic domain, similarly to the Nigerian IbAr10200 strain . In addition, a zinc finger C2H2-type domain (aa 609–632) was found in the L protein of Kosova Hoti, but a previously identified leucine zipper could not be predicted. A leucine zipper motif (composed of three heptads) previously identified at aa 1386–1407 in the L sequence of a Nigerian strain [24, 25], was not identified in the Kosova Hoti L sequence. However, the L sequence of Kosova Hoti (and other strains from group V) in this region differs from the Nigerian strain only in the substitution of the leucine for isoleucine at the position 1386.
Frequently it is observed that arthropod-borne viruses of vertebrates exhibit low genetic diversity which is thought to be due to essentially a double filter in operation, whereby evolution of these viruses is tightly constrained by the need to maintain high fitness in both vertebrate and arthropod host environments . The very high genetic diversity seen in CCHFV is a strikingly exception. Presumably less constraint or greater positive selection is molding the evolutionary pattern of this virus. The complete genome of this representative CCHFV isolate (Kosova Hoti) from a highly endemic region of the Balkans is clearly divergent from strains present in other endemic regions of the world, and considerable sequence difference is even observed among virus strains found within Kosovo. These findings have importance for design of molecular diagnostic tools and vaccine development efforts, as they clearly illustrate the need to consider the high viral diversity and complexity of CCHF viral variant geographic distribution in these efforts.
We thank Mateja Jelovšek for the serological testing and Miša Korva for the alignments. This work was supported by RiViGene (Contract No. SSPE-CT-2005-022639).
- Deyde VM, Khristova ML, Rollin PE, Ksiazek TG, Nichol ST: Crimean-Congo hemorrhagic fever virus genomics and global diversity. J Virol 2006, 80:8834–8842.View ArticlePubMed
- Ergonul O: Crimean-Congo haemorrhagic fever. Lancet Infect Dis 2006, 6:203–214.View ArticlePubMed
- Hewson R, Gmyl A, Gmyl L, Smirnova SE, Karganova G, Jamil B, Hasan R, Chamberlain J, Clegg C: Evidence of segment reassortment in Crimean-Congo haemorrhagic fever virus. J Gen Virol 2004, 85:3059–3070.View ArticlePubMed
- Lukashev AN: Evidence for recombination in Crimean-Congo hemorrhagic fever virus. J Gen Virol 2005, 86:2333–2338.View ArticlePubMed
- Meissner JD, Seregin SS, Seregin SV, Vyshemirskii OI, Yakimenko NV, Netesov SV, Petrov VS: The complete genomic sequence of strain ROS/HUVLV-100, a representative Russian Crimean Congo hemorrhagic fever virus strain. Virus Genes 2006, 33:87–93.View ArticlePubMed
- Avšič-Županc T: Epidemiology of Crimean-Congo Hemorrhagic Fever in the Balkans. Crimean-Congo Hemorrhagic Fever: A Global Perspective (Edited by: Ergonul OWCA). Dordrecht, Springer 2007, 75–88.
- Ahmeti S, Raka L: Crimean-Congo haemorrhagic fever in Kosova : a fatal case report. Virol J 2006, 3:85.View ArticlePubMed
- Vesenjak-Hirjan J, Punda-Polic V, Dobe M: Geographical distribution of arboviruses in Yugoslavia. J Hyg Epidemiol Microbiol Immunol 1991, 35:129–140.PubMed
- Duh D, Saksida A, Petrovec M, Dedushaj I, Avsic-Zupanc T: Novel one-step real-time RT-PCR assay for rapid and specific diagnosis of Crimean-Congo hemorrhagic fever encountered in the Balkans. J Virol Methods 2006, 133:175–179.View ArticlePubMed
- Drosten C, Minnak D, Emmerich P, Schmitz H, Reinicke T: Crimean-Congo hemorrhagic fever in Kosovo. J Clin Microbiol 2002, 40:1122–1123.View ArticlePubMed
- Papa A, Bozovi B, Pavlidou V, Papadimitriou E, Pelemis M, Antoniadis A: Genetic detection and isolation of crimean-congo hemorrhagic fever virus, Kosovo, Yugoslavia. Emerg Infect Dis 2002, 8:852–854.View ArticlePubMed
- Papa A, Papadimitriou E, Bozovic B, Antoniadis A: Genetic characterization of the M RNA segment of a Balkan Crimean-Congo hemorrhagic fever virus strain. J Med Virol 2005, 75:466–469.View ArticlePubMed
- Hewson R, Chamberlain J, Mioulet V, Lloyd G, Jamil B, Hasan R, Gmyl A, Gmyl L, Smirnova SE, Lukashev A, Karganova G, Clegg C: Crimean-Congo haemorrhagic fever virus: sequence analysis of the small RNA segments from a collection of viruses world wide. Virus Res 2004, 102:185–189.View ArticlePubMed
- Van de Peer Y, De Wachter R: TREECON for Windows: a software package for the construction and drawing of evolutionary trees for the Microsoft Windows environment. Comput Appl Biosci 1994, 10:569–570.PubMed
- Sonnhammer EL, von Heijne G, Krogh A: A hidden Markov model for predicting transmembrane helices in protein sequences. Proc Int Conf Intell Syst Mol Biol 1998, 6:175–182.PubMed
- Emanuelsson O, Brunak S, von Heijne G, Nielsen H: Locating proteins in the cell using TargetP, SignalP and related tools. Nat Protoc 2007, 2:953–971.View ArticlePubMed
- Rost B, Yachdav G, Liu J: The PredictProtein server. Nucleic Acids Res 2004, 32:W321–6.View ArticlePubMed
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 1997, 25:3389–3402.View ArticlePubMed
- Bergeron E, Vincent MJ, Nichol ST: Crimean Congo hemorrhagic fever virus glycoprotein processing by the endoprotease SKI-1/S1P is critical for virus infectivity. J Virol 2007.
- Erickson BR, Deyde V, Sanchez AJ, Vincent MJ, Nichol ST: N-linked glycosylation of Gn (but not Gc) is important for Crimean Congo hemorrhagic fever virus glycoprotein localization and transport. Virology 2007, 361:348–355.View ArticlePubMed
- Sanchez AJ, Vincent MJ, Erickson BR, Nichol ST: Crimean-congo hemorrhagic fever virus glycoprotein precursor is cleaved by Furin-like and SKI-1 proteases to generate a novel 38-kilodalton glycoprotein. J Virol 2006, 80:514–525.View ArticlePubMed
- Sanchez AJ, Vincent MJ, Nichol ST: Characterization of the glycoproteins of Crimean-Congo hemorrhagic fever virus. J Virol 2002, 76:7263–7275.View ArticlePubMed
- Vincent MJ, Sanchez AJ, Erickson BR, Basak A, Chretien M, Seidah NG, Nichol ST: Crimean-Congo hemorrhagic fever virus glycoprotein proteolytic processing by subtilase SKI-1. J Virol 2003, 77:8640–8649.View ArticlePubMed
- Honig JE, Osborne JC, Nichol ST: Crimean-Congo hemorrhagic fever virus genome L RNA segment and encoded protein. Virology 2004, 321:29–35.View ArticlePubMed
- Kinsella E, Martin SG, Grolla A, Czub M, Feldmann H, Flick R: Sequence determination of the Crimean-Congo hemorrhagic fever virus L segment. Virology 2004, 321:23–28.View ArticlePubMed
- Weaver SC: Evolutionary influences in arboviral disease. Curr Top Microbiol Immunol 2006, 299:285–314.View ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.