Comprehensive analysis of the overall codon usage patterns in equine infectious anemia virus
© Yin et al.; licensee BioMed Central Ltd. 2013
Received: 18 October 2013
Accepted: 11 December 2013
Published: 20 December 2013
Equine infectious anemia virus (EIAV) is an important animal model for understanding the relationship between viral persistence and the host immune response during lentiviral infections. Comparison and analysis of the codon usage model between EIAV and its hosts is important for the comprehension of viral evolution. In our study, the codon usage pattern of EIAV was analyzed from the available 29 full-length EIAV genomes through multivariate statistical methods.
Effective number of codons (ENC) suggests that the codon usage among EIAV strains is slightly biased. The ENC-plot analysis demonstrates that mutation pressure plays a substantial role in the codon usage pattern of EIAV, whereas other factors such as geographic distribution and host translation selection also take part in the process of EIAV evolution. Comparative analysis of codon adaptation index (CAI) values among EIAV and its hosts suggests that EIAV utilize the translational resources of horse more efficiently than that of donkey.
The codon usage bias in EIAV is slight and mutation pressure is the main factor that affects codon usage variation in EIAV. These results suggest that EIAV genomic biases are the result of the co-evolution of genome composition and the ability to evade the host’s immune response.
KeywordsEquine infectious anemia virus (EIAV) Codon usage bias Evolution
Equine infectious anemia virus (EIAV) is an important nonprimate enveloped virus, of the retrovirus family, lentivirus genus, along with the human immunodeficiency virus (HIV), simian immunodeficiency virus (SIV) . Among the lentiviruses, EIAV is the least complex lentivirus including only 6 genes. In addition to the gag, pol and env genes coding for the structural and enzymatic proteins coded by gag, pol and env, EIAV also contains three accessory genes: tat, rev and S2. The host range of EIAV is reported to include all members of the Equidae, while susceptible to infection, donkeys do not develop clinical EIA and lower amounts of plasma associated virus .
It is well known that the redundancy of the genetic code allows for multiple codons to encode for a single amino acid, resulting in codon usage biases in genes . The non-random usage of synonymous codons is crucial for the efficient protein translation and correct folding. Indeed, mutation pressure and natural selection are thought to be two major forces that drive the codon usage bias away from an equal usage among genes in different organisms . Understanding the extent and causes of biases in codon usage is important for the comprehension of the pathogen evolution and the relationship between pathogens and the immune response .
Recent efforts to understand codon usage biases in viruses have primarily focused on the hepatitis A virus [7, 8], West Nile virus , foot-and-mouth disease virus , influenza virus , and HIV [12–14]. To date, although the remarkable adenine (A)-richness of the EIAV genome was already discovered several decades ago , few codon usage analyses have been performed on EIAV genome. To gain insight into the characteristics of the viral genome, the synonymous codon usage pattern and the correlation between the codon usage pattern of EIAV and its hosts were investigated in our study.
The complete genome sequences of 29 EIAV strains were obtained from the National Center for Biotechnology Information (NCBI) (http://www.ncbi.nlm.nih.gov/Genbank/). The detailed information about the viruses is listed in Additional file 1: Table S1.
Codon usage analysis
Each nucleotide content and each nucleotide content at the third site of the codon in the EIAV coding sequence were calculated using MEGA4 software. The dinucleotides of the EIAV genome were analyzed by DAMBE software. The relative synonymous codon usage (RSCU) values for EIAV were calculated as previously described . The effective number of codons (ENC), was used to quantify deviations from the expected random codon usage of EIAV ORFs . The ENC values range from 20 to 61, and a low ENC value indicates a strong codon usage bias.
The codon adaptation index (CAI) was used to estimate the adaptation of EIAV to host codons. When the CAI value is much closer to 1, the gene expression level is much higher. The CAI was calculated to compare a given codon usage to a predefined reference set, using the CAIcal approach (available at: http://genomes.urv.es/CAIcal). The synonymous codon usage data for the viral hosts were obtained from the codon usage database (http://www.kazusa.or.jp/codon/).
Principal component analysis
Principal component analyses (PCA) were performed to analyze the major trend in the codon usage model among the different EIAV strains. Each ORF is represented as a 59-dimensional vector and each dimension corresponds to the RSCU value of one sense codon, excluding the codons of AUG, UGG and terminal codons. The major trend within a dataset can be determined using measure of relative inertia and genes ordered according to their position along the axis of major inertia .
Results and discussion
Synonymous codon usage in EIAV
The overall nucleotide contents and nucleotide contents at the synonymous third position of sense codons in EIAV genome
No. of genomes analyzed
T (%) ± std
C (%) ± std
A (%) ± std
G (%) ± std
T 3(%) ± std
C3 (%) ± std
A3 (%) ± std
G3 (%) ± std
25.39 ± 0.32
15.57 ± 0.33
37.09 ± 0.32
21.96 ± 0.24
29.62 ± 0.78
11.52 ± 0.58
38.01 ± 0.83
20.85 ± 0.63
Codon usage in EIAV genomes and its hosts
The effect of mutation pressure on the codon usage of EIAV
Genetic relationship based on synonymous codon usage in EIAV
It has been reported that a strong pattern of geographic clustering is observed for EIAV, with a significant correlation between phylogroups of isolates and major geographic regions . Based on the potential for the geographical factors in influencing EIAV evolution, a plot of f'1 and f'2 was performed according to the geographic distribution. The plots for EIAV isolated from China, Japan, and America were generally divided into three groups, implying that the EIAV isolated from the three countries evolved independently after diverging from a common ancestor (Figure 2B). In addition, we cannot ignore that the plots for EIAV strains V70 and V26 were clustered together with the strains isolated from America. The origin of these strains still remains controversial [23, 24]. Our data demonstrated that these EIAV strains have an American ancestry. Notably, the EIAV Miyazaki2011-A plot was far from the plots of the other strains. Recent reports showed that this EIAV strain was unlikely derived as a result of genomic recombination events and constituted a separate monophyletic group . It is interesting to identify the potential origin of this novel EIAV isolate.
Comparative analysis of the codon usage between EIAV and host cells
The synonymous codon usage pattern of EIAV tended to differ from that of horse and donkey (Table 2 and Additional file 2: Figure S1). To further investigate whether the frequency of codon usage between EIAV and its hosts might have a close relationship with the viral proteins’ expression levels, the CAI were calculated using the horse and donkey codon usage as reference sets . A mean CAI of 0.655 ± 0.020 was obtained for the EIAV ORFs in relation to horse codon usage reference set. A mean CAI of 0.593 ± 0.021 was obtained for the EIAV ORFs in relation to the donkey codon usage reference set. There was a trend for a lower CAI for EIAV in relation to donkey, with the consequent lower efficiency of protein synthesis in donkey. This phenomenon reflected that the interplay of codon usage between EIAV and its hosts may influence viral fitness, survival and evolution.
In conclusion, our comprehensive analysis of the codon usage patterns in EIAV has provided a basic understanding about some of the evolutionary information of EIAV. However, there were some limitations to this study. The sample size was relatively small and may not be fully representative of EIAV. More studies should be carried out to confirm the conjecture.
This work was supported by the Natural Science Foundation of China (Grant No. 31072113), the National Science Foundation for Outstanding Young Scholars of China (Grant No. 31222054), State Key Laboratory of Veterinary Biotechnology (Grant No.SKLVBP201205), and Central Public-interest Scientific Institution Basal Research Fund (Grant No. 2013ZL034).
- Leroux C, Cadore JL, Montelaro RC: Equine Infectious Anemia Virus (EIAV): what has HIV's country cousin got to tell us? Vet Res 2004, 35: 485-512. 10.1051/vetres:2004020PubMedView ArticleGoogle Scholar
- Craigo JK, Montelaro RC: EIAV envelope diversity: shaping viral persistence and encumbering vaccine efficacy. Curr HIV Res 2010, 8: 81-86. 10.2174/157016210790416398PubMedView ArticleGoogle Scholar
- Cook SJ, Cook RF, Montelaro RC, Issel CJ: Differential responses of Equus caballus and Equus asinus to infection with two pathogenic strains of equine infectious anemia virus. Vet Microbiol 2001, 79: 93-109. 10.1016/S0378-1135(00)00348-5PubMedView ArticleGoogle Scholar
- Hershberg R, Petrov DA: Selection on codon bias. Annu Rev Genet 2008, 42: 287-299. 10.1146/annurev.genet.42.110807.091442PubMedView ArticleGoogle Scholar
- Karlin S, Mrazek J: What drives codon choices in human genes? J Mol Biol 1996, 262: 459-472. 10.1006/jmbi.1996.0528PubMedView ArticleGoogle Scholar
- Shackelton LA, Holmes EC: The evolution of large DNA viruses: combining genomic information of viruses and their hosts. Trends Microbiol 2004, 12: 458-465. 10.1016/j.tim.2004.08.005PubMedView ArticleGoogle Scholar
- Ma XX, Feng YP, Chen L, Zhao YQ, Liu JL, Guo JZ, Guo PH, Yang JT, Lu JX, Chen SE, Ma ZR: Mapping codon usage in sequence regions flanking cleavage positions in the hepatitis A virus polyprotein. Genet Mol Res 2013, 12: 2306-2319. 10.4238/2013.July.8.11PubMedView ArticleGoogle Scholar
- Zhang Y, Liu Y, Liu W, Zhou J, Chen H, Wang Y, Ma L, Ding Y, Zhang J: Analysis of synonymous codon usage in hepatitis A virus. Virol J 2011, 8: 174. 10.1186/1743-422X-8-174PubMedPubMed CentralView ArticleGoogle Scholar
- Moratorio G, Iriarte A, Moreno P, Musto H, Cristina J: A detailed comparative analysis on the overall codon usage patterns in West Nile virus. Infect Genet Evol 2013, 14: 396-400.PubMedView ArticleGoogle Scholar
- Zhou JH, You YN, Chen HT, Zhang J, Ma LN, Ding YZ, Pejsak Z, Liu YS: The effects of the synonymous codon usage and tRNA abundance on protein folding of the 3C protease of foot-and-mouth disease virus. Infect Genet Evol 2013, 16: 270-274.PubMedView ArticleGoogle Scholar
- Goni N, Iriarte A, Comas V, Sonora M, Moreno P, Moratorio G, Musto H, Cristina J: Pandemic influenza A virus codon usage revisited: biases, adaptation and implications for vaccine strain development. Virol J 2012, 9: 263. 10.1186/1743-422X-9-263PubMedPubMed CentralView ArticleGoogle Scholar
- van der Kuyl AC, Berkhout B: The biased nucleotide composition of the HIV genome: a constant factor in a highly variable virus. Retrovirology 2012, 9: 92. 10.1186/1742-4690-9-92PubMedPubMed CentralView ArticleGoogle Scholar
- Pandit A, Sinha S: Differential trends in the codon usage patterns in HIV-1 genes. PLoS One 2011, 6: e28889. 10.1371/journal.pone.0028889PubMedPubMed CentralView ArticleGoogle Scholar
- Kypr J, Mrazek J: Unusual codon usage of HIV. Nature 1987, 327: 20.PubMedView ArticleGoogle Scholar
- van Hemert FJ, Berkhout B: The tendency of lentiviral open reading frames to become A-rich: constraints imposed by viral genome organization and cellular tRNA availability. J Mol Evol 1995, 41: 132-140.PubMedView ArticleGoogle Scholar
- Sharp PM, Li WH: An evolutionary perspective on synonymous codon usage in unicellular organisms. J Mol Evol 1986, 24: 28-38. 10.1007/BF02099948PubMedView ArticleGoogle Scholar
- Comeron JM, Aguade M: An evaluation of measures of synonymous codon usage bias. J Mol Evol 1998, 47: 268-274. 10.1007/PL00006384PubMedView ArticleGoogle Scholar
- Puigbo P, Bravo IG, Garcia-Vallve S: E-CAI: a novel server to estimate an expected value of Codon Adaptation Index (eCAI). BMC Bioinformatics 2008, 9: 65. 10.1186/1471-2105-9-65PubMedPubMed CentralView ArticleGoogle Scholar
- Tao P, Dai L, Luo M, Tang F, Tien P, Pan Z: Analysis of synonymous codon usage in classical swine fever virus. Virus Genes 2009, 38: 104-112. 10.1007/s11262-008-0296-zPubMedView ArticleGoogle Scholar
- Zielonka J, Bravo IG, Marino D, Conrad E, Perkovic M, Battenberg M, Cichutek K, Munk C: Restriction of equine infectious anemia virus by equine APOBEC3 cytidine deaminases. J Virol 2009, 83: 7547-7559. 10.1128/JVI.00015-09PubMedPubMed CentralView ArticleGoogle Scholar
- Bogerd HP, Tallmadge RL, Oaks JL, Carpenter S, Cullen BR: Equine infectious anemia virus resists the antiretroviral activity of equine APOBEC3 proteins through a packaging-independent mechanism. J Virol 2008, 82: 11889-11901. 10.1128/JVI.01537-08PubMedPubMed CentralView ArticleGoogle Scholar
- Capomaccio S, Cappelli K, Cook RF, Nardi F, Gifford R, Marenzoni ML, Passamonti F: Geographic structuring of global EIAV isolates: a single origin for New World strains? Virus Res 2012, 163: 656-659. 10.1016/j.virusres.2011.11.011PubMedView ArticleGoogle Scholar
- Zheng YH, Sentsui H, Kono Y, Ikuta K: Mutations occurring during serial passage of Japanese equine infectious anemia virus in primary horse macrophages. Virus Res 2000, 68: 93-98. 10.1016/S0168-1702(00)00147-7PubMedView ArticleGoogle Scholar
- Dong JB, Zhu W, Cook FR, Goto Y, Horii Y, Haga T: Identification of a novel equine infectious anemia virus field strain isolated from feral horses in southern Japan. J Gen Virol 2013, 94: 360-365. 10.1099/vir.0.047498-0PubMedView ArticleGoogle Scholar
- Puigbo P, Bravo IG, Garcia-Vallve S: CAIcal: a combined set of tools to assess codon usage adaptation. Biol Direct 2008, 3: 38. 10.1186/1745-6150-3-38PubMedPubMed CentralView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.