Occurrence of a novel mastrevirus in sugarcane germplasm collections in Florida, Guadeloupe and Réunion
Virology Journalvolume 14, Article number: 146 (2017)
In Africa and Asia, sugarcane is the host of at least seven different virus species in the genus Mastrevirus of the family Geminiviridae. However, with the exception of Sugarcane white streak virus in Barbados, no other sugarcane-infecting mastrevirus has been reported in the New World. Conservation and exchange of sugarcane germplasm using stalk cuttings facilitates the spread of sugarcane-infecting viruses.
A virion-associated nucleic acids (VANA)-based metagenomics approach was used to detect mastrevirus sequences in 717 sugarcane samples from Florida (USA), Guadeloupe (French West Indies), and Réunion (Mascarene Islands). Contig assembly was performed using CAP3 and sequence searches using BLASTn and BLASTx. Mastrevirus full genomes were enriched from total DNA by rolling circle amplification, cloned and sequenced. Nucleotide and amino acid sequence identities were determined using SDT v1.2. Phylogenetic analyses were conducted using MEGA6 and PHYML3.
We identified a new sugarcane-infecting mastrevirus in six plants sampled from germplasm collections in Florida and Guadeloupe. Full genome sequences were determined and analyzed for three virus isolates from Florida, and three from Guadeloupe. These six genomes share >88% genome-wide pairwise identity with one another and between 89 and 97% identity with a recently identified mastrevirus (KR150789) from a sugarcane plant sampled in China. Sequences similar to these were also identified in sugarcane plants in Réunion.
As these virus isolates share <64% genome-wide identity with all other known mastreviruses, we propose classifying them within a new mastrevirus species named Sugarcane striate virus. This is the first report of sugarcane striate virus (SCStV) in the Western Hemisphere, a virus that most likely originated in Asia. The distribution, vector, and impact of SCStV on sugarcane production remains to be determined.
Sugarcane (interspecific hybrids of Saccharum spp.) is an economically important perennial crop grown mainly for sugar and ethanol production in the tropical and sub-tropical regions of the world. Sugarcane is not native to the vast majority of countries where it is commercially grown. Introduction of sugarcane clones is therefore essential to establish sugarcane industries or for breeding purposes. In a production area, sugarcane clones are vegetatively propagated via stalk pieces (cuttings) and this planting material is also used for movement of germplasm between geographical locations. However, this material can be infected by various pathogens . Nowadays, quarantine stations that employ robust diagnostic techniques are used for exchange of sugarcane germplasm . This was not the case several decades ago or during the nineteenth century when S. officinarum clones were collected, especially in New Guinea (Melanesia) which is generally accepted as a center of diversity for the genus Saccharum . Additionally, symptomless plants infected with unknown viral pathogens may still escape quarantine procedures when adequate diagnostic methods are not available.
Sugarcane can be infected by viruses belonging to at least seven different virus species in the genus Mastrevirus of the family Geminiviridae: Sugarcane streak virus , Sugarcane streak Egypt virus , Sugarcane streak Reunion virus , Sugarcane white streak virus , Maize streak virus , Saccharum streak virus , and Sugarcane chlorotic streak virus . Strikingly, with the exception of sugarcane white streak virus (SCWSV), six of these viruses have natural geographical ranges that are apparently restricted to Africa and the Indian Ocean islands off the African coast. Although most sugarcane-infecting mastreviruses are primarily found in Africa, they may appear less prevalent elsewhere in the world simply because less effort has been expended to detect them outside Africa.
In this regard, the use of high throughput sequencing based approaches and/or rolling circle amplification (RCA) could enable a more balanced global search for sugarcane-infecting mastreviruses. These techniques coupled with full-genome cloning and sequencing have recently proved useful in identifying unknown mastreviruses within both sugarcane field and quarantine contexts [7, 10], and have revealed the presence of mastreviruses in parts of the world where they were previously unknown [7, 11, 12].
Leaf samples from different species of Saccharum and related genera were collected from the Biological Resource Centre for Tropical Plants in Guadeloupe in 2013 (n = 300), from commercial fields in the Everglades Agricultural Area (Belle Glade and Clewiston, FL) in 2013/2014 (n = 95), the germplasm collection of the USDA-ARS (Miami, FL) in 2013/2014 (n = 113), and from CIRAD’s germplasm collection in Réunion Island in 2014 (n = 209) (Table 1). This sampling was part of a project whose goal was to catalogue sugarcane-infecting virus species and characterize viral communities within sugarcane plants. These leaf samples represent plants that originated (source of first collection or source of sugarcane hybrid creation) from at least 36 different geographical locations (source of >50 clones unknown; Table 2). Freshly collected leaf pieces taken from the top visible dewlap leaf were sealed in plastic bags or dried on CaCl2 and then shipped to CIRAD in Montpellier, France where all samples were further processed. A virion-associated nucleic acids (VANA)-based metagenomics approach was used to analyze the viromes of each of the 717 plant samples from these four sites, as described by Palanga et al. . Briefly, total nucleic acids were extracted from homogenized leaf tissue and used to produce single strand cDNA libraries. Double strand cDNA libraries were produced by klenow polymerization and amplified by PCR. Tagged DNA amplicons from 96 samples were pooled and sequenced using a 454 sequencing plate (Beckman Coulters Genomics, USA).
Following de novo contig assembly of cleaned reads performed using CAP3  and searches using BLASTn (Basic Local Alignment Search Tool) and BLASTx , 34 contigs with detectable homology to mastreviruses were identified from nine of these 717 plant samples (four from Florida, three from Guadeloupe, and two from Réunion). BLASTx searches revealed that these contigs potentially encode proteins from four different mastrevirus-like genes (V1, V2, C1 and C2) that were between 86 and 100% identical to those encoded by the genome of a sugarcane-infecting mastrevirus from China that was deposited in GenBank in September 2015 (isolate WZG, GenBank accession number KR150789). It is noteworthy that this virus genome from China was identified by its depositor (Wen and collaborators, Guangxi University, China) as a sugarcane streak virus isolate, despite the fact that at the time it shared less than 64% pairwise genome sequence identity with any other known mastrevirus and should therefore have been identified as a novel species.
Only one of the nine plants within which mastrevirus sequences were detected (a S. officinarum from the Miami germplasm collection, NG28-020) displayed any discernable chlorotic streak- or striation-like symptoms such as those caused by many of the known monocotyledonous plant-infecting mastreviruses (Fig. 1). Total DNA was extracted from the samples from Florida/Miami (three plants of Saccharum spontaneum Iranspon, one of S. barberi Ketari, one of S. officinarum NG28-020, and one from S. officinarum Pundia), Guadeloupe (one plant of noble cane, S. officinarum EK2, and one plant each of commercial sugarcane cultivars TC3 and TC9), and Réunion (one plant of S. barberi Sararoo 1492 and one plant from S. sinense UBA Aust). Total DNA was enriched for mastrevirus full genomes by rolling circle amplification (RCA) using Phi29 DNA polymerase (TempliPhi™, GE Healthcare, USA) as previously described by Shepherd et al. . The RCA products were either used as templates for polymerase chain reaction (PCR)-based amplification using a set of primers designed based on the VANA contigs (Table 3), or were restricted using either BamHI or PstI. The amplified products were ligated to pJET1.2 (Thermo Fisher USA), whereas the restricted ~2.8Kb mastrevirus genome-length fragments were ligated to pBlueScript (Agilent, USA). The resulting recombinant plasmids were Sanger sequenced by primer walking at Beckman Coulter Genomics (plasmids from the Florida/Miami site) and Macrogen Inc. (plasmids from the Guadeloupe site).
Seventeen genome sequences (2738-2749 nt) were obtained using either RCA followed by enzymatic restriction, or PCR using back-to-back primers, followed by cloning of full length restricted fragments or full length amplicons and Sanger sequencing (Table 3). Three full genome sequences were obtained from a single S. barberi plant (FL_579-1, −2 and −3), five from two different S. officinarum plants (FL_362-1, GP_EK2-1, −2, −3 and −4), four from three different S. spontaneum plants (FL_30-1, −2, FL_434-2 and FL_897-1) and five from two different sugarcane hybrids (GP_TC3-1, −2, GP_TC9-1, −2 and −3; Table 3). RCA amplification products were obtained for three additional infected host plants: S. officinarum Pundia from the USA collection, and S. barberi Sararoo 1492 and S. sinense UBA Aust from the collection in Réunion. However, cloning of these amplicons was unsuccessful after several attempts and for unknown reasons. BLASTn comparisons between the 17 genome sequences and those in GenBank indicated that they were all most similar (89 to 97% identity) to the novel sugarcane-infecting mastrevirus genome sequence from China deposited under GenBank accession number KR150789.
One representative virus isolate was selected for each of the six host plants that yielded entire genome sequences: FL_434-2 (S. spontaneum Iranspon), FL_579-1 (S. barberi Ketari), FL_362-1 (S. officinarum NG28-020), GP_EK2-1 (S. officinarum EK2), GP_TC3-1 (Saccharum hybrid TC3), and GP_TC9-1 (Saccharum hybrid TC9). These six new sugarcane-infecting mastrevirus sequences were aligned together with the sugarcane mastrevirus sequence from China and a representative set of mastrevirus genome sequences that were available in GenBank in January 2017. Whole-genome pairwise nucleotide sequence identities, and replication associated protein (Rep) and capsid protein (CP) pairwise amino acid sequence identities were determined using SDT v1.2 . The six new mastreviruses isolates and their close relative from China share <64% genome-wide identity with other known mastreviruses and thus, based on the International Committee for Virus Taxonomy endorsed mastrevirus species demarcation threshold of 78% genome-wide identity, these virus isolates should all be classified as belonging to a new species: one which we propose be named Sugarcane striate virus based on the symptoms observed on one of the infected S. officinarum plants from which one of the new genomes was isolated (Fig. 1). It is noteworthy that no other RNA or DNA virus sequence was found in this plant using the metagenomics approach described above, thus providing strong evidence for the association between the striation-like symptoms and the new mastrevirus.
Based on the accepted mastrevirus strain demarcation threshold of 94% genome-wide nucleotide sequence identity , the six sugarcane striate virus (SCStV) isolates were further classified into four different strains named A, B, C and D (Fig. 1). The three isolates from Guadeloupe and the isolate from China (isolated from S. officinarum and Saccharum interspecific hybrids) all belong to strain A, whereas the other three strains are comprised of the three isolates from Florida. These latter three strains were each associated with a specific species of Saccharum: the strain B isolate (FL_362-1) was found in a S. officinarum plant, the strain C isolate (FL_579-1) was found in a S. barberi plant and the strain D isolate (FL_434-2) was found in a plant of S. spontaneum variety Iranspon. A maximum-likelihood phylogenetic tree (constructed with the best fitting model, GTR + G + I using jModelTest ) based on the full genome alignment also supports the proposed strain classification (Fig. 1).
Furthermore, four contigs from each of two sugarcane plants from Réunion Island (S. barberi Sararoo 1492 and S. sinense UBA_Aust, both originating from India) ranging in size from between 171 and 1214 nucleotides were obtained after de novo assembly from the VANA reads. These contigs corresponded to mastrevirus V1, V2 and C1 open reading frames that were between 93 and 100% identical to the homologous regions of the SCStV isolate from China, WZG, and 99% identical to the homologous regions of the three isolates from Guadeloupe. The presence of SCStV in the two samples from Réunion was validated by PCR with primers developed using the high throughput sequencing data (data not shown). We can therefore conclude that isolates of SCStV are also present within the sugarcane collection held by CIRAD on Réunion Island.
A neighbor-joining phylogenetic tree was inferred from the aligned sequences of one of each of the four SCStV strains together with 69 other representative mastrevirus genomes using MEGA6  (Fig. 2). Additionally, maximum-likelihood phylogenetic trees were constructed from the inferred CP and Rep amino acid sequences encoded by these mastrevirus genomes with PHYML3  (Fig. 2). In all three trees, the SCStV sequences cluster with other monocotyledonous plant-infecting mastreviruses. The SCStV genomes are most closely related to maize streak Reunion virus (MSRV) and wheat dwarf India virus (WDIV) with which they share between 63 and 64% pairwise genome sequence identity (Additional file 1). The SCStV CP and Rep amino acid sequences respectively share <50% and <56% identity with those of other mastreviruses (Additional files 2 and 3).
Movement of sugarcane germplasm using stalk cuttings greatly facilitates the spread of sugarcane viruses, especially unknown viruses for which no detection methods are available or which can escape quarantine facilities in asymptomatic plants. SCStV was found in four different species of Saccharum (S. officinarum, S. barberi, S. spontaneum, and S. sinense) and only in two commercial sugarcane hybrids (TC3 and TC9) although over 400 hybrid clones (Saccharum spp.) were tested (Table 1). This suggests that the virus occurs mainly in botanical species that were collected since the late 1800s . However, it cannot be excluded that the virus occurs and is actively spread in locations that were not sampled herein as sugarcane is grown in more than 100 locations or countries around the world .
The two commercial sugarcane varieties which were found infected by SCStV herein have been created in Malaysia. With the exception of sugarcane bacilliform virus (a virus that can integrate the sugarcane genome), no sugarcane virus has been reported as seed (also called fuzz) transmitted . Assuming that SCStV is not seed transmitted either, cultivars TC3 and TC9 have been infected by this virus in Malaysia, and the insect vector(s) (which is presumably a leafhopper) should be present at least in this geographical location. The vector(s) might also be present in China where isolate WZG was also collected from sugarcane (identity and source unknown).
It is clear from our study that SCStV has a broader geographical distribution than any known mastrevirus species. Although its nearest relatives have been found in India (WDIV), Africa and Réunion Island (MSRV), both of these species are distantly related enough to SCStV that they provide little resolution with respect to where SCStV may have originated (other than probably somewhere in the eastern hemisphere). However, plants infected by SCStV in this study were all originally sourced from Asian countries (India, Indonesia, Iran, Malaysia, New Guinea), thus suggesting an Asian origin for SCStV. In this regard, it is of some concern that this virus is presently resident within at least two locations in the western hemisphere (USA and Guadeloupe). It is noteworthy that the SCStV isolates from the USA were identified in plants from three varieties that were introduced to the world germplasm collection in Miami, Florida more than six decades ago. Up until now, SCStV has not been identified in commercially grown sugarcane in Florida, possibly because its insect vector(s) does not occur naturally in Florida.
SCStV is a newly reported sugarcane virus whose biology is unknown. It would be of great interest therefore to determine (1) the natural vector(s) of SCStV; (2) the distribution of this virus and its vector across the USA, Central America and the Caribbean, (3) the prevalence of the different SCStV strains that have so far been identified in the USA; and (4) the impact of SCStV on sugarcane growth and yields. All of this information will be crucial to assess the economic risks that are posed by SCStV.
Maize streak Reunion virus
Polymerase chain reaction
Rolling circle amplification
Replication associated protein
Sugarcane striate virus
Sugarcane white streak virus
Virion-associated nucleic acids
Wheat dwarf India virus
Rott P, Bailey RA, Comstock JC, Croft BJ, Saumtally AS. A guide to sugarcane diseases. La Librairie du Cirad. 2000. 339 p.
Guinet-Brial I, Girard JC, Roumagnac P, Daugrois JH, Fernandez E, Rott P. Visacane, an innovative quarantine tool for the exchange of pest and disease-free sugarcane germplasm. Proc Intern Soc Sugar Cane Technol. 2013;28:BP13.
Croft BJ, Piggin CM, Wallis ES, Hogarth DM. Sugarcane germplasm conservation and exchange. Aciar Proceedings No. 67. 1995. 134 p.
Hughes FL, Rybicki EP, Kirby R. Complete nucleotide sequence of sugarcane streak Monogeminivirus. Arch Virol. 1993;132:171–82.
Bigarre L, Salah M, Granier M, Frutos R, Thouvenel J-C, Peterschmitt M. Nucleotide sequence evidence for three distinct sugarcane streak mastreviruses. Arch Virol. 1999;144:2331–44.
Peterschmitt M, Reynaud B, Sommermeyer G, Baudin P. Characterization of maize streak virus isolates using monoclonal and polyclonal antibodies and by transmission to a few hosts. Plant Dis. 1991;75(1):27–32. http://www.apsnet.org/publications/plantdisease/backissues/Documents/1991Articles/PlantDisease75n01_27.PDF.
Candresse T, Filloux D, Muhire B, Julian C, Galzi S, Fort G, Bernardo P, Daugrois JH, Fernandez E, Martin DP, Varsani A, Roumagnac P. Appearances can be deceptive: revealing a hidden viral infection with deep sequencing in a plant quarantine context. PLoS One. 2014;9(7):e102945. doi:10.1371/journal.pone.0102945.
van Antwerpen T, McFarlane SA, Buchanan GF, Shepherd DN, Martin DP, Rybicki EP, Varsani A. First report of Maize streak virus field infection of sugarcane in South Africa. Plant Dis. 2008;92(6):982.
Lawry R, Martin DP, Shepherd DN, van Antwerpen T, Varsani A. A novel sugarcane-infecting mastrevirus from South Africa. Arch Virol. 2009;154:1699–703.
Yahaya A, Dangora DB, Alegbejo MD, Kumar PL, Alabi OJ. Identification and molecular characterization of a novel sugarcane streak mastrevirus and an isolate of the A-strain of maize streak virus from sugarcane in Nigeria. Arch Virol. 2016; doi:10.1007/s00705-016-3148-5.
Agindotan BO, Domier LL, Bradley CA. Detection and characterization of the first north American mastrevirus in switchgrass. Arch Virol. 2015;160(5):1313–7.
Rosario K, Padilla-Rodriguez M, Kraberger S, Stainton D, Martin DP, Breitbart M, Varsani A. Discovery of a novel mastrevirus and alphasatellite-like circular DNA in dragonflies (Epiprocta) from Puerto Rico. Virus Res. 2013;171:231–7.
Palanga E, Filloux D, Martin DP, Fernandes E, Gargani D, Ferdinand R, Zabre J, Bouda Z, Neya JB, Sawadogo M, Traore O, Peterschmitt M, Roumagnac P. Metagenomic-based screening and molecular characterization of cowpea-infecting viruses in Burkina Faso. PLoS One. 2016;11:e0165188.
Huang XQ, Madan A. CAP3: a DNA sequence assembly program. Genome Res. 1999;9:868–77.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215(3):403–10.
Shepherd DN, Martin DP, Lefeuvre P, Monjane AL, Owor BE, Rybicki EP, Varsani A. A protocol for the rapid isolation of full geminivirus genomes from dried plant tissue. J Virol Methods. 2008;149:97–102.
Muhire BM, Varsani A, Martin DP. SDT: a virus classification tool based on pairwise sequence alignment and identity calculation. PLoS One. 2014;9(9):e108277.
Muhire BM, Martin DP, Brown JK, Navas-Castillo J, Moriones E, Zerbini FM, Rivera-Bustamante R, Malathi VG, Briddon RW, Varsani A. A genomewide pairwise-identity-based proposal for the classification of viruses in the genus Mastrevirus (family Geminiviridae). Arch Virol. 2013;158(6):1411–24.
Darriba D, Taboada GL, Doallo R, Posada D. jModelTest 2: more models, new heuristics and parallel computing. Nat Methods. 2012;9(8):772.
Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 2013;30(12):2725–9.
Guindon S, Dufayard JF, Lefort V, Anisimova M, Hordijk W, Gascuel O. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol. 2010;59:307–21.
FAOSTAT. http://www.fao.org/faostat/en/#data/QC. 2017. Accessed 1 April 2017.
Abascal F, Zardoya R, Posada D. ProtTest: selection of best-fit models of protein evolution. Bioinformatics. 2005;21:2104–5.
This research would not have been possible without funding provided by the Florida Sugar Cane League (Project # 00107475 and Fund # F000057). The work is supported by the USDA National Institute of Food and Agriculture [project Hatch/Rott FLA-BGL-005404], the Région Réunion and the European Union (FEDER; Action #1.03). DPM and AV are supported by the National Research Foundation of South Africa. PR is supported by EU grant FP7-PEOPLE-2013-IOF (N° PIOF-GA-2013-622571).
See details in acknowledgment section above. The funding bodies were not involved in the study design, collection of samples, and interpretation of data or preparation of the manuscript.
Availability of data and materials
The datasets generated and/or analyzed during the current study are publicly available in GenBank (KX352040 - KX352056) [https://www.ncbi.nlm.nih.gov/genbank/].
Plant sampling permission
Plant samples used in this study to identify plant virus sequences were collected from germplasm collections and commercial fields with the approval of the owners of these plants (BRCTP in Guadeloupe, CIRAD in Réunion, USDA-ARS, US Sugar, Florida Crystals Corporation, and SCGC in Florida).
Ethics approval and consent to participate
Consent for publication
Author JML has received research grants from the Région Réunion and the European Union (FEDER; Action #1.03). Authors DPM and AV have received research grants from the National Research Foundation of South Africa. Author PR has received an EU grant FP7-PEOPLE-2013-IOF (N° PIOF-GA-2013-622,571). Author PCR has received research grants from the Florida Sugar Cane League (Project # 00107475 and Fund # F000057). Authors JML, DPM, AV, PR, and PCR declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.