- Research
- Open access
- Published:
Detection, characterization, and phylogenetic analysis of novel astroviruses from endemic Malagasy fruit bats
Virology Journal volume 21, Article number: 195 (2024)
Abstract
Bats (order: Chiroptera) are known to host a diverse range of viruses, some of which present a human public health risk. Thorough viral surveillance is therefore essential to predict and potentially mitigate zoonotic spillover. Astroviruses (family: Astroviridae) are an understudied group of viruses with a growing amount of indirect evidence for zoonotic transfer. Astroviruses have been detected in bats with significant prevalence and diversity, suggesting that bats may act as important astrovirus hosts. Most astrovirus surveillance in wild bat hosts has, to date, been restricted to single-gene PCR detection and concomitant Sanger sequencing; additionally, many bat species and many geographic regions have not yet been surveyed for astroviruses at all. Here, we use metagenomic Next Generation Sequencing (mNGS) to detect astroviruses in three species of Madagascar fruit bats, Eidolon dupreanum, Pteropus rufus, and Rousettus madagascariensis. We detect numerous partial sequences from all three species and one near-full length astrovirus sequence from Rousettus madagascariensis, which we use to characterize the evolutionary history of astroviruses both within bats and the broader mammalian clade, Mamastrovirus. Taken together, applications of mNGS implicate bats as important astrovirus hosts and demonstrate novel patterns of bat astrovirus evolutionary history, particularly in the Southwest Indian Ocean region.
Introduction
Bats (order: Chiroptera) make up an extremely diverse mammalian order that has been identified as a reservoir of many of the world’s most virulent zoonotic viruses [1,2,3]. Bats’ capacity to host virulent zoonotic viruses without experiencing disease is posited to be a byproduct of their evolution of flight, which necessitated metabolic adaptations leading to both elongated lifespans and immune system modifications promoting the evolution of viruses that are virulent to non-bat hosts [1, 4,5,6]. Furthermore, the behaviors of many bat species, such as social grooming and roosting in dense aggregations, advance the transmission of infection within populations [7]. Despite the established importance of bats as viral reservoirs, existing surveillance of the viruses hosted by bats is uneven and shows marked geographic and taxonomic biases [8, 9].
A prime example is the island of Madagascar. Madagascar is home to 46 bat species with nearly 80% endemism, the result of a unique biogeographical history characterized by long isolation from surrounding landmasses [10, 11]. This isolation has also led to the coevolution of extraordinarily diverse viruses within Malagasy bats [12,13,14]. Moreover, high human-bat contact rates due to hunting, particularly of large fruit bats, across the island increase the potential for zoonotic spillover [15]. Altogether, these factors make Madagascar an especially important location for surveillance of viruses harbored by bats.
A key understudied family of viruses are astroviruses (family: Astroviridae). Astroviruses (AstVs) are non-enveloped, positive-sense, single-stranded RNA viruses. Their genome contains a 5’-untranslated region (UTR), three open reading frames (ORFs)—ORF1a, ORF1b, and ORF2—and a 3’UTR with a poly A tail [16]. AstVs infect a remarkable diversity of hosts, grouping broadly into two genera: Avastroviruses, which infect avian hosts, and Mamastroviruses, which infect mammalian hosts, including humans. Approximately 2–9% of all acute non-bacterial gastrointestinal infections in children are thought to be from astrovirus infection [16, 17]. This lack of host specificity demonstrates AstVs’ efficiency in cross-species transmission, thought to be facilitated by high viral diversity and frequent recombination events [16, 18,19,20,21]. Although there is currently no direct evidence for zoonotic astrovirus transmission, a number of studies present indirect evidence suggesting a historical spillover of AstVs from animals to humans [20,21,22,23,24,25]. Despite their clear importance in both wildlife and public health, we have only recently begun to understand the true diversity of astroviruses and their evolutionary history in non-human hosts.
Astroviruses have been detected at high incidence and with remarkable diversity in both suborders of Chiroptera, Yinpterochiroptera and Yangochiroptera, though these values range widely depending on the study design, species, and location [24,25,26,27,28]. Differing phylogenetic analyses reveal both strong and weak host- and geographic- clustering, with some bat AstVs grouping more closely with members of the avian Avastrovirus genus. High astrovirus diversity detected within members of the same populations of bat hosts suggests the simultaneous circulation of multiple strains, perhaps driven by the co-roosting of multiple bat species in dense aggregations [1, 7, 29]. Altogether this evidence suggests that bats may be an important reservoir source for cross-species transmission or zoonotic spillover of astroviruses. These insights, however, have been largely based on PCR detections and single gene sequences (typically the RNA-dependent RNA polymerase gene, or RdRp) derived from a few species of bats, limiting the potential for more robust evolutionary analysis. Very recently, metagenomic next-generation (mNGS) sequencing has resulted in the detection of five full-genome bat astroviruses globally [30,31,32]. However, many outstanding questions remain regarding astrovirus presence, infection dynamics, and zoonotic risk.
Here, we present paired mNGS/PCR detection and characterization of astroviruses sampled from three species of endemic fruit bat from Madagascar: Pteropus rufus, Rousettus madagascariensis, and Eidolon dupreanum. We identified a novel near-full length astrovirus genome detected in a Rousettus madagascariensis fecal sample and we characterized its evolutionary history among the broader clade of Mamastroviruses. Using PCR-derived RdRp sequences from Rousettus madagascariensis and Eidolon dupreanum, we additionally explored the biogeographical history of bat astroviruses from Madagascar and its surrounding landmasses in the Southwest Indian Ocean Region. Our work sheds light on the evolutionary history of astroviruses between bat suborders, Yinpterochiroptera and Yangochiroptera.
Methods
Sample collection and processing
Astrovirus infections were identified from a dataset of viruses detected in samples from a longitudinal study of fruit bats across Madagascar. Methodological details on bat field sampling and subsequent RNA extraction, library preparation, and Illumina sequencing have been reported in previous work [13, 14]; here, we give only a brief overview.
Between 2018 and 2019, monthly bat captures were carried out at three species-specific locations: Ambakoana roost (-18.513 S, 48.167 E, Pteropus rufus); adjacent Angavobe/Angavokely caves (-18.944 S, 47.949 E, and − 18.933 S, 47.758 E, Eidolon dupreanum); and Maromizaha cave (-18.9623 S, 48.4525 E, Rousettus madagascariensis). Bats were identified to species, sex, and age (adult vs. juvenile), and throat, fecal, and urine samples were collected.
Following field collection, throat, fecal, and urine samples underwent RNA extraction in the Virology Unit at the Institut Pasteur de Madagascar (IPM) using the Zymo Quick DNA/RNA Microprep Plus Kit (Zymo Research, Irvine, CA). In total, RNA from 285 fecal, 143 throat, and 196 urine swab samples was extracted, then stored in -80 °C freezers at IPM, prior to final transport on dry ice to the Chan Zuckerberg Biohub San Francisco (CZB-SF) for eventual library preparation and subsequent mNGS.
Aliquots of each sample were arrayed into a 384 well plate for mNGS library preparation. Samples were evaporated using a GeneVac EV-2 (SP Industries, Warminster, PA, USA) to enable miniaturized library preparation with the NEBNext Ultra II RNA Library Prep Kit (New England Biolabs, Beverly, MA, USA). Library preparation was performed per the manufacturer’s instructions, with the following modifications: 25 pg of External RNA Controls Consortium Spike-in mix (ERCCS, Thermo-Fisher) were added to each sample prior to RNA fragmentation; the input RNA mixture was fragmented for 8 min at 940C prior to reverse transcription; and a total of 14 cycles of PCR with dual-indexed TruSeq adaptors was applied to amplify the resulting individual libraries. Samples were assessed for quality and quantity, then submitted to an Illumina NextSeq 2000 (Illumina, San Diego, CA, USA) for paired-end sequencing (2 × 146 bp). The pipeline used to separate the sequencing output of the individual libraries into FASTQ files of 146 bp paired-end reads is available on GitHub at https://github.com/czbiohub/utilities.
Detection
Raw reads from Illumina were host-filtered, quality-filtered, and assembled on the Chan Zuckerberg Infectious Diseases (CZID) bioinformatics platform (v3.10 NR/NT 2019-12-01) [33], using a host background model of “bat” compiled from all publicly available full-length bat genomes in GenBank at the time of sequencing (July 2019). Samples were marked positive for astrovirus infection if at least two contigs with an average read depth > 2 reads/nucleotide were assembled that showed significant nucleotide or protein BLAST alignment(s) (alignment length > 100nt/aa and E-value < 0.00001 for nucleotide BLAST/bit score > 100 for protein BLAST) to astroviruses present in the NCBI NR/NT database (v12-01-2019). Additionally, all non-host contigs assembled in CZID were manually BLASTed against all full-length and protein reference sequences for astroviruses available in NCBI Virus.
To test for differences in astrovirus prevalence, we performed four Pearson’s Chi squared tests between differing subsets of the data: between total prevalence across the three species, and between adults and juveniles within each species.
Molecular confirmation
Samples identified as positive via mNGS described above were subjected to conventional PCR. Among the 4 fecal and 7 urine samples identified as AstV positive by mNGS, only one urine sample did not have enough leftover nucleic acid material and was not included in the analysis. Original fecal and urine samples were syringe filtered through a 0.45\(\:\mu\:\)m filter, and 80\(\:\mu\:\)L of supernatant was re-extracted using the Zymo Quick-RNA Miniprep Kit (Zymo Research, Orange, CA, USA), then eluted in 15\(\:\mu\:\)L of nuclease free water according to manufacturer’s instructions. Reverse transcription was carried out on the eluted RNA using SuperScript IV Reverse Transcriptase (Thermo Fisher Scientific, Waltham, MA, USA) under the following thermal conditions: random priming at 65 °C for 5 min, followed by first strand synthesis at 25 °C for 10 min, 42 °C for 50 min, and 80 °C for 10 min. The resulting cDNA was then tested for presence of astrovirus using a heminested assay targeting the gene ORF1b, specifically the RdRp gene, using primers and cycling conditions described in Chu et al. 2008 [34]. We included water as negative control, and a synthetic gBlock (IDT, Newark, NJ, USA) based on a bat-derived Mamastrovirus (NCBI taxID: NC_043102) as a positive control. PCR products were visualized using gel electrophoresis, and any bands appearing at the target size of 422 bp were cut out and purified using a Zymoclean Gel DNA recovery kit (Zymo Research, Orange, CA, USA). Purified PCR amplicons were then subject to Sanger sequencing through the University of Chicago DNA Sequencing Core.
Genome annotation and % identity analysis
To annotate coding sequences, we downloaded all available bat astrovirus full genomes from NCBI Virus at the time of analysis (August 2022). We aligned positive astrovirus contigs from our dataset to these background sequences using the MAFFT [35] algorithm (v7.450) with default parameters in Geneious Prime (v08-18-2022). We then annotated open reading frames and genes in our novel sequence by identifying stop and start codons in regions adjacent to those identified in the homologs.
To investigate similarity to other published sequences, we performed BLASTn (nucleotide-nucleotide) and BLASTx (translated nucleotide-protein) searches within the NCBI database (Table S1). Additionally, we created amino acid and nucleotide identity plots using the program pySimplot [36] with input alignment generated using MAFFT [35] (v7.450) with default parameters in Geneious Prime (v08-18-2022).
From our initial mNGS screen, we identified one near-full length astrovirus genome which we characterized and used in a full-genome phylogeny; all other astrovirus hits were short fragments which aligned with AstV regions with limited phylogenetic relevance given the lack of corresponding PCR targets in GenBank. From our confirmatory PCR analysis, we successfully generated three RdRp sequences which were used in a Southwest Indian Ocean Region bat RdRp phylogeny.
Phylogenetic analysis
To perform phylogenetic analysis, we combined our novel sequences with those publicly available on NCBI. We carried out three major phylogenetic analyses, building (a) a full-genome Mamastrovirus maximum likelihood (ML) phylogeny, (b) a time-resolved Bayesian phylogeny corresponding to a selection of full genome Mamastrovirus sequences available on NCBI Virus, and (c) a Mamastrovirus ML phylogeny corresponding to a conserved 410 bp fragment of the RdRp gene encapsulated in the AstV ORF1b with a focus in the South West Indian Ocean region. Detailed methods for the construction of each phylogeny are available on GitHub (see Data Availability).
Sequence compilation
Our full genome ML phylogeny consisted of one novel full length Mamastrovirus sequence from our study, combined with 41 unique Mamastrovirus sequences from NCBI, and one full length Avastrovirus sequence as an outgroup, for a total of 43 sequences. For comparison, we compiled Mamastrovirus sequences from NCBI through three queries, selecting: [A] all complete RefSeq Genomes under Virus: Mamastrovirus (taxid:249588) and Virus: unclassified Mamastrovirus (taxid:526119) greater than 6,000 bp (N = 36), [B] Mamastrovirus nucleotide genomes under Virus: Astroviridae (taxid:39733) and Virus: unclassified Astroviridae (taxid:352926) with Host: Chiroptera (bats) (taxid:9397) over 6,000 bp (N = 2), and [C] manual searching of Mamastrovirus nucleotide genomes > 6,000 bp identified in the literature (N = 3).
Our Bayesian timetree consisted of the same set of full length Mamastrovirus sequences, removing the Avastrovirus outgroup, for a total of 42 sequences.
Our Mamastrovirus RdRp ML phylogeny consisted of our three PCR-detected RdRp sequences, one from Rousettus madagascariensis and two from Eidolon dupreanum, combined with 122 unique bat Mamastrovirus sequences from NCBI, and one Avastrovirus RdRp fragment as an outgroup, for a total of 126 sequences. NCBI sequences were restricted to those from bat hosts sampled in the Southwest Indian Ocean (SWIO) region. They were compiled through one query in NCBI Virus: Virus: Astroviridae (taxid:39733), Host: Chiroptera (taxid:9397), and Geographic Region: Madagascar (64), Mozambique (31), and Reunion (27). No sequences were available from the other SWIO landmasses: the Comoros, Mayotte, Mauritius, and the Seychelles. Sequences were confirmed to be RdRp fragments via alignment, and metadata such as host taxa and sampling location were verified in the source literature.
Alignment and substitution model
Following dataset compilation for each phylogenetic analysis, sequences were aligned via the MAFFT [35] (v7.450) algorithm in Geneious Prime (v 2022-08-18) using default parameters. Alignments were visually examined and trimmed to match the shortest sequence in the dataset. We then used Modeltest-NG [37] (v0.1.7) to determine the best fit nucleotide substitution for each alignment. All sequences, subsets, and alignments are available in our open-source GitHub repository (see Data Availability).
Phylogenetic tree assembly
Both the full genome and RdRp ML trees were constructed in RAxML-NG [38] (v1.1.0), using the best nucleotide substitution model from Modeltest-NG (TVM + I + G4) [37]. Following best practice recommendations in RAxML-NG [38], twenty ML inferences were made, followed by bootstrap replicate trees inferred using Felsenstein’s method [39]. The MRE-based bootstrapping test was performed every 50 replicates, and bootstrapping was terminated when the diagnostic result was below the threshold value. Support values were compiled onto the best-scoring tree.
The Bayesian timetree was built in BEAST2 [40] (v2.6.7), using the best nucleotide substitution model from Modeltest-NG (TVM + I + G4)[37]. We used a Bayesian Skyline Coalescent Model with a strict lognormal clock rate with prior mean 0.001 substitutions/site/year [41], and a constant population size. Sampling date for each sequence was inferred from NCBI ‘Collection Date’ or through reading source literature; if day was not available, the sampling date was set to the 15th of the month listed; if day and month were not available, the sampling date was set to July 15th of the collection year. Markov Chain Monte Carlo (MCMC) chains were run for > 700,000,000 iterations and terminated when we identified convergence at ESS values > 200 using TRACER (v1.7), with 10% burn-in. We used TreeAnnotator (v2.6.3) to examine mean posterior densities at each node.
All phylogenies were visualized in RStudio (v2022.07.01), using the package ‘ggtree’ [42].
Nucleotide sequence accession numbers
One annotated near full-length mNGS genome sequence from a Rousettus madagascariensis was submitted to NCBI and is available under accession number OQ606244. In addition, three PCR-detected RdRp sequences were submitted to NCBI, two from different Eidolon dupreanum individuals and one from a Rousettus madagascariensis (the same individual as OQ606244) and available under accession numbers PQ038332, PQ038333 and PQ038344.
Results
Detection
In total, RNA samples from 285 fecal, 196 urine, and 143 throat swabs were sequenced. Based on our detection criteria, evidence of astrovirus infection was found in all 3 species of Malagasy bat, across all sampled locations (Fig. 1). Astrovirus positives were detected among 4/285 (1.4%) fecal samples and 7/196 urine samples (3.6%), and zero of the 143 throat swab samples. These 11 positive samples were from 11 different individual bats.
Of the 11 positive samples, 10 had enough remaining material for conventional PCR confirmation of the presence of astrovirus nucleic acid. Of those 10, confirmatory PCR resulted in identification of three positive samples, including our original full genome sequence from mNGS. Subequent Sanger sequencing recovered three RdRp sequences: one sample from Rousettus madagascariensis (our full genome from mNGS) and two samples from Eidolon dupreanum.
Prevalence varied slightly between species, with 1/44 (2.3%) Pteropus rufus, 8/145 (5.5%) Eidolon dupreanum, and 2/96 (2.1%) Rousettus madagascariensis individuals identified for positive astrovirus infection in either fecal or urine samples (Fig. 1). These between-species differences were not significant (χ2 = 2.104, P = 0.3492). Juvenile vs. adult prevalence also did not vary significantly: 1/15 (6.7%) vs. 0/29 (0%) for P. rufus (χ2 = 1.941, P = 0.1635), 1/13 (7.7%) vs. 7/132 (5.3%) for E. dupreanum (χ2 = 0.122, P = 0.7264), and 1/13 (7.7%) vs. 1/83 (1.2%) for R. madagascariensis (χ2 = 2.271, P = 0.1318).
Sequence analysis of Malagasy AstVs
In total, 21 astrovirus contigs were identified via mNGS, ranging from 209–6,593 base pairs in length. The longest contig, AstV OQ606244, recovered from a fecal sample from a juvenile R. madagascariensis, had substantial supporting read depth (Fig. 2) and encoded a near-complete genome sequence. Analysis of the open reading frames of this sequence, in addition to sequence alignment to Mamastroviruses genomes from NCBI, revealed the entirety of ORF1b and ORF2. ORF1a is nearly fully represented in the contig; however, a fragment of the 5’ region including the start codon was not captured.
All other astrovirus detections from mNGS produced contigs representing fragments that aligned within the ORF2 region. Because this region is not typically targeted in PCR, there were not enough NCBI submissions available for a thorough evolutionary analysis including these fragments. As a result, these positive detections were not analyzed further.
However, secondary PCR detection of our positive samples yielded three additional RdRp sequences which were subsequently used in evolutionary analyses. One was from the same Rousettus madagascariensis sample as the near-full length AstV (OQ606244), aligning with 99% coverage and 99% identity, while the other two were from different Eidolon dupreanum individuals.
BLAST analysis of the near-full AstV genome contig (NCBI Accession # OQ606244) demonstrated that it is highly divergent from previously described Mamastrovirus sequences. BLASTn query of AstV OQ606244 recovered identity with only one other sequence in NCBI: a small portion (5%) of AstV OQ606244 could be aligned to a RdRp from an unknown Chiropteran species sampled in Tanzania (KY054020) at 80.9% identity. BLASTx analysis of the highest scoring matches in the database for the amino acid sequence of AstV OQ606244 revealed that the best amino acid alignment shared only 43.1% amino acid identity across 40% of an AstV ORF1a protein sequence recovered from Eidolon helvum in Cameroon (MG693176). All other top results were to non-chiropteran species including pigs and raccoon dogs (Table S2).
To further investigate the sequence divergence of AstV OQ606244, we performed scanning pairwise nucleotide and amino acid sequence analysis using AstV OQ606244 paired, in turn, with each of five full genome bat Astrovirus sequences available in NCBI (Fig. 2). We observed low nucleotide sequence identity: from 28.18 − 40.28% across the whole genome, and average amino acid identity from 23.87% to 44.29% in ORF 1a, 35.14 − 55.61% in ORF1b, and 24.84 − 40.73% in ORF2 (Table 1).
We observed slight discordance in the best matches with AstV OQ606244 at the nucleotide and amino acid level in this pairwise sequence analysis with complete AstV genomes identified in other chiropteran species. One astrovirus (MT734809) from one of the four Myotis daubentonii hosts had the highest pairwise average nucleotide identity to AstV OQ606244. In contrast, a different AstV genome identified in Eidolon helvum (MG693176)—which, like R. madagascariensis, is a Pteropodid fruit bat—had the highest pairwise amino acid identity to AstV OQ606244 across all open reading frames.
Phylogenetic analysis
Mamastrovirus full-genome evolutionary history
To investigate the evolutionary history of our novel bat astrovirus within the broader genus of mammalian astroviruses, Mamastrovirus, we built a maximum-likelihood phylogenetic tree including RefSeq genomes from a range of hosts with an avian astrovirus from the genus Avastrovirus as an outgroup. The best fit nucleotide substitution model as generated by Modeltest-NG [37] was TVM + I + G4. The full-genome maximum likelihood tree resolved two distinct Mamastrovirus clades, with a single Porcine astrovirus [43] falling out more closely related to the Avastrovirus outgroup, likely because it falls within a clade not otherwise represented in this phylogeny (Fig. 3A). No astroviruses derived from any single host order demonstrated monophyly, though sequences nonetheless clustered based on host taxonomy, with high support in most cases. Overall, these findings lend additional support to a growing body of evidence suggesting that astroviruses are likely to engage in cross-species transmission [16, 18,19,20,21].
The bat astroviruses formed a paraphyletic group, with two sequences basal to a broad clade containing a collection of murine, bovine, porcine, feline, and human astroviruses. AstV OQ606244, sampled from R. madagascariensis, grouped most closely with a previously described sequence recovered from a Cameroonian Eidolon helvum fruit bat, a fellow Yinpterochiropteran host [30]. In our phylogeny, both R. madagascariensis and E. helvum astroviruses resolved as more recently derived than four previously described full genome astroviruses recovered from Yangochiropteran (Myotis spp.) hosts (Fig. 3A).
We recovered largely the same topology in our time-resolved Bayesian phylogeny as in the ML tree: all sequences grouped into same two main Mamastrovirus clades, with a few slight deviations in the sequences that resolved as basal within each subclade (Fig. 3B). All bat astroviruses resolved the exact same position in both the Bayesian and ML tree, reinforcing resolution of their evolutionary positions. Our Bayesian timetree recovered a relatively shallow evolutionary history for the Mamastrovirus clade, dating the most recent common ancestor (MRCA) of all Mamastroviruses to just over 300 years ago, and the MRCA of our R. madagascariensis astrovirus and its closest E. helvum AstV relative ~ 150 years ago.
Bat astrovirus RdRp evolutionary history
To investigate if bat astrovirus evolutionary history is driven by biogeographical patterns or host relationships, we built a maximum likelihood phylogeny of all known bat astrovirus RdRp sequences from the SWIO region. Focusing on the RdRp region greatly expanded the range of background sequences available for inclusion in phylogenetic analysis, as this region of the AstV genome is targeted by single-gene PCR detection methods. In this phylogeny, the best fit nucleotide substitution model as generated by Modeltest-NG [37] was TVM + I + G4. Astrovirus sequences in the RdRp phylogeny grouped largely according to host taxonomy rather than geography of sampling site, a pattern which held across both suborder and family across all sampling locations. Almost all Yinpterochiropteran Mamastrovirus sequences resolved into a monophyletic group, while Yangochiropteran AstVs grouped into multiple clades, all basal or sister to the single Yinpterochiropteran clade.
Within Yangochiropteran hosts, AstV sequences derived from Molossids and Miniopterids clustered together regardless of location. Molossid-derived AstVs resolved into numerous scattered clades basal to all other known sequences. Paraphyly has been detected in Molossid species genomes [44], and thus may promote paraphyly in their viruses through coevolution. This host restriction, however, may be confounded with sampling location, since AstVs from Reunion Island Molossids were the only Molossidae sequences available for analysis. Likewise, available AstV sequences from Miniopterids were all restricted to the geographic locality of Madagascar. Notably, previous sampling from Madagascar Molossids has, to date, detected no astroviruses [12] (Fig. 4); however this previous sampling did not include all Molossid populations or species across the island.
There were a few notable exceptions to these broad patterns. AstVs recovered from Nycterid (family: Nycteridae) bats resolved into two disparate locations across the phylogeny: one clustered within the Molossids and another as basal to the Yinpterochiropteran clade. Vespertilionid AstVs (family: Vespertilionidae), with a sample size of only two, were placed within the Yinpterochiropteran clade, most closely related to Rhinonycterid AstVs (family: Rhinocyteridae).
Within the Yinpterochiropteran AstVs, Rhinonycterid astroviruses mixed with those recovered from Hipposiderids, both with representatives from Madagascar and Mozambique. Numerous prior publications have established these bat families as sister to one another [44, 45]; the genetic similarity of these host clades makes cross-transmission of viruses between them more likely.
AstVs from the family Pteropodidae did not all group together. Three representatives from the same species, Rousettus madagascariensis, resolved to very different places on the phylogeny—one within the Yinpterochiropteran clade and most closely related to a Rhinonycterid AstV, one within the Yangochiropteran Miniopterid clade, and our novel RdRp within the Yangochiropteran Molossid clade. Our two novel AstVs from Pteropodid Eidolon dupreanum similarly grouped within the Yangochiropteran Molossid clade instead of with other members of Yinpterochiroptera. Alignment of all five Pteropodid RdRp sequences yielded a pairwise identity of 64.1%, alignment of the three representatives from Rousettus madagascariensis yielded 61.5%, and alignment of the two Eidolon dupreanum yielded 99.5%. Alignment of our three novel sequences, one from R. madagascariensis and two from Eidolon dupreanum yielded a pairwise identity of 75.9%.
Altogether, these patterns suggest that the Yinpterochiropteran AstV sequences that resolve within the Yangochiropteran AstV clade could represent cross-species spillovers. The fact that there are more of these putative spillover AstV sequences from Yangochiropteran to Yinpterochiropteran hosts than the reverse in our phylogeny—as well as the basal position of the Yangochiropteran relative to Yinpterochiropteran AstV clade—suggests that SWIO bat AstVs may have originated Yangochiropteran hosts, then spilled over and radiated within Yinpterochiropteran hosts. Nonetheless, the weak node support and lack of representation of many bat species in this tree preclude certainty in this conclusion.
Discussion
Detection, divergence, and zoonotic potential
Here we describe how mNGS and conventional PCR were applied to detect and characterize astroviruses from three endemic species of Malagasy fruit bat, Pteropus rufus, Rousettus madagascariensis, and Eidolon dupreanum. We characterize the first near-full genome astrovirus detected from Rousettus madagascariensis and undertake phylogenetic analysis of the surrounding clade. In addition, this work presents the first detection of astroviruses from Eidolon dupreanum and Pteropus rufus. Astrovirus RNA has now been identified in 14/18 species of Malagasy bats that have been queried for this virus family (Table S3). While we did detect astrovirus infection in representatives from each of our three sampled species, our detected prevalence was < 8%. It is possible that this low rate of astrovirus detection could reflect a reduced sensitivity of mNGS for virus detection; however, similarly low AstV prevalence levels have also been described in other bats, particularly Pteropodids [46,47,48] when more sensitive directed detection methods were used. Previous PCR-based sampling of astroviruses in Rousettus madagascariensis found positive infection in 2/41 (4.8%) [12] and 2/40 (5.0%) [49], comparable to our 2.1%. Detection of astroviruses in other species of Malagasy bats, however, has reached as high as 88.9% [49] prevalence, but these high values appear to be more represented in Yangochiropteran bats.
There are multiple lines of evidence supporting the high diversity of the Malagasy Pteropodid astroviruses presented in this work. First, both open-ended BLAST and our directed, pairwise sequence comparison of one near full-length R. madagascariensis AstV genome with all other bat astrovirus genomes available indicate that this newly detected AstV is significantly diverged from all previously described viruses in the family (Table S1, Fig. 2). Second, our lack of PCR detection in 8 of 11 AstV positive samples identified by mNGS raises the possibility that many R. madagascariensis AstVs may be too divergent for detection by primers from the literature [34]; however, we cannot rule out the possibility that our limited success with confirmatory PCR could be a result of repeated thaw-refreeze cycles and sample degradation. Further careful sampling and molecular detection of astroviruses from these populations will help resolve these distinct possibilities. Lastly, previous genomic analyses using this same mNGS dataset have also demonstrated the extraordinary divergence of multiple other Madagascar bat viruses [13, 14]. Kettenburg et al. [13] (2022) identified a novel Nobecovirus (family: Coronaviridae) in a Pteropus rufus fecal specimen which is highly divergent from all others known and appears to be basal to the entire Nobecovirus subgenus of the Betacoronavirus subclade [13]. Madera et al. [14] (2022) described a novel Henipavirus (family: Paramyxoviridae) from Eidolon dupreanum urine samples, which is highly diverged from all other viruses in its genus and also basal to henipaviruses known to be hosted by bats. It is likely that future sampling of the Malagasy bat virome will continue to reveal more highly divergent viruses, reflecting Madagascar’s long geographic isolation from other land masses.
Intriguingly, this newly recovered near-full length genome, AstV OQ606244, demonstrates divergence from previously characterized bat astroviruses in its ORF2 region (Fig. 2). Because ORF2 encodes the AstV spike protein, which mediates virus entry into host cells, this region is vital to understanding potential cross-species transmission [16]. While the precise mechanisms of astrovirus cell entry are unknown, the divergence of AstV from other astroviruses in this region highlights the need for in vitro studies to quantify zoonotic potential in this virus family. Altogether, the highly divergent sequence of AstV OQ606244, paired with high observed human-bat contact rates in Madagascar, underline the need for heightened astrovirus surveillance in this region.
The substantial genetic divergence between AstV OQ606244 and other existing bat astroviruses indicate that this may represent a new species of astrovirus. Currently, species classification of Mamastroviruses is largely based on the host species in which they were detected, although the entire taxonomic group is now undergoing a reclassification due to continued detection of astroviruses across a broad range of hosts [50]. As astroviruses continue to receive more attention, we anticipated that the taxonomic classification of AstV OQ606244 will be clarified.
Evolutionary history
Our full-genome ML and Bayesian phylogenies shed light on the placement of bat astroviruses with respect to other mammalian astroviruses. In our phylogenetic analyses, two previously described bat astrovirus genome sequences resolved as basal sequences within one of the two distinct Mamastrovirus clades. This basal position indicates that bats may have been important ancestral hosts from which other mammalian astroviruses descended. We estimate a relatively shallow evolutionary history for the entire Mamastrovirus clade, with MRCA dating to only ~ 300 years ago. Given the substantial host diversity of astroviruses, this indicates that astroviruses circulate widely within mammalian populations. Our phylogenetic trees also give us insight into astrovirus evolutionary history within bats. Our tree topologies suggest that astroviruses may have originated in insectivorous Yangochiropteran bats and later been transmitted to and further radiated in Yinpterochiropterans.
We date the MRCA of AstV OQ606244 and its most closely related known relative, an Eidolon helvum AstV, to ~ 150 years ago. Considering Madagascar’s isolation from the African continent for ~ 160 million years and the divergence of the Rousettus spp. genus ~ 20 million years ago [51], this would indicate very recent viral genetic exchange between bats from Madagascar and the African continent. Thirty-eight of Madagascar’s 46 bat species are endemic, while two species exhibit ranges that include the Comoros and Reunion Islands (Scotophilus borbonicus and Miniopterus aellini, respectively). Nine species of Madagascar bat can be found more broadly distributed across Africa, and, in some cases, parts of Asia and Europe (e.g. Pipistrellus kuhlii and Mops midas) [10]. Investigating which astroviruses are detectable in these cosmopolitan species presents a key opportunity to elucidate the role of these more broadly distributed potential host species in the diversification and distribution of astroviruses.
In contrast to many other astrovirus studies, which demonstrate minimal to no host restriction on AstV evolution [12, 34, 47, 52,53,54], our RdRp phylogeny, which contained representative AstV sequences from SWIO region bats, indicated that astroviruses group by host phylogeny rather than by sampling location [44]. Perhaps the unique biogeographic features of the SWIO region, whereby island status limits bat dispersal, makes this pattern more likely. The most notable exception to this pattern is the appearance of eight Yinpterochiropteran sequences interspersed within disparate Yangochiropteran clades, in contrast with only three Yangochiropteran sequences clustered within the single Yinpterochiropteran clade. For bats, cross-species virus transmission can be mediated by interspecies co-roosting [1, 4, 7, 55]. Both Rousettus madagascariensis and Eidolon dupreanum, cave-roosting species, have been noted to co-roost with several species of insectivorous bats, which may facilitate cross-species transmission [10]. Indeed, Molossid species bats (Mormopterus jugularis) have been described co-roosting with R. madagascariensis in the exact same roost from which our AstV sequence was recovered [55], and our R. madagascariensis astrovirus clusters within a clade otherwise characterized by Molossid-derived AstVs. Additionally, the placement of our novel Eidolon dupreanum AstVs within this same clade suggests cross-species transmission within Pteropodids, likely facilitated by the proximity of our sampling locations, therefore acting as evidence of geographically driven astrovirus circulation.
It should be noted, however, that the relatively short fragment length of the RdRp region in our phylogenetic analysis (approximately 401 bases) yields weak support across many nodes, particularly those that are basal, limiting our confidence in these patterns. Furthermore, sampling has been historically biased to favor insectivorous bats in the SWIO region; as a result, the lack of representation of many bat host species in AstV surveillance to date limits our understanding. Given the high divergence of our recovered AstV sequence, however, as well as the propensity for human-bat interactions in the SWIO region, further surveillance of previously unsampled species to fill these astrovirus research gaps represents a major public health priority.
Conclusion
We characterize astroviruses recovered from urine and fecal samples derived from three species of Malagasy fruit bats, Pteropus rufus, Eidolon dupreanum, and Rousettus madagascariensis, identifying one near-full length astrovirus genome from Rousettus madagascariensis. This virus is highly divergent from all previously described bat AstV sequences, reflecting Madagascar’s unique biogeographical history as an island nation. This work supports the growing body of literature demonstrating that bats are likely to host considerable astrovirus diversity. Given the high rates of bat-human contact in this region this work demonstrates the urgency in the surveillance of astroviruses and other viruses of zoonotic potential in Malagasy bats, particularly. We advocate particularly for the role of longitudinal studies in addressing these aims, given the potential to develop a nuanced understanding of temporal and spatial dynamics in viral presence and shedding. We additionally advocate for heightened characterization of more whole genome bat astrovirus sequences to strengthen downstream phylogenetic analyses.
Data availability
The near-full length genome and three RdRp sequences presented in the study are deposited in NCBI, accession numbers: OQ606244, PQ038332, PQ038333 and PQ038344. Detailed methods are available at https://github.com/brooklabteam/Mada-Bat-AstV.
Change history
07 October 2024
A Correction to this paper has been published: https://doi.org/10.1186/s12985-024-02527-3
References
Calisher CH, Childs JE, Field HE, Holmes KV, Schountz T. Bats: important Reservoir hosts of emerging viruses. Clin Microbiol Rev. 2006;19:531–45.
Han H-J, et al. Bats as reservoirs of severe emerging infectious diseases. Virus Res. 2015;205:1–6.
Guth S et al. Bats host the most virulent—but not the most dangerous—zoonotic viruses. Proc. Natl. Acad. Sci. 119, e2113628119 (2022).
Brook CE, Dobson AP. Bats as ‘special’ reservoirs for emerging zoonotic pathogens. Trends Microbiol. 2015;23:172–80.
Brook CE, et al. Accelerated viral dynamics in bat cell lines, with implications for zoonotic emergence. eLife. 2020;9:e48401.
Brook CE, Rozins C, Guth S, Boots M. Reservoir host immunology and life history shape virulence evolution in zoonotic viruses. PLOS Biol. 2023;21:e3002268.
Luis AD, et al. A comparison of bats and rodents as reservoirs of zoonotic viruses: are bats special? Proc Biol Sci. 2013;280:20122753.
Cohen LE, Fagre AC, Chen B, Carlson CJ, Becker DJ. Coronavirus sampling and surveillance in bats from 1996–2019: a systematic review and meta-analysis. Nat Microbiol. 2023;8:1176–86.
Phelps KL, et al. Bat Research Networks and viral surveillance: gaps and opportunities in Western Asia. Viruses. 2019;11:240.
Goodman S. Les Chauves-Souris De Madagascar. Madagascar: Association Vahatra, Antananarivo; 2011.
Goodman SM, Benstead JP. The natural history of Madagascar. University of Chicago Press; 2003.
Lebarbenchon C, et al. Astroviruses in bats, Madagascar. Emerg Microbes Infect. 2017;6:1–3.
Kettenburg G, et al. Full genome nobecovirus sequences from Malagasy Fruit bats define a unique evolutionary history for this coronavirus clade. Front Public Health. 2022;10:786060.
Madera S, et al. Discovery and genomic characterization of a Novel Henipavirus, Angavokely Virus, from Fruit bats in Madagascar. J Virol. 2022;96:e00921–22.
Jenkins RKB, Racey PA. Bats as bushmeat in Madagascar. Madag Conserv Dev 3, (2008).
Bosch A, Pintó RM, Guix S. Human astroviruses. Clin Microbiol Rev. 2014;27:1048–74.
De Benedictis P, Schultz-Cherry S, Burnham A, Cattoli G. Astrovirus infections in humans and animals – molecular biology, genetic diversity, and interspecies transmissions. Infect Genet Evol. 2011;11:1529–44.
Finkbeiner SR, Le B-M, Holtz LR, Storch GA, Wang D. Detection of newly described Astrovirus MLB1 in Stool samples from children. Emerg Infect Dis. 2009;15:441–4.
Finkbeiner SR, Kirkwood CD, Wang D. Complete genome sequence of a highly divergent astrovirus isolated from a child with acute diarrhea. Virol J. 2008;5:117.
Chu DKW, et al. Detection of novel astroviruses in urban brown rats and previously known astroviruses in humans. J Gen Virol. 2010;91:2457–62.
Kapoor A, et al. Multiple novel astrovirus species in human stool. J Gen Virol. 2009;90:2965–72.
Lukashov VV, Goudsmit J. Evolutionary relationships among Astroviridae. J Gen Virol. 2002;83:1397–405.
Meliopoulos VA, et al. Detection of antibodies against Turkey Astrovirus in humans. PLoS ONE. 2014;9:e96934.
Karlsson EA, et al. Non-human Primates Harbor Diverse mammalian and avian astroviruses including those Associated with Human infections. PLOS Pathog. 2015;11:e1005225.
Ulloa JC, Gutiérrez MF. Genomic analysis of two ORF2 segments of new porcine astrovirus isolates and their close relationship with human astroviruses. Can J Microbiol. 2010;56:569–77.
Fischer K, dos Reis P, V., Balkema-Buschmann A. Bat astroviruses: towards understanding the Transmission dynamics of a neglected Virus Family. Viruses. 2017;9:34.
Nagai M, et al. Full genome analysis of bovine astrovirus from fecal samples of cattle in Japan: identification of possible interspecies transmission of bovine astrovirus. Arch Virol. 2015;160:2491–501.
Wolfaardt M, Kiulia NM, Mwenda JM, Taylor MB. Evidence of a recombinant wild-type human astrovirus strain from a Kenyan child with gastroenteritis. J Clin Microbiol. 2011;49:728–31.
Prendergast BJ, Freeman DA, Zucker I, Nelson RJ. Periodic arousal from hibernation is necessary for initiation of immune responses in ground squirrels. Am J Physiol Regul Integr Comp Physiol. 2002;282:R1054–1062.
Yinda CK, et al. Cameroonian fruit bats harbor divergent viruses, including rotavirus H, bastroviruses, and picobirnaviruses using an alternative genetic code. Virus Evol. 2018;4:vey008.
Lazov CM, Belsham GJ, Bøtner A, Rasmussen TB. Full-genome sequences of Alphacoronaviruses and astroviruses from Myotis and Pipistrelle Bats in Denmark. Viruses. 2021;13:1073.
Li Y, et al. Virome of Bat Guano from Nine Northern California Roosts. J Virol. 2021;95:e01713–20.
Kalantar KL et al. IDseq—An open source cloud-based pipeline and analysis service for metagenomic pathogen detection and monitoring. GigaScience 9, giaa111 (2020).
Chu DKW, Poon LLM, Guan Y, Peiris JS. M. Novel astroviruses in Insectivorous bats. J Virol. 2008;82:9107–14.
Katoh K, Rozewicki J, Yamada K. D. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform. 2019;20:1160–6.
Davies J. PySimPlot. (2023).
Darriba D, et al. ModelTest-NG: a New and Scalable Tool for the selection of DNA and protein evolutionary models. Mol Biol Evol. 2020;37:291–4.
RAxML-NG. a fast, scalable and user-friendly tool for maximum likelihood phylogenetic inference | Bioinformatics | Oxford Academic. https://academic.oup.com/bioinformatics/article/35/21/4453/5487384
Posada D, Crandall KA. Felsenstein Phylogenetic Likelihood. J Mol Evol. 2021;89:134–45.
Bouckaert R, et al. BEAST 2.5: an advanced software platform for bayesian evolutionary analysis. PLOS Comput Biol. 2019;15:e1006650.
Jenkins GM, Rambaut A, Pybus OG, Holmes EC. Rates of Molecular Evolution in RNA viruses: a quantitative phylogenetic analysis. J Mol Evol. 2002;54:156–65.
ggtree: an. r package for visualization and annotation of phylogenetic trees with their covariates and other associated data - Yu – 2017 - Methods in Ecology and Evolution - Wiley Online Library. https://besjournals.onlinelibrary.wiley.com/doi/https://doi.org/10.1111/2041-210X.12628
Xiao C-T, et al. Identification and characterization of novel porcine astroviruses (PAstVs) with high prevalence and frequent co-infection of individual pigs with multiple PAstV types. J Gen Virol. 2013;94:570–82.
Lei M, Dong D. Phylogenomic analyses of bat subordinal relationships based on transcriptome data. Sci Rep. 2016;6:27726.
Teeling EC, et al. A molecular phylogeny for bats illuminates Biogeography and the Fossil Record. Science. 2005;307:580–4.
Joffrin L, et al. Astrovirus in Reunion Free-tailed Bat (Mormopterus francoismoutoui). Viruses. 2021;13:1524.
Rougeron V, et al. Characterization and phylogenetic analysis of new bat astroviruses detected in Gabon, Central Africa. Acta Virol. 2016;60:386–92.
Chidoti V et al. Longitudinal survey of Astrovirus infection in different bat species in Zimbabwe: evidence of high genetic astrovirus diversity. Peer Community J. 3, (2023).
Hoarau AOG, et al. Investigation of astrovirus, coronavirus and paramyxovirus co-infections in bats in the western Indian Ocean. Virol J. 2021;18:205.
Astroviridae | ICTV. https://ictv.global/report_9th/RNApos/Astroviridae
Agnarsson I, Zambrana-Torrelio CM, Flores-Saldana NP. May-Collado, L. J. A time-calibrated species-level phylogeny of bats (Chiroptera, Mammalia). PLoS Curr. 2011;3:RRN1212.
Hoarau F, et al. Bat Astrovirus in Mozambique. Virol J. 2018;15:104.
Kemenesi G, et al. Molecular Survey of RNA viruses in Hungarian bats: discovering Novel Astroviruses, coronaviruses, and Caliciviruses. Vector-Borne Zoonotic Dis. 2014;14:846–55.
Lacroix A, et al. Diversity of bat astroviruses in Lao PDR and Cambodia. Infect Genet Evol. 2017;47:41–50.
Andrianaivoarivelo AR, Ramilijaona O, Racey P, Razafindrakoto N, Jenkins R. Feeding ecology, habitat use and reproduction of Rousettus madagascariensis Grandidier, 1928 (Chiroptera: Pteropodidae) in eastern Madagascar. mammalia 75, (2011).
Acknowledgements
The authors acknowledge Anecia Gentles, Kimberly Rivera, Fifi Ravelomanantsoa, and Sarah Guth for help in the field and lab. We acknowledge the Virology Unit at the Institut Pasteur de Madagascar and the Department of Zoologie et Biodiversité Animale at the University of Antananarivo. We thank Joseph L. DeRisi, Maira Phelps, and Vida Ahyong of the Chan Zuckerberg Biohub (CZB) for logistical support. We additionally thank Angela Detweiler, Michelle Tan, and Norma Neff of the CZB genomics platform for mNGS support and thank the Brook lab at the University of Chicago for helpful contributions to the manuscript. This work was completed in part with resources provided by the University of Chicago’s Research Computing Center.
Funding
Research was funded by the National Institutes of Health (1R01AI129822-01 grant to J-MH, PD, and CEB and 5DP2AI171120-02 grant to CEB), DARPA (PREEMPT Program Cooperative Agreement no. D18AC00031 to CEB), the Bill and Melinda Gates Foundation (GCE/ID OPP1211841 to CEB and J-MH), the Adolph C. and Mary Sprague Miller Institute for Basic Research in Science (postdoctoral fellowship to CEB), the Branco Weiss Society in Science (fellowship to CEB), and the Chan Zuckerberg Biohub.
Author information
Authors and Affiliations
Contributions
CEB concieved of the project and acquired the funding, in collaboration with J-MH, VL, and PD. Field samples were collected, and RNA extracted, by CEB, HCR, SA, AA, and VR. AK led the mNGS, with support from HCR, TR, CEB, and CMT. SH, GK, and CEB analyzed the resulting data. SH and CEB co-wrote the original draft of the manuscript, and SH prepared all figures. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Ethical approval
The animal study was reviewed and approved by UC Berkeley Animal Care and Use Committee and Madagascar Ministry of Forest and the Environment under guidelines posted by the American Veterinary Medical Association.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Horigan, S., Kettenburg, G., Kistler, A. et al. Detection, characterization, and phylogenetic analysis of novel astroviruses from endemic Malagasy fruit bats. Virol J 21, 195 (2024). https://doi.org/10.1186/s12985-024-02471-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12985-024-02471-2