Geographical distribution and relative risk of Anjozorobe virus (Thailand orthohantavirus) infection in black rats (Rattus rattus) in Madagascar

Background Hantavirus infection is a zoonotic disease that is associated with hemorrhagic fever with renal syndrome and cardiopulmonary syndrome in human. Anjozorobe virus, a representative virus of Thailand orthohantavirus (THAIV), was recently discovered from rodents in Anjozorobe-Angavo forest in Madagascar. To assess the circulation of hantavirus at the national level, we carried out a survey of small terrestrial mammals from representative regions of the island and identified environmental factors associated with hantavirus infection. As we were ultimately interested in the potential for human exposure, we focused our research in the peridomestic area. Methods Sampling was achieved in twenty districts of Madagascar, with a rural and urban zone in each district. Animals were trapped from a range of habitats and examined for hantavirus RNA by nested RT-PCR. We also investigated the relationship between hantavirus infection probability in rats and possible risk factors by using Generalized Linear Mixed Models. Results Overall, 1242 specimens from seven species were collected (Rattus rattus, Rattus norvegicus, Mus musculus, Suncus murinus, Setifer setosus, Tenrec ecaudatus, Hemicentetes semispinosus). Overall, 12.4% (111/897) of Rattus rattus and 1.6% (2/125) of Mus musculus were tested positive for THAIV. Rats captured within houses were less likely to be infected than rats captured in other habitats, whilst rats from sites characterized by high precipitation and relatively low seasonality were more likely to be infected than those from other areas. Older animals were more likely to be infected, with infection probability showing a strong increase with weight. Conclusions We report widespread distribution of THAIV in the peridomestic rats of Madagascar, with highest prevalence for those living in humid areas. Although the potential risk of infection to human may also be widespread, our results provide a first indication of specific zone with high transmission. Gathered data will be helpful to implement policies for control and prevention of human risk infection.


Background
Hantaviruses are zoonotic viruses whose main reservoir hosts are rodents, although these viruses have recently been detected in insectivores and bats [1,2]. Human infections appear in several areas of the globe. Clinical symptoms and severity of disease vary between regions according to the species of hantavirus involved. Two diseases associated with hantavirus infections are described, the Hantavirus Cardiopulmonary Syndrome (HCPS) and the Hemorrhagic Fever with Renal Syndrome (HFSR). Nephropathia Epidemica (NE) is a mild form of HFSR.
While NE disease appears to be mild with 0.1% of mortality rates, HCPS and HFRS, are more severe with a mortality that can reach up to 50 and 15% respectively [3]. HCPS are mainly found in the Americas, and HFRS and NE appears in the Old World [4,5].
Hantaviruses are generally thought to be relatively host-specific [6], for example, Seoul orthohantavirus (SEOV) is associated with the rat, Rattus norvegicus, [7] and Puumala orthohantavirus (PUUV) is preferentially hosted by the bank vole, Clethrionomys glareolus [8]. However, studies have shown that spillover infection can occur between sympatric host species [9,10]. For example, the common vole, Microtus arvalis, is the preferential host of Tula orthohantavirus (TULV), but the virus is also detected in other vole species such as Microtus agrestis and Arvicola spec. [11]. Hantaviruses can be found in saliva, urine and feces of infected animals and transmission between reservoirs is thought to be through bites, scratches or inhalation of contaminated aerosols [12,13]. Humans usually contract hantavirus infection by direct contact with rodents or by inhalation of contaminated aerosols from excretion or secretion from infected animals [14]. Transmission inter-human is very uncommon, although it has been reported for some lineages of Andes orthohantavirus (ANDV) [15,16].
Studies have also documented exposure of human populations in Africa to hantavirus [28][29][30], and recently Heinemann et al. have reported the presence of Bowé orthohantavirus associated with human disease in an Ivorian patient [31]. Moreover, as Thailand orthohantavirus is associated with disease [32], it is probable that hantavirus infection in Madagascar is also symptomatic. Nevertheless, the lack of studies in Madagascar prevents us from highlighting the potential threat that this virus presents for public health and from raising awareness amongst clinicians of clinical symptoms associated with hantavirus infection.
In areas of the globe where hantavirus infections in humans are better monitored, there is strong spatial and temporal variation in disease incidence and studies have demonstrated effects of environmental variables, including climate, habitat and land-use [6]. Climatic conditions may influence persistence of hantavirus in the environment and, therefore, transmission rates to humans [33]. However, these environmental effects on incidence are thought to be primarily mediated by the impacts on the abundance of infected reservoirs [34,35]. For individual hantavirus, spatial variation in risk is likely to largely depend on the bioclimatic and habitat preferences of the principle host, whilst temporal variation in risk may be driven by inter-annual climatic driven changes in food availability and breeding rates [36]. Despite the tendency for close associations between individual mammal species and individual hantaviruses, several studies have also suggested that increased mammal diversity may reduce the prevalence of hantavirus infection in the principal host [37][38][39][40], consistent with a "dilution effect".
For areas where data on infection in humans may be under-recorded, examining the spatial distribution of infected reservoirs provides an important first-step in understanding the potential risk to humans, as well as allowing analyses to explore epidemiological processes in the reservoir populations by identifying environmental risk factors. The previous study in Madagascar that detected Anjozorobe virus was limited to one forest location, Anjozorobe-Angavo forest [25]. Here we sample terrestrial peridomestic small mammals trapped in urban and rural sites of representative regions of Madagascar to (i) establish the distribution of Anjozorobe virus and (ii) evaluate to what extent spatial variation in infection rates is related to environmental and host-related variables. Our results will be helpful to implement policies for future awareness and surveillance programs for human communities, as well as increasing our understanding of the epidemiology of hantavirus in a peridomestic African context.

Specimen collection
Sampling was carried out in twenty districts of Madagascar, with a rural and urban zone in each district except for one district where only the rural zone was sampled. The urban zone was centered on a health center, whilst the rural zone was located within a randomly selected commune within 15 km of the health center. Within the urban zone, three sub-areas for trapping were randomly selected within an 800 m*800 m square, whilst the rural zone centered on a single village. Terrestrial small mammals were captured within a range of habitats. At each zone 15-20 houses were sampled, using one wire-mesh trap (BTS) and one Sherman trap placed inside each house. Where possible a further BTS trap was set in the immediate vicinity of the house. Additional BTS traps were set in trapping lines close to vegetation in (i) areas around houses (either adjacent to paths or areas for disposing of waste) and (ii) around rice fields or other low ground primarily used for agriculture. In urban zones, where possible, additional BTS traps were set in markets and around the abattoir. Traps were baited with onion, dry fish and carrot and checked each morning for 3 days. All trapping was conducted February 2012-April 2012 or October 2012-May 2013. Species, gender and weight of each captured animal were recorded. Animals were euthanized by cervical dislocation and organs were collected and stored in liquid nitrogen before laboratory storage at − 80°C.

RNA extraction and nested RT-PCR
Approximately 100 mg of liver and spleen tissues from each individual rodent were ground using Tissuelyser II (Qiagen) with a 5 mm stainless steel beads. Organ supernatants were recovered at 1:10 dilution of culture medium containing 40% of fetal bovine serum and antibiotics. Total RNA was extracted using the TRIzol LS reagents (Invitrogen, Carlsbad, CA) according to the manufacturer's instructions. cDNA was prepared using PRO-MEGA Kit One Step PCR and amplified by a previously published nested-PCR protocol that targets conserved region of the L gene [18]. All PCR products were sequenced (Cogenics, Essex, UK).

Statistical analysis
We investigated the relationship between hantavirus infection probability in rats and possible risk factors by using binomial Generalized Linear Mixed Models (GLMM) with a logit link. Individual infection status was the response. Explanatory covariates were selected for consideration based on their potential to influence transmission of hantavirus within small mammal populations and included individual-level variables and sitelevel variables. Given that rodents trapped at the same site were non-independent, site was included as a random effect. Continuous covariates were centred and standardized. Non-linear relationships were considered by including the square of the variable. To limit the number of models being considered we performed statistical analysis in two stages: first examining individuallevel variables and then site level variables. Model selection was based on likelihood ratio tests (LRT) to compare nested models, and Akaike Information Criteria (AIC) to compare non-nested models. Models with an AIC within 2 of the model with the lowest AIC (ΔAIC< 2) are equally likely to be the best model [41].
The individual-level stage of the analysis considered weight (as a proxy for age), sex and habitat. Two different habitat related variables were considered. The first was a six-level factor including inside house, outside house, abattoir, market, exterior trap lines and lowground trap lines. The second was a two-level factor that compared inside house rats to rats from all other habitats mentioned above. Explanatory variables were first considered in univariate models. However, initial analyses indicated that weight had a very strong effect and models without weight sometimes had problems converging, weight was therefore included in all models.
The site-level stage of the analysis included all variables significant from the first stage of the analysis and added site-level variables individually. A range of variables related to climate were considered. Bioclimate was classified into four zones: dry, subarid, subhumid and humid. The following climatic variables were also obtained from WorldClim (http://worldclim.org/version2): Annual Mean Temperature BIO1, Mean Diurnal Range BIO2, Temperature Seasonality BIO4, Annual Precipitation BIO12, Precipitation Seasonality BIO15 and Precipitation of Driest Quarter BIO17. WorldClim variables were extracted from a 10 km radius from a point midway between the urban and rural sites. As these climatic variables were correlated, it would not be possible to consider them together in subsequent multivariate GLMMs. We therefore also used principle components analysis (PCA) to summarize climatic variation between sites and considered the first two principal components as covariates in the GLMM. In addition to climatic variables, we also examined site type (rural vs urban); season which was defined as dry season October to December and rainy season January to April; inside house R. rattus abundance (number rats trapped in houses/number of trap nights in houses), outside house R. rattus abundance (number rats trapped outside/number of trap nights outside); host species diversity based on the Shannon diversity index and host species diversity based on Evenness [42].
In each stage, following individual assessment of variables, non-correlated variables with a p-value for the LRT of < 0.20 were considered for inclusion in a multivariate model. Statistical analyses were conducted in R software version 3.3.1 using the following packages: vegan, ade4 and lme4 [43][44][45][46].
Overall 113 (9%) small mammals were PCR positive for hantaviruses, of which 111 were R. rattus and two M. musculus. It is to be noted that all PCR-positive samples were confirmed by sequencing. Hantavirus RNA was not detected in samples from R. norvegicus or insectivores. Genetic analysis of the partial the coding region of the L segment (301 nt) of rodent-borne hantaviruses detected in Madagascar show that these viruses cluster with previous hantavirus detected in Madagascar; Anjozorobe virus ( Fig. 1).

Risk factors of hantavirus infection in R. rattus
For these analyses, only R. rattus were considered due to the small number of hantavirus positive from the other species.
In the individual-level stage, univariate logistic analyses indicated that weight was an important factor with a highly significant positive linear effect on hantavirus infection probability (Table 2). There was no difference between males and females. However, significant differences in infection probabilities were observed between habitats. Based on Akaike information criterion (AIC), the model with habitat as a two-level factor appears to sufficiently describe this variation, with houses having a lower probability of infection than other habitats   Table 2). Prevalence of infected rodents living within houses or outside were 7.5% (34/456) and 17.5% (77/441) respectively. Principal Components Analysis (PCA) results showed that the two principal components F1 and F2 represented 88% of the total climatic data information (Fig. 3). F1 was positively correlated with annual precipitation and precipitation of driest quarter and negatively correlated with mean diurnal range in temperature and precipitation seasonality. Thus, humid sites were characterized by high values of F1, whilst subarid and some subhumid sites that showed more seasonal variation had low values of F1. F2 exhibited positive correlation with mean annual temperature (Fig. 3), with dry sites having the highest values for F2.
The GLMMs in the site-level stage showed strong evidence of climatic effects. In models that included the best model from the individual-level stage and single site-level covariates, the mean diurnal ranges in temperature, annual precipitation and F1 from the PCA analysis were all associated with variation in infection probability based on LRT (Table 3). There was evidence of a non-linear relationship with F1 and some suggestion of a non-linear effect for mean diurnal range in temperature and annual precipitation (Table 3). Infection probabilities increased with increasing precipitation (OR = 2) and increasing values of F1 (at least to some kind of plateau) and decreased with an increasing diurnal range in temperature (OR = 0.56). There was no evidence of any difference between rural and urban sites. There was also no effect of season, rodent abundance measures or host diversity measures. Consequently, as the climate measures were highly correlated no further multi-variate models were considered.
Comparison of AIC values from the univariate models indicated that models including precipitation performed best (Table 3). Although, the model with the lowest AIC included a non-linear effect of precipitation, a simple linear effect was similar in its ability to describe the data. Fig. 1 Phylogenetic tree of hantavirus based on the partial sequences of the L segment coding region. Phylogenetic tree generated by the Neighbourjoining methods and Kimura-2 parameter, based on the alignment of the coding region of the L partial segment 301 nucleotides long of rodents-borne hantaviruses detected in Madagascar. •Reference Sequences ■ Outgroup sequences obtained from Genbank (http://www.ncbi.nlm.nih.gov/nuccore/. Our strains are indicated by Z and the initial of the district followed by the name of the isolation site. In bold are the sequences of hantavirus detected in Mus musculus. Only bootstrap percentages ≥70% (from 1000 resampling) is indicated. Scale bar indicates nucleotide substitution per site The model with a non-linear effect of F1 was marginally worse at explaining the data (ΔAIC = 2.32). Thus, high precipitation appears to be the principle climatic driver of high hantavirus infection probability.

Discussion
Our national-scale study demonstrates that THAIV is widespread in the peridomestic small mammal communities of Madagascar, primarily infecting R. rattus, the most common species in our study, in both rural and urban sites. Rats living outside houses are more likely to be infected than those inhabiting houses and hantavirus prevalence is higher in humid areas of the island. Weight also influenced infection probability, as several studies have shown before for this chronic infection [47,48]. Our results are important for understanding the potential risk to humans from hantavirus infection in Madagascar.
We confirm R. rattus as the main reservoir of hantavirus in Madagascar. Sequencing results from a subset of samples form a distinct cluster with Anjozorobe virus. The detection of two infected M. musculus may suggest spillover infections. Spillover infections in sympatric hosts of closely related species or genera have been reported before for Dobrava orthohantavirus (DOBV) [49] and Tula orthohantavirus (TULV) [11]. Our results are consistent with the idea of that whilst hantaviruses have preferred host species, some spillover infections can occur [11]. However, the importance of such secondary spillover hosts for the persistence and dynamics of infection is unclear. For TULV, the spatial distribution of infected individuals from secondary host species and their apparent clearance of the virus suggest they play a minor role, if any [11].
We did not detect any hantavirus in two other introduced mammals, R. norvegicus and S. murinus. R. norvegicus is the main reservoir of Seoul orthohantavirus, a virus that is found predominantly in the Old World  [50][51][52], but thought to have a more widespread global distribution [2] and recently confirmed using molecular techniques as causing human infections in Europe [53,54]. S. murinus has been found infected with Thottapalayam orthohantavirus in India, Thailand and Indonesia. As our sample size for R. norvegicus and S. murinus were relatively small (n = 124 and n = 76 respectively), we cannot exclude the possibility that they are infected with hantavirus in Madagascar.
The global prevalence of infection amongst R. rattus was 12.4% (111/897), which is similar to the prevalence of Mayotte virus observed in R. rattus populations on Mayotte Island (18.1%: 29/160) [26]. However, we detected variation in prevalence at two spatial scales. Large-scale spatial variation was not driven by any ruralurban split but was related to climate, whilst there was also within-site variation related to habitat type. Spatial prevalence patterns at both scales may be associated with variation in transmission rates due to changing contact rates between potential hosts and infective particles. Serological infection may have found more rats that have been exposed to hantaviruses, but using a more conservative test that detects current presence of DNA is arguably more relevant for the potential infection risk for humans as these individuals are actively infected, moreover uncertainty about specificity of serological testing and the potential for cross-reaction with antibodies against other viruses could have resulted in noise within the dataset (false positives). Furthermore, RT-PCR could allow the detection of new variant of ANJOV or a new orthohantavirus which was of importance for us since to date, hantavirus described in Madagascar, were detected from R. rattus collected in a small region of Madagascar [25]. At the national scale, sites in humid environments presented the highest rates of hantavirus infection. Climatic drivers, including increased precipitation, have been linked to inter-annual variation of hantavirus cases in humans in China and the Four Corners Region of the USA [33,34,55]. Various mechanisms for these relationships have been proposed, including increased vegetation growth leading to more abundant rodent populations and increased contact and viral transmission between rodents and between rodents and humans [34,55]. A positive relationship between host abundance and infection prevalence has been found for Puumala orthohantavirus in a spatiotemporal study of bank vole populations [39]. However, some other studies have found only weak relationships between principal host abundance and prevalence [56], and in our study we did not observe any significant association between rat abundance and hantavirus prevalence. The absence of strong effects of host abundance may reflect difficulties in measuring abundance in a relevant way, for example current prevalence may depend on previous host abundance rather than current host abundance, or because the relationship between abundance and contact rates is not linear [57]. Alternatively, other factors may drive the association between high precipitation and high prevalence through impacts on contact rates.
Rather than absolute rat abundance, the positive association may instead be linked to variation in the seasonality of reproduction. As indicated by the PCA analysis, sites with high precipitation are also characterized by less seasonality. Rats inhabiting such environments, where there are sufficient food resources throughout the year to allow reproduction, may exhibit behaviors that are linked to hantavirus transmission throughout the year (e.g. aggressive behaviors or behaviors associated with maintenance of territories). As humidity can maintain the infectivity and the stability of virus in the ex-vivo environment [58], an alternative explanation could be that increased contact between rodents and infective particles increases hantavirus infection rates in humid areas.
Within sites, rats trapped outside were more likely to be infected than rats trapped inside houses. This variation could also be due to differences in rat contact structures and virus persistence. Contact structures between rats living in houses are likely to be very different from rats inhabiting outdoors. If rats typically inhabit a single house, each house may represent a single small and relatively isolated population with a much-reduced contact structure. A non-exclusive alternative hypothesis could be that persistence of hantavirus in the environment is reduced within houses.
Several studies of hantavirus in reservoir populations have found an apparent dilution effect, with lower prevalence in sites with increased mammal diversity [37][38][39]. Such an effect can occur through a variety of mechanisms such as decreases in host density due to competition or reductions in encounter rates between infected and susceptible hosts. In studies of Sin Nombre virus there has been some evidence that increased mammal diversity leads to reduced intraspecific contact rates in the principal host species [37]. In our study we found no evidence of an effect of diversity, measured at the sitelevel, on hantavirus prevalence and it seems unlikely that community diversity explains the difference in infection probability between inside and outside rats, as all the species trapped were caught in both types of locations (apart from M. musculus, which could only be captured by Sherman traps which were restricted to houses, and the rarely trapped H. semispinosus). The lack of a dilution effect in our study may suggest that whilst an effect of community composition appears to be common for hantaviruses in reservoir populations inhabiting natural and semi-natural habitats located in several parts of the world [59], it is not a feature of hantaviruses in a peridomestic context. Alternatively, as for abundance, our measures of community diversity may be inappropriate to capture the underlying mechanism. For example, the effects of community composition on Puumala orthohantavirus were driven by abundance of a specific species, rather than overall community diversity [39,60]. Thus, more detailed analyses, as well as data from more diverse sites, would further improve our understanding of the potential mechanisms behind the spatial variation in prevalence.

Conclusions
To conclude, we report the widespread distribution of THAIV in the peridomestic rat, R. rattus, captured in both urban and rural sites of Madagascar, with highest prevalence in humid areas of the island. Thus, although the potential risk of transmission to humans may also be widespread, our results provide a first indication of areas where the risk may be higher. The reduced infection probabilities in rats living in houses may decrease the risk of transmission to humans, but data on human exposure is necessary to properly evaluate the risk.

Funding
This work was supported by the Institut Pasteur de Madagascar (Internal Project through ZORA: Zoonoses, Rodent and Arboviruses) and Wellcome Trust Fellowships to ST (#081705, #095171). VR was also supported though Girard's fellowship undergraduate program from the Institut Pasteur de Madagascar. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Availability of data and materials
The datasets used and/or analysed during the current study are available from the corresponding author upon reasonable request.
Authors' contributions VR performed the viral diagnosis and drafted the manuscript. VR and ST analyzed the statistical data. MMO, FMA, SAF, JPR and SA coordinated the fieldwork and participated to the sampling collection. MMO, SFA, ST and JMH designed the field study. CF helped in the phylogenetic analyses and revised the paper. JMH, ST and DADR coordinated the study and revised the paper. All authors reviewed the paper. All authors read and approved the final manuscript.

Ethics approval
This study was conducted according to the institutional ethical guidelines. The Institut Pasteur de Madagascar is guided by the International Guiding Principles for Biomedical Research Involving Animals. This Institution comply with the following laws, regulations, and policies governing the care and use of laboratory animals (Directive 2010/63/EU revising Directive 86/609/EEC on the protection of animals used for scientific purposes was adopted on 22 September 2010; National charter related to Ethics on animal experimentation, Ministry of Education and Research, Ministry of agriculture, livestock and fisheries; 3 April 2008; Charter of the Institut Pasteur related to Ethics on animal experimentation). Moreover, this institution is approved by the US Office of Laboratory Animal Welfare (Animal Welfare Assurance number F17-00356).