Phylodynamics of avian influenza clade 2.2.1 H5N1 viruses in Egypt
Virology Journal volume 13, Article number: 49 (2016)
Highly pathogenic avian influenza (HPAI) viruses of the H5N1 subtype are widely distributed within poultry populations in Egypt and have caused multiple human infections. Linking the epidemiological and sequence data is important to understand the transmission, persistence and evolution of the virus. This work describes the phylogenetic dynamics of H5N1 based on molecular characterization of the hemagglutinin (HA) gene of isolates collected from February 2006 to May 2014.
Full-length HA sequences of 368 H5N1 viruses were generated and were genetically analysed to study their genetic evolution. They were collected from different poultry species, production sectors, and geographic locations in Egypt. The Bayesian Markov Chain Monte Carlo (BMCMC) method was applied to estimate the evolutionary rates among different virus clusters; additionally, an analysis of selection pressures in the HA gene was performed using the Single Likelihood Ancestor Counting (SLAC) method.
The phylogenetic analysis of the H5 gene from 2006–14 indicated the presence of one virus introduction of the classic clade (2.2.1) from which two main subgroups were originated, the variant subgroup which was further subdivided into 2 sub-divisions (188.8.131.52 and 184.108.40.206a) and the endemic subgroup (220.127.116.11). The clade 18.104.22.168 showed a high evolution rate over a period of 6 years (6.9 × 10−3 sub/site/year) in comparison to the 22.214.171.124a variant cluster (7.2 × 10−3 over a period of 4 years). Those two clusters are under positive selection as they possess 5 distinct positively selected sites in the HA gene. The mutations at 120, 154, and 162 HA antigenic sites and the other two mutations (129∆, I151T) that occurred from 2009–14 were found to be stable in the 126.96.36.199 clade. Additionally, 13 groups of H5N1 HPAI viruses were identified based on their amino acid sequences at the cleavage site and “EKRRKKR” became the dominant pattern beginning in 2013.
Continuous evolution of H5N1 HPAI viruses in Egypt has been observed in all poultry farming and production systems in almost all regions of the country. The wide circulation of the 188.8.131.52 clade carrying triple mutations (120, 129∆, I151T) associated with increased binding affinity to human receptors is an alarming finding of public health importance.
Influenza-A viruses contain eight segments of single-strand RNA (ssRNA) and they are continuously evolving overtime. Point mutations can introduce small changes known as genetic drift which mainly occurs because the virus polymerase lacks the proofreading property. These changes are thought to be selected by pressures that force the virus to mutate. Highly pathogenic avian influenza viruses of the H5N1 subtype caused severe outbreaks in 1996/97 in southern China and Hong Kong . In recent years, the H5N1 viruses spread from Asia to Europe and then to Africa, becoming endemic in poultry in parts of Asia and Egypt with frequent transmission to humans. In Egypt, the H5N1 HPAI virus (clade 2.2) was first reported in poultry in February 2006. Since then, the virus has spread rapidly among commercial and backyard flocks in most of the governorates . Human infection rates were still rising as of June 2, 2014, reaching 175 infections with 63 deaths. By May 2015, infections reached 342 with 114 deaths due to the emergence of a new cluster originated from 184.108.40.206 . The H5N1 viruses were isolated from ducks, chickens, and humans in Egyptian households and clustered into a distinct genetic group designated as 2.2.1. The majority of viruses derived from vaccinated poultry in commercial farms belonged to the 220.127.116.11 clade of variant viruses [4–6].
Genotypic characterization of avian influenza H5N1 viruses with the study of the evolutionary dynamics of circulating viruses will promote understanding of the virus evolution in a particular place. In a situation like Egypt, the genetic diversity of viruses leads to the production of heterogeneous genotypes . However, the mechanisms associated with the genotype diversity of H5N1 viruses have still not been investigated . Genetic phylogeny is currently considered the gold standard in characterizing viral genomics, transmission, and molecular evolution. Attempts to analytically trace the migration of viruses through evolutionary history have been done to infer migratory events . In order to enhance the understanding of H5N1 HPAI virus epidemiology and the disease dynamics, particularly in endemic countries, regular linking of epidemiological data from individual outbreaks with the respective sequence information is of paramount importance to decide the activity of efficient disease control.
The aim of this work was to study the evolution of Egyptian H5N1 viruses from 2006 to 2014 using longitudinal epidemiological and virological data. The phylogenetic analysis of the HA gene linked with spatial data analysis can help us to understand the geographic spread of those viruses. This work aimed to describe the cluster dynamics of circulating viruses and evaluate the persistence of H5N1 strains in annual epidemics. Through that analysis, it was possible to determine the evolution rates of the HA gene and to characterise different H5N1genotypes in poultry in Egypt.
Results and discussion
The phylogenetic analysis of the H5 gene of Egyptian viruses from 2006 to 2014 indicated the presence of two main subgroups—namely the classic 2.2.1 and the variant 18.104.22.168—according to the updated nomenclature of the WHO/OIE/FAO H5N1 Evolution Working group . The classic group of clade 2.2.1 that was introduced into Egypt in 2006 remained stable through 2009 and represented the original viruses known at that time. The variant clade 22.214.171.124, which emerged in late 2007 from vaccinated commercial poultry, was subdivided into 2 clusters from 2008 to 2011 (126.96.36.199 and 188.8.131.52a). The first cluster emerged in late 2007 (184.108.40.206) and remained until 2009, while the second cluster (220.127.116.11a) emerged in 2008 and remained until 2011. Since then, these variant clusters have not been detected (Figs. 1 and 2). In 2008, the classic viruses evolved into a new clade 18.104.22.168 due to gradual accumulation of genetic mutations in the HA protein, and was the dominant cluster between 2009 and 2014 in both the household and commercial poultry sectors irrespective of their vaccination status (Figs. 1 and 2). Among the 368 H5 sequences analyzed in this study, there were 299 viruses of classic clade (75 belonged to 2.2.1, 224 to 22.214.171.124) and 69 viruses of variant clade (21 of 126.96.36.199 and 48 of 188.8.131.52a) (Fig. 2).
In general, the original viruses of clade 2.2.1 from 2006 to 2009 were distributed along populated areas of the Nile basin. They were mostly detected in the northern Delta region (53/75) with fewer cases (22/75) in Upper Egypt. The results from passive surveillance and notifications through the veterinary authority supported the same findings that Lower Egypt represented the highest record in comparison to Upper Egypt . However, this finding requires further investigation because it may be due to a sampling bias, as most surveillance activity was directed to the Delta governorates at that time.
The endemic 184.108.40.206 cluster was identified first in 2008 and continues to circulate. It was widely distributed in most governorates in Egypt. Unlike 2.2.1, the 220.127.116.11 cluster was equally distributed between Lower and Upper Egypt (each with 112/224). The 18.104.22.168 viruses were mostly detected in Fayoum (37 virus) and Giza (24 virus) in Upper Egypt, as well as in Menofia (32 virus) in Lower Egypt, (Fig. 3).
From 2007 to 2011, most (51/69) viruses from the variant clade 22.214.171.124 were detected in the Delta area, with lower detection rates (18/54) in Upper Egypt (Giza, Beni suef, Menia, Luxor, and Qena governorates). The variant clade was highly prevalent (46/69) in commercial poultry farms, especially in Qalubiya (20/46), then Giza (6/46), Sharkia (5/46), and Dakahlya (5/46). Both 126.96.36.199 and 188.8.131.52a were first detected in Sharkia governorate with further expansion of both variant clusters to Qalubiya, Beheria, and Dakahlya in Lower Egypt and to Giza then to Upper Egypt (Additional file 1: Figure S1).
Analysis of virus population dynamics of the entire data set of the Egyptian H5N1 viruses showed a rise in genetic diversity in the 184.108.40.206 cluster from early 2008, shortly after the first introduction of the H5N1 viruses in the country in 2006. From 2009 to 2014, the 220.127.116.11 cluster exhibited a constant progressive adaptation to poultry and was considered to be an endemic cluster . Genetically and antigenically distinct viruses emerged in Egypt in late 2007 after vaccination began in poultry (referred to as subgroups E and F  or subclade B  or clade 18.104.22.168 in this study) and estimated to have the highest divergence and rapid evolution rate.
The classic clade (2.2.1) and endemic clade (22.214.171.124) was widely distributed in the household poultry production sector involving chickens (118/134) and ducks (72/75). From 2009–2014, there was an increased detection of the 126.96.36.199 clade in live bird markets (LBM) (25/33), (Fig. 4). In this study, the variant clades (188.8.131.52 and 184.108.40.206a) were mainly reported in commercial chickens (44/93). Although most cases (38/69) were presented with unknown vaccination history, the variant clades were mainly reported in vaccinated flocks (27/69).
There is an apparent lack of disease notification and reporting in the commercial poultry sector in Egypt . Thus HA gene sequences of H5N1 viruses since 2012 from this sector is lacking in most governorates. Phylogeography can highlight the drivers of H5N1 emergence and spread. Qalubiya appears to represent a popular location for virus transmission as also has been explored in previous study . In addition, Sharkia and Dakahlia in Delta and Giza were of the same character as they have all virus clusters recorded in different time periods (Fig. 3). However, there remains uncertainty about virus spread to and from those locations and thus more research needs to be conducted in order to investigate this phenomenon.
Analysis of selection pressures and evolutionary rates
The Nonsynonymous/Synonymous nucleotide substitution ratio (dN/dS) per site was greater than one for nine individual sites in the HA1 domain of the HA gene (Table 1), indicating the presence of positive selection driving the evolution of Egyptian H5N1 viruses in these sites. In particular, the 220.127.116.11 clade showed five distinct positively selected sites (120, 129, 154, 155 and 162), 18.104.22.168 has three prominent sites (140, 141 and 162), while 22.214.171.124a has five characteristic sites (140, 141, 154, 162 and 185) (Table 1). The results indicated that the 126.96.36.199 clade is under positive selection pressure that leads to more adaptation of the viruses to the environment and the maintenance of its endemic state.
The population dynamics analysis revealed a rapid increase in the genetic diversity of A/goose/Guangdong/1/96 lineage viruses from mid-1999 to early 2000 . In this study, it was shown that the Egyptian H5N1 viruses exhibited high evolution dynamics in almost all governorates of the country. The viruses from clades 188.8.131.52 and 184.108.40.206a had the highest record of positive selection sites (Table 1), which may be attributed to vaccination pressure due to long-standing application of vaccines with high virus load in the endemic environment. This reflects the continuous adaptation of Egyptian viruses to the poultry and to their environment with persistent changes every season . The genetic variation among the Egyptian viruses was previously reported and the presence of positive selection was recorded. In this regard, Cattoli et al.  indicated that evolutionary dynamics and positive selection significantly increased in virus populations in countries applying the avian influenza vaccination for H5N1, compared to viruses in countries that had never used vaccination. They also indicated that the rapid evolution of H5N1 viruses in Egypt was possibly linked to vaccination pressure due to sub-optimal use of vaccines.
The Egyptian viruses showed a high rate of evolution since 2006, as the original clade 2.2.1 was 4 × 10−3 substitution/site/year and lasted for 4 years. The variant clade, conversely, had 6.1 × 10−3 substitution/site/year, distributed as 3.8 × 10−3 for 220.127.116.11 over 2 years and 7.2 × 10−3 for 18.104.22.168a over 4 years. The 22.214.171.124 clade showed higher and slower evolution rate in comparison to variant viruses, it was 6.9 × 10−3 over a period of 6 years (Table 2). In addition, the Bayesian skyride analysis of the 126.96.36.199 viruses from 2009 to 2014 showed that the genetic diversity is directly proportional to the annual prevalence peaks. The genetic diversity of the variant clusters from 2007 to 2010 showed a higher pattern of increase followed by a sharp decline in 2011(Fig. 5a, b).
The evolutionary analysis of Egyptian viruses revealed that these viruses have progressive rates of evolution. The factors related to this increase were mainly attributed to the sub-optimal use of vaccines and long-lasting virus persistence in the environment leading to the endemic prevalence of 188.8.131.52 viruses over six successive years. Cattoli et al.  revealed that the two main Egyptian clades (designated as A and B) have co-circulated in domestic poultry since late 2007 and exhibited different profiles of positively selected codons and rates of nucleotide substitution. The mean evolutionary rate of clade 2.2.1 H5N1 viruses was estimated in their study as 4.07 × 10−3 nucleotide substitutions per site, per year whereas clade 184.108.40.206 viruses possessed a markedly higher substitution rate (8.87 × 10−3) and that reflected the high genetic diversity among Egyptian viruses.
Molecular characterization and genetic analysis of HA gene
The analysis of the 368 HA genes enabled us to examine changes in the receptor binding site (RBS), antigenic sites (AS) and the cleavage site.
Changes in the receptor binding site
The most characteristic change in the receptor binding site of the Egyptian viruses included in this study was the observation of one amino acid deletion at site 129 (129∆) that was not recorded in the ancestral strain (A/goose/Guangdong/1/96). This change was linked to the emergence of 220.127.116.11 clade in 2008. The viruses with 129∆ were found in the majority of human infections in Egypt in 2009 and have been found in all H5N1 human infections afterwards . The presence of the 129∆ mutation may affect the binding of the virus to human receptors. Another important change in the receptor binding site was linked to the variant clades (18.104.22.168 and 22.214.171.124a), which showed S129L substitution. The substitution was more pronounced in the 126.96.36.199a cluster and was linked with another substitution (P74S) in most cases (28/35) (Table 3).
The most recently isolated viruses (n = 56) that were collected between 2012 and 2014 were examined for HA gene mutations and belonged to the 188.8.131.52 clade in which three mutations (129∆, I151T, and S120(D,N)) were shown to be constant (Table 3).
The loss of HA154–156 glycosylation site was shown to enhance H5N1 virus binding to terminally α-2,6 sialic acid receptors and so increased the transmissibility to mammals [17–19]. The majority of the Egyptian 2.2.1 viruses lacked this site. There were few (6/75) 2.2.1 viruses that had the HA154–156 glycosylation site. Conversely, the majority of the Egyptian 184.108.40.206 viruses (154/224) had this site (Table 3).
The H5N1 viruses from Egypt displayed four characteristic mutations (D43N, S120(D,N), (S,L)129∆, and I151T). The results showed that 57 % of the HA sequenced genes showed a triple mutation (129∆, S120(D,N), and I151T) (Table 3). These triple mutations are characteristic in 220.127.116.11 clusters from different bird species such as chicken, duck, turkey, geese, ostrich, and quail; however, few of the 18.104.22.168 viruses did not carry them. The percentage of those viruses with the triple mutation reached 100 % from 2012 to 2014. Two mutations of those (129∆, I151T) had increased attachment and infectivity to the human lower respiratory tract, but not in the larynx ; that indicates an increasing possibility of human infections in Egypt.
The majority of variant viruses of clade 22.214.171.124 had no changes in the receptor binding site at position 129 (129∆) (Table 3), and they were not responsible for any human infection in Egypt, except for one case in early 2008 during the beginning of widespread prevalence of this cluster . In addition, Perovic et al.  indicated the extensive evolution of Egyptian H5N1 HPAI virus towards human. They reported that all G2 viruses (referred to as 126.96.36.199 clade in our study) displayed four characteristic mutations (D43N, S120(D,N), (S,L)129∆ and I151T). The other mutations that are linked to increased affinity to human receptors like “S223N, D183G, E186G, Q192R, Q222L and G224S” were not found in the Egyptian H5N1 viruses.
Changes in the antigenic sites
Amino acid substitutions at the antigenic sites can cause antigenic drift, possibly leading to vaccine escape as observed in the field. In comparison to the virus introduced in 2006, the characteristic changes that occurred in the antigenic sites of HA gene are very specific to each cluster (Table 3). There were 35/75 of 2.2.1 viruses that showed no antigenic changes in comparison to the parent Egyptian virus of 2006 and the precursor virus A/Bar-headed Goose/Qinghai/5/05. In addition, 26/75 of the viruses showed one antigenic change at either P74S or D154(E,N) or R162(I,K) or S141P sites. The remaining (14/75) of 2.2.1 viruses showed two to three changes. The most predominant antigenic change for those viruses was R162I which has been observed in 17 viruses. Approximately 60 % (132/224) of the viruses that belong to 188.8.131.52 cluster have four mutations that occurred at the same sites, i.e., S120(D,N), I151(T,L), D154(A,N) and R162K (Table 3). The Egyptian variant viruses (184.108.40.206 and 220.127.116.11a) carry characteristic mutations in the antigenic sites (at positions 74, 129, 140, 141, 154 and 162), while 18.104.22.168 cluster carries characteristic mutations at positions 120, 129, 151, 154, and 162; all these mutations can differentiate each cluster of Egyptian viruses from the others. Four characteristic mutations P74S, R162K, 140G and 141P were frequently detected together in the variant cluster 22.214.171.124 (17/31), whereas the variant cluster 126.96.36.199a had another four characteristic mutation that occurred together in sites P74S, R162(K,E), 141P and D154N (Table 3).
There have been 21 potential antigenic sites identified in the HA of H5N1 HPAI. It was shown that a single amino acid change in the HA of H5N1 HPAI virus can affect immune response and protection . The recent Egyptian viruses from 2013 and 2014 carry four prominent mutations in the antigenic sites (at positions 120, 151, 154 and 162), all of which belong to the endemic clade 188.8.131.52 viruses; two of these mutations (154 and 162) were distinguishing the new viruses from the earlier 2008–09 viruses, which indicate limited changes in these sites in comparison to the earlier viruses of this cluster. However the recent viruses circulating during the last few years in Egypt from 2011 showed a clear separation from the ancestral viruses indicating a gradual evolution of those viruses.
The antigenic analysis of the earlier H5N1 variant strains in Egypt demonstrated antigenic variation [12, 23], which was driven by multiple mutations primarily occurring in the major antigenic sites at the globular head of HA . Other studies showed that the classic clade 2.2.1 strains are antigenically related and cross-reactive to the ancestral Asian H5N1 strains, but demonstrated weak cross-reactivity with the Egyptian variant 184.108.40.206 strains [23, 25]. The majority of these mutations, alongside the other 19 amino acid mutations, were located within or adjacent to the receptor binding domain (RBD) in the HA1 that may affect the virus replication and transmission. There were six conserved mutations in previously reported antigenic sites (D43N, S120(N,D), S129∆, I151T, D154N and R162K) between the early 2006 strain and the endemic 220.127.116.11 cluster strains. It was shown that the D43N mutation resulted in antigenic drift between classic 2.2.1 and 18.104.22.168 clusters .
The results of the present study showed clear differences among the virus clusters in terms of the absence or presence of certain changes in the antigenic sites. For instance, the majority of 22.214.171.124 cluster had four changes occurring at the same time in the antigenic sites (S120(D,N), R162K, I151(T,L) and D145(A,N)), while the latter two sites were lacking in the 126.96.36.199 cluster. Ibrahim et al., 2013  observed an antigenic variation between different H5N1 clusters, especially between the variant 188.8.131.52a and the endemic 184.108.40.206 cluster strains showing a significant antigenic drift. Some residues were located in different antigenic sites like 133S, 154D and 156A, 190L and 192Q and 71L. They also confirmed that the 220.127.116.11 cluster showed a broader reactivity to all strains that represent different H5N1 clusters circulating in Egypt, as it shared residues with all the strains in the major antigenic sites.
Changes in the cleavage site
There were 13 groups of viruses identified based on the amino acid sequences at the cleavage site. Most of the H5N1 HPAI virus isolates belonging to 2.2.1 (42/75), 18.104.22.168 (105/224) and 22.214.171.124a (7/38) as well as all the viruses that belong to 126.96.36.199 cluster (31/31) have a common cleavage site of “ERRRKKR”, described as the consensus cleavage site for clade 2.2 viruses . However, the pattern “EKRRKKR” became dominant from 2013 and replaced the previous pattern which disappeared after 2012 (Table 4). The pattern “EGRRKKR” of amino acid sequence was exclusively present in 188.8.131.52a cluster and represented the highest proportion (23/38) among the other six patterns in the same cluster, while the pattern “ERRRKR” was observed only in 30.6 % (23/75) of the viruses that belong to the 2.2.1 cluster (Table 4).
The currently dominant amino acid cleavage site pattern “PQGEKRRKKR/GLF” is closely associated with the mutation 129∆ at the receptor binding site, and mutations at the antigenic sites (S120D, I151T, D145N and R162K) which are known to increase the binding ability to human receptors. All these mutations characterize the dominant cluster (184.108.40.206) from 2011 onwards . The substitution R325G was found at the cleavage site in Egyptian 220.127.116.11a viruses, while R325K characterized recent 18.104.22.168 viruses from 2011–2014. The R325G substitution was shown to significantly reduce pathogenicity without altering the transmission efficiency of H5N1 HPAI virus  and shows that non-adaptive mutations can play a role in virus evolution as the 22.214.171.124a cluster disappeared in Egypt since 2011.
The high diversity of the HA gene in relation to some governorates indicates active virus circulation in different locations. In this study, the hypervariability of the HA gene was noticed in relation to the geographical location (data not shown). Viruses from Qalubiya, Giza, Menufia and Dakahlya governorates had the highest number of heterogeneous amino acid sequences of the HA gene. In addition, unequal virus distribution was noticed among governorates and that favour virus persistence in an endemic area (Fig. 3). All the above mentioned results support the existence of genetic diversity of HA gene in Egypt with progressive virus evolution in a model of intermittent re-emerging H5N1 viruses to a clean areas located inside an endemic environment.
Evolution of H5N1 HPAI viruses in Egypt continues to occur in all poultry farming and production systems and in almost all regions of the country. From 2006 to 2014, two clades have been detected, each subdivided into two genetic clusters (2.2.1, 126.96.36.199, 188.8.131.52a and 184.108.40.206). The 220.127.116.11 has been the dominant cluster circulating since 2011. It is possible that viruses within the variant clusters were less fit than the viruses of the classic clade 2.2.1, ultimately giving rise to a group of endemic clade 18.104.22.168 viruses. The wide circulation of the 22.214.171.124 cluster carrying mutations associated with increased binding affinity to human receptors is an alarming finding of public health importance. Continuous monitoring of the circulating viruses and sequencing of HA and other genes, in particular the NA gene, is important to better select viruses for vaccine studies and to understand the evolution of viruses over time. Regular data sharing among professionals in the animal and public health sectors will allow linking of epidemiological and sequence information and will provide a clear picture on the virus evolution.
Nucleotide sequencing of HA gene
The H5N1 HPAI virus isolates and field samples from 368 cases of H5N1 were collected in Egypt during the period from 2006 to 2014. They were collected from different localities in Lower and Upper Egypt, different bird species (chicken, duck, turkey, geese, quail and ostrich) and different poultry value chain nodes like households, commercial poultry farms and live bird markets.
The full length HA gene sequencing has been conducted, where the ribonucleic acids (RNAs) of virus isolates or samples were extracted using QiaAmp viral RNA extraction kit (Qiagen, Germany) according to the manufacturer’s instructions. A one-step RT-PCR was conducted on the extracted RNAs using specific primers for Matrix (M) and H5 genes [29, 30]. The PCR products were purified using a QiaAmp purification kit (Qiagen, Germany). The HA gene sequencing was done using a Bigdye Terminator Kit (version 3.1; Applied Biosystems, Foster City, CA) on a 3130 Genetic Analyzer (Applied Biosystems, Foster City, CA). The sequencing of the HA gene was conducted at NLQP and the data were regularly submitted to the GenBank and are available at the National Center for Biotechnology Information (NCBI) Influenza Virus Resource. Recently, new sequence data from 2012–2014 were added in the GenBank under accession numbers of KJ522707-KJ522745 and KP209286-KP209303.
Phylodynamics of HA gene
After excluding the sequences from duplicate strains, 365 out of 368 full-length HA genes of Egyptian H5N1 viruses were used for this analysis. For the estimation of the rates of nucleotide substitution among H5N1 viruses from Egypt, the Bayesian Markov Chain Monte Carlo (BMCMC) method (as implemented in BEAST v1.4.7) was applied . The Bayesian GMRF skyride coalescent tree model was used . The uncorrelated lognormal relaxed (UCLD) clock  that allows evolutionary rates to vary along branches within lognormal distributions was used and Hasegawa-Kishino-Yano (HKY) substitution model with empirical base frequencies and gamma site heterogeneity model at 4 categories. Mean evolutionary rates and divergence times were calculated using Tracer V.1.5 . The phylogenetic trees were visualized with FigTree v.1.1.2 .
Analysis of selection pressures
The site-specific selection pressures for the HA gene of Egyptian H5N1 viruses was measured as the ratio of nonsynonymous (dN) to synonymous (dS) nucleotide substitutions per site (dN/dS) or omega (ω). Normalized dN-dS was estimated as the raw dN-dS divided by the total length of the tree measured in the number of expected substitutions per nucleotide per site. The estimates were made using the Single Likelihood Ancestor Counting (SLAC) method available at the Datamonkey online version of the Hy-Phy package . This analysis used input Neighbor Joining (NJ) phylogenetic trees estimated according to the HKY model of nucleotide substitution. A site with ω > 1 is indicating positive selection. Statistical distributions were used to model the variation in ω among sites, allowing a subset of sites to have ω >1 while the rest of the sequence may be under purifying selection with ω < 1 with p-value of less than 0.05 .
Genetic characterization of HA gene
In the present study, HA genes of 368 Egyptian H5N1 viruses were genetically characterized and studied for evidence of genetic mutations in different parts of the gene, including receptor binding, antigenic and cleavage sites. Multiple and pairwise sequence alignments were constructed using the Clustal-W algorithm of Bio-edit® software V.7.1.11 . The mutations in the receptor binding and antigenic sites were tabulated against the virus clusters to explore their proportion among the sequenced HA genes and in order to identify the common antigenic differences between the virus clusters. The acquisition or loss of glycosylation sites of known importance in the HA gene was recorded. The amino acid sequences at the receptor binding, antigenic and cleavage sites were grouped and tabulated per cluster and year.
Availability of supporting data
The sequences of HA gene were submitted to the public GenBank database under accession numbers from KJ522707 to KJ522745 and from KP209286 to KP209303.
synonymous nucleotide substitution
Emergency Center of Transboundary Animal Diseases
Food and Agriculture Organization of the United Nations
highly pathogenic avian influenza
highest posterior density
live bird market, BMCMC, Bayesian Markov Chain Monte Carlo
National Center for Biotechnology Information
National Laboratory for Veterinary Quality Control on Poultry Production
proteolytic cleavage site
receptor binding domain
receptor binding site
reverse transcription polymerase chain reaction
Single Likelihood Ancestor Counting
uncorrelated lognormal relaxed clock
Chen R, Holmes EC. Avian Influenza exhibits rapid evolutionary dynamics. Mol Biol Evol. 2006;23:2336–41.
Aly MM, Arafa A, Hassan MK. Epidemiological findings of outbreaks of disease caused by highly pathogenic H5N1 avian influenza virus in poultry in Egypt during 2006. Avian Dis. 2008;52:269–77.
Arafa AS, Naguib MM, Luttermann C, Selim AA, Kilany WH, Hagag N, et al. Emergence of a novel cluster of influenza A(H5N1) virus clade 126.96.36.199 with putative human health impact in Egypt, 2014/15. Euro Surveill. 2015;20(13):2–8.
Abdel-Moneim AS, Shany SA, Fereidouni SR, Eid BT, El-Kady MF, Starick E, et al. Sequence diversity of the haemagglutinin open reading frame of recent highly pathogenic avian influenza H5N1 isolates from Egypt. Arch Virol. 2009;154:1559–62.
Arafa A, Suarez DL, Hassan MK, Aly MM. Phylogenetic analysis of HA and NA genes of HPAI-H5N1 Egyptian strains isolated from 2006 to 2008 indicates heterogeneity with multiple distinct sublineages. Avian Dis. 2010;54:345–9.
Donis RO, Smith GJ. Nomenclature updates resulting from the evolution of avian influenza A(H5) virus clades 188.8.131.52a, 2.2.1, and 2.3.4 during 2013–2014.World Health Organization/World Organisation for Animal Health/Food and Agriculture Organization (WHO/OIE/FAO) H5 Evolution Working Group. Influenza Other Respir Viruses. 2015 May 12. doi: 10.1111/irv.12324. [Epub ahead of print].
Vijaykrishna D, Bahl J, Riley S, Duan L, Zhang JX, Chen H, et al. Evolutionary dynamics and emergence of panzootic H5N1 influenza viruses. PLoS Pathog. 2008;4(9):e1000161.
Wallace RG, Hodac H, Lathrop RH, Fitch WM. A statistical phylogeography of influenza A H5N1. Proc Natl Acad Sci U S A. 2007;104(11):4473–8.
WHO/ OIE/ FAO H5N1 Evolution Working Group. Revised and updated nomenclature for highly pathogenic avian influenza A (H5N1) viruses. Influenza Other Respir Viruses. 2014;8(3):384–8.
El-Zoghby EF, Aly MM, Nasef SA, Hassan MK, Arafa AS, Selim AA, et al. Surveillance on A/H5N1 virus in domestic poultry and wild birds in Egypt. Virol J. 2013;10:203.
El-Shesheny R, Kandeil A, Bagato O, Maatouq AM, Moatasim Y, Rubrum A, et al. Molecular characterization of avian influenza H5N1 virus in Egypt and the emergence of a novel endemic subclade. J Gen Virol. 2014;95:1444–63.
Balish AL, Davis CT, Saad MD, El-Sayed N, Esmat H, Tjaden JA, et al. Antigenic and genetic diversity of highly pathogenic avian influenza A (H5N1) viruses isolated in Egypt. Avian Dis. 2010;54:329–34.
Cattoli G, Fusaro A, Monne I, Coven F, Joannis T, El-Hamid HS. Evidence for differing evolutionary dynamics of A/H5N1 viruses among countries applying or not applying avian influenza vaccination in poultry. Vaccine. 2011;29:9368–75.
FAO. Approaches to controlling, preventing and eliminating H5N1 Highly Pathogenic Avian Influenza in endemic countries. 2011. FAO., http://www.fao.org/docrep/014/i2150e/i2150e.pdf.
Scotch M, Mei C, Makonnen YJ, Pinto J, Ali A, Vegso S, et al. Phylogeography of influenza A H5N1 clade 184.108.40.206 in Egypt. BMC Genomics. 2013;14:871.
Arafa A, Suarez D, Kholosy SG, Hassan MK, Nasef S, Selim A, et al. Evolution of highly pathogenic avian influenza H5N1 viruses in Egypt indicating progressive adaptation. Arch Virol. 2012;157:1931–47.
Watanabe Y, Ibrahim MS, Ellakany HF, Kawashita N, Mizuike R, Hiramatsu H, et al. Acquisition of Human-Type Receptor Binding Specificity by New H5N1 Influenza Virus Sublineages during Their Emergence in Birds in Egypt. PLoS Pathog. 2011;7:19.
Xu X, Subbarao K, Cox NJ, Guo Y. Genetic characterization of the pathogenic influenza A/Goose/Guangdong/1/96 (H5N1) virus: similarity of its hemagglutinin gene to those of H5N1 viruses from the 1997 outbreaks in Hong Kong. Virology. 1999;261:15–9.
Neumann G, Macken CA, Karasin AI, Fouchier RAM, Kawaoka Y. Egyptian H5N1 Influenza Viruses—Cause for Concern? PLoS Pathog. 2012;8(11):e1002932. doi:10.1371/journal.ppat.1002932.
Earhart KC, Elsayed NM, Saad MD, Gubareva LV, Nayel A, Deyde VM, et al. Oseltamivir resistance mutation N294S in human influenza A(H5N1) virus in Egypt. J Infect Public Health. 2009;2:74–80.
Perovic VR, Muller CP, Niman HL, Veljkovic N, Dietrich U, Tosic DD, et al. Novel Phylogenetic Algorithm to Monitor Human Tropism in Egyptian H5N1-HPAIV Reveals Evolution toward Efficient Human-to-Human Transmission. PLoS One. 2013;8(4):e61572.
Hoffmann E, Lipatov AS, Webby RJ, Govorkova EA, Webster RG. Role of specific hemagglutinin amino acids in the immunogenicity and protection of H5N1 influenza virus vaccines. Proc Natl Acad Sci U S A. 2005;102:12915–20.
Beato MS, Mancin M, Yang J, Buratin A, Ruffa M, Maniero S, et al. Antigenic characterization of recent H5N1 highly pathogenic avian influenza viruses circulating in Egyptian poultry. Virology. 2013;435:350–6.
Cattoli G, Milani A, Temperton N, Zecchin B, Buratin A, Molesti E, et al. Antigenic drift in H5N1 avian influenza in poultry is driven by mutations in major antigenic sites of the hemagglutinin molecule analogous to human influenza. J Virol. 2011;85(17):8718–24.
Watanabe Y, Ibrahim MS, Ellakany HF, Kawashita N, Daidoji T, Takagi T, et al. Antigenic analysis of highly pathogenic avian influenza virus H5N1 sublineages cocirculating in Egypt. J Gen Virol. 2012;93:2215–26.
Ibrahim M, Eladl AF, Sultan HA, Arafa AS, Abdel Razik AG, Abd El Rahman S, et al. Antigenic analysis of H5N1 highly pathogenic avian influenza viruses circulating in Egypt (2006–2012). Vet Microbiol. 2013;167:651–61.
OFFLU. Influenza A Cleavage Sites version 4 April 2014. 2014. www.offlu.net/fileadmin/home/en/resource-centre/pdf/Influenza_A_Cleavage_Sites.pdf.
Yoon S-W, Kayali G, Ali MA, Webster RG, Webby RJ, Ducatez MF. A single amino acid at the hemagglutinin cleavage site contributes to the pathogenicity but not the transmission of Egyptian highly pathogenic H5N1 influenza virus in chickens. J Virol. 2013;87(8):4786–8.
Spackman E, Senne DA, Myers TJ, Bulaga LL, Garber LP, Perdue ML, et al. Development of a realtime reverse transcriptase PCR assay for type A influenza virus and the avian H5 and H7 hemagglutinin subtypes. J Clin Microbiol. 2002;40:3256–60.
Slomka MJ, Pavlidis T, Banks J, Shell W, McNally A, Essen S, et al. Validated H5 Eurasian real-time reverse transcriptase– polymerase chain reaction and its application in H5N1 outbreaks in 2005–2006. Avian Dis. 2007;51:373–7.
Drummond AJ, Rambaut A. BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol Biol. 2007;7:214.
Minin VN, Bloomquist EW, Suchard MA. Smooth skyride through a rough skyline: Bayesian coalescent-based inference of population dynamics. Mol Biol Evol. 2008;25:1459–71.
Drummond AJ, Ho SY, Phillips MJ, Rambaut A. Relaxed phylogenetics and dating with confidence. PLoS Biol. 2006;4:699–710.
Rambaut A, Drummond AJ. 2007. Tracer v1.4: MCMC trace analyses tool. http://beast.bio.ed.ac.uk/Tracer. Accessed 20 June 2008.
Rambaut A. 2008. FigTree v1.1.1: Tree figure drawing tool. Available: http://tree.bio.ed.ac.uk/software/figtree/. Accessed 20 June 2008.
Kosakovsky Pond SL, Frost DWS. Not so different after all: a comparison of methods for detecting amino acid sites under selection. Mol Biol Evol. 2005;22:1208–22.
Yang Z, Wong WSW, Nielsen R. Bayes empirical Bayes interference of amino acid sites under positive selection. Mol Biol Evol. 2005;22:1107–18.
Hall TA. BioEdit: a user-friendly biological sequence alignment editor and analysis. program for Windows 95/98/NT. Nucl Acids Symp Ser. 1999;41:95–8.
This work was supported by the United States Agency for International Development (USAID) [grant number AID-263-IO-11-00001, Mod.#3] in the framework of OSRO/EGY/101/USA project jointly implemented by the FAO, General Organization for Veterinary Services (GoVS) and National Laboratory for Veterinary Quality Control of Poultry Production (NLQP).
The views expressed in this information product are those of the author(s) and do not necessarily reflect the views or policies of FAO.
The authors declare that they have no competing interests.
AA is the main corresponding author; he designed, followed up, reviewed all the technical work, and drafted the manuscript. IE carried out data collection and illustrations, reviewed the manuscript. SK was responsible for the epidemiological data collection and reviewing. MH, GD, and JL approved, reviewed, and followed up the work. YM assisted with manuscript design and preparation, following up and reviewing the manuscript, he is the second corresponding author. All authors read and approved the final manuscript.
About this article
Cite this article
Arafa, A., El-Masry, I., Kholosy, S. et al. Phylodynamics of avian influenza clade 2.2.1 H5N1 viruses in Egypt. Virol J 13, 49 (2016). https://doi.org/10.1186/s12985-016-0477-7