Skip to main content

Tracking the molecular evolution and transmission patterns of SARS-CoV-2 lineage B.1.466.2 in Indonesia based on genomic surveillance data



As a new epi-center of COVID-19 in Asia and a densely populated developing country, Indonesia is facing unprecedented challenges in public health. SARS-CoV-2 lineage B.1.466.2 was reported to be an indigenous dominant strain in Indonesia (once second only to the Delta variant). However, it remains unclear how this variant evolved and spread within such an archipelagic nation.


For statistical description, the spatiotemporal distributions of the B.1.466.2 variant were plotted using the publicly accessible metadata in GISAID. A total of 1302 complete genome sequences of Indonesian B.1.466.2 strains with high coverage were downloaded from the GISAID’s EpiCoV database on 28 August 2021. To determine the molecular evolutionary characteristics, we performed a time-scaled phylogenetic analysis using the maximum likelihood algorithm and called the single nucleotide variants taking the Wuhan-Hu-1 sequence as reference. To investigate the spatiotemporal transmission patterns, we estimated two dynamic parameters (effective population size and effective reproduction number) and reconstructed the phylogeography among different islands.


As of the end of August 2021, nearly 85% of the global SARS-CoV-2 lineage B.1.466.2 sequences (including the first one) were obtained from Indonesia. This variant was estimated to account for over 50% of Indonesia’s daily infections during the period of March–May 2021. The time-scaled phylogeny suggested that SARS-CoV-2 lineage B.1.466.2 circulating in Indonesia might have originated from Java Island in mid-June 2020 and had evolved into two disproportional and distinct sub-lineages. High-frequency non-synonymous mutations were mostly found in the spike and NSP3; the S-D614G/N439K/P681R co-mutations were identified in its larger sub-lineage. The demographic history was inferred to have experienced four phases, with an exponential growth from October 2020 to February 2021. The effective reproduction number was estimated to have reached its peak (11.18) in late December 2020 and dropped to be less than one after early May 2021. The relevant phylogeography showed that Java and Sumatra might successively act as epi-centers and form a stable transmission loop. Additionally, several long-distance transmission links across seas were revealed.


SARS-CoV-2 variants circulating in the tropical archipelago may follow unique patterns of evolution and transmission. Continuous, extensive and targeted genomic surveillance is essential.


Coronavirus disease 2019 (COVID-19) is an infectious disease caused by a novel β-coronavirus, namely, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) [1, 2]. It has evolved into an ongoing pandemic since 11 March 2020 [3, 4]. According to the World Health Organization (WHO), over 215 million confirmed cases with nearly 4.5 million related deaths have been reported globally as of the end of August 2021 [5]. Straddling Southeast Asia and Oceania, Indonesia is the world's largest archipelago nation and the world’s fourth most populous nation. Despite some degree of underestimation, Indonesia has been bearing the highest cumulative burden of COVID-19 in the region of the Association of Southeast Asian Nations (ASEAN) since mid-October 2020 [6]. As of 6 June 2021, this populous tropical country had only 5.51% of its citizens fully vaccinated and was once considered to be Asia’s new epi-center of the pandemic after India [7, 8].

With the worldwide spread of the virus, its continuous evolution yielded the emergence of multiple variants, which has posed an increased risk to global public health [9]. In this context, sequencing-based genomic surveillance, whether at the national, regional or global level, has become the key to formulating effective intervention strategies [10]. The Pango nomenclature offers a dynamic proposal for the classification and subtyping of SARS-CoV-2 strains based on genetic relatedness [11]. Under this system, a multitude of SARS-CoV-2 lineages and sub-lineages are identified and named accurately, including four variants of concern (VOCs: B.1.1.7, B.1.351, P.1 and B.1.617.2 corresponding to Alpha, Beta, Gamma and Delta labelled by WHO) and several variants of interest (VOIs) [12]. Since 2020, several teams and institutions (e. g., the Yogyakarta-Central Java COVID-19 Study Group and the West Java Health Laboratory) have conducted whole-genome sequencing (WGS) of SARS-CoV-2 strains circulating in various parts of Indonesia, and have shared genetic data and related metadata via the Global Initiative on Sharing Avian Influenza Data (GISAID) platform [13,14,15,16,17]. These efforts make it possible to gain insights into Indonesia's COVID-19 epidemic from a micro perspective.

One of the WHO-designated variants under monitoring (VUMs), SARS-CoV-2 lineage B.1.466.2, was first documented in Indonesia in November 2020 and was reported to contribute to 31.7% of all cases nationwide as of the third week of August 2021 (second only to the Delta variant) [12, 18]. During the pre-Delta outbreak period, the B.1.466.2 lineage was the most dominant variant in Indonesia whereas the Alpha variant was more dominating in other parts of Asia [19]. Although the B.1.466.2 variant has been introduced into neighbouring countries (e. g., Malaysia) [20], it still mainly circulates in Indonesia and might have accumulated unique mutations in indigenous transmission. Due to its overly dispersed territory and relatively limited health resources, extensive and continuous SARS-CoV-2 genomic surveillance remains a daunting task for Indonesia [21]. Two bioinformatic analyses of SARS-CoV-2 full-length sequences from Indonesia confirmed the dominance of GH clade with marker mutations of D614G and Q57H [22, 23]. A more representative recent study found that in Indonesia, the B.1.466.2 variant shared some missense mutations with both Alpha and Delta VOCs, indicating a significant impact on transmission and clinical severity; the N439K mutation showing antibody resistance was prevalent in the Indonesian B.1.466.2 variant [19]. Other related studies mainly focused on a certain province or part of Indonesia (especially the Java Island) [13,14,15,16,17]. In West Java, the B.1.466.2 variant dominated the local SARS-CoV-2 infections from January through April 2021, which was generally consistent with the frequency trend of Q57H mutation [17]. In Central Java and Yogyakarta, around 13% of SARS-CoV-2 samples (collected from May 2020 to June 2021) were clustered into the B.1.466.2 lineage, ranking second among non-Delta variants [16]. As an important local variant, there is a lack of nationwide genomic surveillance targeting the B.1.466.2 lineage. In particular, the trajectory of its population size over time and the picture of how it transmitted between different islands in Indonesia remain unclear.

In the present study, we retrospectively analyzed the whole-genome sequencing data of SARS-CoV-2 lineage B.1.466.2 from Indonesia to systematically reveal its evolutionary features and transmission patterns in such a tropical archipelago setting.


Study area

Indonesia is divided into 34 first-level administrative regions which are located on five main islands (Java, Sumatra, Kalimantan, Sulawesi and Papua) and two main archipelagos (Nusa Tenggara and Maluku). Geographically, Bali is the westernmost part of Nusa Tenggara but it is traditionally listed as a separate unit in the statistics. Following the Wallace Line in evolutionary ecology, Java, Bali, Sumatra and Kalimantan belong to Western Indonesia, while Sulawesi, Nusa Tenggara, Maluku and Papua belong to Eastern Indonesia. Western Indonesia has a larger population and a higher level of socio-economic development than Eastern Indonesia [24].

Data collection

All SARS-CoV-2 genomic data in the study were downloaded from the GISAID’s EpiCoV database ( on 28 August 2021. The relevant retrieval fields were set as follows: (1) “hCoV-19” as virus name; (2) “Asia/Indonesia” as location; (3) “Human” as host. For quality control, sequences that meet any of the following filtering criteria should be excluded: (1) length < 29000nt; (2) proportion of undetermined nucleotide bases > 5%; (3) Vero cell passage history; (4) incomplete collection date; (5) unknown Pango lineage. Finally, a total of 4210 complete and high-coverage SARS-CoV-2 genomes collected from Indonesia between 12 March 2020 and 17 August 2021 were included and compiled into Dataset 1. Based on Dataset 1, 1302 genomes belonging to the B.1.466.2 lineage (with the collection date ranging from 6 August 2020 to 2 August 2021) were extracted to constitute Dataset 2 for further analysis. The accession IDs for relevant sequences are available in Additional file 1.

Statistics on lineage prevalence

The global epidemic burden of the SARS-CoV-2 lineage B.1.466.2 was characterized by the national distribution of genomic samples, which can be directly retrieved in GISAID. Using the metadata in Dataset 2, we calculated the absolute counts and proportions of the B.1.466.2 samples according to the eight regions of Indonesia mentioned above. In time series, the daily sampling counts of the Indonesian B.1.466.2 variant were smoothed by the seven-day rolling average method. The dynamic proportions of the B.1.466.2 and Delta (lineage B.1.617.2 and its sub-lineages) variants among SARS-CoV-2 infections in Indonesia were estimated with the genomic data in Dataset 1 using the R “ratesci” package v0.3-0 (Peter J. Laud, Statistical Services Unit, University of Sheffield, Sheffield, United Kingdom); the corresponding 95% confidence intervals were calculated based on Bayesian Jeffrey’s prior [25].

Genome-wide phylogenetic and mutation analyses

The 1302 full‐length genome sequences in Dataset 2 were aligned using MAFFT v7.487 (Osaka University, Suita, Osaka, Japan). We performed the phylogenetic analysis of Indonesian SARS-CoV-2 lineage B.1.466.2 strains with maximum likelihood (ML) method based on the general time reversible model [26]. The ML phylogenetic tree was built using FastTree v2.1.10 (Morgan N. Price, Physical Biosciences Division, Lawrence Berkeley National Laboratory, CA, USA) [27], and the branches with low support (bootstrap value < 0.7) were removed. A root-to-tip regression was conducted to test the temporal signal in Dataset 2 using TreeTime v0.8.4 (Pavel Sagulenko, Emma Hodcroft & Richard Neher, Max Planck Institute for Developmental Biology, Tubingen, Germany) [28], and the effect was evaluated with R square calculated using TempEst v1.5.3 [29]. Based on the ML phylogeny and temporal structure, the corresponding time-scaled phylogenetic tree was built with the "least square" re-rooting scheme. The “Mugration” model in TreeTime v0.8.4 was applied to estimate the discrete state (region) and its probability for each most recent common ancestor (MRCA) of the time-scaled phylogenetic tree [30]. Taking Wuhan-Hu-1 (NC_045512.2) as the reference genome, single nucleotide variants (SNVs) were called and annotated using the Nextclade tool ( [31]. The time-scaled phylogenetic tree and the SNV matrix corresponding to the leaf nodes were visualized using the iTOL tool ( [32].

Phylodynamic and phylogeographic reconstructions

Using TreeTime v0.8.4, the maximum likelihood approach was applied to estimate the effective population size (Ne) of SARS-CoV-2 lineage B.1.466.2 circulating in Indonesia over time [28]. According to the dynamics of Ne, four phases were determined for better understanding the demographic history of this local variant. As an indicator of real-time virus transmissibility, the effective reproduction number (Rt) was estimated using the R (R Foundation for Statistical Computing, Vienna, Austria) “EpiEstim” package v2.2-4 based on the time series of daily sampling counts [33].

To reconstruct the phylogeography of the B.1.466.2 variant within Indonesia, the coalescent events from the time-scaled phylogenetic tree were used to represent the transmission events. After filtering out the clades with MRCA discrete state probability less than 0.7, the transmission events were counted by region in each phase [34]. For a given region in Indonesia, the level of local transmission was evaluated with the ratio of inferred intra-regional transmission events to inter-regional transmission events (including importations and exportations) [35]. A series of maps were made to visualize the inferred transmission patterns of SARS-CoV-2 lineage B.1.466.2 in Indonesia during different phases using the R “Leaflet” package v2.0.4.1 [36].


Spatiotemporal distribution characteristics

As of 28 August 2021, the GISAID database recorded a total of 1953 complete genomes of SARS-CoV-2 lineage B.1.466.2 from 24 countries. Of these, 84.9% were from Indonesia, followed by Malaysia (4.6%), Singapore (4.3%), Australia (1.4%), Japan (1.4%) and Papua New Guinea (0.9%) (Fig. 1A). The first sample (accession ID: EPI_ISL_2868921) was isolated from Banten Province (a part of Java), Indonesia on 6 August 2020. The geographic distribution of 1302 Indonesian SARS-CoV-2 lineage B.1.466.2 samples included in Dataset 2 is shown in Fig. 1B. There were 493 samples from Sumatra and 406 samples from Java, together accounting for 69.0%. There were no samples from Maluku.

Fig. 1
figure 1

A Table displaying the global distribution of SARS-CoV-2 lineage B.1.466.2. B Pie chart displaying the geographical distribution of SARS-CoV-2 lineage B.1.466.2 in Indonesia; numbers of genomic samples from different regions and corresponding percentages are labeled (no samples from Maluku). Both statistics are based on genome samples recorded in GISAID as of the end of August 2021

The daily count of Indonesian SARS-CoV-2 lineage B.1.466.2 samples began to increase significantly in early January 2021 and reached a peak in late May 2021 (36 sequences collected on 19 May 2021), and rapidly decreased thereafter (Fig. 2A). Among 4210 samples of various SARS-CoV-2 variants from Indonesia(Dataset 1), the B.1.466.2 variant was the dominant one from early March 2021 to late May 2021 and reached its peak proportion (83.7%) with a 95% confidence interval (95% CI) of 65.5% ~ 94.8% on 16 May 2021 (Fig. 2B). Contrary to the B.1.466.2 variant, the Delta variant experienced a sharp increase in its proportion during May 2021, making it the latest dominant variant in Indonesia.

Fig. 2
figure 2

A Time series of daily sampling counts of the Indonesian SARS-CoV-2 B.1.466.2 variant expressed with raw data (grey line) and seven-day rolling averages (red line). B Estimated proportion dynamics of the B.1.466.2 and Delta (lineage B.1.617.2 and its sub-lineages) variants in Indonesia; the colored bands represent corresponding 95% confidence intervals

Time-scaled phylogeny and mutation profiles

As shown in Additional file 2, a positive temporal signal in Dataset 2 was detected using root-to-tip regression analysis (R2 = 0.362; P < 0.01). The evolutionary rate of this lineage in Indonesia was roughly estimated at 7.446 × 10−4 substitutions per site per year. Furthermore, the time-scaled phylogenetic relationships of SARS-CoV-2 B.1.466.2 variants from Indonesia were reconstructed using maximum-likelihood phylodynamic analysis (Fig. 3A). The root of the evolutionary tree was located in Java (bootstrap support: 99.3%), and the time to the MRCA was dated to 13 June 2020.

Fig. 3
figure 3

A Time-scaled maximum-likelihood phylogenetic tree of SARS-CoV-2 lineage B.1.466.2 circulating in Indonesia; colors indicate different sampling locations. B Genome-wide single nucleotide variation (SNV) matrix of the Indonesian SARS-CoV-2 B.1.466.2 variant (against the reference genome); major non-synonymous mutations are marked in red, while major synonymous mutations are marked in blue

According to the tree topology, two sub-lineages (labeled A and B) were identified, with 161 (12.4%) and 1141 (87.6%) samples respectively. The samples from Java were relatively evenly distributed in clades, while over 95.0% of those from Sumatra and Kalimantan were clustered in the B sub-lineage. The common amino acid changes (with mutation frequencies > 99%) of the B.1.466.2 variant circulating in Indonesia included: ORF1a-T1168I, ORF1a-P1640L, ORF1b-P314L, S-N439K, S-D614G, ORF3a-Q57H and N-T205I (Fig. 3B). Compared with sub-lineage A, sub-lineage B was additionally characterized by amino acid changes such as ORF1a-S944L, ORF1a-L3644F, ORF1b-S1182L and S-P681R.

Reconstructed demographic history and epidemic dynamics

The historical trajectory of effective population size of SARS-CoV-2 lineage B.1.466.2 circulating in Indonesia could be divided into four phases (Fig. 4A). From mid-June 2020 to late September 2020, the Ne rose in volatility based on an initial low level. During the five months after the end of September 2020, this lineage underwent a steady exponential expansion of the viral population in Indonesia; the Ne was estimated to multiply 50-fold and reach its peak (9.7 × 103) at the beginning of March 2021. In the third phase (early March to early May, 2021), a slight population contraction of the variant was inferred. Finally, the viral population seemed to remain relatively steady with a mean Ne of 4.8 × 103.

Fig. 4
figure 4

A Maximum likelihood skyline displaying the effective population size (Ne) trajectory of SARS-CoV-2 lineage B.1.466.2 circulating in Indonesia; the blue line represents the point estimates for Ne, while the red dots represent the cut-off points of phase division. B Estimated effective reproduction number (Rt) dynamics of the Indonesian SARS-CoV-2 B.1.466.2 variant; the median Rt estimates for the four phases (divided according to the Ne trajectory) are indicated. The grey bands represent corresponding 95% confidence intervals

The effective reproduction number of SARS-CoV-2 lineage B.1.466.2 in Indonesia between early August 2020 and early August 2021 was estimated using information from 1302 complete genomes (Fig. 4B). During the investigation period of one year, Rt showed an overall downward trend, but showed different characteristics in four phases. The median estimate of Rt for the first phase was the highest (5.00), but its confidence interval was wide due to the limited sample size. Three Rt peaks were observed in the second phase, and the highest one reached 11.18 (95% CI: 5.36 ~ 19.10) on 23 December 2020. During the third phase, the estimated Rt remained relatively stable around the cut-off value of one. The median Rt for the last phase was estimated to be 0.89 (< 1.00).

Spatiotemporal transmission patterns

The inferred transmission of the B.1.466.2 variant between different regions of Indonesia in each phase was mapped. By the end of September 2020, the scope of transmission was limited, presenting the epi-center in Java and spilling to neighboring Sumatra and Bali (Fig. 5A). The second phase (October 2020 to February 2021) was inferred to be a critical period for the B.1.466.2 prevalence in Indonesia, both in terms of spread coverage and transmission intensity (Fig. 5B). Java exported this variant at a higher intensity to all other regions and constituted bidirectional transmission loops with Sumatra, Bali and Sulawesi, respectively. There was a long-distance transmission link from Sumatra to Papua. Intra-regional transmission occurred in all regions except Papua, especially in Java, Sumatra and Bali. During the following two months, inter-regional transmission became sparse and weak (Fig. 5C). The transmission loop between Java and Sumatra still existed, and Sumatra acted as an emerging epi-center with upgraded intra-regional transmission. After early May 2021, inter-regional transmission remained relatively low, characterized by viral introductions from Sumatra, Java and Nusa Tenggara to Sulawesi, as well as bidirectional viral exchanges between Sumatra and Java (Fig. 5D). The intra-regional transmission in Kalimantan increased while those in previous hotspots declined. In general, the prevalence of SARS-CoV-2 lineage B.1.466.2 in Indonesia followed spatiotemporal patterns of strong in the western part and weak in the eastern part, rising in the early stage and declining in the later stage. The inferred frequencies for both intra- and inter-regional transmission events are given in Additional file 3.

Fig. 5
figure 5

A series of maps showing the inferred transmission of the SARS-CoV-2 B.1.466.2 variant in Indonesia’s different regions in each phase. Panels A, B, C and D correspond to Phases I, II, III and IV respectively. Line colors indicate the inter-regional transmission intensity, while the circle size indicates the relative strength of spread within the region


As the COVID-19 pandemic continues to progress globally, the evolution and transmission dynamics of various SARS-CoV-2 variants have aroused great interest among virologists, clinicians, epidemiologists, and even health decision-makers. This study revealed the prevalence, evolution and spatiotemporal transmission of an endemic variant (the lineage B.1.466.2) in Indonesia using publicly available complete genomes and associated metadata. Such findings provide valuable reference for responding to future outbreaks in archipelagic countries.

More than 80% of the SARS-CoV-2 lineage B.1.466.2 genomes are from Indonesia, and other countries with a contribution rate of more than 1% (except Japan) are adjacent to Indonesia. The geographic distribution supported the local prevalence of this variant with the Indonesian Archipelago as its global epi-center [17]. Furthermore, the number of samples contributed by each region in Indonesia was generally consistent with its population size and laboratory diagnostic capacity [21]. It should be pointed out that sampling may have been biased by the availability of sequencing centers and resources, which was relatively low in Eastern Indonesia (especially before the establishment of the nationwide network for SARS-CoV-2 genomic surveillance) [37]. The B.1.466.2 variant was first documented in Indonesia in November 2020 [12]. However, the collection time of one Indonesian B.1.466.2 sample submitted to GISAID can be traced back to 6 August 2020. Hence, we inferred that this variant was likely to have been circulating in Indonesia before August 2020. In the study, we observed a noticeable shift in the dominant strain of Indonesian SARS-CoV-2 infections, which was from the B.1.466.2 variant to the B.1.617.2 or Delta variant and took place in late May 2021. Interestingly, the prevalence of these two variants in Indonesia was almost complementary in time series. This phenomenon reflected the fierce competition between different variant populations of the same virus, and in this case, the highly contagious Delta variant eventually gained a competitive advantage [38]. As the virus continues to evolve, similar population succession is likely to be repeated, which calls for real-time genomic surveillance [10].

The time-scaled phylogeny revealed the most recent common ancestor and heterogeneous evolution of this lineage. According to the root state in the evolutionary tree, the B.1.466.2 variant circulating in Indonesia was estimated to have originated in Java in mid-June 2020. The estimate of its MRCA was logically consistent with the sampling location and date of the earliest B.1.466.2 sequence. To comprehend the emergence and evolution of pathogenic viruses, the complex "host–pathogen-environment" interaction provides an analytical framework [39]. As the world's most populous island, Java is home to 147.8 million people (over 50% of the Indonesia’s total population) [40]. Java's very high population density (~ 1200/km2) means a high SARS-CoV-2 transmission rate [41], superimposed on its large number of potential hosts, which can lead to large viral population size and intense selection pressure—two major evolutionary drivers [42]. Therefore, it seemed unsurprising that this local variant of Indonesia originated from Java. In addition to the root, the trunks to which side branches belong were also assigned to Java, indicating that the SARS-CoV-2 lineage B.1.466.2 strains in other parts of Indonesia were all introduced from Java. Like other forms of life on earth, the evolution of viruses is subjected to environmental selection. Among the side branches of the evolutionary tree, we observed the clustering preference of sequences derived from Sumatra and Kalimantan in the B clade (sub-lineage). The equator traverses Sumatra and Kalimantan, and both large islands are known for their dense jungles and diverse species. At least 88 and 97 species of tropical bats have been identified in Sumatra and Kalimantan respectively, including those that potentially serve as natural hosts for SARS-CoV-2 [43,44,45]. Interestingly, a viral ecology study in the islands of the Western Indian Ocean found a strong signal of co-evolution between coronaviruses and their bat host species and further revealed the effect of the endemism resulted from geographical isolation on viral diversification within bat families [46]. Therefore, we speculate that the unique jungle ecology of Sumatra and Kalimantan may contribute to the local circulation of the B.1.466.2 variant through certain species of bats (e. g. Rhinolophus spp.) [47, 48].

The evolutionary rate of Southeast Asian SARS-CoV-2 strains (isolated before September 2020) was estimated to be 1.446 × 10−3 substitutions per site per year [49], while the corresponding estimate for the Indonesian B.1.466.2 variant was almost halved. The decline in the rate of evolution probably occurred under its RNA proofreading mechanism and partly reflected that this variant was more adaptable to the local environment than the original strain [50, 51]. Several high-frequency non-synonymous mutations were identified in the SARS-CoV-2 lineage B.1.466.2 from Indonesia, most of which occurred in the non-structural protein 3 (NSP3) and the spike protein. NSP3 is the largest viral protein containing multiple domains and participates in the formation of the replication/transcription complex [52]. Notably, the S944L mutation (only observed in the B sub-lineage) is located in the papain-like protease domain of NSP3. However, whether this mutation can improve viral fitness remains to be verified in further experiments [53]. Co-mutations of S-D614G and ORF3a-Q57H are the genetic markers for the GH clade, to which the Pango lineage B.1.466.2 belongs according to GISAID [54]. The D614G mutation in the spike protein confers a selective advantage on SARS-CoV-2 and eventuates in all VOIs and VOCs [55]. Previous studies have proven that the D614G strain has higher infectivity, primarily by altering the conformation of the RBD (receptor binding domain) and enhancing the affinity with the ACE2 (angiotensin-converting enzyme 2) receptor [56,57,58]. The Q57H mutation causes major truncation of ORF3b and structural instability of ORF3a [59]. SARS-CoV-2 strains with this mutation are reported to evade innate immune activation and inflammatory responses [60, 61]. As the second most common RBD mutation worldwide, the S-N439K mutation has been shown not only to enhance the infectivity of the virus (probably via creating a new salt bridge with ACE2), but also to increase the resistance to the neutralizing antibody [62, 63]. Located in the furin cleavage site of the spike protein, the P681R mutation is one of the defining mutations for the current dominant strain globally—the Delta variant [64]. It was reported that this highly conserved mutation could facilitate furin-mediated S1–S2 cleavage, accelerate membrane fusion and internalization, enhance replication efficiency and promote resistance to neutralizing antibodies [65,66,67]. In this study, we found that the sub-lineage B of the Indonesian B.1.466.2 variant has evolved the S-P681R mutation, and the numerical advantage of the B sub-lineage over the A lineage (approximately 7:1) supported the potential higher infectivity of the strain with this mutation. Although the effect of a single amino acid mutation on viral infectivity, virulence and immunological properties is generally limited, the accumulation of some significant mutations in a background of multiple mutations may profoundly alter the pandemic patterns and the clinical outcomes, as we have seen with the Delta variant [15, 16, 68]. Given the mutation profiles observed (especially the D614G/N439K/P681R co-mutations in the spike protein), continuous monitoring of the B.1.466.2 variant remains essential.

In epidemiological contexts, Ne (a measure of genetic diversity) is generally considered to be proportional to the number of infected individuals [69]. For the COVID-19 pandemic, Ne may reflect the incidence of SARS-CoV-2 infection among the population due to the short transmission window of the virus [70]. Considering the decline in susceptible individuals and the implementation of intervention strategies, Rt is commonly used to assess the real-time dynamics of infectious disease transmission [71]. In the initial phase after the emergence of the B.1.466.2 variant, its effective population expanded the fastest and corresponded to a very high Rt, suggesting that the local hosts were highly susceptible to the new variant. An exponential growth in Ne with a median Rt > 1 was observed during October 2020–February 2021, which documented the widespread of SARS-CoV-2 lineage B.1.466.2 in communities (especially in the form of infection clusters). Moreover, the peak of estimated Rt overlapped with the peak of total reported cases nationwide in January 2021 [72], indicating that the B.1.466.2 variant could be responsible for the surge of total SARS-CoV-2 infections at the period. From early March 2021 to early May 2021, the Ne of the B.1.466.2 variant showed a decreasing trend, which may have resulted from the competitive pressure brought by the Delta variant, the enforcement of community-level public activity restrictions (locally known as PPKM) and the launching of Indonesian nationwide vaccination program against COVID-19 [73]. Similar reasons could be used to explain the lower Rt. In the final phase of the study, the median Rt estimate < 1 and the plateaued Ne suggested that SARS-CoV-2 lineage B.1.466.2 seemed to be gradually extinct in Indonesia. However, in this case we could not exclude the possibility that the B.1.466.2 variant was still circulating in certain regions due to the lag in SARS-CoV-2 genome submissions to GISAID [74]. Similar to some of its relatives (e. g., OC43 and HKU1), the circulation pattern of SARS-CoV-2 also exhibits a certain seasonality [75, 76]. During the period of December 2021 to March 2022 (just in another rainy season), the B.1.466.2 variant was detected again in several regions of Indonesia [77], which may in part reflect seasonal outbreaks.

We assessed the inter-regional transmission and the relative advantage of intra-regional transmission over inter-regional transmission of SARS-CoV-2 lineage B.1.466.2 in Indonesia in the four phases (divided based on the inferred Ne trajectory). Java was estimated to be the initial epi-center for this SARS-CoV-2 variant, which was consistent with the estimated origin located in Java. In addition, a certain degree of intra-regional transmission was likely to have already existed in Java before the first infection with this variant was officially documented. This finding highlighted the importance of improved genomic surveillance for the timely identification of new SARS-CoV-2 variants. Corresponding to the exponential growth of Ne, the transmission of the B.1.466.2 variant in Indonesia reached its peak during the second phase, which coincided with the country’s rainy season (October to March) and several important festivals. The rainy season usually means more water vapour in air and less ultraviolet radiation from sunlight. In flow physics, higher ambient relative humidity contributes to longer lifetime of respiratory droplets [78]. Previous studies demonstrated that ultraviolet light has the strongest correlation with lower COVID-19 growth [76, 79, 80], probably due to the inactivation of the virus by ultraviolet light and its promotion of the vitamin D (as an immune regulator) synthesis. In addition, Indonesia is a country prone to flooding, and serious flood disasters mostly occur in the rainy season every year and mainly affect populated areas like Java and Sumatra [81]. Flooding can cause both difficulties in practicing social distancing during evacuation and interruptions in the supply of medical protective materials, facilitating the local transmission of the virus [82, 83]. Thus, strengthening the prevention, preparedness and readiness for COVID-19 in the rainy season is needed. A large number of studies have confirmed the contribution of population mobility to the spread of COVID-19 [84,85,86]. To transition to a healthy, safe and productive society, the Indonesian government changed the strictest intervention policy of large-scale social restrictions (PSBB) into the “PSBB transisi” policy [87]. This allowed some citizens (especially those living in the capital, Jakarta) to return home to visit their relatives during Christmas 2020 and the following New Year holiday, partly contributing to the spread across regions [88]. The sea barrier makes inter-regional passenger traffic in Indonesia largely dependent on aviation, and air travel is sometimes the only way to reach certain parts of the archipelago. Based on previous research evidence, we speculated that air travel provided a path for the relocation diffusion of the B.1.466.2 variant from Java and Sumatra to the remote Papua [89, 91]. In the socio-economic dimension, Java is the center of gravity of Indonesia, followed by Sumatra. These two regions in Western Indonesia contribute about 80% of the national economy and interact frequently with convenient transportation conditions [90]. The phylogeographic reconstruction suggested that Java and Sumatra successively acted as epi-centers and formed stable transmission loops between them, reflecting the significant influence of the spatial structure of socio-economic factors on the diffusion pattern of the pandemic [91, 92].

Targeting an indigenous SARS-CoV-2 variant (B.1.466.2) circulating in Indonesia, we identified its evolutionary characteristics and transmission patterns by retrospective genomic surveillance. Although an epidemiological paradigm that combines molecular and native perspectives was provided, unevenly distributed and relatively inadequate genomic samples may bias the analytic results to some extent. In fact, sampling bias is inevitable in genomic surveillance and related research [93]. Therefore, in the process of deriving conclusions, we should try to combine the real-world pandemic situations and be cautious. As Indonesia improves the use of WGS for SARS-CoV-2 and strengthens international health cooperation, more accurate and real-time genomic surveillance for this variant and even comparative genomic analysis along with other variants are expected in the future.


The SARS-CoV-2 lineage B.1.466.2 is considered to be an endemic variant and preceded the Delta variant as the dominant strain in Indonesia. This variant circulating in Indonesia was estimated to have emerged in Java Island since mid-June 2020 and to have evolved into two disproportional and distinct clades as of the end of August 2021. Most of the identified high-frequency non-synonymous mutations were found in the NSP3 and the spike protein, and the D614G/N439K/P681R co-mutations in the spike protein were observed. A four-phased trajectory of Ne was inferred, with an exponential growth from October 2020 to February 2021. The Rt was estimated to reach its peak in late December 2020 and to be less than one after early May 2021. Java and Sumatra successively acted as epi-centers of the Indonesian SARS-CoV-2 B.1.466.2 variant and formed stable transmission loops between them. A long-distance transmission link between the westernmost Sumatra and the easternmost Papua was inferred.

To our knowledge, this is the first comprehensive analysis of the genomic surveillance data of SARS-CoV-2 lineage B.1.466.2 in Indonesia, the world's largest archipelagic country. Our findings suggest that SARS-CoV-2 variants circulating in the tropical archipelago may follow unique patterns of evolution and transmission. Thus, continuous genomic surveillance with wider coverage and higher resolution is essential for the development of practicable and effective pandemic intervention strategies.

Availability of data and materials

The datasets supporting the conclusions of this article are included within the article and its additional files.



Coronavirus disease 2019


Severe acute respiratory syndrome coronavirus 2


World Health Organization


Association of Southeast Asian Nations


Variants of concern


Variants of interest


Whole-genome sequencing


Global initiative on sharing avian influenza data


Variants under monitoring


Maximum likelihood


Most recent common ancestor


Single nucleotide variants


Effective population size


Effective reproduction number

95% CI:

95% Confidence interval


Non-structural protein 3


Receptor binding domain


Angiotensin-converting enzyme 2

PPKM (Indonesian):

Community-level public activity restrictions

PSBB (Indonesian):

Large-scale social restrictions


  1. Lu R, Zhao X, Li J, Niu P, Yang B, Wu H, Wang W, Song H, Huang B, Zhu N, et al. Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. Lancet. 2020;395(10224):565–74.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  2. WHO. Naming the coronavirus disease (COVID-19) and the virus that causes it. 2020.

  3. Zhu N, Zhang D, Wang W, Li X, Yang B, Song J, Zhao X, Huang B, Shi W, Lu R, et al. A novel coronavirus from patients with pneumonia in China, 2019. N Engl J Med. 2020;382(8):727–33.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Cucinotta D, Vanelli M. WHO declares COVID-19 a pandemic. Acta Biomed. 2020;91(1):157–60.

    PubMed  PubMed Central  Google Scholar 

  5. WHO. Weekly epidemiological update on COVID-19 - 31 August 2021. 2021.

  6. ABVC. COVID-19 situational report in the ASEAN region (as of July 21,2021). 2021.

  7. Indonesia-MoH. Vaksinasi COVID-19 nasional. 2022.

  8. Dyer O. Covid-19: Indonesia becomes Asia’s new pandemic epicentre as delta variant spreads. BMJ. 2021;374:n1815.

    Article  PubMed  Google Scholar 

  9. Kumar A, Parashar R, Kumar S, Faiq MA, Kumari C, Kulandhasamy M, Narayan RK, Jha RK, Singh HN, Prasoon P, et al. Emerging SARS-CoV-2 variants can potentially break set epidemiological barriers in COVID-19. J Med Virol. 2021;94(4):1300–14.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  10. Oude MB, Worp N, Nieuwenhuijse DF, Sikkema RS, Haagmans B, Fouchier R, Koopmans M. The next phase of SARS-CoV-2 surveillance: real-time molecular epidemiology. Nat Med. 2021;27(9):1518–24.

    Article  CAS  Google Scholar 

  11. Rambaut A, Holmes EC, O’Toole A, Hill V, McCrone JT, Ruis C, du Plessis L, Pybus OG. A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat Microbiol. 2020;5(11):1403–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. WHO. Tracking SARS-CoV-2 variants. 2021.

  13. Gunadi, Wibawa H, Marcellus, Hakim MS, Daniwijaya EW, Rizki LP, Supriyati E, Nugrahaningsih D, Afiahayati, Siswanto et al. Full-length genome characterization and phylogenetic analysis of SARS-CoV-2 virus strains from Yogyakarta and Central Java, Indonesia. PeerJ. 2020;8:e10575.

  14. Gunadi, Wibawa H, Hakim MS, Marcellus, Trisnawati I, Khair RE, Triasih R, Irene, Afiahayati, Iskandar K et al. Molecular epidemiology of SARS-CoV-2 isolated from COVID-19 family clusters. BMC Med Genomics. 2021; 14(1):144.

  15. Gunadi, Hakim MS, Wibawa H, Marcellus, Trisnawati I, Supriyati E, Afiahayati, Khair RE, Iskandar K, Siswanto et al. Association between prognostic factors and the outcomes of patients infected with SARS-CoV-2 harboring multiple spike protein mutations. Sci Rep 2021; 11(1):21352.

  16. Gunadi, Hakim MS, Wibawa H, Marcellus, Setiawaty V, Slamet, Trisnawati I, Supriyati E, El KR, Iskandar K et al. Is the infection of the SARS-CoV-2 delta variant associated with the outcomes of COVID-19 patients? Front Med (Lausanne) 2021;8:780611.

  17. Fibriani A, Stephanie R, Alfiantie AA, Siregar A, Pradani G, Yamahoki N, Purba WS, Alamanda C, Rahmawati E, Rachman RW, et al. Analysis of SARS-CoV-2 genomes from West Java, Indonesia. Viruses. 2021;13(10):2097.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. IMoH. IKHTISAR MINGGUAN COVID-19 (Indonesia, 14–20 Agustus 2021). 2021.

  19. Cahyani I, Putro EW, Ridwanuloh AM, Wibowo S, Hariyatun H, Syahputra G, Akbariani G, Utomo AR, Ilyas M, Loose M, et al. Genome Profiling of SARS-CoV-2 in Indonesia, ASEAN and the Neighbouring East Asian Countries: Features, Challenges and Achievements. Viruses. 2022;14(4):778.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Sam IC, Chong YM, Abdullah A, Fu J, Hasan MS, Jamaluddin FH, Kamarulzaman A, Lim KK, Mohd NM, Pang YK, et al. Changing predominant SARS-CoV-2 lineages drives successive COVID-19 waves in Malaysia, February 2020 to March 2021. J Med Virol. 2021;94:1146–53.

    Article  PubMed  CAS  Google Scholar 

  21. Aisyah DN, Mayadewi CA, Igusti G, Manikam L, Adisasmito W, Kozlakidis Z. Laboratory readiness and response for SARS-Cov-2 in Indonesia. Front Public Health. 2021;9:705031.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Ulfah M, Helianti I. Bioinformatic analysis of the whole genome sequences of SARS-CoV-2 from Indonesia. Iran J Microbiol. 2021;13(2):145–55.

    PubMed  PubMed Central  Google Scholar 

  23. Nidom RV, Indrasari S, Normalina I, Nidom AN, Afifah B, Dewi L, Putra AK, Ansori A, Kusala M, Alamudi MY, et al. Phylogenetic and full-length genome mutation analysis of SARS-CoV-2 in Indonesia prior to COVID-19 vaccination program in 2021. Bull Natl Res Cent. 2021;45(1):200.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Lee J, Ryu JS. current status of parasite infections in indonesia: a literature review. Korean J Parasitol. 2019;57(4):329–39.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Laud P. Confidence intervals for comparisons of binomial or poisson rates. 2018.

  26. Franceschi VB, Caldana GD, Mayer ADM, Cybis GB, Thompson CE. Genomic epidemiology of SARS-CoV-2 in Esteio, Rio Grande do Sul, Brazil. BMC Genomics. 2021.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Price MN, Dehal PS, Arkin AP. FastTree 2–approximately maximum-likelihood trees for large alignments. PLoS ONE. 2010;5(3):e9490.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  28. Sagulenko P, Puller V, Neher RA. TreeTime: maximum-likelihood phylodynamic analysis. Virus Evol. 2018;4(1):x42.

    Article  Google Scholar 

  29. Rambaut A, Lam TT, Max CL, Pybus OG. Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen). Virus Evol. 2016;2(1):w7.

    Article  Google Scholar 

  30. Jacob MD, Scott R, Guirales S, Janies DA. Fundamental evolution of all Orthocoronavirinae including three deadly lineages descendent from Chiroptera-hosted coronaviruses: SARS-CoV, MERS-CoV and SARS-CoV-2. Cladistics. 2021;37(5):461–88.

    Article  Google Scholar 

  31. Hadfield J, Megill C, Bell SM, Huddleston J, Potter B, Callender C, Sagulenko P, Bedford T, Neher RA. Nextstrain: real-time tracking of pathogen evolution. Bioinformatics. 2018;34(23):4121–3.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Letunic I, Bork P. Interactive tree of life (iTOL) v4: recent updates and new developments. Nucleic Acids Res. 2019;47(W1):W256–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Anne Cori SCNM. Estimate time varying reproduction numbers from epidemic curves. 2021.

  34. Alpert T, Brito AF, Lasek-Nesselquist E, Rothman J, Valesano AL, MacKay MJ, Petrone ME, Breban MI, Watkins AE, Vogels C, et al. Early introductions and transmission of SARS-CoV-2 variant B.1.1.7 in the United States. Cell. 2021;184(10):2595–604.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Kraemer M, Hill V, Ruis C, Dellicour S, Bajaj S, McCrone JT, Baele G, Parag KV, Battle AL, Gutierrez B, et al. Spatiotemporal invasion dynamics of SARS-CoV-2 lineage B.1.1.7 emergence. Science. 2021;373(6557):889–95.

    Article  CAS  PubMed  Google Scholar 

  36. Cheng J. An R interface to leaflet maps. 2021.

  37. Indonesia-MoH. Pemerintah Tingkatkan Kapasitas Deteksi Genom Virus SARS-CoV-2. 2021.

  38. Fan XA, Xy B, Rac C, Yx A. Quantifying competitive advantages of mutant strains in a population involving importation and mass vaccination rollout. Infect Dis Model. 2021;6:988–96.

    Google Scholar 

  39. Parvez MK, Parveen S. Evolution and emergence of pathogenic viruses: past, present, and future. Intervirology. 2017;60(1–2):1–7.

    Article  PubMed  Google Scholar 

  40. Wikipedia. List of islands by population. 2021.

  41. Sy K, White LF, Nichols BE. Population density and basic reproductive number of COVID-19 across United States counties. PLoS ONE. 2021;16(4):e249271.

    Article  CAS  Google Scholar 

  42. Simmonds P. Reconstructing the origins of human hepatitis viruses. Philos Trans R Soc Lond B Biol Sci. 2001;356(1411):1013–26.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Huang JC. Bats & coffee in sumatras rainforest. 2014.

  44. Struebig M. Building a future for borneos bats. 2009.

  45. Zhou P, Yang XL, Wang XG, Hu B, Zhang L, Zhang W, Si HR, Zhu Y, Li B, Huang CL, et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020;579(7798):270–3.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  46. Joffrin L, Goodman SM, Wilkinson DA, Ramasindrazana B, Lagadec E, Gomard Y, Le Minter G, Dos SA, Corrie SM, Sookhareea R, et al. Bat coronavirus phylogeography in the Western Indian Ocean. Sci Rep. 2020;10(1):6873.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  47. Abdullahi IN, Emeribe AU, Mustapha JO, Fasogbon SA, Nwofe J. Exploring the genetics, ecology of SARS-COV-2 and climatic factors as possible control strategies against COVID-19. Le infezioni in medicina: rivista periodica di eziologia, epidemiologia, diagnostica, clinica e terapia delle patologie infettive. 2020;28(2):167–73.

    Google Scholar 

  48. Hassanin A, Tu VT, Curaudeau M, Csorba G. Inferring the ecological niche of bat viruses closely related to SARS-CoV-2 using phylogeographic analyses of Rhinolophus species. Sci Rep. 2021;11(1):14276.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  49. Zhu M, Shen J, Zeng Q, Tan JW, Kleepbua J, Chew I, Law JX, Chew SP, Tangathajinda A, Latthitham N, et al. Molecular phylogenesis and spatiotemporal spread of SARS-CoV-2 in Southeast Asia. Front Public Health. 2021;9:685315.

    Article  PubMed  PubMed Central  Google Scholar 

  50. Wang S, Xu X, Wei C, Li S, Zhao J, Zheng Y, Liu X, Zeng X, Yuan W, Peng S. Molecular evolutionary characteristics of SARS-CoV-2 emerging in the United States. J Med Virol. 2022;94(1):310–7.

    Article  CAS  PubMed  Google Scholar 

  51. Denison MR, Graham RL, Donaldson EF, Eckerle LD, Baric RS. Coronaviruses: an RNA proofreading machine regulates replication fidelity and diversity. RNA Biol. 2011;8(2):270–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  52. Lei J, Kusov Y, Hilgenfeld R. Nsp3 of coronaviruses: structures and functions of a large multi-domain protein. Antivir Res. 2018;149:58–74.

    Article  CAS  PubMed  Google Scholar 

  53. Kashima S, Slavov SN, Giovanetti M, Rodrigues ES, Patané JSL, Viala VL, Santos EV, Evaristo M, Lima LPOD, Martins AJ, et al. Introduction of SARS-COV-2 C.37 (WHO VOI lambda) in the Sao Paulo State, Southeast Brazil. J Med Virol. 2021;94:1206–11.

    Article  PubMed  CAS  Google Scholar 

  54. GISAID. Clade and lineage nomenclature aids in genomic epidemiology studies of active hCoV-19 viruses. 2021.

  55. Chakraborty C, Saha A, Sharma AR, Bhattacharya M, Lee SS, Agoramoorthy G. D614G mutation eventuates in all VOI and VOC in SARS-CoV-2: is it part of the positive selection pioneered by Darwin? Mol Ther Nucleic Acids. 2021;26:237–41.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  56. Yurkovetskiy L, Wang X, Pascal KE, Tomkins-Tinch C, Nyalile TP, Wang Y, Baum A, Diehl WE, Dauphin A, Carbone C, et al. Structural and functional analysis of the D614G SARS-CoV-2 spike protein variant. Cell. 2020;183(3):739–51.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  57. Ozono S, Zhang Y, Ode H, Sano K, Tan TS, Imai K, Miyoshi K, Kishigami S, Ueno T, Iwatani Y, et al. SARS-CoV-2 D614G spike mutation increases entry efficiency with enhanced ACE2-binding affinity. Nat Commun. 2021;12(1):848.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  58. Korber B, Fischer WM, Gnanakaran S, Yoon H, Theiler J, Abfalterer W, Hengartner N, Giorgi EE, Bhattacharya T, Foley B, et al. Tracking changes in SARS-CoV-2 spike: evidence that D614G increases infectivity of the COVID-19 virus. Cell. 2020;182(4):812–27.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  59. Lam JY, Yuen CK, Ip JD, Wong WM, To KK, Yuen KY, Kok KH. Loss of orf3b in the circulating SARS-CoV-2 strains. Emerg Microbes Infect. 2020;9(1):2685–96.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  60. Chu D, Hui K, Gu H, Ko R, Krishnan P, Ng D, Liu G, Wan C, Cheung MC, Ng KC, et al. Introduction of ORF3a-Q57H SARS-CoV-2 variant causing fourth epidemic wave of COVID-19, Hong Kong, China. Emerg Infect Dis. 2021;27(5):1492–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  61. Wang R, Chen J, Gao K, Hozumi Y, Yin C, Wei GW. Analysis of SARS-CoV-2 mutations in the United States suggests presence of four substrains and novel variants. Commun Biol. 2021;4(1):228.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  62. Thomson EC, Rosen LE, Shepherd JG, Spreafico R, Da SFA, Wojcechowskyj JA, Davis C, Piccoli L, Pascall DJ, Dillen J, et al. Circulating SARS-CoV-2 spike N439K variants maintain fitness while evading antibody-mediated immunity. Cell. 2021;184(5):1171–87.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  63. Zhou W, Xu C, Wang P, Luo M, Xu Z, Cheng R, Jin X, Guo Y, Xue G, Juan L, et al. N439K variant in spike protein alter the infection efficiency and antigenicity of SARS-CoV-2 based on molecular dynamics simulation. Front Cell Dev Biol. 2021;9:697035.

    Article  PubMed  PubMed Central  Google Scholar 

  64. Ramesh S, Govindarajulu M, Parise RS, Neel L, Shankar T, Patel S, Lowery P, Smith F, Dhanasekaran M, Moore T. Emerging SARS-CoV-2 variants: a review of its mutations, its implications and vaccine efficacy. Vaccines (Basel). 2021;9(10):1195.

    Article  CAS  Google Scholar 

  65. Liu Y, Liu J, Johnson BA, Xia H, Ku Z, Schindewolf C, Widen SG, An Z, Weaver SC, Menachery VD, et al. Delta spike P681R mutation enhances SARS-CoV-2 fitness over Alpha variant. Cell Rep. 2022;39(7):

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  66. Saito A, Irie T, Suzuki R, Maemura T, Nasser H, Uriu K, Kosugi Y, Shirakawa K, Sadamasu K, Kimura I, et al. Enhanced fusogenicity and pathogenicity of SARS-CoV-2 Delta P681R mutation. Nature. 2021;602:300–6.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  67. Wall EC, Wu M, Harvey R, Kelly G, Warchal S, Sawyer C, Daniels R, Hobson P, Hatipoglu E, Ngai Y, et al. Neutralising antibody activity against SARS-CoV-2 VOCs B16172 and B1351 by BNT162b2 vaccination. Lancet. 2021;397(10292):2331–3.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  68. Roy C, Mandal SM, Mondal SK, Mukherjee S, Mapder T, Ghosh W, Chakraborty R. Trends of mutation accumulation across global SARS-CoV-2 genomes: implications for the evolution of the novel coronavirus. Genomics. 2020;112(6):5331–42.

    Article  CAS  PubMed  Google Scholar 

  69. Karcher MD, Carvalho LM, Suchard MA, Dudas G, Minin VN. Estimating effective population size changes from preferentially sampled genetic sequences. Plos Comput Biol. 2020;16(10):e1007774.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  70. Smith MR, Trofimova M, Weber A, Duport Y, Kuhnert D, von Kleist M. Rapid incidence estimation from SARS-CoV-2 genomes reveals decreased case detection in Europe during summer 2020. Nat Commun. 2021;12(1):6009.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  71. Nishiura H, Chowell G. The effective reproduction number as a prelude to statistical estimation of time-dependent epidemic trends. In: Chowell G, Hyman JM, Bettencourt LMA, Castillo-Chavez C, editors. Mathematical and statistical estimation approaches in epidemiology. Springer: Dordrecht; 2009. p. 103–21.

    Chapter  Google Scholar 

  72. WHO. WHO Indonesia situation report - 69. 2021.

  73. OCHA. Situation update: response to COVID-19 in Indonesia (As of 1 March 2021). 2021.

  74. Kalia K, Saberwal G, Sharma G. The lag in SARS-CoV-2 genome submissions to GISAID. Nat Biotechnol. 2021;39(9):1058–60.

    Article  CAS  PubMed  Google Scholar 

  75. Li Y, Wang X, Nair H. global seasonality of human seasonal Coronaviruses: a clue for postpandemic circulating season of severe acute respiratory syndrome Coronavirus 2? J Infect Dis. 2020;222(7):1090–7.

    Article  CAS  PubMed  Google Scholar 

  76. Karapiperis C, Kouklis P, Papastratos S, Chasapi A, Ouzounis CA. A strong seasonality pattern for Covid-19 incidence rates modulated by UV radiation levels. Viruses. 2021;13(4):574.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  77. GitHub. Recent circulation of multiple sublineages of B.1.466.2 in Jakarta, Indonesia #405. 2022.

  78. Kai LC, Chong SN, Hori N, Rui Y, Lohse D. Extended lifetime of respiratory droplets in a turbulent vapor puff and its implications on airborne disease transmission. Phys Rev Lett. 2021.

    Article  Google Scholar 

  79. Merow C, Urban MC. Seasonality and uncertainty in global COVID-19 growth rates. Proc Natl Acad Sci. 2020;117(44):27456–64.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  80. Carleton T, Cornetet J, Huybers P, Meng KC, Proctor J. Global evidence for ultraviolet radiation decreasing COVID-19 growth rates. Proc Natl Acad Sci. 2021;118(1):e2012370118.

    Article  CAS  PubMed  Google Scholar 

  81. Supatmi S, Hou R, Sumitra ID. Study of hybrid neurofuzzy inference system for forecasting flood event vulnerability in Indonesia. Comput Intell Neurosci. 2019;2019:6203510.

    Article  PubMed  PubMed Central  Google Scholar 

  82. Shen X, Cai C, Yang Q, Anagnostou EN, Li H. The US COVID-19 pandemic in the flood season. Sci Total Environ. 2021;755(Pt 2):142634.

    Article  CAS  PubMed  Google Scholar 

  83. Simonovic SP, Kundzewicz ZW, Wright N. Floods and the COVID-19 pandemic-A new double hazard problem. WIREs Water. 2021;8(2):e1509.

    Article  PubMed  PubMed Central  Google Scholar 

  84. Lai S, Ruktanonchai NW, Zhou L, Prosper O, Luo W, Floyd JR, Wesolowski A, Santillana M, Zhang C, Du X, et al. Effect of non-pharmaceutical interventions to contain COVID-19 in China. Nature. 2020;585(7825):410–3.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  85. Liu Y, Wang Z, Rader B, Li B, Wu CH, Whittington JD, Zheng P, Stenseth NC, Bjornstad ON, Brownstein JS, Tian H. Associations between changes in population mobility in response to the COVID-19 pandemic and socioeconomic factors at the city level in China and country level worldwide: a retrospective, observational study. Lancet Digit Health. 2021;3(6):e349.

    Article  PubMed  PubMed Central  Google Scholar 

  86. Tuladhar R, Grigolini P, Santamaria F. The allometric propagation of COVID-19 is explained by human travel. Infect Dis Model. 2022;7(1):122–33.

    PubMed  Google Scholar 

  87. SuaraJakarta. 4 Aturan Baru PSBB Transisi Jakarta Dimulai Senin Hari Ini. 2020.

  88. KOMPAS: Ramainya Arus Mudik Natal dan Tahun Baru di Tengah Pandemi Covid-19. Halaman all. 2020.

  89. Mesle M, Hall IM, Christley RM, Leach S, Read JM. The use and reporting of airline passenger data for infectious disease modelling: a systematic review. Euro Surveill. 2019;24:31.

    Article  Google Scholar 

  90. Fu Y, Supriyadi A, Wang T. China’s outward FDI in Indonesia: spatial patterns and determinants. Sustainability-Basel. 2018;10(12):4632.

    Article  Google Scholar 

  91. Zhu M, Kleepbua J, Guan Z, Chew SP, Tan JW, Shen J, Latthitham N, Hu J, Law JX, Li L. Early Spatiotemporal Patterns and Population Characteristics of the COVID-19 Pandemic in Southeast Asia. Healthcare (Basel). 2021;9(9):1220.

    Article  Google Scholar 

  92. Sigler T, Mahmuda S, Kimpton A, Loginova J, Wohland P, Charles-Edwards E, Corcoran J. The socio-spatial determinants of COVID-19 diffusion: the impact of globalisation, settlement characteristics and population. Glob Health. 2021;17(1):56.

    Article  Google Scholar 

  93. Lam A, Duchene S. The impacts of low diversity sequence data on phylodynamic inference during an emerging epidemic. Viruses. 2021;13(1):79.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references


We gratefully acknowledge all the authors from the originating laboratories where genetic sequence data were generated and shared via the GISAID initiative, which has made this work possible. The Acknowledgment Table for GISAID is reported as Additional file 1.


The author(s) received no financial support for the research, authorship and/or publication of this article.

Author information

Authors and Affiliations



MZ, QZ and LL designed the study. MZ, QZ, BILS and IC participated in the data collection and curation. QZ, MZ, BILS, SPC and HF performed the analyses. MZ, QZ, BILS, SPC, IC and JWT drafted the initial manuscript. LL administered and supervised the study. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Lanjuan Li.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file1

Acknowledgment Table for GISAID listing GISAID IDs, Originating Labs, Submitting Labs and Authors.

Additional file2

Root-to-tip regression plot based on the maximum-likelihood phylogeny of SARS-CoV-2 lineage B.1.466.2 in Indonesia.

Additional file3

Inferred transmission frequencies of SARS-CoV-2 lineage B.1.466.2 among different Indonesian regions in each phase.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhu, M., Zeng, Q., Saputro, B.I.L. et al. Tracking the molecular evolution and transmission patterns of SARS-CoV-2 lineage B.1.466.2 in Indonesia based on genomic surveillance data. Virol J 19, 103 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: