Open Access

Reconstruction of epidemic curves for pandemic influenza A (H1N1) 2009 at city and sub-city levels

Virology Journal20107:321

Received: 11 September 2010

Accepted: 16 November 2010

Published: 16 November 2010


To better describe the epidemiology of influenza at local level, the time course of pandemic influenza A (H1N1) 2009 in the city of Hong Kong was reconstructed from notification data after decomposition procedure and time series analysis. GIS (geographic information system) methodology was incorporated for assessing spatial variation. Between May and September 2009, a total of 24415 cases were successfully geocoded, out of 25473 (95.8%) reports in the original dataset. The reconstructed epidemic curve was characterized by a small initial peak, a nadir followed by rapid rise to the ultimate plateau. The full course of the epidemic had lasted for about 6 months. Despite the small geographic area of only 1000 Km2, distinctive spatial variation was observed in the configuration of the curves across 6 geographic regions. With the relatively uniform physical and climatic environment within Hong Kong, the temporo-spatial variability of influenza spread could only be explained by the heterogeneous population structure and mobility patterns. Our study illustrated how an epidemic curve could be reconstructed using regularly collected surveillance data, which would be useful in informing intervention at local levels.


The time course of an infectious disease epidemic is one important piece of information for understanding the dynamics of pathogen transmission. For a localized outbreak, for example, food poisoning, an epidemic curve is often conveniently drawn during case investigation. Describing the time course of a country-wide epidemic is more complex, which is not uncommonly complicated by reporting delay, discrepant access to diagnostics, varied public perception and the influence of accompanying health-seeking behaviours. In time of an emerging pandemic, these obstacles pose a great challenge to our society, when a timely construction of an epidemic curve is desirable. The spread of pandemic (H1N1) 2009 was a case in point. When the pandemic first hit the population, most people were non-immune to the novel virus, albeit the presence of partial immunity in some older people[1]. The relative lack of airborne transmission implies that the dissemination of the virus could be shaped largely by population structures, their networking pattern and human mobility[2]. An epidemic curve, if constructed, should reflect these characteristics for supporting the design of effective public health control programs.

Because of the spatial variability of the population, it is hypothesized that the epidemic curves could vary significantly from place to place. In this study we set out to describe the time course of the H1N1 epidemic with a spatial context in Hong Kong, a South-Eastern Chinese territory of about 1000 Km2 in area. Since the diagnosis of the first case on 1 May 2009, all laboratory confirmed cases of pandemic (H1N1) 2009 were reported to the Government. Through the Centre for Health Protection, an anonymised dataset was obtained for the study, which included the age, gender and residential building location of each confirmed case. The residential address was transformed to x and y coordinates in Hong Kong Grid 1980 projection system. Geographically, Hong Kong can be divided into 18 districts and 400 District Council Constituency Areas (DCCA), each of the latter having an average of 17000 population for electoral purpose. ArcGIS version 9.2 was used for spatial exploration while time series analysis was performed to track the time course of the epidemic. A filtering procedure was applied to decompose the series into trend, seasonal and residual components (STL - seasonal trend decomposition procedure based on Loess), implemented on R[3]. Institutional approval for access to the data was obtained from HKSAR Department of Health, in compliance with the Personal Data (Privacy) Ordinance. Individual consent was deemed unnecessary in the analysis of collected surveillance data which did not involve primary data collection.

Overall, a total of 24415 pandemic (H1N1) 2009 cases were successfully geocoded, out of 25473 (95.8%) reported between May and September 2009. The male-to-female ratio was 1.07:1. There was marked heterogeneity in the geographic spread of the reported cases, ranging from 6 cases to 272 cases per DCCA (figure 1). Evaluating at district level, the number of reported cases ranged from below 30 to > 50 per 100,000 populations. In the absence of physical boundaries between geographic units, people are free to move within and across districts and DCCA in their daily activity. We redefined six geographic regions representing places separated by natural borders like mountains and water bodies, after exclusion of uninhabitable areas. The region boundaries, population size and demographic characteristics of pandemic influenza (H1N1) 2009 cases are given in figure 1 and Table 1. The attack rate, expressed as the reported number per 100 resident population was similar across all regions, despite the difference in case density. The proportion of students (defined as people of age 5-19) in the reported case was higher than adults (age 20-64), a pattern opposite to that in the general population(Table 2).
Figure 1

Spatial distribution of reported Pandemic influenza (H1N1) 2009 cases in Hong Kong from May to September 2009, by district council constituency area (DCCA). Uninhabitable areas, including land elevated over 200 meters and water bodies, are hashed. Table 1 displays the population characteristics and statistics on reported cases for the 6 geographic regions - Hong Kong Island, Kowloon, North West, Sai Kung, Shatin/Tai Po, South West, the boundaries of which are given in thick lines.

Table 1

Population characteristics by geographic region (2006 by-census data) in Hong Kong

Geographic regions

Area (km2)


Population density per km2

Proportional distribution


Infant (%) age 0-4

Student (%) age 5-19

Adult (%) age 20-64

Elderly (%) age > 64

Hong Kong Island








Kowloon Peninsula








Shatin & Taipo








Sai Kung








North West








South West








Table 2

Characteristics of Pandemic influenza A (H1N1) cases by geographic region (May-September 2009) in Hong Kong

Geographic regions

Attack rate (%)

No. of cases

Case density per km2

Proportional distribution


Infant (%) aged 0-4

Student (%) aged 5-19

Adult (%) aged 20-64

Elder (%) aged > 64

Hong Kong Island








Kowloon Peninsula








Shatin & Taipo








Sai Kung








North West








South West








An epidemic curve was drawn from the reported numbers in the original dataset (Figure 2 - upper panel). In this study, the parameters and components of STL function (Yt = Tt+St+Rt ) were: t as the time unit, from 1 to 153; Yt as the daily count of H1N1 cases on day t; Tt as the trend component; St as the seasonal component, using 7-day as the smoothing window to account for the weekly cycles adopted by laboratories in the testing and reporting of results (tests on 3-day and 14-day windows were performed yielding less satisfactory results); and Rt as the residual component. The final epidemic curve was reconstructed from the trend component after seasonal decomposition and the exclusion of residuals, through a sequence of operations employing Loess smoother. The Loess regression ĝ(x) smoothed y given x along the scale of the independent variable. The trend smoothing was computed in R by Loess[3]. The regression was locally weighted by V i ( x ) = W ( | X i X | λ q ( x ) ) where W was the weighted function, λq(x) was the q th farthest distance of xi from x, for i = 1 to n, with q as positive integer. The resultant trend (figure 2 lower panel) is characterized by a small peak at around Day 55-60, followed by a nadir and then rapid rise to the ultimate peak on Day 135. By defining students as those between the age of 5 and 19, the trend was plotted again using data on students alone and non-students, with the former bearing remarkable similarity to the all case series (results not shown). There were also marked variations across the 6 geographic regions in amplitude and configuration (figure 3). The early peak could only be seen in Kowloon, and less remarkably on Hong Kong Island region. The peak was reached in all 6 regions at around the same time, though the magnitude and the interval between onset and peak varied. The temporal profiles of residuals, constituted by the remains of the original dataset after seasonal and trend decomposition,[3] demonstrate that there were more spikes over time on Hong Kong Island and Kowloon Peninsula(figure 4).
Figure 2

Construction of epidemic curves using the original data (upper panel) and then by time series analysis after seasonal decomposition of time series by Loess (STL) and smoothing (lower panel). The procedure involved five steps of, firstly, detrending; secondly, cycle-subseries smoothing by loess; thirdly, low-pass filtering by moving average; fourthly, detrending by subtracting seasonal component; and finally, deseasonalizing.

Figure 3

Epidemic curves for Pandemic influenza (H1N1) 2009 by geographic regions - Hong Kong Island, Kowloon, North West, Sai Kung, Shatin/Tai Po, South West.

Figure 4

Pattern of the residuals after seasonal decomposition of time series by Loess (STL) for the 6 geographic regions - Hong Kong Island, Kowloon, North West, Sai Kung, Shatin/Tai Po, South West.

Our study illustrated how an epidemic curve of an influenza epidemic could be reconstructed using regularly collected surveillance data, after decomposition of the time series. STL was applied on the assumption that the seasonal components had been contributed by weekly cycles of laboratory workloads, and that residuals reflected local outbreaks especially in schools. The smoothed trend therefore depicts the time course of the pandemic over a five-month period. The full time course of the epidemic could not be drawn as reporting of individual cases of pandemic (H1N1) 2009 ceased to be mandated at the end of September 2009. Interestingly, though, the peak appeared to have been reached before the reporting mechanism ended. A review of the epidemic curve constructed from influenza-like illness (ILI) surveillance suggested that this first wave came to the trough as of the end of October[4]. It is therefore quite likely that the full course of the epidemic has lasted for about 6 months.

Despite the small area of the territory of Hong Kong, spatial variation was observed in its initial diffusion[5]. This phenomenon was further characterized in this study by the demonstration of variability in: (a) the time point at which influenza cases were first introduced, (b) the time for a critical mass of cases to become cumulated, before the epidemic kicked off, and (c) the amplitude of the regional epidemics. Of note were the small peak and a subsequent nadir in the initial part of the time course, which have been attributed to mitigation introduced by the Government through school closure[6]. The initial small peak was however not seen in all geographic regions. This may be explained by the differential pattern of virus transmissions in each region in view of the variation of population structures, or that there were higher uptake of reported cases in some locations against the background of considerable public attention, when the news of an impending epidemic first broke out[7]. On the other hand, the landscape of the epidemic curve thus constructed was contributed largely by infections in students, an observation made in other studies on pandemic (H1N1) as well as seasonal influenza[8, 9]. Against the background of a relatively uniform physical and climatic environment within Hong Kong, the temporo-spatial variability of influenza spread could only be explained by the heterogeneous population structure and mobility patterns.

Our study carried some limitations. Firstly we assumed that case reporting had been consistently executed over time. In the first five months, all clinically suspicious cases presenting to government clinical services and designated clinics were tested for the virus, alongside referrals from the private sector. While reporting can never be complete, the large number of cases reported (over 20,000) and the single public health agency supervising influenza surveillance in Hong Kong should have offset any inconsistency, thereby enhancing the robustness of the analysis. Secondly, STL decomposition was a filtering procedure based on an algorithm which may not have incorporated all determinants of the influenza transmission dynamics. The validity of the methods for field study would need to be further evaluated. Todate, STL has been used in syndromic surveillance, but with a different public health objective of detecting of outbreaks in the community[10]. In the development of an effective public health response, timeliness of the analysis and the use of regularly collected data are often crucial. In this connection, the algorithm described in this study has allowed the time course of a new epidemic to be drawn without resorting to sophisticated modeling techniques or simulations. The spatial variation in the time course of the pandemic (H1N1) 2009 was an important observation which may be further explored in context of strategies of public health interventions.



This study was partly funded by the Direct Grant of the Medical Faculty, The Chinese University of Hong Kong (project code: 2041533).

Dr SK Chuang and Dr Thomas Tsang of Centre for Health Protection, Department of Health of the Hong Kong SAR Government, are thanked for their support and assistance in enabling the georeferenced swine flu dataset to be available for the studies described in this report. Miss Mandy Li is thanked for her assistance in geocoding and data management.

Part of the contents in this work has been presented at (a) the Multinational Influenza Seasonal Mortality Study (MISMS) Oceania Regional Meeting and Workshop hosted by Fogarty International Center of the US NIH, 15-19 March 2010, Melbourne, Australia, and (b) Hong Kong Society for Infectious Diseases XIV Annual Scientific Meeting, 6 March 2010, Hong Kong.

Authors’ Affiliations

Stanley Ho Centre for Emerging Infectious Diseases, The Chinese University of Hong Kong


  1. Hancock K, Veguilla V, Lu X, Zhong W, Butler EN, Sun H, Liu F, Dong L, DeVos JR, Gargiullo PM, Brammer TL, Cox NJ, Tumpey TM, Katz JM: Cross-reactive antibody responses to the 2009 pandemic H1N1 influenza virus. New Engl J Med 2009, 361: 1945-1952. 10.1056/NEJMoa0906453PubMedView ArticleGoogle Scholar
  2. Han K, Zhu X, He F, Liu L, Zhang L, Ma H, Tang X, Huang T, Zeng G, Zhu BP: Lack of airborne transmission during outbreak of pandemic (H1N1) 2009 among tour group members, China, June 2009. Emerg Infect Dis 2009, 15: 1578-1581.PubMedPubMed CentralView ArticleGoogle Scholar
  3. Cleveland RB, Cleveland WS, Mcrae JE, Terpenning I: STL: a seasonal-trend decomposition procedure based on Loess. J Official Statistics 1990, 6: 3-73.Google Scholar
  4. Centre for Health Protection: Local situation on human swine influenza (Pandemic Influenza H1N1(2009)) as of March 24 2010. Swine and Seasonal Flu Monitor 2010, 2: 12.Google Scholar
  5. Lee SS, Wong NS: Characterizing the initial diffusion pattern of pandemic (H1N1) 2009 using surveillance data. PLoS Curr Influenza 2010, 10: RRN1151. 10.1371/currents.RRN1151Google Scholar
  6. Cowling BJ, Lau EH, M Ip DK, Ho LM, Tsang T, Chuang SK, Leung PY, Lo SV, Liu SH, Riley S: School closure and mitigation of pandemic (H1N1) 2009, Hong Kong. Emerg Infect Dis 2010, 16: 538-541.PubMedPubMed CentralView ArticleGoogle Scholar
  7. Baxter R: Surveillance lessons from first-wave pandemic (H1N1) 2009, Northern California, USA. Emerg Infect Dis 2010, 16: 504-506. 10.3201/eid1603.091285PubMedPubMed CentralView ArticleGoogle Scholar
  8. Kar-Purkayastha I, Ingram C, Maguire H, Roche A: The importance of school and social activities in the transmission of influenza A(H1N1): England, April - June 2009. Euro surveill 2009, 14: pii: 19311.Google Scholar
  9. Foy HM, Hall C, Cooney MK, Allan I, Fox JP: Influenza surveillance by age and target group. Am J Epidemiol 1979, 109: 582-587.PubMedGoogle Scholar
  10. Hafen RP, Anderson DE, Cleveland WS, Maciejewski R, Ebert DS, Abusalah A, Yakout M, Ouzzani M, Grannis SJ: Syndromic surveillance: STL for modeling, visualizing, and monitoring disease counts. BMC Med Inform Decis Mak 2009, 9: 21. 10.1186/1472-6947-9-21PubMedPubMed CentralView ArticleGoogle Scholar


© Lee and Wong; licensee BioMed Central Ltd. 2010

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.