- Short report
- Open Access
Genotyping of high-risk anal human papillomavirus (HPV): ion torrent-next generation sequencing vs. linear array
Virology Journalvolume 14, Article number: 112 (2017)
Our next generation sequencing (NGS)-based human papillomavirus (HPV) genotyping assay showed a high degree of concordance with the Roche Linear Array (LA) with as little as 1.25 ng formalin-fixed paraffin-embedded-derived genomic DNA in head and neck and cervical cancer samples. This sensitive genotyping assay uses barcoded HPV PCR broad-spectrum general primers 5+/6+ (BSGP)5+/6+ applicable to population studies, but it’s diagnostic performance has not been tested in cases with multiple concurrent HPV infections.
We conducted a cross-sectional study to compare the positive and negative predictive value (PPV and NPV), sensitivity and specificity of the NGS assay to detect HPV genotype infections as compared to the LA. DNA was previously extracted from ten anal swab samples from men who have sex with men in Nigeria enrolled on the TRUST/RV368 cohort study. Two-sample tests of proportions were used to examine differences in the diagnostic performance of the NGS assay to detect high vs. low-risk HPV type-specific infections.
In total there were 94 type-specific infections detected in 10 samples with a median of 9.5, range (9 to 10) per sample. Using the LA as the gold standard, 84.4% (95% CI: 75.2–91.2) of the same anal type-specific infections detected on the NGS assay had been detected by LA. The PPV and sensitivity differed significantly for high risk (PPV: 90%, 95% CI: 79.5–96.2; sensitivity: 93.1%, 95% CI: 83.3–98.1) as compared to low risk HPV (PPV: 73%, 95% CI: 54.1–87.7; sensitivity: 61.1, 95% CI: 43.5–76.9) (all p < 0.05). The NPV for all types was 92.5% (95% CI: 88.4–95.4). The NPV and specificity were similar for high and low risk HPVs (all p > 0.05). The NGS assay detected 10 HPV genotypes that were not among the 37 genotypes found on LA (30, 32, 43, 44, 74, 86, 87, 90, 91, 114).
The NGS assay accurately detects multiple HPV infections in individual clinical specimens with limited sample volume and has extended coverage compared to LA.
Identifying type-specific infections of human papillomavirus (HPV) is fundamental for understanding carcinogenic risk in epidemiologic studies, particularly in high burden areas such as sub-Saharan Africa. We recently developed a next-generation sequencing (NGS)-based HPV genotyping assay that shows 92% concordance on type-specific infections with the Roche Linear Array (LA) in cases with head and neck or cervical cancer . The assay uses barcoded HPV PCR broad-spectrum general primers 5+/6+ (BSGP5+/6+)  that have a high sensitivity for HPV type specific infections and for multiple HPV infections as compared to earlier versions of the primer set (GP5+/6+) . Prior studies evaluating the benefits of next generation sequencing for the detection of HPV have incorporated different primer sets [3,4,5,6,7] or used whole genome sequencing [8,9,10] for viral discovery and have not evaluated the effectiveness of the BSGP5+/6+ primer set in a next generation sequencing assay as compared to the PGMY09/11 primer set used in LA. Some studies have shown that these broad-spectrum primer-based PCR methods may be less sensitive at detecting multiple infections [11, 12]. Our prior study could not test the performance of the NGS assay in samples with multiple infections because the head and neck squamous cell carcinomas and cervical cancer specimens tested were primarily associated with a single HPV type-specific infection, with the exception of one sample that had two type-specific infections. Therefore, the objective of this study was to estimate the diagnostic performance of a NGS assay that employs BSGP5+/6+ primers as compared to the LA to detect a high number of concurrent HPV type-specific infections in anal swab samples collected from an epidemiologic study of men who have sex with men.
Genomic DNA was previously extracted from anal swab samples and genotyped on LA to determine the prevalence of HPV type-specific infections between HIV-positive and HIV-negative men who have sex with men in Abuja, Nigeria . In brief, DNA was extracted from 250 μL of Aptima Specimen Transport medium (Hologic, San Diego, CA) using the QIAmp MinElute Media Kit (Qiagen, Valencia, CA). DNA was resuspended in 100 μL of Buffer AVE, quantified by NanoDrop (ng/μL) and stored at -200 C. A 10-μL aliquot of the purified DNA was amplified using the PGMY 09/11 L1 consensus primer system which co-amplifies 37 HPV genotypes and a human β-globin internal control target. Both high- and low-risk HPV genotypes were detected using the LA (Roche Diagnostics, Indianapolis IN). High-risk HPV included 13 type specific infections: 16, 18, 31, 35, 39, 45, 51, 52, 56, 58, 59, and 68 . For any HPV (both low and high-risk), the median number of infections was 4, (interquartile range [IQR]: 2–8, range: 0–15); 38% (59/154) of the specimens had 2–5 infections and 38% (58/154) had 6–15 infections. Approximately 49% (76/154) of the specimens had two or more high-risk HPV infections (median: 1, IQR:0–3, range 0–7).
For the present study, 10 samples of genomic DNA positive for multiple high-risk HPV infections were selected. Quantified genomic DNA (20 ng) were included in HPV library amplification as previously described . In brief, all samples were included in the sequencing pool at a standardized concentration of ~500pM, as determined by the BioAnalyzer. Samples without library product detection were included in the pool at equal volumes. Pooled samples were purified and then quantified for emulsion template preparation on the Qubit 2.0 Fluorometer and prepared using Ion personal genome machine (PGM) 200 kits on the OneTouch 2. Sequencing was performed on the Ion Torrent PGM using the 200 v2 sequencing chemistry and 316v2 chips.
Data processing was performed by the ion torrent server, using Torrent Suite v4.4.3, and mapped to the full genomic sequences of HPV downloaded from the Papillomavirus Episteme (PaVE) database with a minimum score of AQ17. Further filtering of only reads >100 bp was performed using NGSUtils. A sample contained >5000 reads to be included in the analysis and the reads for each type specific infection accounted for more than 0.05% of the total number of reads to be called positive.
The median and interquartile range of the number of genotypes identified per sample were estimated for any HPV infection, high-risk HPV, and low-risk HPV infection. To compare the NGS-HPV genotyping assay with LA, the presence or absence of 33 type-specific infections that were detectable by both assays were included in the analysis (6, 11, 16, 18, 26, 31, 33, 35, 39, 40, 42, 45, 51, 53, 54, 56, 58, 59, 61, 62, 66, 67, 68, 69, 70, 71, 72, 73, 81, 82, 83, 84, 89. Bolding indicates high-risk). There were 4 type-specific infections (52, 55, 64, and IS39 (82 subtype) that were not included in the comparison. In the LA assay, HPV 52 cross reacts with 35 and 58. If either 35 or 58 are present, then 52 may be underdiagnosed in the LA assay. For the other three genotypes, genomic sequences were not present in the PaVE database and were not available for mapping and therefore could possibly be underdiagnosed in the NGS assay. In total there were 330 type-specific infections that could be detected by both assays.
To estimate the ability of the NGS assay to detect the same types of infections detected on LA, we calculated the positive predictive value [(true positive/(true positive + false positive))*100] and the negative predictive value [(true negative/(true negative + false negative))*100] for any HPV, and separately for high-risk HPV and low-risk HPV with the associated 95% confidence intervals. To estimate the ability of the NGS assay to correctly identify those with and without HPV genotypes, we calculated the sensitivity [(true positive/(true positive + false negative))*100] and the specificity [(true negative/(true negative + false positive))*100] for any HPV, and separately for high-risk HPV and low-risk HPV with the associated 95% confidence intervals. Two-sample tests of proportions were used to compare the performance of the NGS-HPV assay for high and low risk HPV infections relative to the LA. Analyses were performed using Stata Statistical Software: Release 13 (College Station, TX: Stata Corp LP).
The median number of HPV type-specific infections in the 10 samples as detected by LA was 9.5 (IQR: 9–10), the median number of high risk HPV was 5 (IQR: 5–7), and the median number of low risk HPV was 4 (IQR: 2–4) (Fig. 1). In total, the NGS assay detected 76 of the 94 type-specific infections detected by LA. Using the LA as the gold standard, 84.4% (95% CI: 75.2–91.2) of the same HPV type-specific infections detected on the NGS assay had been detected on the LA (Table 1). The NGS HPV assay was more likely to detect the same anal high risk HPV (PPV: 90.0%, 95% CI: 79.5–96.2) as compared to the low risk HPV (PPV:73.3%, 95% CI: 54.1–87.7) (p = 0.04). The sensitivity of the NGS assay was higher for high risk HPV as compared to low risk infections (p = 0.0001). Interestingly, the NGS HPV assay never detected any of the low risk HPV types 84 and 89. If we exclude these two types, the sensitivity of the NGS assay becomes 78.6%, 95% CI: 59.0–91.7. The NPV for all types was 92.5% (95% CI: 88.4–95.4) and the NPV and specificity were similar for low and high risk HPVs (p = 0.80, p = 0.16, respectively). Overall, the NGS HPV assay detected an additional 10 HPV genotypes that were not among the 37 genotypes found on the LA (30, 32, 43, 44, 74, 86, 87, 90, 91, 114) (Fig. 1).
Our study confirms that the NGS assay was able to detect multiple HPV infections, particularly the high risk type-specific infections. The NGS assay detected 90% of the same high risk HPV as the LA and its sensitivity or true positive rate for high-risk HPV was 93%. More specifically, the NGS assay can detect HPV-52, a high risk HPV that is underdiagnosed on the LA because it cross reacts with other type-specific infections. For the true negative infections, the NGS assay was able to detect 93% of the same types not detected on the LA. The NPV and the specificity were similar regardless if they were low or high risk infections. These findings are consistent with an earlier study that found that the additional eight forward broad-spectrum primers and two backward primers significantly improved the ability of this primer set to detect multiple infections .
The Ion Torrent NGS assay may not be as sensitive for detecting all HPV type-specific infections as the new multiplex type-specific HPV E7 PCR system [11, 12], but its purpose is to detect the known oncogenic type-specific infections from small amounts of DNA for multiple samples in longitudinal studies. The slightly lower positive predictive value for all types of HPV was in part driven by the low PPV for low risk infections. For epidemiologic studies focused on high risk infections, the NGS assay is a suitable alternative to the LA. For studies that have valuable archived samples, it is particularly advantageous as a genotyping assay because it uses as little as 20 ng of genomic DNA. For men, the prevalence of high risk HPV remains high and does not significantly decline with age as seen in women [15, 16]. A better understanding of gender differences in the natural history of HPV could be facilitated by diagnostics that require small sample volumes.
There is a possible limitation to our study. We oversampled specimens that had a large number of high-risk HPV infections which may have reduced the number of low-risk types included in the total sample. This may in turn have affected our estimates of the positive predictive value and sensitivity of the assay to detect low-risk HPV infections. Still, in larger studies, the capability to detect multiple oncogenic type-specific infections is more relevant to understanding risk factors for progression to cancer.
Despite this limitation, our study demonstrates that the NGS assay is comparable to the LA in detecting multiple high-risk HPV infections. The NGS assay could provide a diagnostic tool for large epidemiologic studies evaluating the natural history of HPV and identifying those most at risk of developing HPV-associated malignancies. The reagent and material cost per sample is significantly lower with the NGS assay, although that will have to be balanced against the capital investment in the Ion Torrent platform.
Broad-spectrum general primers
Roche Linear Array
Next generation sequencing
Negative predictive value
Personal genome machine
Positive predictive value
Ambulos Jr NP, Schumaker LM, Mathias TJ, White R, Troyer J, Wells D, et al. Next-Generation Sequencing-Based HPV Genotyping Assay Validated in Formalin-Fixed, Paraffin-Embedded Oropharyngeal and Cervical Cancer Specimens. J Biomol Tech. 2016;27:46–52.
Schmitt M, Dondog B, Waterboer T, Pawlita M. Homogeneous amplification of genital human alpha papillomaviruses by PCR using novel broad-spectrum GP5+ and GP6+ primers. J Clin Microbiol. 2008;46:1050–9.
Ekstrom J, Bzhalava D, Svenback D, Forslund O, Dillner J. High throughput sequencing reveals diversity of Human Papillomaviruses in cutaneous lesions. Int J Cancer. 2011;129:2643–50.
Yi X, Zou J, Xu J, Liu T, Liu T, Hua S, et al. Development and validation of a new HPV genotyping assay based on next-generation sequencing. Am J Clin Pathol. 2014;141:796–804.
Arroyo LS, Smelov V, Bzhalava D, Eklund C, Hultin E, Dillner J. Next generation sequencing for human papillomavirus genotyping. J Clin Virol. 2013;58:437–42.
Flores-Miramontes MG, Torres-Reyes LA, Alvarado-Ruiz L, Romero-Martinez SA, Ramirez-Rodriguez V, Balderas-Pena LM, et al. Human papillomavirus genotyping by Linear Array and Next-Generation Sequencing in cervical samples from Western Mexico. Virol J. 2015;12:161-015-0391-4.
Barzon L, Militello V, Lavezzo E, Franchin E, Peta E, Squarzon L, et al. Human papillomavirus genotyping by 454 next generation sequencing technology. J Clin Virol. 2011;52:93–7.
Bzhalava D, Johansson H, Ekstrom J, Faust H, Moller B, Eklund C, et al. Unbiased approach for virus detection in skin lesions. PLoS One. 2013;8:e65953.
Cullen M, Boland JF, Schiffman M, Zhang X, Wentzensen N, Yang Q, et al. Deep sequencing of HPV16 genomes: A new high-throughput tool for exploring the carcinogenicity and natural history of HPV16 infection. Papillomavirus Res. 2015;1:3–11.
Meiring TL, Salimo AT, Coetzee B, Maree HJ, Moodley J, Hitzeroth II, et al. Next-generation sequencing of cervical DNA detects human papillomavirus types not detected by commercial kits. Virol J. 2012;9:164-422X-9-164.
Schmitt M, Dondog B, Waterboer T, Pawlita M, Tommasino M, Gheit T. Abundance of multiple high-risk human papillomavirus (HPV) infections found in cervical cells analyzed by use of an ultrasensitive HPV genotyping assay. J Clin Microbiol. 2010;48:143–9.
Clifford GM, Vaccarella S, Franceschi S, Tenet V, Umulisa MC, Tshomo U, et al. Comparison of Two Widely Used Human Papillomavirus Detection and Genotyping Methods, GP5+/6 + −Based PCR Followed by Reverse Line Blot Hybridization and Multiplex Type-Specific E7-Based PCR. J Clin Microbiol. 2016;54:2031–8.
Nowak RG, Gravitt PE, He X, Ketende S, Dauda W, Omuh H, et al. Prevalence of Anal High-Risk Human Papillomavirus Infections Among HIV-Positive and HIV-Negative Men Who Have Sex With Men in Nigeria. Sex Transm Dis. 2016;43:243–8.
Munoz N, Bosch FX, de Sanjose S, Herrero R, Castellsague X, Shah KV, et al. Epidemiologic classification of human papillomavirus types associated with cervical cancer. N Engl J Med. 2003;348:518–27.
Chin-Hong PV, Vittinghoff E, Cranston RD, Buchbinder S, Cohen D, Colfax G, et al. Age-Specific prevalence of anal human papillomavirus infection in HIV-negative sexually active men who have sex with men: the EXPLORE study. J Infect Dis. 2004;190:2070–6.
Bruni L, Diaz M, Castellsague X, Ferrer E, Bosch FX, de Sanjose S. Cervical human papillomavirus prevalence in 5 continents: meta-analysis of 1 million women with normal cytological findings. J Infect Dis. 2010;202:1789–99.
We would like to thank the men who participated as well as the dedicated staff in the TRUST/RV368 cohort study.
The research reported in this publication was supported by UMGCC P30 grant under award number (P30 CA 134274–04). The samples used in this study originated from the TRUST/RV368 cohort study supported by the US National Institutes of Health under award number R01MH099001-01 and R01AI120913-01, the US Military HIV Research Program (Grant No. W81XWH-07-2-0067), Fogarty AITRP (D43TW01041), and the President’s Emergency Plan for AIDS Relief through cooperative agreement U2G IPS000651 from the HHS/Centers for Disease Control and Prevention, Global AIDS program with Institute of Human Virology Nigeria. The content is solely the responsibility of the authors and does not necessarily reflect the official views of the Department of Defense, National Institutes of Health, or other funders.
Availability of data and material
All data generated and analyzed during this study are included in this published article as part of its Additional file 1.
RN, NA, and KC designed the study. TM generated the HPV genotyping data. RN and SB conceived and analyzed the data. RN, NP, LS, SB, KC drafted the manuscript and TM, RW, JT, DW, and MC provided critical review and editing. All authors have seen and approved the paper.
The authors declare that they have no competing interests.
Consent for publication
All of the data included in this paper was removed of any identifiable characteristics to preserve the confidentiality of the individual participants.
Ethics approval and consent to participate
The research reported was approved by the University of Maryland Baltimore Institutional Review Board. In addition, for the cohort study, approval was obtained from the Federal Capital Territory Health Research Ethics Committee in Nigeria and the Walter Reed Army Institute of Research Institutional Review Board.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The tabular data used in the analyses for this manuscript is included as an excel file. This data includes both the LA and NGS results for each of the types detected on the 10 samples. (XLS 57 kb)