Whole genome sequencing and comparative genomic analyses of two Vibrio cholerae O139 Bengal-specific Podoviruses to other N4-like phages reveal extensive genetic diversity
- Derrick E Fouts9,
- Jochen Klumpp6Email author,
- Kimberly A Bishop-Lilly7, 8,
- Mathumathi Rajavel9,
- Kristin M Willner7, 8,
- Amy Butani7, 8,
- Matthew Henry7, 8,
- Biswajit Biswas7, 8,
- Manrong Li6,
- M John Albert7,
- Martin J Loessner6,
- Richard Calendar8 and
- Shanmuga Sozhamannan7, 8, 9, 5
© Fouts et al.; licensee BioMed Central Ltd. 2013
Received: 4 February 2013
Accepted: 21 May 2013
Published: 28 May 2013
Vibrio cholerae O139 Bengal is the only serogroup other than O1 implicated in cholera epidemics. We describe the isolation and characterization of an O139 serogroup-specific phage, vB_VchP_VchO139-I (ϕVchO139-I) that has similar host range and virion morphology as phage vB_VchP_JA1 (ϕJA1) described previously. We aimed at a complete molecular characterization of both phages and elucidation of their genetic and structural differences and assessment of their genetic relatedness to the N4-like phage group.
Host-range analysis and plaque morphology screening were done for both ϕJA1 and ϕVchO139-I. Both phage genomes were sequenced by a 454 and Sanger hybrid approach. Genomes were annotated and protein homologies were determined by Blast and HHPred. Restriction profiles, PFGE patterns and data on the physical genome structure were acquired and phylogenetic analyses were performed.
The host specificity of ϕJA1 has been attributed to the unique capsular O-antigen produced by O139 strains. Plaque morphologies of the two phages were different; ϕVchO139-I produced a larger halo around the plaques than ϕJA1. Restriction profiles of ϕJA1 and ϕVchO139-I genomes were also different. The genomes of ϕJA1 and ϕVchO139-I consisted of linear double-stranded DNA of 71,252 and 70,938 base pairs. The presence of direct terminal repeats of around 1974 base pairs was demonstrated. Whole genome comparison revealed single nucleotide polymorphisms, small insertions/deletions and differences in gene content. Both genomes had 79 predicted protein encoding sequences, of which only 59 were identical between the two closely related phages. They also encoded one tRNA-Arg gene, an intein within the large terminase gene, and four homing endonuclease genes. Whole genome phylogenetic analyses of ϕJA1 and ϕVchO139-I against other sequenced N4-like phages delineate three novel subgroups or clades within this phage family.
The closely related phages feature significant genetic differences, in spite of being morphologically identical. The phage morphology, genetic organization, genomic content and large terminase protein based phylogeny support the placement of these two phages in the Podoviridae family, more specifically within the N4-like phage group. The physical genome structure of ϕJA1 could be demonstrated experimentally. Our data pave the way for potential use of ϕJA1 and ϕVchO139-I in Vibrio cholerae typing and control.
KeywordsVibrio cholerae O139 N4-like virus Genome comparison Terminal repeats Intein Phylogenetic relationship
Cholera is an acute diarrheal disease caused by ingestion of food or water contaminated with the Gram-negative bacterium Vibrio cholerae. These organisms reside and survive within environmental reservoirs during inter-epidemic periods and cause seasonal outbreaks in cholera endemic countries. Cholera is a major concern in parts of the world lacking adequate sanitary infrastructure, especially during mass migration of populations due to human driven or natural disasters, as was evident during the 2010 cholera epidemics in Haiti [1–3]. V. cholerae has a short incubation period and produces an enterotoxin that causes a copious, painless, watery diarrhea that can quickly lead to severe dehydration and death, if treatment is not promptly administered. Up to 80% of cases can be successfully treated with oral rehydration salts [4, 5]. Since 2000, the incidence of cholera has increased steadily, culminating in 589,854 reported cases from 58 countries worldwide, including 7816 deaths in 2011, with a case-fatality rate (CFR) of 1.3% [4, 5]. However, according to the World Health Organization, an estimated 3–5 million cholera cases and 100,000–120,000 deaths occur every year, underscoring the severity of this disease [4, 5].
More than 200 serogroups of V. cholerae are known . However, all seven pandemics in the recorded history of cholera have been caused by V. cholerae strains of the O1 serogroup. The 6th and 7th pandemic O1 strains belong to the Classical and El Tor biotypes, respectively . V. cholerae O139 Bengal, a novel pathogen that emerged in late 1992, has remained endemic in South Asia . There is overwhelming genetic evidence supporting the hypothesis that the O139 serogroup emerged from an O1 El Tor strain by horizontal transfer of the O-antigen gene cluster [9, 10]. The evolution of V. cholerae, as a pathogen, is fairly well understood. Two major virulence factors; CTX (encoding the cholera toxin) and TcpA (encoding the toxin coregulated pilus) and the master regulatory factor, ToxT have been extensively characterized [11, 12]. In a landmark study, the genes encoding cholera toxin, ctxAB, were shown to be carried on, and horizontally transferred by, a filamentous phage, ϕCTX .
Successful epidemiological surveillance and public health intervention of disease outbreaks (such as containment of the disease spread and administration of medical countermeasures) depend on rapid, accurate and inexpensive identification of the etiological agent. Besides the potential therapeutic use of phages, the specificity of phage-bacterial interaction has been exploited for accurate identification of bacteria. Phage-typing schemes have been in use for identification of many bacterial pathogens with high specificity, the most notable being Salmonella, Mycobacterium and Listeria[14–16]. Phage-typing schemes have also been in use for sub typing Classical and El Tor V. cholerae strains. These schemes have been periodically updated with new phages to address the changing genetic and epidemiological profiles of V. cholerae strains during yearly epidemics [17, 18]. During the V. cholerae O139 outbreaks of 1992 in the Indian subcontinent, a phage designated as ϕJA1 belonging to the family Podoviridae, which specifically infects O139 strains, was isolated in Dhaka . Several additional phages specific for this new pathogen were subsequently isolated, and a new phage typing scheme for V. cholerae O139 has been put forth .
The V. cholerae O139-specific ϕJA1 has been further characterized. It was hypothesized that the O139 O-antigen is the phage receptor since mutants lacking the O-antigen were found to be phage resistant [9, 19]. Furthermore, a phage-encoded lyase enzyme was shown to degrade the O139 polysaccharide [21, 22]. However, neither has the genome sequence of this phage been determined nor has the gene encoding the lyase enzyme been identified yet.
Here we present the genetic characterization of another V. cholerae O139 Bengal-specific phage, ϕVchO139-I, isolated in 1999 and compare it to ϕJA1 isolated in 1992. ϕVchO139-I is a lytic, double-stranded DNA phage, similar but not identical to ϕJA1 . We also present the genomic sequences of phages ϕJA1 and ϕVchO139-I and compare their genetic organization and genomic content to seventeen other currently available phages of the N4-like group.
Results and discussion
Isolation and characterization of V. cholerae phage ϕVchO139-I
ϕJA1 was isolated from the same location seven years prior to the isolation of ϕVchO139-I. Also, ϕVchO139-I has the same host range as ϕJA1. Like ϕJA1, ϕVchO139-I infected all tested strains of V. cholerae serogroup O139, but failed to infect any other serogroup strain (serogroup O1-O206) (Additional file 1: Figure S1) or O139 Bengal acapsular mutants (data not shown). ϕJA1 is known to produce a characteristic halo around the plaque postulated to be due to the degradation of O139 capsular polysaccharide by a polysaccharide lyase secreted from phage-infected and lysed bacteria [21, 22].
Based on electron micrographic DNA length measurements, the genome size of ϕVchO139-I was estimated to be 69.9 kb (data not shown). PFGE estimate indicated a genome size of approximately 72 kb for both phages ϕJA1 and ϕVchO139-I (Additional file 1: Figure S2A). Restriction fragment length polymorphism (RFLP) patterns of the genomes of ϕJA1 and ϕVchO139-I with several enzymes were different (HpaII, ClaI, XmnI) whereas with some other enzymes (EcoRI and HindIII) they were identical (Additional file 1: Figure S2B).
Identification of virion structural proteins
Despite the fact that phages ϕJA1 and ϕVchO139-I exhibit differences in halo size of their plaques, we did not observe any difference in the relative concentration of any protein in the gel. Based on all these structural characteristics, ϕJA1 and ϕVchO139-I appeared to be very similar phages with minor differences.
Genome sequencing of ϕJA1 and ϕVchO139I
In order to decipher the genetic differences between the two phages, we carried out whole genome sequencing of the two phages. An initial draft genome of ϕJA1 and ϕVchO139-I was obtained by Sanger-based whole genome shotgun sequencing with finishing reactions performed for ϕJA1. The Sanger assembled single contigs of ϕJA1 and ϕVchO139-I genomes were 69,755 and 68,268 bps in size respectively. We reasoned that the genome was incomplete at this stage because of the discrepancy we observed between the genome sizes of the assembled genomes and the PFGE size estimate (~72 kb) as well as restriction fragments not accounted for by the assembly. Since it is possible that some of the clones were toxic to E. coli and hence were excluded, a cloning-independent NextGen sequencing approach was taken to complete the DNA sequencing of these genomes. To accomplish this, an additional 502,102 (ϕJA1) and 507,297 (ϕVchO139-I) 454 pyrosequencing reads were generated. De novo assembly of a subset of 15,000 Roche/454 reads each resulted in single contigs of 69,751 bp and 69,354 bp for ϕJA1 and ϕVchO139-I respectively. All 454 and for ϕJA1 additional Sanger reads were mapped back to these contigs. Regions of low coverage were confirmed by Sanger sequencing.
Determination of the structure and linearity of ϕJA1 and ϕVchO139-I genomic ends
Because the ϕJA1 genome sequence had a better quality assembly, having had more successful finishing reactions and fewer sequence ambiguities, it was used to determine the sequence of the genome ends. The resultant mapping assembly revealed a pile-up of 454 reads spanning the terminal 1,974 base pairs at the 3′ end of the assembly with twice the sequence depth of the remainder of the genome. This suggested that this specific sequence is present twice in the phage genome (Additional file 1: Figure S3), which could explain the size discrepancy. It is well known that N4-like phages have terminal repeats, but they are usually not identical.
Analysis of terminal restriction enzyme fragments and primer walks were used to partially determine the structure of the ϕJA1 genomic ends. We were unable to clone the terminal restriction fragments of the genome in order to obtain sequences corresponding to the putative genomic ends that could be mapped onto the assembly. Bal31 nuclease pre-treatment was used to help identify the true physical ends of the genome as described previously ; however, no progressive shortening of any restriction fragment was observed, indicating that the ϕJA1 and ϕVchO139-I genomic ends are inaccessible to Bal31 enzyme. A pre-digestion with proteinase K also did not improve Bal31 accessibility to the ends, indicating the absence of a covalently bound terminal protein (data not shown). It is possible that inter or intra molecular circularization of the genomes prevent Bal31 entry. Hence, at present, the reason for Bal31 resistance of the phage genomic termini is unclear.
To determine whether the ϕJA1 genome is linear or circularly permuted and whether the ends are fixed or variable, restriction enzyme analyses were performed. Restriction digestion of ϕJA1 DNA produced two distinct fragments when cut with enzymes that have single recognition sites (SalI, PauI and NheI) and three distinct fragments when cut with enzymes that have two sites such as ClaI (data not shown). ϕVchO139-I DNA was cut into two distinct fragments when single cutters ClaI and PauI were used (data not shown). Both ϕJA1 and ϕVchO139-I DNAs produced three fragments upon double digestion with enzymes that have a unique restriction site (e.g., XhoI-AhdI, XhoI-SalI and XhoI-SphI) (Additional file 1: Figure S2A). These findings clearly indicate a fixed linear genome structure without circular permutation . Possible cohesive genome ends (cos) were examined by heating one half of ClaI, Van91I and OliI restriction digests for 10 min at 75°C prior to electrophoresis. This heating step did not change the fragment pattern, compared to the unheated control, thus excluding the presence of cohesive genomic ends (data not shown) . This finding is consistent with the placement of these phages in N4-like phage group (see below) in that N4- like phages are not known to have cos sites .
ϕJA1 and ϕVchO139-I genome properties and polymorphisms
Despite having been isolated seven years apart, ϕJA1 and ϕVchO139-I have very similar genetic organization and genome sequences. The non-redundant genomes of ϕJA1 and ϕVchO139-I were found to be 69,278 bps and 68,964 bps in length, respectively with an identical G + C mol% of 34.6%, which differs considerably from the G + C mol% of the genome of the host V. cholerae, which averages ~47.6% per strain. Direct pairwise comparison of their genomes at the nucleotide level revealed that they are 99% identical across 99% of the length, using NCBI Megablast. The differences were due to a total of 298 single nucleotide polymorphisms (SNPs) and two small insertions/deletions (JA1_0002 and VCO139_0077 have no ortholog in the other genome, Figure 4). The absence of these CDSs in the other genome was confirmed by BLASTN analysis. Of these SNPs, 96 were intergenic, 139 SNPs were synonymous, and 63 resulted in non-synonymous amino acid coding changes in 19 proteins (Figure 4, Additional file 1: Table S1). Additionally, there were 79 predicted protein-coding sequences identified in the genomes of ϕJA1 and ϕVchO139-I, of which 59 are identical between the two phages. Of these, 53 were shown to be 100% identical at the nucleotide level (Additional file 1: Table S1). Both ϕJA1 and ϕVchO139-I encode a single tRNA-Arg gene between JA1_0023 and JA1_0025 in ϕJA1 and VCO139_0022 and VCO139_0024 in ϕVchO139-I and an intein within the large terminase gene.
Comparison of ϕJA1 and ϕVchO139-I with N4
Top BLASTP matches in the NCBI nr database revealed relatedness to E. coli phage N4-like phages. Comparison with E. coli phage N4 revealed 22 N4 proteins with BLASTP matches to ϕJA1 and considerable synteny (Figure 4). These matches include functions involved in head morphogenesis (i.e., large terminase, portal, and major coat proteins), transcription (i.e., both N4 RNA polymerase proteins), and replication (i.e., DNA helicase, ssDNA-binding protein (DBP) and DNA polymerase proteins). The presence of the very large (around 335 KDa) virion-encapsulated RNA polymerase (vRNAP) is a hallmark of the N4-like phages as well as the heterodimeric RNA polymerase used in middle mode transcription . Like bacteriophage T4, N4 has homologs of the ‘rapid-lysis’ genes, rIIA and rIIB [32, 33]; however, both ϕJA1 and ϕVchO139-I lack homologs of these genes. Likewise, homologs of the N4 major tail protein (gp65), neck appendage (gp66), and virion decorating (gp17) proteins appear to be missing from ϕJA1 and ϕVchO139-I as well as an N-acetylmuramidase (gp61). ϕJA1 and ϕVchO139-I also lack tRNAs-Asn, Ser, Thr, and Pro, found between gp47 and gp48 of N4, but instead both encode a single tRNA-Arg gene as noted above. The location of the tRNA genes is different between these Vibrio phages and N4.
ϕJA1 and ϕVchO139-I genomes are enriched with selfish DNA compared to N4. Specifically, ϕJA1 and ϕVchO139-I each have an intein within the large terminase protein (JA1_0067) as well as four copies of an HNH family of homing endonuclease (JA1_0032, 0034, 0041, and 0047 of ϕJA1 and VCO139_0032, 0034, 0041 and 0047) (Figure 4 and Additional file 1: Table S1), in contrast to N4, which has none of these selfish DNA elements. Homing endonucleases are site-specific DNA endonucleases that initiate mobility by introducing double-strand breaks at defined positions in genomes lacking the endonuclease gene, stimulating repair and recombination pathways that mobilize the endonuclease coding region . These homing endonucleases typically reside within self-splicing group I introns . Indeed, three of the four HNH endonuclease genes split genes into two parts (N4 homologs gp24, DNAP, and gp43, Figure 4), suggesting that these HNH homing endonucleases may be carried on an intron.
A virion-associated/secreted lyase enzyme encoded by ϕJA1 has been described [21, 22]. We identified a candidate gene, JA1_0065 in ϕJA1 (87.3 kDa) and VCO139_0066 in ϕVchO139-I (87.4 kDa), annotated as a putative minor structural protein containing a domain with very weak BLASTP homology to a phage-related tail fiber protein and a Hidden Markov Model match to a peptidase domain (pfam13884) prototypically found in the endosialidase of bacteriophage K1F . The K1F polysialic acid capsule-degrading endosialidase is a key component of the tail spike of phage K1F, and has been observed in other phages . There are six nucleotide differences between the ϕJA1 and ϕVchO139-I minor structural proteins of which five are non-synonymous changes leading to amino acid differences in the proteins. We speculate that this minor structural protein is the polysaccharide lyase enzyme although verification of this prediction awaits further functional genomics experiments.
N4-like phage family comparisons
To better understand the diversity and relatedness of the N4 family of phages, genomes with top BLAST matches to N4 and falling under the N4-like phage taxonomic classification in the NCBI database were compared. Using N4 as the reference genome, a total of 18 N4-related phages were compared to N4 with an average percent identity (API) at the amino acid level, as high as 65% for Enterobacter phage KBNP21 to as low as 43% for Vibrio phage ϕJA1 (Additional file 1: Table S2). Of the phages compared, the two Vibrio parahaemolyticus phages VBP32 and VBP47 were the closest relatives of ϕJA1 and ϕVchO139-I with an API to N4 of 44% (Additional file 1: Table S2). All these genomes showed considerable synteny, but variable gene conservation (Additional file 1: Figure S4). Every N4-like phage encoded two RNA polymerases, including the vRNAP, making them bona fide members of the N4 family. The presence of rIIA and rIIB varied widely across the family, as did the gene content between the portal (JA1_0062) and large terminase (JA1_0067) genes. E. coli phage vB_EcoP_G7C had the greatest number of HNH homing endonuclease genes (5), but none of them appear to have disrupted any gene.
Initial assemblies using 454 sequence reads revealed the presence of apparent frame-shifts caused by variation in homopolymeric tracts that were subsequently corrected by high-quality Sanger data. We wanted to investigate whether this finding (frame-shifts due to homopolymeric tracts) was unique to these V. cholerae phages or prevalent in other phages as well. Using the EMBOSS program fuzznuc, the number of homopolymeric tracts of greater than 5 nucleotides in length were tabulated for the N4 family of phages and for the E. coli tailed phages λ, Mu, P1, T1, T4, T5 and T7 (Additional file 1: Table S3). This data showed that although homopolymeric tracts of up to 8 nucleotides in length are common in these phages, N4-like phages had the longest homopolymeric tracts among the phages searched (up to 12 nucleotides for ϕVchO139-I). The significance of this finding is not clear at this time, although it may be related to the replication mechanism of the N4-like phages or to the regulation of gene expression via a polymerase slippage mechanism.
ϕJA1 and ϕVchO139-I employ an N4-like DNA packaging mechanism
Although phages φJA1 and φVchO139-I are 99% identical there are considerable differences between them in terms of single nucleotide polymorphisms and some insertions/deletions. Phages can be considered different when there are discernible phenotypic differences (such as plaque morphologies, growth characteristics, host range etc.) that can be linked to genetic differences. In this case, there is a phenotypic difference in the plaque morphology (different sized halos) of ϕJA1 and ϕVchO139-I and hence they would be considered variants although a genetic link has not been established yet. The genetic characterization and genomic data of ϕJA1 and ϕVchO139-I presented in this study lay the foundation for the development of rapid phage-based diagnostics and therapeutics against this cholera-causing V. cholerae strain by providing sequence data and gene targets conferring binding specificity and lysis. The genome annotation indicated the absence of any toxin, virulence or lysogeny associated genes that may preclude the phages’ therapeutic utility. The gene encoding the secreted lyase enzyme  responsible for the different sized halos around the plaques could not definitively be identified from nucleotide differences between the two phages or by strong similarity to other known lyases. The lack of extensive similarity further underscores the remarkable genetic diversity seen in the lyase genes in nature . Further work on identifying the lyase gene using a functional genomic approach is underway. From a phylogenetic perspective, comparison of the genomes of all the currently available N4-like phages has further delineated sub clades within this group of the Podoviridae family.
Bacterial strains and culture conditions
V. cholerae O139 Bengal strain AI1837 or 63–93 (MO45) were used for all phage propagation and large scale preparation of ϕJA1 and ϕVchO139-I lysates. The Shimada serotype collection strains representing serogroups O1 through O206 were used for host range analyses . All strains were grown in LB or BHI; phage assays were done in LB or BHI or 2xYT agar plates with the respective soft agar. All the strains were grown from single colonies in the indicated medium (LB or BHI) in flasks or tubes at 37°C with shaking at ~200 rpm.
Phage isolation and propagation
Untreated raw sewage for phage isolation was collected from the International Centre for Diarrheal Disease Research, Bangladesh (ICDDR,B) hospital sewer located in Dhaka, and filtered through a Whatman filter to remove large particles. Ninety ml of the filtrate was mixed with 10 ml of 10 × LB and 100 μl of an overnight culture of V. cholerae O139 strain was added and grown overnight in a 37°C shaker. The cultures were spun and filtered through a 0.45 μm filter. One hundred μls of ten-fold serial dilutions of the filtrate were mixed with 100 μl of a logarithmic phase culture of V. cholerae O139 strain, adsorbed for 20 minutes at 37°C and plated with 4 ml of top agar . The plates were incubated overnight at 37°C.
Single plaques purified from overnight plates were purified twice and then propagated by the confluent lysis method [40, 41]. The resulting lysate was further purified by CsCl2 gradient centrifugation according to an established protocol . Single step growth experiments were performed as previously described [43, 44]. Briefly, host strains were grown to mid-log phase, infected with a multiplicity of infection (MOI) of 10 for 10 min at 37°C in the presence of 1 mM CaCl2. Infected cells were centrifuged and washed three times to remove unbound phage. Infected cells were serially diluted and plated using the soft-agar overlay method. Data presented are the results from two independent experiments.
In accordance with the recent recommendation for bacterial virus nomenclature, we have named the Vibrio phages as vB_VchP_JA1 and vB_VchP_VchO139-I . In this scheme, the first part of the designation, vB means ‘virus of bacteria’, the second part, VchP indicates the bacterial host Vibrio cholerae, and the family of the phage ‘P’ for Podoviridae, and the last part JA1 or VchO139-I indicates the laboratory names. These designations have been used in the title, abstract and GenBank submissions. However, for simplicity, only the laboratory designations ϕJA1 and ϕVchO139-I are used throughout this manuscript.
Purified phage particles were negatively stained with 2% uranyl acetate . Samples were observed in a Tecnai G2 Spirit electron microscope at 120 kV equipped with an EAGLE CCD camera (FEI Company, Hillsboro, USA). DNA molecules were partially denatured by heating in formamide and prepared for electron microscopy using a modified cytochrome c spreading technique . Electron micrographs of partially denatured molecules along with a size standard (pBluescript plasmid DNA) were measured and processed as described previously .
Host range analysis was done by spot tests . The 206 V. cholerae serogroup type strains were grown overnight at 37°C and 10 μl of the overnight cultures were spotted on LB or BHI plates, the spots were allowed to dry, followed by spotting 5 μl containing ~108 PFU total amount of ϕJA1 or ϕVchO139-I phage lysates. Spot tests were also done on bacterial lawns. In some experiments, either overnight cultures or dilutions thereof, were mixed with phages at different MOI and spotted. The plates were incubated at 37°C overnight and the results were scored based on clearing or absence of clearing of bacterial spots. The efficiency of plating (e.o.p) was calculated as the ratio of the titer on a test strain compared to the titer on the strain in which the lysate was prepared. Phages JA1 and VchO139-I were prepared either on AI1837 or 63–93 (MO45). The e.o.p on O139 serogroup strains were 1 whereas in all other serogroup strains the e.o.p was 0.
Phage DNA extraction and analysis
Bacteriophage DNA was extracted as described previously [26, 28, 42]. Briefly, phage lysate was treated with DNAse and RNAse and the phage particles were pelleted by high-speed centrifugation. The pellet was resuspended in PBS buffer and phage capsids were lysed by proteinase K and heat treatment. Phage DNA was extracted by organic solvents, as previously described .
Whole genome sequencing of ϕJA1 and ϕVchO139-I
Shotgun sequencing by the Sanger method was done as follows: Five to ten μg of phage DNA was sheared to ~2 kb fragments using a nebulizer (Invitrogen corp). The sheared DNA was end polished and cloned into a pCR Blunt 4 Topo Vector. Plasmid DNA was purified from the clones using the Qiagen Plasmid Miniprep kit (Qiagen, Hilden, Germany) and sequenced using an ABI 3700 sequencer with M13 universal and reverse primers. The resulting sequences were assembled using Sequencher software. Gaps in contigs and low coverage regions were filled by primer walking using intact phage DNA as template.
Whole genome pyrosequencing of phages ϕJA1 and ϕVchO139-I was also done in order to boost the coverage of both phage assemblies and to help fill in unclonable regions that may have been toxic to E. coli. The Roche/454 sequencing was done according to the manufacturer’s recommended protocol (FLX Titanium) using a Roche/454 sequencer FLX instrument. Subsets of sequences (15,000) were assembled using the Newbler GS De Novo Assembler version 2.6 . The resulting single assembly of each respective genome was used to map the remaining 454 reads and available Sanger reads using the high throughput ‘Map Reads to Reference’ module in CLC Genomics Workbench version 5.5 (http://www.clcbio.com) using default settings. Sanger reads were used to manually edit the sequence in homopolymeric regions.
Bioinformatic analyses of sequence data
The final assembled sequence was annotated using the JCVI annotation pipeline as described previously . Protein molecular weights were determined from predicted coding sequences (CDSs) using the EMBOSS program pepstats. SNPs between the two phage genomes were identified by mapping the ϕVchO139-I 454-reads onto the non-redundant ϕJA1 genome by first using the high throughput ‘Map Reads to Reference’ module followed by the ‘SNP Detection’ module both in CLC Genomics Workbench version 5.5 (http://www.clcbio.com), using default settings. Homopolymeric tracts were identified by analyzing the underlying sequences for each nucleotide within a tract of six or more A, T, G, or C nucleotides as described previously . SNPs within these homopolymeric tracks were not considered for further analysis. Linear illustrations of multiple phage comparisons were constructed using an in-house (JCVI) PERL script LinearDisplay.pl, showing only pairwise BLASTP results with E-value ≤ 1 × 10-5 and percent identity ≥ 35%. Phage phylogenetic trees were constructed using the mean of the BLASTP score ratio as described previously . Comparison of the large terminase protein amino acid sequences was performed as described previously  using a dataset modified from the original published set . ClustalW alignments were used to generate a phylogenetic tree with the Neighbor Joining method and 1000 bootstrap replicates.
SDS-PAGE and peptide mass fingerprinting
Peptide mass fingerprinting was performed as described previously [54, 55]. Briefly, purified phage particles were collected by centrifugation and mixed with an equal volume of sample buffer prior to heating at 95°C for 5 min. Proteins were separated on precast, horizontal SDS-PAGE gels (Excel Gel gradient 8-18% PAGE gels, GE Healthcare, Solingen, Germany), followed by staining with Coomassie blue R-350 (Phastblue R; GE Healthcare). Major bands were excised from the gel; the gel pieces were washed twice in 100 mM NH4HCO3/50% acetonitrile and washed once with 50 ul acetonitrile. Supernatants were discarded and proteins in gel fragments were digested with trypsin at 37°C overnight. Supernatants were removed and the gel pieces were extracted twice in 0.1% TFA/50% acetonitrile. Supernatants were dried and dissolved in 0.1% formic acid and analyzed by LC-ESI-MS/MS (Functional Genomics Center, Zurich, Switzerland) and compared against a database of all predicted phage proteins using Scaffold software version 3 (Proteome Software Inc., Portland, OR, USA). Protein domains were predicted with InterProScan (http://www.ebi.ac.uk/InterProScan).
Determination of the structure of genomic ends
The physical genome structure (linear vs circular) of ϕJA1 and ϕVchO139-I was assessed by restriction digestion profiles under conditions recommended by the manufacturers (New England Biolabs, Ipswich, MA and Fermentas-Thermo Scientific, Waltham, MA). Bal31-nuclease digestion was performed as previously described . The presence of putative cohesive genomic ends was evaluated by digesting genomic DNA with ClaI, Van91I and OliI, heating half of the digest to 75°C for 10 min and comparing it to the non-heated restriction digests after electrophoresis in an agarose gel. Pulsed-field gel electrophoresis was done using a CHEF-DR-III apparatus (BioRad, Hercules, CA) under the following conditions: Gels (1%) were cast using Megabase agarose (BioRad) and run at 14°C buffer temperature with 1-25 s switch time, 6 V/cm and 120° angle for 17.5 h. PFGE MidRange Marker I and II (New England Biolabs, Ipswich, MA) were used for estimating fragment sizes. Primers for genome-end runoff sequencing were designed based on the consensus genome sequence and Sanger sequencing reactions were carried out using either intact ϕJA1 DNA or isolated restriction fragments as templates. The resulting sequence reads were manually verified by Sequiserve GmbH, Vaterstetten, Germany. Sequencing reactions were performed at the highest possible annealing temperatures in order to prevent potential re-circularization of the template DNA molecule. Phage ϕJA1 genome-end fragments resulting from restriction digestion with SalI or NruI restriction enzyme were cut out of agarose gels and extracted using QiaQuick Gel Extraction Kit (Qiagen, Hilden, Germany). Isolated fragments were checked for purity by electrophoresis and used as templates for Sanger sequencing.
Nucleotide sequence accession numbers
Complete nucleotide sequences of ϕJA1 and ϕVchO139-I were deposited to GenBank under accession numbers KC438282 and KC438283 respectively. The following seventeen N4-like phage genomes were obtained from NCBI for use in whole genome comparisons: 1) E. coli phage KBNP21, JX415535 ; 2) E. coli phage N4, EF056009 (unpublished); 3) E. coli phage vB_EcoP_G7, HQ259105 ; 4) Enterobacter phage EcP1, HQ641380 (unpublished); 5) E. coli phage IME11, JX880034 ; 6) Erwinia phage vB_EamP-S6, HQ728266  7) Pseudomonas phage LIT1, FN422399; 8) Pseudomonas phage LUZ7, FN422398 ; 9) Roseovarius sp. 217 phage 1, FR682616 (unpublished); 10) Silicibacter phage DSS3phi2, FJ591093 and 11) Sulfitobacter phage EE36phi1, FJ591094 ; 12) Roseovarius Plymouth Podovirus 1, FR719956; 13) Pseudomonas phage PA26, JX194238 ; 14) Vibrio phage VBP32, HQ634196 (unpublished); 15) Vibrio phage VBP47, HQ634194 (unpublished); 16) Sulfitobacter phage pCB2047-B, HQ317387 (unpublished) and 17) E. coli phage EC1, KC206276 (unpublished).
MR was supported by Morgan State University Office of Faculty Professional Development under Title III Grant from the US Department of Education and MSU NIH-EARDA Grant from NICHD. The work done at NMRC was supported by the Transformational Medical Technologies Program under contract TMTI-IB06RSQ002 through the Defense Threat Reduction Agency to SS. The authors would like to thank Dr. Rudi Lurz (Max-Planck-Institute for Molecular Genetics; Berlin, Germany) for help with electron microscopy work and Ms. Kristin Kittle for editorial assistance The views expressed in this article are those of the authors and do not necessarily reflect the official policy or position of the Department of the Navy or the Critical Reagents Program of the Chemical Biological Medical Systems, Joint Program Executive office, Department of Defense, nor the U.S. Government. The BDRD, NMRC authors are employees of the U.S. Government. This work was prepared as part of their official duties. Title 17 U.S.C. §105 provides that “Copyright protection under this title is not available for any work of the United States Government.” Title 17 U.S.C. §101 defines a U.S. Government work as a work prepared by a military service member or employee of the U.S. Government as part of that person’s official duties.
- Centers for Disease Control and Prevention: Update: outbreak of cholera ---Haiti, 2010. MMWR Morb Mortal Wkly Rep 2010, 59:1586–1590.
- Farmer P, Almazor CP, Bahnsen ET, Barry D, Bazile J, Bloom BR, Bose N, Brewer T, Calderwood SB, Clemens JD: Meeting cholera’s challenge to Haiti and the world: a joint statement on cholera prevention and care. PLoS Negl Trop Dis 2011, 5:e1145.PubMedView Article
- Centers for Disease Control and Prevention: Update on cholera --- Haiti, Dominican Republic, and Florida. MMWR Morb Mortal Wkly Rep 2010, 59:1637–1641.
- WHO: Cholera fact sheet. 2011. http://www.who.int/mediacentre/factsheets/fs107/en/
- WHO: Cholera. Weekly Epidemiological Record 2011, 87:289–304.
- Shimada T, Arakawa E, Itoh K, Okitsu T, Matsushima A, Asai Y, Yamai S, Nakazato T, Nair GB, Albert MJ, Takeda Y: Extended serotyping scheme for Vibrio cholerae . Curr Microbiol 1994, 28:175–178.View Article
- Mukhopadhyay AK, Ramamurthy T: Asiatic cholera: mole hills and mountains. In Epidemiological and molecular aspects of cholera. Edited by: Ramamurthy T, Bahattacharya SK. New York: Springer; 2011:5–29.View Article
- Ramamurthy T, Garg S, Sharma R, Bhattacharya SK, Nair GB, Shimada T, Takeda T, Karasawa T, Kurazano H, Pal A: Emergence of novel strain of Vibrio cholerae with epidemic potential in southern and eastern India. Lancet 1993, 341:703–704.PubMedView Article
- Stroeher UH, Manning PA: Vibrio cholerae serotype O139: swapping genes for surface polysaccharide biosynthesis. Trends Microbiol 1997, 5:178–180.PubMedView Article
- Sozhamannan S, Yildiz FH: Diversity and genetic basis of polysaccharide biosynthesis in Vibrio cholerae . In Epidemiological and molecular aspects of cholera. Edited by: Ramamurthy T, Bahattacharya SK. New York: Springer; 2011:120–160.
- Faruque SM, Albert MJ, Mekalanos JJ: Epidemiology, genetics, and ecology of toxigenic Vibrio cholerae . Microbiol Mol Biol Rev 1998, 62:1301–1314.PubMed
- Mekalanos JJ: The evolution of Vibrio cholerae as a pathogen. In Epidemiology and molecular aspects of Cholera. Edited by: Ramamurthy T, Ramamurthy T, Bahattacharya SK. New York: Springer; 2011:97–115.View Article
- Waldor MK, Mekalanos JJ: Lysogenic conversion by a filamentous phage encoding cholera toxin. Science 1996, 272:1910–1914.PubMedView Article
- Engel HW, Berwald LG, Grange JM, Kubin M: Phage typing of mycobacterium kansasii. Tubercle 1980, 61:11–19.PubMedView Article
- Loessner MJ: Improved procedure for bacteriophage typing of Listeria strains and evaluation of new phages. Appl Environ Microbiol 1991, 57:882–884.PubMed
- Ward LR, de Sa JD, Rowe B: A phage-typing scheme for Salmonella enteritidis . Epidemiol Infect 1987, 99:291–294.PubMedView Article
- Chattopadhyay DJ, Sarkar BL, Ansari MQ, Chakrabarti BK, Roy MK, Ghosh AN, Pal SC: New phage typing scheme for Vibrio cholerae O1 biotype El Tor strains. J Clin Microbiol 1993, 31:1579–1585.PubMed
- Mukerjee S, Roy UK, Rudra BC: Studies on typing of cholera vibrios by bacteriophage. V. Geographical distribution of phage-types of Vibrio cholerae . Ann Biochem Exp Med 1963, 23:523–530.PubMed
- Albert MJ, Bhuiyan NA, Rahman A, Ghosh AN, Hultenby K, Weintraub A, Nahar S, Kibriya AK, Ansaruzzaman M, Shimada T: Phage specific for Vibrio cholerae O139 Bengal. J Clin Microbiol 1996, 34:1843–1845.PubMed
- Chakrabarti AK, Ghosh AN, Nair GB, Niyogi SK, Bhattacharya SK, Sarkar BL: Development and evaluation of a phage typing scheme for Vibrio cholerae O139. J Clin Microbiol 2000, 38:44–49.PubMed
- Linnerborg M, Weintraub A, Albert MJ, Widmalm G: Depolymerization of the capsular polysaccharide from Vibrio cholerae O139 by a lyase associated with the bacteriophage JA1. Carbohydr Res 2001, 333:263–269.PubMedView Article
- Adeyeye J, Azurmendi HF, Stroop CJ, Sozhamannan S, Williams AL, Adetumbi AM, Johnson JA, Bush CA: Conformation of the hexasaccharide repeating subunit from the Vibrio cholerae O139 capsular polysaccharide. Biochemistry 2003, 42:3979–3988.PubMedView Article
- Ackermann HW: Bacteriophage observations and evolution. Res Microbiol 2003, 154:245–251.PubMedView Article
- Ackermann HW: 5500 Phages examined in the electron microscope. Arch Virol 2007, 152:227–243.PubMedView Article
- Falco SC, Zehring W, Rothman-Denes LB: DNA-dependent RNA polymerase from bacteriophage N4 virions. Purification and characterization. J Biol Chem 1980, 255:4339–4347.PubMed
- Klumpp J, Dorscht J, Lurz R, Bielmann R, Wieland M, Zimmer M, Calendar R, Loessner MJ: The terminally redundant, nonpermuted genome of Listeria bacteriophage A511: a model for the SPO1-like myoviruses of Gram-positive bacteria. J Bacteriol 2008, 190:5753–5765.PubMedView Article
- Born Y, Fieseler L, Marazzi J, Lurz R, Duffy B, Loessner MJ: Novel virulent and broad-host-range Erwinia amylovora bacteriophages reveal a high degree of mosaicism and a relationship to Enterobacteriaceae phages. Appl Environ Microbiol 2011, 77:5945–5954.PubMedView Article
- Klumpp J, Fouts DE, Sozhamannan S: Next-generation sequencing technologies and the changing landscape of phage genomics. Bacteriophage 2012, 2:190–199.PubMedView Article
- Kilcher S, Loessner MJ, Klumpp J: Brochothrix thermosphacta bacteriophages feature heterogeneous and highly mosaic genomes and utilize unique prophage insertion sites. J Bacteriol 2010, 192:5441–5453.PubMedView Article
- Ohmori H, Haynes LL, Rothman-Denes LB: Structure of the ends of the coliphage N4 genome. J Mol Biol 1988, 202:1–10.PubMedView Article
- Willis SH, Kazmierczak KM, Carter RH, Rothman-Denes LB: N4 RNA polymerase II, a heterodimeric RNA polymerase with homology to the single-subunit family of RNA polymerases. J Bacteriol 2002, 184:4952–4961.PubMedView Article
- Paddison P, Abedon ST, Dressman HK, Gailbreath K, Tracy J, Mosser E, Neitzel J, Guttman B, Kutter E: The roles of the bacteriophage T4 r genes in lysis inhibition and fine-structure genetics: a new perspective. Genetics 1998, 148:1539–1550.PubMed
- Abedon ST: Bacteriophage T4 resistance to lysis-inhibition collapse. Genetical research 1999, 74:1–11.PubMedView Article
- Edgell DR, Gibb EA, Belfort M: Mobile DNA elements in T4 and related phages. Virol J 2010, 7:290.PubMedView Article
- Stoddard BL: Homing endonuclease structure and function. Quart rev biophys 2005, 38:49–95.View Article
- Stummeyer K, Dickmanns A, Muhlenhoff M, Gerardy-Schahn R, Ficner R: Crystal structure of the polysialic acid-degrading endosialidase of bacteriophage K1F. Nat struct mol biol 2005, 12:90–96.PubMedView Article
- Stummeyer K, Schwarzer D, Claus H, Vogel U, Gerardy-Schahn R, Muhlenhoff M: Evolution of bacteriophages infecting encapsulated bacteria: lessons from Escherichia coli K1-specific phages. Mol Microbiol 2006, 60:1123–1135.PubMedView Article
- Casjens SR, Gilcrease EB, Winn-Stapley DA, Schicklmaier P, Schmieger H, Pedulla ML, Ford ME, Houtz JM, Hatfull GF, Hendrix RW: The generalized transducing Salmonella bacteriophage ES18: complete genome sequence and DNA packaging strategy. J Bacteriol 2005, 187:1091–1104.PubMedView Article
- Lombard V, Bernard T, Rancurel C, Brumer H, Coutinho PM, Henrissat B: A hierarchical classification of polysaccharide lyases for glycogenomics. Biochem J 2010, 432:437–444.PubMedView Article
- Kropinski AM, Mazzocco A, Waddell TE, Lingohr E, Johnson RP: Enumeration of bacteriophages by double agar overlay plaque assay. Methods Mol Biol 2009, 501:69–76.PubMedView Article
- Adams MH: Methods of study of bacterial viruses. In Bacteriophages. New York: Interscience publishers, Inc; 1959:443–457.
- Sambrook J, Russell DW: Molecular cloning - a laboratory manual. 3rd edition. New York: Cold Spring Harbor Laboratory Press; 2001.
- Hyman P, Abedon ST: Practical methods for determining phage growth parameters. Methods Mol Biol 2009, 501:175–202.PubMedView Article
- Mathews CK, Kutter EM, Mosig G, Berget PB: Bacteriophage T4. Washington, DC: American Society for Microbiology; 1983.
- Steven AC, Trus BL, Maizel JV, Unser M, Parry DA, Wall JS, Hainfeld JF, Studier FW: Molecular substructure of a viral receptor-recognition protein. The gp17 tail-fiber of bacteriophage T7. J Mol Biol 1988, 200:351–365.PubMedView Article
- Inman RB, Schnos M: Partial denaturation of thymine- and 5-bromouracil-containing lambda DNA in alkali. J Mol Biol 1970, 49:93–98.PubMedView Article
- Littlewood RK, Inman RB: Computer-assisted DNA length measurements from electron micrographs with special reference to partial denaturation mapping. Nucleic Acids Res 1982, 10:1691–1706.PubMedView Article
- Mazzocco A, Waddell TE, Lingohr E, Johnson RP: Enumeration of bacteriophages using the small drop plaque assay system. Methods Mol Biol 2009, 501:81–85.PubMedView Article
- Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA, Berka J, Braverman MS, Chen YJ, Chen Z: Genome sequencing in microfabricated high-density picolitre reactors. Nature 2005, 437:376–380.PubMed
- Stevens RH, Ektefaie MR, Fouts DE: The annotated complete DNA sequence of Enterococcus faecalis bacteriophage phiEf11 and its comparison with all available phage and predicted prophage genomes. FEMS Microbiol Lett 2011, 317:9–26.PubMedView Article
- Fouts DE, Mongodin EF, Mandrell RE, Miller WG, Rasko DA, Ravel J, Brinkac LM, DeBoy RT, Parker CT, Daugherty SC: Major structural differences and novel potential virulence mechanisms from the genomes of multiple Campylobacter species. PLoS Biol 2005, 3:e15.PubMedView Article
- Fouts DE: Phage_Finder: automated identification and classification of prophage regions in complete bacterial genome sequences. Nucleic Acids Res 2006, 34:5839–5851.PubMedView Article
- Dorscht J, Klumpp J, Bielmann R, Schmelcher M, Born Y, Zimmer M, Calendar R, Loessner MJ: Comparative genome analysis of Listeria bacteriophages reveals extensive mosaicism, programmed translational frameshifting, and a novel prophage insertion site. J Bacteriol 2009, 191:7206–7215.PubMedView Article
- Zimmer M, Sattelberger E, Inman RB, Calendar R, Loessner MJ: Genome and proteome of Listeria monocytogenes phage PSA: an unusual case for programmed + 1 translational frameshifting in structural protein synthesis. Mol Microbiol 2003, 50:303–317.PubMedView Article
- Zimmer M, Scherer S, Loessner MJ: Genomic analysis of Clostridium perfringens bacteriophage phi3626, which integrates into guaA and possibly affects sporulation. J Bacteriol 2002, 184:4359–4368.PubMedView Article
- Nho SW, Ha MA, Kim KS, Kim TH, Jang HB, Cha IS, Park SB, Kim YK, Jung TS: Complete genome sequence of the bacteriophages ECBP1 and ECBP2 isolated from Two different Escherichia coli strains. J Virol 2012, 86:12439–12440.PubMedView Article
- Kulikov E, Kropinski AM, Golomidova A, Lingohr E, Govorun V, Serebryakova M, Prokhorov N, Letarova M, Manykin A, Strotskaya A, Letarov A: Isolation and characterization of a novel indigenous intestinal N4-related coliphage vB_EcoP_G7C. Virology 2012, 426:93–99.PubMedView Article
- Fan H, An X, Huang Y, Zhang Z, Mi Z, Tong Y: Complete genome sequence of IME11, a New N4-like bacteriophage. J Virol 2012, 86:13861.PubMedView Article
- Ceyssens PJ, Brabban A, Rogge L, Lewis MS, Pickard D, Goulding D, Dougan G, Noben JP, Kropinski A, Kutter E, Lavigne R: Molecular and physiological analysis of three Pseudomonas aeruginosa phages belonging to the “N4-like viruses”. Virology 2010, 405:26–30.PubMedView Article
- Zhao Y, Wang K, Jiao N, Chen F: Genome sequences of two novel phages infecting marine Roseobacters . Environ Microbiol 2009, 11:2055–2064.PubMedView Article
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://www.creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.