Portal protein diversity and phage ecology.

TitlePortal protein diversity and phage ecology.
Publication TypeJournal Article
Year of Publication2008
AuthorsSullivan, MB, Coleman, ML, Quinlivan, V, Rosenkrantz, JE, DeFrancesco, AS, Tan, G, Fu, R, Lee, JA, Waterbury, JB, Bielawski, JP, Chisholm, SW
JournalEnviron Microbiol
Date Published2008 Oct
KeywordsBacteriophages, Cluster Analysis, DNA, Viral, Ecosystem, Genetic Variation, Molecular Sequence Data, Phylogeny, Prochlorococcus, Sequence Analysis, DNA, Sequence Homology, Synechococcus, Viral Proteins, Virus Assembly

Oceanic phages are critical components of the global ecosystem, where they play a role in microbial mortality and evolution. Our understanding of phage diversity is greatly limited by the lack of useful genetic diversity measures. Previous studies, focusing on myophages that infect the marine cyanobacterium Synechococcus, have used the coliphage T4 portal-protein-encoding homologue, gene 20 (g20), as a diversity marker. These studies revealed 10 sequence clusters, 9 oceanic and 1 freshwater, where only 3 contained cultured representatives. We sequenced g20 from 38 marine myophages isolated using a diversity of Synechococcus and Prochlorococcus hosts to see if any would fall into the clusters that lacked cultured representatives. On the contrary, all fell into the three clusters that already contained sequences from cultured phages. Further, there was no obvious relationship between host of isolation, or host range, and g20 sequence similarity. We next expanded our analyses to all available g20 sequences (769 sequences), which include PCR amplicons from wild uncultured phages, non-PCR amplified sequences identified in the Global Ocean Survey (GOS) metagenomic database, as well as sequences from cultured phages, to evaluate the relationship between g20 sequence clusters and habitat features from which the phage sequences were isolated. Even in this meta-data set, very few sequences fell into the sequence clusters without cultured representatives, suggesting that the latter are very rare, or sequencing artefacts. In contrast, sequences most similar to the culture-containing clusters, the freshwater cluster and two novel clusters, were more highly represented, with one particular culture-containing cluster representing the dominant g20 genotype in the unamplified GOS sequence data. Finally, while some g20 sequences were non-randomly distributed with respect to habitat, there were always numerous exceptions to general patterns, indicating that phage portal proteins are not good predictors of a phage's host or the habitat in which a particular phage may thrive.

Alternate JournalEnviron. Microbiol.
PubMed ID18673386
PubMed Central IDPMC2657995