Marine Science Faculty Publications
Document Type
Article
Publication Date
2015
Keywords
virus, bacteriophage, genomics, metagenomics, ecology
Digital Object Identifier (DOI)
https://doi.org/10.3389/fmicb.2015.00381
Rights Information
This work is licensed under a Creative Commons Attribution 4.0 License.
Was this content written or created while at USF?
Yes
Citation / Publisher Attribution
Frontiers in Microbiology, v. 6, art. 381
Scholar Commons Citation
Aziz, Ramy; Dwivedi, Bhakti; Akhter, Sajia; Breitbart, Mya; and Edwards, Robert, "Multidimensional Metrics for Estimating Phage Abundance, Distribution, Gene Density, and Sequence Coverage in Metagenomes" (2015). Marine Science Faculty Publications. 713.
https://digitalcommons.usf.edu/msc_facpub/713
Comments
Phages are the most abundant biological entities on Earth and play major ecological roles, yet the current sequenced phage genomes do not adequately represent their diversity, and little is known about the abundance and distribution of these sequenced genomes in nature. Although the study of phage ecology has benefited tremendously from the emergence of metagenomic sequencing, a systematic survey of phage genes and genomes in various ecosystems is still lacking, and fundamental questions about phage biology, lifestyle, and ecology remain unanswered. To address these questions and improve comparative analysis of phages in different metagenomes, we screened a core set of publicly available metagenomic samples for sequences related to completely sequenced phages using the web tool, Phage Eco-Locator. We then adopted and deployed an array of mathematical and statistical metrics for a multidimensional estimation of the abundance and distribution of phage genes and genomes in various ecosystems. Experiments using those metrics individually showed their usefulness in emphasizing the pervasive, yet uneven, distribution of known phage sequences in environmental metagenomes. Using these metrics in combination allowed us to resolve phage genomes into clusters that correlated with their genotypes and taxonomic classes as well as their ecological properties. We propose adding this set of metrics to current metaviromic analysis pipelines, where they can provide insight regarding phage mosaicism, habitat specificity, and evolution.