JSM 2012 Home

JSM 2012 Online Program

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

Online Program Home

Abstract Details

Activity Number: 224
Type: Topic Contributed
Date/Time: Monday, July 30, 2012 : 2:00 PM to 3:50 PM
Sponsor: Section on Statistical Computing
Abstract - #305294
Title: Quantifying Taxonomic and Functional Diversity of Metagenomes from Next-Generation Sequencing Data
Author(s): Katherine Pollard*+ and Thomas J. Sharpton and Rebecca Truty and Joshua Ladau and Samantha Riesenfeld and Guillaume Jospin and Steven Kembel and Morgan Langille and James O'Dwyer and Dongying Wu and Jessica Green and Jonathan Eisen
Companies: Gladstone Institutes, UCSF and Gladstone Institutes, UCSF and Gladstone Institutes, UCSF and Gladstone Institutes, UCSF and Gladstone Institutes, UCSF and University of California at Davis and University of Oregon and University of California at Davis and Santa Fe Institute and University of California at Davis and University of Oregon and University of California at Davis
Address: 1650 Owens Street, San Francisco, CA, 94158, United States
Keywords: phylogenetics ; Markov models ; metagenomics ; next-generation sequencing ; bioinformatics ; diversity
Abstract:

Analysis of shotgun sequenced environmental DNA, known as metagenomics, promises insight into the taxonomic and functional diversity of microbial communities. To overcome challenges associated with the fragmentary, non-overlapping nature of metagenomic sequence data, we developed novel statistical phylogenetic methods for identifying operational taxonomic units (OTUs) and operational protein families (OPFs), as well as read-based error detection methods and a simulation pipeline for testing the performance of these and other metagenomics analysis tools. A key feature of our approach is the use of full-length genes from sequenced microbial genomes to guide the placement of metagenomic sequences into phylogenetic trees and probabilistic model based alignments (e.g., via profile hidden Markov models or stochastic context free grammars). We generated a database of 340,000 OPFs that encapsulates significantly more protein functional diversity than existing databases (e.g., PFAM). Using niche modeling methods, we leveraged our database and analysis tools to predict and compare the distributions of OTUs and OPFs across the world's oceans, North American soils, and human gut communities.


The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.

Back to the full JSM 2012 program




2012 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Continuing Education program, please contact the Education Department.