Abstract:
|
Many protocols for the analysis of shotgun proteomics mass spectra attempt to mitigate the multiple testing problem by controlling the hypothesis space in various ways: only considering peptides resulting from enzymatic cleavages, ignoring possible post-translational modifications, etc. However, these strategies cannot identify spectra generated by rarer types of peptides. We describe a statistical testing framework, cascaded search, to solve this problem. The method requires that the user specify a statistical confidence threshold as well as a series of peptide databases. For instance, such a cascade of databases could include fully tryptic, semi-tryptic and non-enzymatic peptides, or peptides with increasing numbers of modifications. Cascaded search then gradually expands the list of candidate peptides from more likely peptides toward rare peptides, sequestering at each stage any spectrum that is identified with a specified statistical confidence. We demonstrate that the proposed approach yields improved statistical power relative to commonly used protocols.
|
ASA Meetings Department
732 North Washington Street, Alexandria, VA 22314
(703) 684-1221 • meetings@amstat.org
Copyright © American Statistical Association.