Scripps VIVO scripps research logo

  • Index
  • Log in
  • Home
  • People
  • Organizations
  • Research
  • Events
Search form

Central limit theorem as an approximation for intensity-based scoring function

Academic Article
uri icon
  • Overview
  • Identity
  • Additional Document Info
  • View All
scroll to property group menus

Overview

authors

  • Wohlschlegel, J.
  • Park, S. K.
  • Xu, T.
  • Yates III, John

publication date

  • January 2006

journal

  • Analytical Chemistry  Journal

abstract

  • In this paper, we present an intensity-based probability function to identify peptides from tandem mass spectra and amino acid sequence databases. The function is an approximation to the central limiting theorem, and it explicitly depends on the cumulative product ion intensities, number of product ions of a peptide, and expectation value of the cumulative intensity. We compare the results of database searches using the new scoring function and scoring functions from earlier algorithms, which implement hypergeometric probability, Poisson's model, and cross-correlation scores. For a standard protein mixture (tandem mass spectra generated from the mixture of five known proteins), we generate receiver operating curves with all scoring schemes. The receiver operating curves show that the shared peaks count-based probability methods (like Poisson and hypergeometric models) are the most specific for matching high-quality tandem mass spectra. The intensity-based (central limit model) and intensity-modeled (cross-correlation) methods are more sensitive when matching low-quality tandem mass spectra, where the number of shared peaks is insufficient to correctly identify a peptide. Cross-correlation methods show a small advantage over the intensity-based probability method.

subject areas

  • Algorithms
  • Amino Acids
  • Databases, Protein
  • Humans
  • Mass Spectrometry
  • Models, Statistical
  • Peptide Fragments
  • Peptide Mapping
  • Probability
  • Proteins
  • Proteomics
  • Software
  • Statistical Distributions
scroll to property group menus

Identity

International Standard Serial Number (ISSN)

  • 0003-2700

Digital Object Identifier (DOI)

  • 10.1021/ac051206r

PubMed ID

  • 16383314
scroll to property group menus

Additional Document Info

start page

  • 89

end page

  • 95

volume

  • 78

issue

  • 1

©2021 The Scripps Research Institute | Terms of Use | Powered by VIVO

  • About
  • Contact Us
  • Support