Analyzing yeast protein-protein interaction data obtained from different sources

Nat Biotechnol. 2002 Oct;20(10):991-7. doi: 10.1038/nbt1002-991.

Abstract

High-throughput methods for detecting protein interactions, such as mass spectrometry and yeast two-hybrid assays, continue to produce vast amounts of data that may be exploited to infer protein function and regulation. As this article went to press, the pool of all published interaction information on Saccharomyces cerevisiae was 15,143 interactions among 4,825 proteins, and power-law scaling supports an estimate of 20,000 specific protein interactions. To investigate the biases, overlaps, and complementarities among these data, we have carried out an analysis of two high-throughput mass spectrometry (HMS)-based protein interaction data sets from budding yeast, comparing them to each other and to other interaction data sets. Our analysis reveals 198 interactions among 222 proteins common to both data sets, many of which reflect large multiprotein complexes. It also indicates that a "spoke" model that directly pairs bait proteins with associated proteins is roughly threefold more accurate than a "matrix" model that connects all proteins. In addition, we identify a large, previously unsuspected nucleolar complex of 148 proteins, including 39 proteins of unknown function. Our results indicate that existing large-scale protein interaction data sets are nonsaturating and that integrating many different experimental data sets yields a clearer biological view than any single method alone.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Chromatography, Liquid / methods
  • Database Management Systems
  • Databases, Protein*
  • Genome, Fungal
  • Macromolecular Substances
  • Mass Spectrometry / methods
  • Multiprotein Complexes
  • Protein Interaction Mapping / methods*
  • Proteome
  • Reproducibility of Results
  • Saccharomyces cerevisiae / metabolism*
  • Saccharomyces cerevisiae Proteins / chemistry*
  • Saccharomyces cerevisiae Proteins / metabolism*
  • Sensitivity and Specificity
  • Sequence Alignment / methods*
  • Sequence Analysis, Protein
  • Species Specificity

Substances

  • Macromolecular Substances
  • Multiprotein Complexes
  • Proteome
  • Saccharomyces cerevisiae Proteins