
3.3 Generating MS Data for Protein Identification 237
3.3.2
Peptide Mass Fingerprint Combined With Composition Information
Historically, it has been possible to combine PMF information with
compositional information. This was a common technique prior to
the introduction of delayed extraction and reflectron technology. The
composition information provided orthogonal information which
provided additional specificity for a database search by enabling
further constraints on that search (Fenyç et al. 1998). For instance,
this publication demonstrated that with the knowledge of the pre-
sence of a cysteine residue in a tryptic peptide of mass 2000 Da (mea-
sured to 0.5 Da mass accuracy) the number of matching proteins in S
cerevisiae was reduced by a factor of five. A number of approaches
have been employed to gain compositional information. In one exam-
ple reported by Pappin, a PMF is acquired and the peptide masses
noted. A subsequent methyl esterification reaction is performed on a
small aliquot of the sample esterifying the carboxyl side-chain of the
acidic residues, glutamic and aspartic acid, and the carboxylic group
of the C-terminal residue present in each peptide of the digest. A
PMF spectrum is re-acquired. The subsequent mass changes between
corresponding peptides are indicative of the number of acidic groups
each peptide contains. This information was combined with the origi-
nal PMF using a MOWSE composition search potentially increasing
search discrimination by orders of magnitude (Pappin et al. 1995).
The number of exchangeable hydrogens within a peptide offers
composition information. Again, this procedure requires the acquisi-
tion of a PMF, subsequent labeling of the mixture with a deuterium
solution (D
2
O) and finally re-acquisition of a second spectrum. The
mass increase of each peptide by the number of exchangeable hydro-
gens is indicative of amino acid composition (Sepetov et al. 1993;
James et al. 1994).
Moreover Goodlett et al. (2000) demonstrated composition informa-
tion and high mass accuracy can be very specific information for
unambiguously identifying proteins. By labeling cysteine residues
with a specific isotopic label, IDEnT (the tag contained chlorine,
which has a specific isotopic profile owing to the relative abundance
of
35
Cl and
37
Cl) the distinctive isotopic pattern of the labeled peptide
could be simply recognized. With this specific composition informa-
tion and mass accuracy measured to within 1 ppm using an FT-ICR
MS, the mass of a single peptide was sufficient to unambiguously
identify a protein from the whole yeast database (1 peptide from a
possible in ~345,000 peptides).
Fenyç D, Qin J, Chait BT. Elec-
trophoresis 19 (1998) 998–
1005.
Pappin D, Rahman D, Hansen
HF, Bartlet-Jones M, Jeffrey W,
Bleasby AJ. Methods in Biolo-
gical Sciences (1996) 135–150.
Sepetov NF, Issakova OL, Lebi
M, Swiderek K, Stahl DC, Lee
TD. Rapid Commun Mass
Spectrom. 7 (1993) 58–62.
James P, Quadroni M, Carafoli
E, Gonnet G. Protein Sci 3
(1994) 1347–1350.
Goodlett DR, Bruce JE,
Anderson GA, Rist B, Pasa-
Tolic L, Fiehn O, Smith RD,
Aebersold R. Anal Chem 72
(2000) 1112–1118.
Cysteine was chosen as the
labeled residue because it is one
of the rarest amino acid resi-
dues, allowing constrained
database searching.