Amino acid residue masses are fundamental to peptide identification and MS/MS spectrum interpretation in proteomics. In LC-MS/MS analysis, mass differences between fragment ions correspond directly to amino acid residues, enabling peptide sequencing through b/y ion interpretation, sequence tag generation, and de novo sequencing.
This reference guide summarizes 32 commonly used amino acid residues, including standard amino acids, uncommon residues, and modified amino acids frequently encountered in proteomics workflows.
Proteomics LC-MS/MS Interpretation Workflow
Peptide Ionization
MS/MS Fragmentation
b/y Ion Assignment
Residue Mass Interpretation (this article)
PTM Analysis
Peptide Identification / De Novo Sequencing
Why Amino Acid Residue Mass Matters in Proteomics
In proteomics LC-MS/MS, peptide sequences are reconstructed by analyzing mass differences between fragment ions.
For example:
Δm/z = 71.037 → Alanine (A)
Δm/z = 99.068 → Valine (V)
Δm/z = 147.068 → Phenylalanine (F)
Mass differences between adjacent b ions or y ions correspond directly to amino acid residue masses.
This principle is essential for:
- peptide identification
- database search
- de novo sequencing
- sequence tag generation
- PTM interpretation
- fragment ion annotation
Without accurate residue masses, reliable MS/MS interpretation is impossible.
Residue Mass vs Free Amino Acid Mass
In proteomics, residue mass is different from free amino acid mass.
During peptide bond formation, amino acids lose H₂O molecules.
Therefore:
Residue mass = amino acid mass − H2O
The masses used in peptide MS/MS interpretation are residue masses, not free amino acid masses.
Monoisotopic Mass vs Average Mass
Proteomics MS/MS interpretation uses monoisotopic masses rather than average masses.
Monoisotopic mass:
- uses the lightest stable isotope
- provides highest mass accuracy
- is required for high-resolution MS/MS interpretation
Modern LC-MS/MS instruments such as:
- Orbitrap
- QTOF
- FT-ICR
depend on monoisotopic residue masses for peptide identification.
Standard Amino Acid Residue Mass Table (20 Amino Acids)
| Code | Amino Acid | Formula | Monoisotopic Residue Mass (Da) |
|---|---|---|---|
| A | Alanine | C3H5NO | 71.03711 |
| R | Arginine | C6H12N4O | 156.10111 |
| N | Asparagine | C4H6N2O2 | 114.04293 |
| D | Aspartic acid | C4H5NO3 | 115.02694 |
| C | Cysteine | C3H5NOS | 103.00919 |
| Q | Glutamine | C5H8N2O2 | 128.05858 |
| E | Glutamic acid | C5H7NO3 | 129.04259 |
| G | Glycine | C2H3NO | 57.02146 |
| H | Histidine | C6H7N3O | 137.05891 |
| I | Isoleucine | C6H11NO | 113.08406 |
| L | Leucine | C6H11NO | 113.08406 |
| K | Lysine | C6H12N2O | 128.09496 |
| M | Methionine | C5H9NOS | 131.04049 |
| F | Phenylalanine | C9H9NO | 147.06841 |
| P | Proline | C5H7NO | 97.05276 |
| S | Serine | C3H5NO2 | 87.03203 |
| T | Threonine | C4H7NO2 | 101.04768 |
| W | Tryptophan | C11H10N2O | 186.07931 |
| Y | Tyrosine | C9H9NO2 | 163.06333 |
| V | Valine | C5H9NO | 99.06841 |
Extended and Modified Amino Acids in Proteomics
In practical proteomics workflows, non-standard residues and modified amino acids are frequently encountered.
These may originate from:
- post-translational modifications (PTMs)
- metabolic intermediates
- non-canonical translation
- sample preparation artifacts
- specialized proteins
| Code | Amino Acid / Residue | Description | Monoisotopic Residue Mass (Da) |
|---|---|---|---|
| U | Selenocysteine | Selenium-containing cysteine | 150.95363 |
| O | Pyrrolysine | Rare archaeal amino acid | 255.15829 |
| pE | Pyroglutamate | Cyclized glutamine/glutamate | 111.03203 |
| Hyp | Hydroxyproline | Hydroxylated proline | 113.04768 |
| Hyl | Hydroxylysine | Hydroxylated lysine | 144.08988 |
| Cit | Citrulline | Arginine modification | 175.11168 |
| Orn | Ornithine | Arginine metabolism intermediate | 114.07931 |
| Hse | Homoserine | Metabolic intermediate | 101.04768 |
| Hcy | Homocysteine | Methionine-related residue | 135.03540 |
| βAla | Beta-alanine | Non-standard amino acid | 71.03711 |
| Gla | γ-Carboxyglutamate | Vitamin K-dependent PTM | 173.03203 |
| Nle | Norleucine | Leucine analog | 113.08406 |
These residues are particularly important in advanced proteomics and PTM analysis.
Amino Acid Mass Differences in MS/MS Spectra
In MS/MS spectra, fragment ion mass differences directly correspond to amino acid residues.
Example:
b5 → b6 = +71.037 Da → Alanine
y7 → y8 = +99.068 Da → Valine
This principle forms the foundation of:
- peptide sequencing
- fragment ion interpretation
- sequence ladder reconstruction
- de novo sequencing algorithms
Modern peptide search engines use these mass relationships extensively.
Leucine vs Isoleucine Problem
Leucine (L) and isoleucine (I) have identical monoisotopic masses:
113.08406 Da
Because they are isomers, standard MS/MS fragmentation usually cannot distinguish them.
Therefore:
- most database searches treat I/L equivalently
- de novo sequencing often reports ambiguous residues
- additional fragmentation methods may be required for differentiation
This is one of the most fundamental limitations in peptide MS/MS interpretation.
Cysteine Modification in Proteomics
Cysteine residues are commonly modified during sample preparation.
Most common example:
Carbamidomethylation
+57.021464 Da
This modification is introduced by iodoacetamide (IAA) alkylation and is typically treated as a fixed modification in database searches.
Modified cysteine mass:
Cysteine + CAM = 160.03065 Da
Carbamidomethylation is one of the most important modifications in bottom-up proteomics.
Methionine Oxidation
Methionine residues are highly susceptible to oxidation.
Most common modification:
Oxidation
+15.994915 Da
Oxidized methionine is frequently observed in:
- biological oxidation
- sample handling
- LC-MS sample preparation
Because methionine oxidation occurs easily, it is typically included as a variable modification in peptide database searches.
Residue Masses and Sequence Tag Generation
Sequence tags are generated by matching fragment ion mass differences to amino acid residue masses.
Example:
71.037 → 99.068 → 147.068
A → V → F
These partial sequence patterns can:
- reduce database search space
- improve peptide identification confidence
- support de novo sequencing
Sequence tagging remains one of the core principles of peptide MS/MS interpretation.
Practical Quick Reference
The following residue masses are among the most frequently encountered in peptide MS/MS spectra.
| Amino Acid | Monoisotopic Residue Mass |
|---|---|
| Glycine (G) | 57.021 |
| Alanine (A) | 71.037 |
| Serine (S) | 87.032 |
| Proline (P) | 97.053 |
| Valine (V) | 99.068 |
| Threonine (T) | 101.048 |
| Leucine/Isoleucine (L/I) | 113.084 |
These mass differences appear repeatedly during peptide fragmentation analysis.
Practical Proteomics Databases
Why High Mass Accuracy Matters
Modern high-resolution instruments can distinguish extremely small mass differences.
For example:
113.08406 ≠ 113.04768
This allows:
- accurate residue assignment
- PTM identification
- false positive reduction
- high-confidence sequence reconstruction
Mass accuracy is especially critical in de novo sequencing workflows.
Residue Mass Interpretation in De Novo Sequencing
In de novo sequencing, peptide sequences are reconstructed directly from fragment ion mass differences without using a protein database.
This process relies heavily on:
- accurate residue masses
- fragment ion ladders
- isotope interpretation
- neutral loss analysis
Even small mass errors can lead to incorrect sequence reconstruction.
Therefore, accurate residue mass interpretation is one of the most important foundations of de novo sequencing.
Limitations and Interpretation Challenges
Residue mass interpretation alone cannot completely identify peptide structure.
Major challenges include:
- isobaric residues (Leu/Ile)
- overlapping isotope peaks
- internal fragments
- neutral loss peaks
- co-fragmentation
- PTM ambiguity
Therefore, amino acid mass interpretation should always be combined with:
- fragmentation patterns
- isotope evidence
- retention time
- database search results
- PTM analysis
FAQ
What is amino acid residue mass?
Residue mass is the mass of an amino acid after loss of H₂O during peptide bond formation.
Why are monoisotopic masses used in proteomics?
Because high-resolution MS/MS instruments require exact masses for accurate peptide identification.
Why are leucine and isoleucine difficult to distinguish?
They are structural isomers with identical monoisotopic masses.
Why is cysteine often modified?
Cysteine is highly reactive and commonly alkylated during sample preparation.
Why is methionine oxidation common?
Methionine is chemically susceptible to oxidation during biological processes and sample handling.
How are residue masses used in MS/MS?
Mass differences between fragment ions correspond directly to amino acid residues.
Why are residue masses important for de novo sequencing?
De novo sequencing reconstructs peptide sequences directly from fragment ion mass differences.
Key Takeaways
- Amino acid residue masses are fundamental to peptide MS/MS interpretation
- Monoisotopic masses are essential for high-resolution proteomics
- Fragment ion mass differences correspond directly to amino acid residues
- Extended and modified residues are important in advanced proteomics workflows
- Leucine and isoleucine remain difficult to distinguish by standard MS/MS
- Accurate residue masses are critical for de novo sequencing and PTM analysis
Related Articles
- How b and y Ions Reconstruct Peptide Sequences
- Neutral Loss in Proteomics MS/MS
- The Complete LC-MS/MS Peptide Identification Workflow
- What Is De Novo Sequencing in Proteomics?
- CID vs HCD vs ETD Fragmentation Explained
- What Is an Immonium Ion in Proteomics MS/MS?
- 43 Major PTM Reference Table for Proteomics LC-MS/MS
