Prediction models
Pre-trained MS²PIP models
MS²PIP includes multiple specialized prediction models, fit for peptide spectra with different properties. These properties include fragmentation method, instrument, labeling techniques and modifications. As all of these properties can influence fragmentation patterns, it is important to match the MS²PIP model to the properties of your experimental dataset.
All models are downloaded automatically upon first use. Model files can also be downloaded manually from genesis.ugent.be/uvpublicdata/ms2pip.
MS2 acquisition information and peptide properties of the models’ training datasets:
Model |
Fragmentation method |
MS2 mass analyzer |
Peptide properties |
---|---|---|---|
HCD2019 |
HCD |
Orbitrap |
Tryptic digest |
HCD2021 |
HCD |
Orbitrap |
Tryptic / Chymotrypsin digest |
CID |
CID |
Linear ion trap |
Tryptic digest |
iTRAQ |
HCD |
Orbitrap |
Tryptic digest, iTRAQ-labeled |
iTRAQphospho |
HCD |
Orbitrap |
Tryptic digest, iTRAQ-labeled, enriched for phosphorylation |
TMT |
HCD |
Orbitrap |
Tryptic digest, TMT-labeled |
TTOF5600 |
CID |
Quadrupole time-of-flight |
Tryptic digest |
HCDch2 |
HCD |
Orbitrap |
Tryptic digest |
CIDch2 |
CID |
Linear ion trap |
Tryptic digest |
Immuno-HCD |
HCD |
Orbitrap |
Immunopeptides |
CID-TMT |
CID |
Linear ion trap |
Tryptic digest, TMT-labeled |
timsTOF2023 |
CID |
Ion mobility quadrupole time-of-flight |
Tryptic-, elastase digest, immuno class 1 |
timsTOF2024 |
CID |
Ion mobility quadrupole time-of-flight |
Tryptic-, elastase digest, immuno class 1 & class 2 |
Models, version numbers, and the train and test datasets used to create each model:
Model |
Current version |
Train-test dataset (unique peptides) |
Evaluation dataset (unique peptides) |
Median Pearson correlation on evaluation dataset |
---|---|---|---|---|
HCD2019 |
v20190107 |
MassIVE-KB (1 623 712) |
PXD008034 (35 269) |
0.903786 |
CID |
v20190107 |
NIST CID Human (340 356) |
NIST CID Yeast (92 609) |
0.904947 |
iTRAQ |
v20190107 |
NIST iTRAQ (704 041) |
PXD001189 (41 502) |
0.905870 |
iTRAQphospho |
v20190107 |
NIST iTRAQ phospho (183 383) |
PXD001189 (9 088) |
0.843898 |
TMT |
v20190107 |
Peng Lab TMT Spectral Library (1 185 547) |
PXD009495 (36 137) |
0.950460 |
TTOF5600 |
v20190107 |
PXD000954 (215 713) |
PXD001587 (15 111) |
0.746823 |
HCDch2 |
v20190107 |
MassIVE-KB (1 623 712) |
PXD008034 (35 269) |
0.903786 (+) 0.644162 (++) |
CIDch2 |
v20190107 |
NIST CID Human (340 356) |
NIST CID Yeast (92 609) |
0.904947 (+) 0.813342 (++) |
HCD2021 |
v20210416 |
Combined dataset (520 579) |
PXD008034 (35 269) |
0.932361 |
Immuno-HCD |
v20210316 |
Combined dataset (460 191) |
PXD005231 (HLA-I) (46 753) PXD020011 (HLA-II) (23 941) |
0.963736 |
CID-TMT |
v20220104 |
PXD041002 (72 138) |
PXD005890 (69 768) |
0.851085 |
timsTOF2023 |
v20230912 |
Combined dataset (234 973) |
PXD043026 PXD046535 PXD046543 |
0.892540 (tryptic) 0.871258 (elastase) 0.899834 (class I) 0.635548 (class II) |
timsTOF2024 |
v20240105 |
Combined dataset (480 024) |
PXD043026 PXD046535 PXD046543 PXD038782 |
0.883270 (tryptic) 0.814374 (elastase) 0.887192 (class I) 0.847951 (class II) |
Training new MS²PIP models
[todo]
Prediction features
The table below lists and describes all features generated and used by MS²PIP. These are mostly based on four amino acid properties (basicity, hydrophobicity, helicity and isoelectric point) for the full precursor and for the N- and C-terminal ions.
Feature |
Description |
---|---|
|
Precursor length |
|
Precursor charge |
|
Precursor charge is 1 (one-hot encoding) |
|
Precursor charge is 2 (one-hot encoding) |
|
Precursor charge is 3 (one-hot encoding) |
|
Precursor charge is 4 (one-hot encoding) |
|
Precursor charge is 5 (one-hot encoding) |
|
Minimum basicity of the precursor |
|
First quartile of basicity of the precursor |
|
Second quartile of basicity of the precursor |
|
Third quartile of basicity of the precursor |
|
Maximum basicity of the precursor |
|
Minimum helicity of the precursor |
|
First quartile of helicity of the precursor |
|
Second quartile of helicity of the precursor |
|
Third quartile of helicity of the precursor |
|
Maximum helicity of the precursor |
|
Minimum hydrophobicity of the precursor |
|
First quartile of hydrophobicity of the precursor |
|
Second quartile of hydrophobicity of the precursor |
|
Third quartile of hydrophobicity of the precursor |
|
Maximum hydrophobicity of the precursor |
|
Minimum isoelectric point of the precursor |
|
First quartile of isoelectric point of the precursor |
|
Second quartile of isoelectric point of the precursor |
|
Third quartile of isoelectric point of the precursor |
|
Maximum isoelectric point of the precursor |
|
Length of the N-terminal ion |
|
Length of the C-terminal ion |
|
Count of amino acid ‘A’ in the N-terminal ion |
|
Count of amino acid ‘A’ in the C-terminal ion |
|
Count of amino acid ‘C’ in the N-terminal ion |
|
Count of amino acid ‘C’ in the C-terminal ion |
|
Count of amino acid ‘D’ in the N-terminal ion |
|
Count of amino acid ‘D’ in the C-terminal ion |
|
Count of amino acid ‘E’ in the N-terminal ion |
|
Count of amino acid ‘E’ in the C-terminal ion |
|
Count of amino acid ‘F’ in the N-terminal ion |
|
Count of amino acid ‘F’ in the C-terminal ion |
|
Count of amino acid ‘G’ in the N-terminal ion |
|
Count of amino acid ‘G’ in the C-terminal ion |
|
Count of amino acid ‘H’ in the N-terminal ion |
|
Count of amino acid ‘H’ in the C-terminal ion |
|
Count of amino acid ‘I’ in the N-terminal ion |
|
Count of amino acid ‘I’ in the C-terminal ion |
|
Count of amino acid ‘K’ in the N-terminal ion |
|
Count of amino acid ‘K’ in the C-terminal ion |
|
Count of amino acid ‘M’ in the N-terminal ion |
|
Count of amino acid ‘M’ in the C-terminal ion |
|
Count of amino acid ‘N’ in the N-terminal ion |
|
Count of amino acid ‘N’ in the C-terminal ion |
|
Count of amino acid ‘P’ in the N-terminal ion |
|
Count of amino acid ‘P’ in the C-terminal ion |
|
Count of amino acid ‘Q’ in the N-terminal ion |
|
Count of amino acid ‘Q’ in the C-terminal ion |
|
Count of amino acid ‘R’ in the N-terminal ion |
|
Count of amino acid ‘R’ in the C-terminal ion |
|
Count of amino acid ‘S’ in the N-terminal ion |
|
Count of amino acid ‘S’ in the C-terminal ion |
|
Count of amino acid ‘T’ in the N-terminal ion |
|
Count of amino acid ‘T’ in the C-terminal ion |
|
Count of amino acid ‘V’ in the N-terminal ion |
|
Count of amino acid ‘V’ in the C-terminal ion |
|
Count of amino acid ‘W’ in the N-terminal ion |
|
Count of amino acid ‘W’ in the C-terminal ion |
|
Count of amino acid ‘Y’ in the N-terminal ion |
|
Count of amino acid ‘Y’ in the C-terminal ion |
|
basicity of the first amino acid of the peptide |
|
basicity of the last amino acid of the peptide |
|
basicity of the amino acid before the fragmentation site |
|
basicity of the amino acid at the fragmentation site |
|
basicity of the 1st amino acid after the fragmentation site |
|
basicity of the 2nd amino acid after the fragmentation site |
|
Sum of basicity of the N-terminal ion |
|
Minimum basicity of the N-terminal ion |
|
First quartile of basicity of the N-terminal ion |
|
Second quartile of basicity of the N-terminal ion |
|
Third quartile of basicity of the N-terminal ion |
|
Maximum basicity of the N-terminal ion |
|
Sum of basicity of the C-terminal ion |
|
Minimum basicity of the C-terminal ion |
|
First quartile of basicity of the C-terminal ion |
|
Second quartile of basicity of the C-terminal ion |
|
Third quartile of basicity of the C-terminal ion |
|
Maximum basicity of the C-terminal ion |
|
Helicity of the first amino acid of the peptide |
|
Helicity of the last amino acid of the peptide |
|
Helicity of the amino acid before the fragmentation site |
|
Helicity of the amino acid at the fragmentation site |
|
Helicity of the 1st amino acid after the fragmentation site |
|
Helicity of the 2nd amino acid after the fragmentation site |
|
Sum of helicity of the N-terminal ion |
|
Minimum helicity of the N-terminal ion |
|
First quartile of helicity of the N-terminal ion |
|
Second quartile of helicity of the N-terminal ion |
|
Third quartile of helicity of the N-terminal ion |
|
Maximum helicity of the N-terminal ion |
|
Sum of helicity of the C-terminal ion |
|
Minimum helicity of the C-terminal ion |
|
First quartile of helicity of the C-terminal ion |
|
Second quartile of helicity of the C-terminal ion |
|
Third quartile of helicity of the C-terminal ion |
|
Maximum helicity of the C-terminal ion |
|
Hydrophobicity of the first amino acid of the peptide |
|
Hydrophobicity of the last amino acid of the peptide |
|
Hydrophobicity of the amino acid before the fragmentation site |
|
Hydrophobicity of the amino acid at the fragmentation site |
|
Hydrophobicity of the 1st amino acid after the fragmentation site |
|
Hydrophobicity of the 2nd amino acid after the fragmentation site |
|
Sum of hydrophobicity of the N-terminal ion |
|
Minimum hydrophobicity of the N-terminal ion |
|
First quartile of hydrophobicity of the N-terminal ion |
|
Second quartile of hydrophobicity of the N-terminal ion |
|
Third quartile of hydrophobicity of the N-terminal ion |
|
Maximum hydrophobicity of the N-terminal ion |
|
Sum of hydrophobicity of the C-terminal ion |
|
Minimum hydrophobicity of the C-terminal ion |
|
First quartile of hydrophobicity of the C-terminal ion |
|
Second quartile of hydrophobicity of the C-terminal ion |
|
Third quartile of hydrophobicity of the C-terminal ion |
|
Maximum hydrophobicity of the C-terminal ion |
|
Isoelectric point of the first amino acid of the peptide |
|
Isoelectric point of the last amino acid of the peptide |
|
Isoelectric point of the amino acid before the fragmentation site |
|
Isoelectric point of the amino acid at the fragmentation site |
|
Isoelectric point of the 1st amino acid after the fragmentation site |
|
Isoelectric point of the 2nd amino acid after the fragmentation site |
|
Sum of isoelectric points of the N-terminal ion |
|
Minimum isoelectric point of the N-terminal ion |
|
First quartile of isoelectric points of the N-terminal ion |
|
Second quartile of isoelectric points of the N-terminal ion |
|
Third quartile of isoelectric points of the N-terminal ion |
|
Maximum isoelectric point of the N-terminal ion |
|
Sum of isoelectric points of the C-terminal ion |
|
Minimum isoelectric point of the C-terminal ion |
|
First quartile of isoelectric points of the C-terminal ion |
|
Second quartile of isoelectric points of the C-terminal ion |
|
Third quartile of isoelectric points of the C-terminal ion |
|
Maximum isoelectric point of the C-terminal ion |