Command line interface

ms2pip

ms2pip [OPTIONS] COMMAND [ARGS]...

Options

-l, --logging-level <logging_level>
Options:

DEBUG | INFO | WARNING | ERROR | CRITICAL

--version

Show the version and exit.

annotate-spectra

Annotate observed spectra.

Parameters

psms

PSMList or path to PSM file that is supported by psm_utils.

spectrum_file

Path to spectrum file with target intensities.

psm_filetype

Filetype of the PSM file. By default, None. Should be one of the supported psm_utils filetypes. See https://psm-utils.readthedocs.io/en/stable/#supported-file-formats.

spectrum_id_pattern

Regular expression pattern to apply to spectrum titles before matching to peptide file spec_id entries.

model

Model to use as reference for the ion types that are extracted from the observed spectra. Default: “HCD”, which results in the extraction of singly charged b- and y-ions.

ms2_tolerance

MS2 tolerance in Da for observed spectrum peak annotation. By default, 0.02 Da.

processes

Number of parallel processes for multiprocessing steps. By default, all available.

Returns

results: List[ProcessingResult]

List of ProcessingResult objects with theoretical m/z and observed intensity values.

ms2pip annotate-spectra [OPTIONS] PSMS SPECTRUM_FILE

Options

-t, --psm-filetype <psm_filetype>
Options:

idxml | msms | mzid | peprec | pepxml | percolator | proteome_discoverer | timscore | tsv | xtandem | msamanda | sage | ionbot

-o, --output-name <output_name>
-p, --spectrum-id-pattern <spectrum_id_pattern>
--model <model>
Options:

CID | HCD2019 | TTOF5600 | TMT | iTRAQ | iTRAQphospho | HCDch2 | CIDch2 | HCD2021 | Immuno-HCD | CID-TMT | timsTOF2023 | timsTOF2024 | HCD | timsTOF

--ms2-tolerance <ms2_tolerance>
-n, --processes <processes>

Arguments

PSMS

Required argument

SPECTRUM_FILE

Required argument

correlate

Compare predicted and observed intensities and optionally compute correlations.

Parameters

psms

PSMList or path to PSM file that is supported by psm_utils.

spectrum_file

Path to spectrum file with target intensities.

psm_filetype

Filetype of the PSM file. By default, None. Should be one of the supported psm_utils filetypes. See https://psm-utils.readthedocs.io/en/stable/#supported-file-formats.

spectrum_id_pattern

Regular expression pattern to apply to spectrum titles before matching to peptide file spec_id entries.

compute_correlations

Compute correlations between predictions and targets.

add_retention_time

Add retention time predictions with DeepLC (Requires optional DeepLC dependency).

model

Model to use for prediction. Default: “HCD”.

model_dir

Directory where XGBoost model files are stored. Default: ~/.ms2pip.

ms2_tolerance

MS2 tolerance in Da for observed spectrum peak annotation. By default, 0.02 Da.

processes

Number of parallel processes for multiprocessing steps. By default, all available.

Returns

results: List[ProcessingResult]

Predicted spectra with theoretical m/z and predicted intensity values, and optionally, correlations.

ms2pip correlate [OPTIONS] PSMS SPECTRUM_FILE

Options

-t, --psm-filetype <psm_filetype>
Options:

idxml | msms | mzid | peprec | pepxml | percolator | proteome_discoverer | timscore | tsv | xtandem | msamanda | sage | ionbot

-o, --output-name <output_name>
-p, --spectrum-id-pattern <spectrum_id_pattern>
-x, --compute-correlations
-r, --add-retention-time
--model <model>
Options:

CID | HCD2019 | TTOF5600 | TMT | iTRAQ | iTRAQphospho | HCDch2 | CIDch2 | HCD2021 | Immuno-HCD | CID-TMT | timsTOF2023 | timsTOF2024 | HCD | timsTOF

--model-dir <model_dir>
--ms2-tolerance <ms2_tolerance>
-n, --processes <processes>

Arguments

PSMS

Required argument

SPECTRUM_FILE

Required argument

get-training-data

Extract feature vectors and target intensities from observed spectra for training.

Parameters

psms

PSMList or path to PSM file that is supported by psm_utils.

spectrum_file

Path to spectrum file with target intensities.

psm_filetype

Filetype of the PSM file. By default, None. Should be one of the supported psm_utils filetypes. See https://psm-utils.readthedocs.io/en/stable/#supported-file-formats.

spectrum_id_pattern

Regular expression pattern to apply to spectrum titles before matching to peptide file spec_id entries.

model

Model to use as reference for the ion types that are extracted from the observed spectra. Default: “HCD”, which results in the extraction of singly charged b- and y-ions.

ms2_tolerance

MS2 tolerance in Da for observed spectrum peak annotation. By default, 0.02 Da.

processes

Number of parallel processes for multiprocessing steps. By default, all available.

Returns

features

pandas.DataFrame with feature vectors and targets.

ms2pip get-training-data [OPTIONS] PSMS SPECTRUM_FILE

Options

-t, --psm-filetype <psm_filetype>
Options:

idxml | msms | mzid | peprec | pepxml | percolator | proteome_discoverer | timscore | tsv | xtandem | msamanda | sage | ionbot

-o, --output-name <output_name>
-p, --spectrum-id-pattern <spectrum_id_pattern>
--model <model>
Options:

CID | HCD2019 | TTOF5600 | TMT | iTRAQ | iTRAQphospho | HCDch2 | CIDch2 | HCD2021 | Immuno-HCD | CID-TMT | timsTOF2023 | timsTOF2024 | HCD | timsTOF

--ms2-tolerance <ms2_tolerance>
-n, --processes <processes>

Arguments

PSMS

Required argument

SPECTRUM_FILE

Required argument

predict-batch

Predict fragmentation spectra for a batch of peptides.

Parameters

psms

PSMList or path to PSM file that is supported by psm_utils.

psm_filetype

Filetype of the PSM file. By default, None. Should be one of the supported psm_utils filetypes. See https://psm-utils.readthedocs.io/en/stable/#supported-file-formats.

add_retention_time

Add retention time predictions with DeepLC (Requires optional DeepLC dependency).

model

Model to use for prediction. Default: “HCD”.

model_dir

Directory where XGBoost model files are stored. Default: ~/.ms2pip.

processes

Number of parallel processes for multiprocessing steps. By default, all available.

Returns

predictions: List[ProcessingResult]

Predicted spectra with theoretical m/z and predicted intensity values.

ms2pip predict-batch [OPTIONS] PSMS

Options

-o, --output-name <output_name>
-f, --output-format <output_format>
Options:

csv | mgf | msp | bibliospec | spectronaut | dlib

-r, --add-retention-time
--model <model>
Options:

CID | HCD2019 | TTOF5600 | TMT | iTRAQ | iTRAQphospho | HCDch2 | CIDch2 | HCD2021 | Immuno-HCD | CID-TMT | timsTOF2023 | timsTOF2024 | HCD | timsTOF

--model-dir <model_dir>
-n, --processes <processes>

Arguments

PSMS

Required argument

predict-library

Predict spectral library from protein FASTA file.

ms2pip predict-library [OPTIONS]

predict-single

Predict fragmentation spectrum for a single peptide.

ms2pip predict-single [OPTIONS] PEPTIDOFORM

Options

-o, --output-name <output_name>
--model <model>
Options:

CID | HCD2019 | TTOF5600 | TMT | iTRAQ | iTRAQphospho | HCDch2 | CIDch2 | HCD2021 | Immuno-HCD | CID-TMT | timsTOF2023 | timsTOF2024 | HCD | timsTOF

--model-dir <model_dir>
-p, --plot

Arguments

PEPTIDOFORM

Required argument