Abstract
The Bio Tool Kit is a micro application for use within PeakView Software and Explorer in SCIEX OS Software which provides a suite of tools for biomolecule analysis (molecular weight analysis, sequence analysis, as well as the detection and mapping of any post-translational modifications, mutations, or truncations).Bio Tool Kit can be used with both data from SCIEX QTRAP® Systems, TripleTOF® Systems or QTOF Systems.
Introduction
In order to more fully comprehend the function and behavior of biomolecules it is essential to better understand the details of their molecular structure and sequence. The Bio Tool Kit provides a suite of tools for complete characterization of biomolecules such as proteins, peptides, and oligonucleotides using data generated from SCIEX mass spectrometers. The tools within the Bio Tool Kit enable intact molecular weight analysis, sequence analysis, as well as the detection, mapping, and quantitation of any post-translational modifications, mutations, or truncations. Together, these tools can provide a much more comprehensive view of the biomolecule under study.
Key benefits of the Bio Tool Kit
- Reconstruction workflows
- Determine the intact molecular weights of proteins, peptides, oligonucleotides, mAbs, and other biomolecules using the mass reconstruction tools and either a charge series or isotope series in positive ion or negative ion mode
- Manual sequence workflows
- Find the sequence of unknown peptides using the de novo sequencing tools and then match the proposed sequence to fragment ions within the MS/MS data
- Peptide mapping workflows
- Perform an in silico digest of a protein sequence and then map that sequence to LC-MS data. Confirm peptide matches using MS/MS data
- Extensive data dictionary
- All of the features within the Bio Tool Kit are backed by a database containing over 1300 modifications to enable a comprehensive analysis of any biological or chemical processing that may have occurred to the biomolecule
Reconstruction workflows
Determining the intact mass of a biomolecule is often an important first step in biomolecule characterization. Users can choose to reconstruct raw data automatically or in a more manual fashion. With the protein reconstruction tool, the maximum entropy algorithm automatically detects all charge series envelopes in the raw data and produces a reconstructed spectrum annotated with the intact molecular weights of all biomolecules detected in the raw data (Figure 1).
With the manual reconstruction workflow, users have the choice to deconvolute based upon a charge series or isotope series. For a manual reconstruction based upon an isotope series, the user simply selects multiple isotopes of the ion of interest and the tool will calculate the intact molecular weight and annotate the spectrum (Figure 2, top). Peptide masses can also be automatically determined from LC-MS (Figure 2 bottom) by using the LC-MS Reconstruct algorithm. Reconstruction is performed across the chromatographic run, peptide masses are computed and a table of masses is created for use in downstream workflows such as peptide mapping (Figure 5).
Manual sequence workflows
Using the tools within the Bio Tool Kit, users can “walk” up or down an MS/MS spectrum to determine the sequence of a peptide (Figure 3). This process is straightforward and is accomplished by following these steps:
- Highlight a fragment ion as the starting point for manual sequencing.
- Double-click on the caption for the next peak to consider in the sequence.
- Continue selecting peaks until finished. The final result shows the peptide sequence ladder.
Peptide sequences can also be matched to MS/MS data with the peptide fragment pane. Here, a table of theoretical fragment ions from the proposed sequence is matched to ions found in the data. All matches are highlighted within the table and labeled in the MS/MS spectrum. The tool considers multiple different fragment ion types and will also find and highlight modifications (Figure 4).
Peptide mapping workflow
Mapping a protein sequence to mass spectrometry data of a digested protein is one of the key ways to gain detailed insight into the structure and modifications on a protein. With the Bio Tool Kit, the first step to peptide mapping is to identify spectral peaks in the LC-MS data and then reconstruct them to neutral masses from the m/z data (Figure 2). The LC-MS Peptide Reconstruct feature uses a peak finding algorithm to identify groups of peaks that form isotope series and charge series.
Once the peaks have been found, it’s time to map the data. Here a protein sequence is digested in silico to create a list of peptides. These peptides are then matched to the peaks found in the LC-MS data. Any sample preparation artifacts, missed cleavages, and biological modifications can also be selected and considered for the peptide matching. Matched peptides are highlighted in the reconstructed peak list and the protein sequence is highlighted with all found peptides (Figure 5).
If an LC-MS/MS workflow was used, the MS/MS data for any matched peptides can be compared with their sequence using the Peptide Fragment pane. As with the Sequencing workflow, all fragment ion matches are highlighted in the table and annotated in the spectrum (Figure 4).
Modification catalog
Behind all of the features within the Bio Tool Kit lies the Data Dictionary. This dictionary contains detailed information about the building blocks for biomolecules such as amino acids and nucleotides. Additionally, the dictionary contains over 1300 biological, sample preparation, and substitution modifications including modifications from the UniMod database (Figure 6). Users may even edit the dictionary to add custom modifications if desired. This modification catalog is used in all relevant SCIEX software packages to ensure consistency across experiments and data processing:
- PeakView® Software and Bio Tool Kit for peptide mapping, de novo sequencing, and reconstruction workflows
- ProteinPilot™ Software for discovery proteomics
- BioPharmaView™ Software for biologics characterization
Conclusions
The Bio Tool Kit is a micro application for use within PeakView Software and Explorer in SCIEX OS Software and provides a suite of tools for biomolecule analysis. The kit provides the most commonly-needed functions for the characterization of biomolecules by mass spectrometry, including:
-
Manual Sequence Workflow
- Peptide De novo sequencing
- Peptide MS/MS fragment matching
-
Peptide Mapping Workflow
- Protein sequence in silico digest
- MS mapping with MS/MS confirmation
-
Reconstruction Workflows
- Intact protein
- Peptide mixtures
- Manual determination
- Oligonucleotide in negative mode
-
Extensive Data Dictionary
- Over 500 sample preparation and biological modifications
- Add custom modifications