- Home
- Life Science Research
- Metabolomics
- MS-DIAL data processing for untargeted metabolomics
With SWATH® Acquisition on the TripleTOF® 6600 System
Cyrus Papan, SCIEX, Germany
Fast acquisition speed and wide dynamic range of TripleTOF 6600 System combined with SWATH Acquisition generates quantitative information of all detectable compounds across a wide concentration range from biological samples. The MS-DIAL extracts the product ion spectra for MS/MS fragment matching, identification and quantitation.
The major bottle neck in untargeted metabolomics analysis since its conception has been the accurate identification of the metabolites in a complex biological sample. A confident approach for unknown metabolite identification is to match the product ion fragments to a reference MS/MS spectrum. However, data dependent techniques often do not collect MS/MS on all precursors to allow for the identification of metabolites.
The data independent approach using SWATH® Acquisition ensures that product ion spectra is acquired on all detectable compounds in a sample, effectively generating a digitized record of all detectable metabolites in a sample.1 The fast MS/MS acquisition speed of the TripleTOF® 6600 System (up to 100 MS/MS per second) is key for acquiring high quality SWATH Acquisition data on metabolomic samples. In addition, the use of variable sized Q1 windows2 improves the specificity in the fragment assignment. Furthermore, the SWATH Acquisition MS/MS data can be used for quantitation, allowing an MRM-style quantification approach at the MS/MS level.
MS-DIAL (Mass Spectrometry – Data Independent AnaLysis) is an open-source software for the identification and quantification of small molecules and lipids from DIA and DDA-based untargeted LC-MS/MS analysis.3 It leverages the power of the SWATH Acquisition method for untargeted metabolomics analysis using a two-step process: data is deconvoluted by MS2Dec algorithm, then the precursor and fragment ions are re-associated to obtain purified specific product ion spectra of each precursor ion (Figure 1). The ‘purified MS/MS spectra’ provides high accuracy for metabolite identification and better identification coverage of low abundant metabolites. Here, the use of MS-DIAL for processing SWATH Acquisition data from the TripleTOF 6600 System is demonstrated. A comparison of metabolites from two different strains of Arabidopsis was performed.
Sample preparation: The upper aqueous phase of a chloroform: methanol extraction of plant material from two different strains of the widely used model organism Arabidopsis thaliana (mouse ear cress) were prepared.
LC-MS/MS analysis: The samples were analyzed using a TripleTOF® 6600 System with a Shimadzu Nexera HPLC system using variable window SWATH acquisition method in positive ion mode. Q1 mass range from 80 to 600 Da was covered and MS/MS was acquired in high resolution mode (30000 resolution).
Data processing: Data was converted using ABF converter interface, Reifycs Analysis Base File Convertor to convert the SCIEX data file (*.wiff) to ABF format3. The ABF converter interface with experimental data files loaded for conversion is shown in Figure 2. ABF files were then processed in MS-DIAL software pipeline, with multiple files loaded in one work session. Chromatographic peaks are integrated and aligned for quantitative comparison between sample groups. For the library matching, MS-DIAL utilizes the open source NIST MSP text file format library for the fragment spectral library matching to MSP-libraries from public compound data bases such as MassBank3 or LipidBlast4 for compound identification.
Three data files of each cell line group (A1-A3 and B1-B3) were loaded into the MS-DIAL software version 2.94 and alignment was done. The main results window is shown in Figure 3.
Figure 1 shows the raw (upper left panel) and purified (upper right panel) spectra at the retention time indicated by the red arrow. It is evident that some of the co-eluting fragment ions are low abundant compared to contaminating but not co-eluting fragments. Many peaks present in the un-purified product ion spectrum have been eliminated by the purification process. This results in a much cleaner product ion spectrum, which is then better suitable for matching with a spectral library.
MS-DIAL uses spectral libraries in the open NIST msp text format, converted from MassBank databases, a public mass spectral database system. Currently, MassBank contains around 220 000 experimental and in silico spectral records of over 73,000 unique compounds. The library is loaded into MS-DIAL, then the deconvolved SWATH acquisition or DDA product ion spectra can be matched to the database reference spectra.
In Figure 4, three examples of identifications from the SWATH acquisition data by MS/MS spectral matching are shown. The dot-plot of m/z vs. retention time is filtered to show only identified peaks. Note that the software also assigns possible adducts, as shown in the middle and bottom left dot plots.
Table 1 shows a list of identified compounds exported from the MS-DIAL software. Thirty-nine metabolites were identified with confidence from both of the plant strains. The average RT and average m/z are reported for the measured precursor ion.
MS-DIAL data output can be exported in several file formats for post processing data analysis as shown in Figure 5. Various metrics are selectable in the data export window. Data exported in individual text file formats can be used for further data processing in other tools, such as Excel, or other statistical software packages.
Data can also be exported as a generic text file for import into MarkerView™ Software. Here, the aligned chromatographic peak areas were exported for features with significant changes for further analysis.
Principal component analysis (PCA) can then be performed on the identified metabolites within MS-DIAL. A clear separation between the two plant groups was observed in the scores plot (Figure 6). The loadings plot highlights the individual components responsible for the separation. The labels in the loadings plot are filtered to show only the identified components. The contribution plot below shows that over 70% of the variability in the detected features are correlated with the type of sample.
The fast acquisition speed and wide dynamic range of the TripleTOF 6600 System combined with SWATH Acquisition generates structural and quantitative information of all detectable compounds across a wide concentration range from biological samples. The MS-DIAL software leverages the power of the SWATH Acquisition data for untargeted metabolomics acquisition by extracting the product ion spectra for MS/MS fragment matching. It then utilizes the accurate mass, isotope ratios information, and retention time prediction for identification which exceeds the two orthogonal parameters guideline by the Metabolomics Standards Initiative.7
The MS-DIAL generated product ion spectra are purified from co-eluting chemical noise and are often cleaner compared to DDA-derived spectra, resulting in better matching with spectral libraries. MS-DIAL also supports normalization methods for MS/MS quantitation analysis. The complete workflow has been demonstrated here for the quantitative comparison of two strains of A. Thaliana.