Using the TripleTOF 6600 system and SWATH acquisition
Robert Proos
SCIEX, USA
SWATH acquisition (a data independent acquisition workflow) coupled with quality spectral libraries is of increasing interest when analyzing complex biological samples due to the comprehensive and quantitative nature of the results. Here, the detection rates of metabolites in plasma from both the DDA and DIA datasets on the TripleTOF 6600 system was studied. Also a series of LC-MS/MS libraries were also evaluated in this study include NIST 17, MoNA (MassBank of North America), METLIN, and SCIEX libraries. It was found that SWATH Acquisition identifies over 70% more compounds than an optimized data dependent acquisition (DDA) method.
Comprehensive metabolite identification with MS/MS library spectral matching can be problematic for data-dependent acquisition (DDA) workflows as it often requires multiple injections for each sample to obtain all the MS/MS needed for identification and the sampling for MS/MS is stochastic. Data Independent Acquisition (DIA) workflows are of increasing interest when analyzing complex biological samples due to the comprehensive nature of the acquisition approach.1 In addition to capturing MS/MS spectra for all detectable analytes, SWATH acquisition also provides quantitative data at both the MS or MS/MS levels, enabling a comprehensive qualitative and quantitative analysis of metabolites in complex biological samples like plasma in a single injection. Here, the detection rates of metabolites in plasma will be explored using both the DDA and DIA workflows on the TripleTOF 6600 System.
Accurate compound identification remains a bottleneck in the field of metabolomics in the translation of experimental features to the biologic interpretation. Using library searching approaches, where MS and MS/MS information is compared to spectral libraries, is a common approach when exploring complex samples. In this study, an evaluation of several spectral libraries was performed on data from both the DDA and DIA workflows, to compare the number of metabolites accurately identified in an extracted plasma sample in a single injection workflow.2 The LC-MS/MS libraries evaluated in this study include NIST 17, MoNA (MassBank of North America), METLIN, and SCIEX libraries.
Sample preparation: 100 μL pooled plasma (AA 45/32 Phys Control Plasma, SCIEX) was transferred into a 2 mL microtube, 800 μL of methanol was added to extract metabolites and precipitate proteins. Samples were vortexed and then centrifuged at 14,000 RPM for 10 min. An 800 μL aliquot of the supernatant, which contains the metabolites, was transferred to a 2 mL microtube. The sample was dried to a pellet using a CentriVap Concentrator using no heat. The pellet was then reconstituted in 100 μL de-ionized water, centrifuged at 14,000 RPM for 10 min, and the supernatant transferred to an HPLC vial insert.
Chromatography: An ExionLC AD HPLC system with a Phenomenex Kinetex F5 column (150×2.1 mm, 2.6 μm, 100 Å) at 30 ºC was used with a flow rate of 200 μL/min. Mobile phase A = water with 0.1% formic acid. Mobile phase B = acetonitrile with 0.1% formic acid. The reversed-phase gradient method is listed in Table 1. The injection volume was 3 μL.
Mass spectrometry: A TripleTOF 6600 system with a DuoSpray source and Electrospray Ionization (ESI) probe with positive mode ionization was used. For both DDA and SWATH acquisition methods, TOF MS data was collected between 50 and 1000 m/z with an accumulation time of 100 msec. DDA acquisition was performed using top 20 MS/MS with a threshold of 500 cps, dynamic background subtraction, dynamic accumulation, with a minimum accumulation time of 25 msec. No inclusion or exclusion lists were used for the DDA analysis. SWATH acquisition was performed with 20 variable Q1 isolation windows, optimized based on the TOF MS data (Figure 2), using the SWATH acquisition variable window calcluator.3 Each MS/MS spectrum used a 25 msec accumulation time. The cycle time was 651 msec for both the DDA and DIA methods.
Data processing: Data were processed using SCIEX OS software for non-targeted peak finding (Table 2) with library matching to the NIST 17, METLIN, MoNA, and the SCIEX All-in-One accurate mass MS/MS spectral libraries. Only experimentally obtained LC-MS/MS spectra were used in the analysis. Compound identification required both MS level precursor mass and MS/MS level spectral matching.
In the DDA analysis, out of the 14,573 features found in MS, a total of 4864 precursor ions selected for MS/MS analysis. Of those, 2456 were replicate spectra from the same precursor in subsequent scans, resulting in 2408 unique candidate ions fragmented. A summary of the detected features is provided in Table 3.
The dynamic background subtraction filter used in the selection process is used to improve the quality of the MS/MS spectra by only triggering on candidate ions near the apex of the peak. It does so by creating an XIC of the candidate ion over the last 3 data points, taking the first derivative, and determining whether the candidate ion is approaching or at the apex of the peak. This filter reduces the total number of candidate ions selected for MS/MS, eliminating background ions and not triggering on peak fronts or tails even when above the intensity threshold.
As the cycle time in the method is fixed, in cycles when less than the maximum number of candidate ions are selected, the 500 msec of MS/MS time is distributed among the selected precursors. The longer accumulation times result in decreased noise, which improves the data quality. The dynamic accumulation option also used in this method further improves data quality by allocating more time to precursor ions of lower intensity. The overall result is high-quality MS/MS data which allows for improved spectral library matching.
In the single injection DDA analysis with 2408 unique candidate ions on which MS/MS data were obtained, 362 of these features were identified by both precursor mass and MS/MS library matching using SCIEX OS software for analysis. An example of a metabolite identification from DDA acquisition is provided in Figure 3.
Contrary to DDA acquisition where a precursor ion is individually selected and isolated for fragmentation, in DIA, wider isolation windows are used such that MS/MS is acquired on all detectable compounds in the sample. Variable window SWATH acquisition ensures an optimal balance between sensitivity and specificity is achieved for each sample type.1 In this method the SWATH acquisition variable window calculator3 was used to determine optimum window sizes by adjusting the window size to normalize the total number of ions per window. A summary of these windows is presented in Figure 2.
From the SWATH acquisition data, all detectable features can be searched against spectral libraries for identification. The assignment of product ions to specific precursors is determined by the software using background subtraction or alignment of product ions to precursor ions by PCVG. In this example, all 16,799 features found in the SWATH acquisition data were searched against the spectral libraries, compared to only 4,864 features with spectra from the DDA analysis.
The additional 12 thousand features not selected by DDA are likely low-intensity analytes as they were not triggered in the top 20 DDA method. The SWATH acquisition data yielded an additional 266 compounds that were identified, increasing the total number of compounds from 362 to 628 identified by precursor match and MS/MS spectral match from a single injection of extracted human plasma. An example of a compound identified by SWATH acquisition that was not identified by the DDA method is shown in Figure 4.
With SCIEX OS software, a variety of libraries can be used to search the DDA or SWATH acquisition data. Here, a number of different libraries varying in size and source were used to investigate the numbers of metabolites observed from DDA and SWATH acquisition data. Table 4 and Figure 1 summarize the results of all the library searches on both datasets. Not surprisingly, the large libraries provide more identifications. Also the SWATH acquisition data provides consistently higher identifications over DDA, also as expected due to the comprehensive nature of the data acquisition.
Next, to understand the differences provided by using different libraries, the compounds found by each approach were compared and summarized in Figure 5. As would be expected, the larger the spectral library, the more library matches that could be obtained as seen in Table 4. As plasma often contains more than just endogenous metabolites, using libraries with broader coverage can provide more feature identifications. However if focusing on human biology is a key component of the study, using library focused on endogenous metabolites (like the AMMSL) can simplify the biological study.
Also, it was found that libraries varied quite widely in the compound identifications found, even between metabolite focused libraries. In the Venn diagram (Figure 5), there were 249 features identified using the METLIN library that were not identified using only the MoNA library. Similarly, there were 124 features identified using the MoNA library that were not identified using only the Metlin library. There were 508 compounds identified using these two libraries; however, only 135 of those compounds were identified using both libraries.
It is also shown in Figure 5 that there were an additional 120 features that were identified using the NIST 17 and SCIEX All-in-One libraries that did not have MS/MS spectral matches in either the METLIN or MoNA libraries. The NIST 17 and SCIEX all-in-One libraries include compounds from additional classes including medications, extractables, leachables, pesticides, illicit drugs, foods, natural products, PFAS, and more, which could be present in biological matrices. An example of one of these library matches is presented in Figure 6, stachydrine is found in citrus fruit and therefore not an endogenous metabolite in human plasma, but a possible biomarker for citrus fruit consumption.
A simple 20-minute RP-LC method was used to acquire DDA and SWATH acquisition data on an extracted human plasma sample using a TripleTOF 6600 system. The top 20 DDA method was able to identify 362 of the features on the basis of precursor mass and MS/MS spectral matching. The 20 variable window SWATH acquisition method resulted in an additional 266 features being identified, for a total of 628 compounds identified in a single injection. This study has demonstrated that variable window SWATH acquisition can be useful for improving compound identification when using untargeted acquisition approaches for metabolomics.
In global metabolomics analysis of biofluids, and especially of human plasma or urine, numerous non-metabolite analytes will be present from sources such as the environment, food, and medications. These analytes have the potential to complicate data analysis appearing as unknowns, possibly showing similar profiles to biomarkers or up or down regulation differences. This study demonstrated that the inclusion of non-metabolomics libraries can increase identification of other features in the dataset. Additionally, it was shown that even across metabolomics focused libraries, significant differences in metabolite coverage still exist.