Untargeted lipidomics workflow on TripleTOF® 6600 System
Hiroshi Tsugawa1,2, Atsushi Hanada3, Kazuki Ikeda1,4, Yosuke Isobe1, Yuya Senoo1, Makoto Arita1,4,5
1RIKEN Center for Sustainable Resource Science, Japan, 2RIKEN Center for Integrative Medical Sciences, Japan, 3SCIEX, Japan, 4Graduate School of Medical Life Science, Yokohama City University, Japan, 5Graduate School of Pharmaceutical Sciences, Keio University, Japan
Data independent approaches to MS data collection is being increasingly explored in metabolomics and lipidomics research, due to the advantage of collecting comprehensive data on all detectable species. MS-DIAL is an open-source software for the identification and quantification of small molecules and lipids from DIA and DDA-based untargeted LC-MS/MS analysis. A workflow combining SWATH Scquisition and MS-DIAL for lipid identification is demonstrated here using different dietary oil supplements.
Lipids in nature are diverse biomolecules, as the majority of lipids contain different combinations of side chains and head groups. The major bottleneck in untargeted lipidomics analysis is the accurate identification of the lipids in complex biological samples.There is a new interest on accurate identification of the lipids in complex biological samples by matching the MS/MS product ion fragments to a reference MS/MS spectrum or using an in-silico MS/MS library1. Data dependent acquisition (DDA) techniques often do not collect MS/MS on all precursors but the data independent approach using SWATH Acquisition, a data independent acquisition (DIA) method, ensures that product ion spectra is acquired on all detectable compounds in a sample, effectively generating a digitized record of all detectable metabolites in a sample.
Here an untargeted lipidomics workflow is demonstrated which includes: a simple lipid extraction method from 20 μL of mouse plasma, a 15-minute SWATH Acquisition method, and a 1 hour data analysis step using MS-DIAL (Mass Spectrometry – Data Independent AnaLysis). MS-DIAL is an open-source software for the identification and quantification of small molecules and lipids from DIA and DDA-based untargeted LC-MS/MS analysis2, providing lipids identification by MS/MS confirmation. Using this workflow, 322 lipid molecular species were identified out of 420 spectra matched by MS/MS confirmation from a mouse plasma (3 groups, N = 4) provided by RIKEN Center for Integrated Biomedical Sciences Metabolome Research Team. Quantitative analysis of lipids clearly differentiated the mouse plasma samples from the 3 different dietary groups (Figure 1).
Mouse plasma preparation: In this study, 8 weeks old C57BL/ 6J mice (4 males) were fed for one week with Oriental Yeast diet containing either 4% soybean oil, palm oil or linseed oil. Blood was collected, heparin was added and plasma was extracted for each sample (4 samples collected from 3 dietary groups).
Lipid extraction: Extraction of lipids was performed by mixing chloroform: methanol: plasma at 100 μL: 200 μL: 20 μL. Here, a pooled quality control (QC) sample was created by mixing 50 μL of the final volume of 320 μL lipid extract from each sample in a separate container. In addition, two blanks (blanks) were prepared that followed the same extraction process. Finally, 150 μL of each sample solution was transferred to a vial for LC-MS analysis, as well as the QC and blank samples. The QC sample was analyzed after each 3 sample injections to monitor the sensitivity of mass spectrometry throughout the analysis. Note: the QC analysis was injected from the same vial for every QC injection. However, when considering sample vaporization etc., it is recommended to have separate aliquoted vials of QCs for large scale analysis.
LC-MS/MS analysis: The Shimadzu Nexera UHPLC system with Waters ACQUITY CSH C18 2.1 x 100 mm, 1.7 µm column was used for liquid chromatography with a SCIEX TripleTOF 6600 System. For large-scale analysis, a Waters ACQUITY VanGuard CSH C18 1.7 µm pre-column is recommended to protect the column, replacing it after 300 injections. It is recommended to replace the analytical column after 1000 to 1500 sample injections. The conditions of liquid chromatography are shown in Table 1, the liquid chromatography (LC) method gradient is shown in Table 2. Data was acquired on a TripleTOF 6600 System in positive and negative ion mode using the settings outlined in Tables 3.
The two prepared blank samples, 12 total samples, 2 vials of QC samples as well as one vial of ultra-pure water as blank buffer were analyzed in both positive and negative polarity mode. Sample injection was performed in order shown in Table 4 with biological samples analyzed in randomized order.
For 200 injections, 1 liter of each mobile phase is required. For mobile phase A, the glass bottle is first rinsed with acetonitrile, then 600 mL of acetonitrile, 399 mL HPLC grade water, and 1 mL of formic acid is added. This is mixed well and then 0.63 g of ammonium formate salt is and sonication at room temperature for 10 minutes.
For mobile phase B, place 1 mL of ultrapure water in a small beaker, add 1 mL of formic acid. Add in 0.63 g of ammonium formate and mix slowly so that the above three are mixed. Rinse a 1 liter glass bottle with acetonitrile, add in the solution prepared in the beaker, then add 900 mL of isopropanol and 98 mL of acetonitrile to 1 L glass bottle. Sonicate at room temperature for 10 minutes.
Do not use mobile phases until ammonium formate is dissolved and bubbles have disappeared.
SWATH Acquisition data were analyzed by MS-DIAL version 1.98. As a result, a total of 322 lipids were identified by positive and negative mode (Table 5). Lipid identification was achieved using MS/MS confirmation after performing automatic peak detection, waveform processing, and compound identification by MS-DIAL which takes approximately 1 hour. The automatic identification of MS-DIAL is carried out using the highly reliable integration index considering retention time and isotopic ratio in addition to the MS/MS similarity calculation and by dot/product method which is the most frequently used method2. MS-DIAL graphical user interface also assists the manual curation for identification results (Figure 2).
Standardization of quantitative values was also carried out by the LOESS (LOcally WEighted Scatter-plot Smoother) Cubic spline method using quantitative values of pooled quality controls (QCs).4 The quantitative drift of MS signal for each compound is computed using the assumption that the quantitative value of QC measured at a rate of 1 per every 3-5 sample, a common method used for large-scale cohort studies and implemented in MS-DIAL.3 Each quantitative value is expressed as a relative value when the quantitative value of QC is set to 1. The merit of this method is that it compensates for drift in the MS quantitation value for each analysis, but also it can integrate positive ion mode data and negative ion mode data (as it is already standardized based on QC).
Furthermore, availability of a QC sample of the same quality can enable integration of data across months and years. The mean or median value of the original QC ion abundances can be multiplied to the normalized values so that the ion abundance information for each metabolite is considered for data interpretation.
Results from principal component analysis (PCA) on the identified 420 lipid spectra clearly distinguishes the differences in phenotype reflecting the dietary condition (Figure 1). PCA Analysis highlighted that 55% of the data was explainable by the biological effect of the mice fed by different diets.
List of identified lipids shown in Table 6 indicates 18: 3 (α-linolenic acid), 20: 5 (eicosapentaenoic acid, EPA), 22: 6 (docosahexaenoic acid, DHA) derived from ω3 series fatty acids, are constituent fatty acids in mice with the linseed oil containing diet. The phospholipid and triacylglycerols also increased in mouse plasma samples fed with linseed lambda oils rich in ω3 fatty acids.
On the other hand, 80% of the constituent fatty acids of palm oil are composed of 16: 0 (palmitic acid) and 18: 2 (linoleic acid), and when looking at the lipid profile of mouse plasma with palm oil as the diet, phospholipid and triacylglycerol with these two fatty acids as constituents are increased. In addition, the main constituent fatty acid of soybean oil was 18: 2 (linoleic acid), which also increased in the plasma of mice fed with soybean oil.
Because the position information of the double bond position and the acyl group cannot be determined in this method, A and B were annotated by the elution order (Table 6). The "P-" is to indicate that it is a plasmalogen type.
The SWATH Acquisition with MS-DIAL lipid identification workflow introduced here provides identification of structural isomers and binding information of organic elements. Out of 420 matched lipid spectra, 322 unique lipids were identified excluding all duplicates such as those identified in both in positive and negative modes, and those detected with different adduct ions. Quantitation of identified lipid species clearly differentiates the three sample groups with the different dietary oil supplements.