Using the ZenoTOF 7600 system
Ihor Batruch1, Yves Le Blanc1, Jose-Castro Perez2, and Christie Hunter2
1SCIEX, Canada; 2SCIEX, USA
Analyzing large sample cohorts is critical when performing translational or biomarker research. To meet the sample introduction demands of high-throughput proteomics, the Evosep One system (Evosep) has been developed a standardized platform with preset methods. But faster chromatography quickly overwhelms traditional data dependent acquisition (DDA). Here using Zeno SWATH DIA – a data independent acquisition strategy, the combination of the Evosep One system and the ZenoTOF 7600 system was investigated for fast quantitative proteomics.
Identifying and quantifying large numbers of proteins and peptides are important in translational research to understand biological functions. One example of this is the analysis of large sample cohorts to identify robust biomarkers of disease. The complexity of biological fluids, tissues and cell lines often overwhelms the capabilities of data-dependent acquisition (DDA) strategies using mass spectrometry due to the stochastic nature of this methodology. This limitation has driven the rapid adoption of data-independent acquisition (DIA or SWATH DIA) approaches, such as Zeno SWATH DIA. This method achieves 5-6x more MS/MS sensitivity,1 resulting in this DIA approach surpassing DDA for protein identifications and quantification in complex matrices. This advantage is further observed at short acquisition times that are required to meet the throughput demands of current proteomics studies.2,3
To meet the demands of high-throughput proteomics, the Evosep One system (Evosep) provides standardized liquid chromatography with preset methods and disposable trap columns. The system is optimized such that the user selects the number of samples to run per day (SPD) to determine the column, flow rates and run times necessary for analysis. The system is highly reproducible, robust and offers high MS utilization. These system features are ideal for applications in high-throughput laboratories.
Here, the combination of the Evosep One system and the ZenoTOF 7600 system was investigated for fast quantitative proteomics. Zeno SWATH DIA was used to investigate the numbers of proteins that can be detected and robustly quantified at varying SPD throughputs (Figure 1).
Sample preparation: Commercial human cell line tryptic digest (HeLa, Thermo Fisher Scientific) was reconstituted with 95% buffer A (water with 0.1% formic acid) and 5% buffer B (acetonitrile with 0.1% formic acid) to a working concentration of 0.1 µg/µL. Working stock solution was further diluted to 1.25 ng/µL for the 25 ng load, 2.5 ng/µL for the 50 ng load, 10 ng/µL for the 200 ng load and 25 ng/µL for the 500 ng load. A 20 µL sample of each concentration was loaded into Evotips for sample injection.
Chromatography: The Evosep One system (Evosep, Denmark) was operated using the preset SPD configurations, as described in Table 1. Flow rates and columns were adjusted based on vendor specifications. Examples of chromatography for the 50 ng load are shown in Figure 2. Control of the Evosep One system was fully integrated with SCIEX OS software, such that a single batch could be used to control both LC and MS.
Mass spectrometry: The ZenoTOF 7600 system was operated in Zeno SWATH DIA mode using the OptiFlow Turbo V ion source. The 200 SPD, 100 SPD and 60 SPD throughput methods were run in the microflow configuration, and the 30 SPD method was run in the nanoflow configuration, as described in Table 2. Analysis was performed in triplicate for each tested condition.
Data processing: Zeno SWATH DIA data were processed using DIA-NN software beta version 1.8.1.3,4 Data were analyzed using an experimental library generated from the pH fractionation of 2 human cell lines,6 a library-free approach using a library generated in silico from a human FASTA file7 or the Pan Human Library.8 Pg.matrix.tsv and pr.matrix.tsv reports were used to report the number of protein groups and peptide precursors identified at <1% FDR and the number of proteins and peptide precursors quantified at <20% CV.
Digested HeLa cell lysate was used to characterize the various SPD workflows using Zeno SWATH DIA on the ZenoTOF 7600 system. For these experiments, 200 and 50 ng sample loads were used (Figures 1 and 3, respectively). This characterization permits users to select the workflows and required sample amounts that best match the throughput and depth of proteome coverage for their experimental needs.
For both loading amounts, as the SPD was reduced and the sample analysis time increased, the number of proteins identified and quantified increased (Figure 3). When the throughput was reduced from 200 to 30 SPD, 102% and 82% more proteins were identified at <1% FDR at 50 and 200 ng loads, respectively. More importantly, the number of proteins quantified at <20% CV also greatly increased, with 77% and 80% more quantified at 50 and 200 ng loads, respectively.
For a 50 ng load of HeLa digest (Figure 3), the 30 SPD workflow provided maximal proteome coverage. With Zeno SWATH DIA, 5727 protein groups were detected, 4232 of which were quantified at <20% CV (Figure 3). With this load and SPD workflow, 74% of detected proteins were reliably quantified.
With a higher load, run times were easily accelerated. Using the 100 SPD workflow and a sample load of 200 ng of HeLa digest (Figure 1), 5499 protein groups were detected and 4870 (88%) were reproducibly quantified with <20% CV. This method therefore provided both proteome coverage and high sample throughput. When the throughput was reduced to 30 SPD and 200 ng of HeLa digest was loaded, 7014 protein groups were detected, 6189 (88%) of which were quantified with <20% CV.
An emerging workflow powered by improvements in software algorithms is the library-free approach for processing DIA data. Here, DIA-NN software was used to convert a FASTA file in silico to a spectral library to use for data processing. The Zeno SWATH DIA data from these experiments were processed with both the in silico library and a spectral library generated from experimental ZenoTOF 7600 system data.6 These results were also compared with those generated by processing the data using the Pan Human library (Figure 4,top).7
Analysis of the data generated with a sample load of 200 ng HeLa digest on column revealed good performance with the library-free approach. Analysis using the in silico generated library provided protein identification results similar to those yielded by analysis with the 2 experimentally generated libraries (Figure 4). At 30 SPD, 6417 proteins were identified at <1% FDR using the library-free approach, whereas 7014 proteins were identified with the ZenoTOF 7600 library.
A key advantage of the Zeno SWATH DIA approach for protein identification is the ability to also obtain quantification information from the Zeno MS/MS data, as shown in Figure 4. Typically, 90% of the proteins identified were also quantified with <20% CV when 200 ng of sample was loaded for all throughput workflows.
The library-free approach is a very effective search strategy, as it enables the identification and quantification of proteins without the prerequisite of generating sample and cohort-specific libraries.
Finally, the impact of SPD and load on the numbers of peptide precursors quantified was examined (Figure 5). For a 200 ng load, ~15,000 precursors were quantified at 200 SPD and ~42,000 at 30 SPD (Figure 5). The most significant improvement occurred when reducing throughput from 100 to 60 SPD, as identifications increased 43%.
In terms of SPD throughput and different proteomic loads, lower loads favored higher SPDs. A throughput of 60 SPD offered balance between performance in detection and quantification. Lower SPD throughputs and higher loads led to increases in protein detection and quantification, as expected. The protein and peptide data presented here can be used as guidance to select appropriate sample loads and required throughputs to meet future study needs.
The ZenoTOF 7600 system using Zeno SWATH DIA provides excellent depth of proteome coverage with high quantitative reproducibility, due to the increase in MS/MS sensitivity with the Zeno trap enabled. Here, Zeno SWATH DIA was coupled with the Evosep One system to demonstrate compatibility with this system and to highlight various workflows for high-throughput, quantitative proteomics applications.