Abstract
This technical note demonstrates the suitability of ZT Scan DIA acquisition on the SCIEX ZenoTOF 8600 system, coupled with MS-DIAL processing, for untargeted metabolomics. With ZT Scan DIA, comprehensive MS2 coverage of MS1 features is achieved, and the resulting deconvoluted spectra are less chimeric than those from either DDA or SWATH, leading to higher putative MS/MS identification rates.
Data-independent acquisition (DIA) has seen limited adoption in metabolomics due to persistent concerns about highly chimeric MS/MS spectra, which impair compound identification. Even with narrow isolation windows, conventional data-dependent acquisition (DDA) workflows often yield co-isolated fragment ions, reducing spectral purity. Established deconvolution approaches, including retention-time-based (RTDec) and database-assisted (DBDec) methods, offer partial mitigation but remain constrained by co-elution complexity, chromatographic variability, and incomplete spectral library coverage, particularly for structurally diverse small molecules.
Here, quadrupole-axis deconvolution (Q1Dec), implemented via ZT Scan DIA acquisition on the ZenoTOF 8600 system, is evaluated as an alternative strategy for untargeted metabolomics. ZT Scan DIA employs a rapidly scanning, sliding isolation window combined with Zeno trapping to reconstruct high-quality, DDA-like MS/MS spectra independent of isolation width [1-3]. Using a Leptospirosis infection model in human vascular endothelial cells, ZT Scan DIA was benchmarked against Top N DDA and variable-window SWATH. Results demonstrate improved spectral clarity, increased identification rates, and enhanced MS/MS coverage, highlighting ZT Scan DIA as a robust solution for reducing chimeric interference and advancing DIA-based metabolomics workflows (Figure 1).
Key features of ZT Scan DIA data Q1 deconvolution
- Quadrupole-axis deconvolution (Q1Dec) with ZT Scan DIA enables precursor-specific fragment assignment by using a sliding isolation window across the Q1 dimension, resulting in cleaner, less chimeric MS/MS spectra.
- Comprehensive MS/MS coverage (>99% of features) is achieved through DIA acquisition that fragments all detectable precursors, which ensures consistent detection of both high- and low-abundance metabolites.
- Multiplexed deconvolution (Q1Dec + RTDec) in MS-DIAL combines orthogonal strategies to resolve both co-isolated and co-eluting species.
Introduction
Similarly to the early days of proteomics, DIA workflows (which often employ wide isolation windows to comprehensively fragment all precursors) have not yet gained widespread traction in metabolomics due to concerns of highly chimeric spectra, and data-dependent analysis (DDA, also known as IDA) workflows with narrow isolation windows (≤1 Da) are still preferred. This is because chimeric spectra, arising from the co-isolation and fragmentation of additional precursor ions alongside the targeted precursor of interest, are particularly concerning in metabolomics, given the vast chemical space of small molecules compared with peptides, which exhibit predictable fragmentation patterns. The contamination of a query MS/MS spectrum with additional fragment ions from unwanted precursors hinders identification via spectral similarity scoring against experimentally obtained reference spectra of pure standards.
Even narrow, unit-resolved DDA isolation windows often produce chimeric spectra. Moreover, the common practice of selecting the top N precursors per cycle with dynamic exclusion in untargeted DDA experiments precludes conventional retention-time deconvolution (RTDec), because no MS2 elution profile is acquired (4-6). RTDec resolves co-eluting interferences by correlating the chromatographic elution profiles (apex, width, and peak shape) of precursor and product ions, as routinely performed in GC-MS, SWATH, AIF, or targeted DDA analyses. However, RTDec has its shortcomings and often fails to correctly correlate common class-specific diagnostic fragment ions produced by many similar analytes to a single precursor ion (7). Furthermore, wide and asymmetric chromatographic peaks, those common in HILIC-based metabolomics experiments, pose additional challenges for precursor and product ion chromatogram correlation.
Database-assisted deconvolution (DBDec) offers an acquisition-independent alternative by reconstructing chimeric query spectra through linear combinations of library reference spectra (8-12). Yet, especially for metabolomics, DBDec remains limited by incomplete library coverage and has not been optimized with wide isolation windows, reducing its effectiveness for novel compounds and wide-window DIA schemes such as SWATH.
First conceptualized over a decade ago (13), quadrupole-axis deconvolution (Q1Dec) has only recently become practically viable for DIA metabolomics (14). Building on early hardware implementations (15), the new ZenoTOF 8600 system’s ZT Scan DIA mode acquires spectra by rapidly scanning a sliding isolation window across the mass range in fine steps while leveraging Zeno trapping to enhance duty cycle and sensitivity. These data can be deconvoluted to yield a Q1 precursor ion selection window as narrow as 0.9 Da. Q1Dec overcomes the hardware limitation of isolating a narrow precursor region because its ability to eliminate unwanted precursors stems from the sliding window's step size, rather than its width. It is also compatible with all the previously mentioned deconvolution strategies, enabling the multiplexed application of RTDec, Q1Dec, and DBDec within modern software such as MS-DIAL and DecoID, yielding cleaner spectra and higher library identification rates than either conventional DDA or SWATH.
In this technical note, we evaluate the suitability of three acquisition modes available on the ZenoTOF 8600 (DDA, SWATH, and ZT Scan DIA) for untargeted metabolomics in human vascular endothelial cells infected with Leptospirosis from three different bacterial strains. Using MS-DIAL for data processing and a variety of spectral databases for identification, we show that ZT Scan DIA outperforms Top 85 DDA and 80 variable-width SWATH methods in terms of putative MS/MS identification rate, MS/MS coverage (in the case of DDA), and MS/MS spectral quality after deconvolution. Through simulations using the exported MS-DIAL data, we also show that approximately 50% of the DDA features with MS/MS spectra are chimeric, and that, after deconvolution, ZT Scan DIA data produces cleaner MS/MS spectra than either method.
Materials and methods
Sample preparation: HMEC-1 cells were plated in 60 mm dishes at a density of 800,000 cells/dish in MCDB 131 medium supplemented with 10% fetal bovine serum, 10 ng/mL epidermal growth factor, 1 µg/mL hydrocortisone, and 10 mM glutamine as per recommendations from the American Tissue Culture Collection (ATCC) and allowed to attach overnight. The following day, the medium was changed and inoculated with either PBS (UI), Leptospira biflexa serovar Patoc strain Patoc-1 (LbP ), Leptospira interrogans serovar Copenhageni strain Fiocruz L1-130 (LiC), or Leptospira interrogans serovar Manilae strain L495 (LiM) at a multiplicity of infection of 20. After 24 h, the medium was removed, the cell layer was washed twice with warm 0.85% saline, and a 2:2:1 acetonitrile: methanol: water extraction solvent was applied. The dish was incubated for 5 minutes on dry ice, then the cell layer was scraped off into 1.5 mL microcentrifuge tubes, vortexed for 30 sec, flash-frozen in liquid nitrogen for 1 min, and stored on dry ice until sample isolations were complete. Afterward, all samples were sonicated for 5 min and subjected to two rounds of vortexing for 30s, flash-freezing in liquid nitrogen for 1 min, and sonication for 5 min. Samples were stored at -20˚C overnight to precipitate proteins. The following day, samples were centrifuged at 18,000 RCF at 4˚C for 10 min, and the supernatant was removed and dried with a SpeedVac Concentrator. Meanwhile, the protein pellet was resuspended in 400 μL 100 mM NaOH and dissolved with four cycles of vortexing for 10 s and heating at 95˚C for 5 min. Protein concentration was measured with a BCA assay, and the dried metabolite pellets were resuspended in a 2:1 acetonitrile: water solution at 1 μL/2.5 μg protein in each sample. The suspension was then subjected to two cycles of 5 min sonication and 1 min vortexing and then incubated at 4˚C overnight. The following day, samples were centrifuged at 18,000 RCF at 4˚C for 10 min, and the supernatant was transferred to glass LC-MS vials for analysis on the ZenoTOF 8600 system.
Chromatography: Prepared samples were resolved by HPLC using a HILICON iHILIC-(P) Classic column (100 x 2.1 mm, 5 µm, 200 Å; P/N 161.122.0520) equipped with a guard column (HILICON iHILIC-(P) Classic Guard, 5 µm, 200 Å, P/N 161.122.0520). The column was maintained at 45 °C. Mobile phase A consisted of 5.0% acetonitrile in water containing 20 mM ammonium bicarbonate, 1% ammonium hydroxide, and 5 µM medronic acid. Mobile phase B was 95% acetonitrile in water with no additives. The gradient conditions are shown in Table 1.
Data processing. All raw data (*.wiff2) were processed in MS-DIAL v5.6 (build 5.6.250820-alpha; hereafter “MS-DIAL 5.6”) by using the parameter set summarized in Table 3. The full processing workflow comprised centroiding, peak detection, adduct annotation, compound identification, and cross-sample alignment. Identification was performed against multiple MS/MS spectral libraries in *.msp format, with parameter settings tuned per library to account for differences in curation level, instrument provenance, and spectral density (Table 3).
For the NIST23 spectral library, an in-house curated *.msp derivative was generated to ensure compatibility with the MS-DIAL scoring framework and to standardize metadata. Specifically, additional compound-level annotations (for example, InChIKey, formula, exact mass) were programmatically populated via the PubChem API, and the library was filtered to retain only entries acquired on high-resolution platforms by using ESI or APCI ionization [16,17]. This step was intended to constrain the search space to spectra most consistent with the acquisition conditions used in this study, while minimizing ambiguity introduced by low-resolution or orthogonal ionization data.
For each processing run, MS-DIAL results were exported at two levels. At the per-sample level, PeakID tables and associated experimental MS/MS spectra (*.msp) were retained. At the study level, the AlignmentID table, which represents globally aligned features across all samples, was exported for downstream filtering and summary analysis.
To systematically evaluate the influence of acquisition strategy and deconvolution, data were processed both independently and in combination. Specifically, three single-mode projects were created (DDA-only, SWATH-only, and ZT Scan DIA-only), followed by an “All-Modes” project in which all acquisitions were jointly aligned to generate a consensus feature list. Within the DIA-based projects (SWATH and ZT Scan DIA), additional processing branches were implemented to isolate the effects of spectral deconvolution. Initial processing was performed with deconvolution disabled (“NoDec”) by unchecking both “Run RT deconvolution” and “Run Q1Dec” in the Spectrum Deconvolution settings (Figure 2). The same datasets were then reprocessed with one or more deconvolution strategies enabled.
For SWATH data, two configurations were considered: NoDec and RT-based deconvolution (RTDec). For ZT Scan DIA 2.0, four configurations were evaluated: NoDec, RTDec, quadrupole-axis deconvolution (Q1Dec), and the combined RTQ1Dec condition. PeakID tables and MS/MS spectra were exported after each processing pass, enabling direct comparison of feature-level outcomes across deconvolution strategies. Alignment tables were exported only for the highest deconvolution level applied to each dataset to avoid redundancy.
Before export, alignment results were filtered to retain only confidently characterized features, defined here as those matched to a reference database and exhibiting an MS-CleanR blank ratio <0.05. Beyond this automated filtering, minimal manual curation was performed.
Consequently, the dataset retains the expected artifacts of large-scale untargeted processing, including duplicated features due to imperfect alignment, partial peak integration, and occasional suboptimal library matches. While these would typically be addressed in a fully optimized metabolomics workflow, they were considered acceptable within the scope of the present benchmarking and simulation-focused analysis.
Data analysis and in silico simulations. All downstream analysis was conducted in Python, with MS-DIAL PeakID tables imported as Pandas DataFrames to enable flexible, feature-level manipulation. Each entry in the PeakID table encodes a discrete chromatographic feature with associated metadata, including retention-time boundaries (left, apex, right), precursor m/z, experimental MS/MS spectrum, peak shape descriptors (Gaussian similarity, ideal slope, symmetry), and annotation details.
Feature counts were derived by summing the row counts across all PeakID tables. From this population, subsets were defined based on data completeness and annotation status. Features with recorded MS/MS spectra were first isolated, and within this subset, library-matched features were identified by using MS-DIAL (annotation tag codes 330 and 430).
To further stratify spectral quality, an entropy-based classification scheme [18,19] was applied following conventions established by the MassBank of North America (MoNA). MS/MS spectra were designated as “clean” when they satisfied dual criteria of raw spectral entropy <3.0 and normalized spectral entropy <0.8, as calculated by using the SpectralEntropy Python package. This step provides an orthogonal quality metric independent of library matching or simulated classification as chimeric or non-chimeric based on PeakID tables and is particularly useful in the context of not only DIA data but also DDA data; spectral chimerism is prevalent in both types of data.
To simulate spectral chimerism and assess the effects of deconvolution, the analysis was restricted to features with observed MS/MS spectra. For each feature A, pairwise comparisons were performed against all other features B within the same sample to evaluate potential co-isolation and resolvability.
Chimerism was defined operationally based on joint m/z and chromatographic overlap criteria. Feature A was classified as chimeric if any feature B exhibited (i) a precursor m/z falling within the isolation window of A and (ii) overlapping chromatographic boundaries (i.e., B_left < A_right and B_right > A_left). This definition captures both direct co-elution and partial temporal overlap scenarios relevant to DIA acquisitions (and potentially DDA acquisitions).
Deconvolution performance was modeled by using three complementary frameworks:
- Database-assisted deconvolution (DBDec), applicable across all acquisition modes, evaluates whether co-isolated features can be separated based on MS/MS dissimilarity. Here, A is considered unresolved from B if the spectral similarity between their experimental MS/MS spectra exceeds 0.8, calculated by using the entropy similarity metric with a 20 ppm tolerance. Notably, the reference “database” in this context is sample-specific and constructed directly from the ensemble of observed spectra in the PeakID table. This captures the challenge of reconstructing a convolved query spectrum as a linear combination of database spectra when the database contains too many candidates with shared peaks.
- Retention-time deconvolution (RTDec), applied to SWATH and ZT Scan DIA 2.0 data, leverages chromatographic orthogonality. Features are considered resolvable when they exhibit sufficient divergence in peak apex (ΔRT ≥5 s), relative peak width (>20% difference), or peak shape (any descriptor differing by >0.2). This reflects the practical heuristics commonly embedded in deconvolution algorithms.
- Quadrupole-axis deconvolution (Q1Dec), unique to ZT Scan DIA 2.0, exploits differences in precursor isolation. Resolution is achieved when the absolute m/z separation between features exceeds the Q1 step size (0.8 Da), which is consistent with the acquisition stepping scheme.
All possible combinations of these deconvolution strategies (for example, RTDec alone, DBDec alone, RTDec+DBDec, Q1Dec alone, etc.) were evaluated. For each combination, the IDs of features that met the resolution criteria were tracked and stored, enabling detailed post hoc inspection and visualization. A summarized results table, capturing counts and classifications across all scenarios, was exported as a CSV for reporting and downstream interpretation.
Results and discussion
Untargeted metabolomics workflows are fundamentally constrained by the realities of gas- and liquid-phase chemistry in the mass spectrometer, where the idealized assumption that a single precursor yields a single clean product-ion spectrum is rarely met in practice. Instead, particularly under high-throughput acquisition modes and in complex biological matrices, multiple co-eluting or near-isobaric precursors are frequently co-isolated and fragmented simultaneously. The resulting product-ion spectra, commonly referred to as chimeric spectra, represent a convolution of fragment ions derived from multiple molecular species.
From a practical standpoint, chimeric spectra are not the exception; they are an intrinsic feature of data-dependent and, even more so, data-independent acquisition strategies. In dense chromatographic space, precursor isolation windows inevitably capture co-eluting species with overlapping m/z values. This co-isolation leads to multiplexed fragmentation events, producing spectra that no longer map cleanly to a single chemical entity. While such multiplexing can increase acquisition efficiency, it introduces ambiguity in spectral interpretation, which directly affects downstream identification confidence.
The consequences of spectral convolution are multifaceted. First, the presence of fragment ions originating from multiple precursors can obscure or distort the characteristic fragmentation patterns required for confident library matching. Diagnostic ions may be suppressed, shifted in relative intensity, or masked entirely by overlapping fragments from co-isolated species. Second, the inclusion of unrelated fragment ions can artificially inflate spectral similarity scores against incorrect library entries, increasing the likelihood of false-positive identifications. This effect is particularly pronounced in large-scale spectral library searches, where even modest overlap in fragment ion space can yield misleading matches when the underlying spectra are not pure.
Addressing the impact of chimeric spectra is therefore a critical step toward improving confidence in untargeted workflows. Recent advances in acquisition strategies, such as ZT Scan DIA, and in computational deconvolution approaches have begun to mitigate these challenges by either reducing the frequency of precursor co-isolation or disentangling mixed spectra after acquisition. In the following sections, we evaluate how effectively these strategies improve spectral purity and, consequently, the reliability of compound identification in complex metabolomic samples.
DIA-based workflows generate comprehensive MS/MS coverage of the metabolome
The ultra-fast acquisition rate of the ZenoTOF 8600 system, combined with the sensitivity gains of the Zeno trap, enables a 5 ms dwell time for MS/MS spectrum acquisition, allowing 85 precursors per cycle to be acquired in a 1s duty cycle in DDA, which results in ~50% of features in MS-DIAL having an associated MS/MS spectrum. Despite leveraging the improved scan speed, DDA still cannot achieve the MS/MS acquisition rates possible with SWATH or ZT Scan DIA methods – both of which obtained MS/MS spectra for >99% of features (Table 4). As expected, the SWATH DIA and ZT Scan DIA data generated raw MS/MS spectra that are almost entirely chimeric. However, the DDA data also showed a high level of chimeric interference.
MS/MS spectra acquired with DDA have significant chimeric interference
Simulation using DDA PeakID tables shows that >50% of detected features (A) have at least one co-eluting feature (B) with an m/z value within the precursor isolation window (Table 4), demonstrating that DDA spectra are frequently chimeric, consistent with previous observations [5]. Although DDA is considered the benchmark for generating high-quality MS/MS spectra for database matching, substantial contamination persists due to the finite quadrupole isolation width (~1.0 Da on SCIEX systems). Narrowing this window is the only hardware-based approach to reduce chimericity, but it does not eliminate it and comes at the cost of reduced precursor transmission efficiency.
Compared to DIA, DDA exhibits lower levels of chimeric interference; however, it results in reduced MS/MS coverage and the inability to perform robust MS/MS-level quantitation. This is particularly true for low-abundance features. These limitations arise from the stochastic nature of precursor selection and dynamic exclusion, which also constrain data-driven deconvolution approaches.
Given that MS/MS contamination is largely unavoidable across acquisition strategies, computational deconvolution is required. ZT Scan DIA enables a complementary approach by fragmenting each precursor multiple times as the quadrupole isolation window steps across the m/z space. This produces a characteristic intensity distribution across sequential MS/MS scans, allowing fragment ions to be associated with their precursors along the Q1 dimension. In MS-DIAL, this strategy (Q1Dec) can be combined with retention time–based deconvolution, thereby improving spectral purity and coverage for downstream identification.
Q1Dec: A novel deconvolution approach based on a dynamic quadrupole with ZT Scan DIA acquisition
In ZT Scan DIA acquisition, the quadrupole mass filter moves a precursor ion transmission window across the full precursor m/z range in fine, overlapping steps that are ~1/5 the width of the isolation window. As the window slides, a given precursor ion gradually enters the transmission window, reaches maximum transmission when centered, and then exits. Consequently, the intensities of its fragment ions are recorded across multiple consecutive MS/MS scans, each corresponding to a slightly different Q1 position [1-2].
When the intensity of a specific fragment ion is plotted against the center position of the isolation window (i.e., along the Q1 axis), it produces a characteristic triangular envelope whose apex corresponds directly to the true precursor m/z (Figure 3). Because of this relationship, fragment ions from co-isolated precursors with different m/z values can be assigned to their respective precursors by using a process called quadrupole-axis deconvolution (Q1Dec). Q1Dec exploits these intensity profiles to trace each fragment ion back to its originating precursor, even when multiple precursors are co-isolated in the same window or co-elute chromatographically, thereby deconvoluting chimeric signals and yielding cleaner, more specific MS/MS spectra for confident identification and quantitation.
Deconvolved ZT Scan DIA spectra show significantly less chimeric interference
The value of Q1Dec ZT Scan DIA spectral interpretation is significant. Chimeric spectra complicate the use of orthogonal annotation strategies, such as retention-time alignment. Algorithms that assume a one-to-one correspondence between precursor and product ions can misattribute fragments, leading to incorrect structural hypotheses. In lipidomics and metabolomics contexts—where structural isomers are prevalent, and differentiation relies on subtle fragmentation signatures—this ambiguity can propagate through annotation pipelines, ultimately reducing the accuracy and reproducibility of reported identifications. However, Q1Dec does not preclude additional deconvolution approaches, such as those performed along the retention-time axis, and multiplexing these approaches is complementary. To illustrate this point, we computed the spectral entropy and normalized spectral entropy for every spectrum in the “NoDec”, “RTDec”, “Q1Dec”, and “RTQ1Dec” sets of PeakID tables exported from the “ZTScan-only” MS-DIAL project and plotted the distribution of entropies for each set of spectra (Figure 4).
The raw entropy distributions (panel a) plot the density of each feature’s spectral entropy from the ZT Scan DIA PeakID spectra obtained from the NoDec, Q1Dec, RTDec, and RTQ1Dec data sets. Panel b shows the density of the normalized spectral entropy, computed using the same features and normalized by the number of peaks in the spectrum, so it ranges from 0 to 1. To classify a feature as clean or noisy, consider spectral entropy, raw entropy, and normalized entropy: a “clean” spectrum is one with minimal noise and is characterized by spectral entropy < 3 for raw entropy and < 0.8 for normalized entropy, as suggested by the software. As shown in both panels, ZT Scan MS/MS data without deconvolution have ~45% of spectra classified as clean. Q1Dec and RTDec, on their own, increased the percentage of clean spectra to ~58%. The combination of Q1Dec and RTDec increases this percentage to ~66% clean.
To highlight the combined power of comprehensive MS/MS coverage and deconvolution, we summarize and report the results of our entropy calculations and in silico deconvolution simulation as a percentage of the total detected features in the sample’s PeakID table from the “All-Modes” project in Table 5. Deconvoluted ZT Scan DIA spectra are cleaner than those derived from DDA (~71% vs. ~ 32%) and are ~20% cleaner than SWATH data. In aggregate, these results demonstrate the utility of multiplexing deconvolution strategies to improve data quality and, consequently, reference-matching of data to metabolites. Both RTDec and Q1Dec alone improve the percentage of features with clean MS/MS spectra, but combining the approaches yields the highest percentage of clean spectra.
More IDs with higher confidence by using ZT Scan DIA
Quantifying spectral purity from a theoretical perspective, as demonstrated by spectral entropy calculations, is a useful way to understand how deconvolution affects spectral clarity. However, the impact of deconvolution on improving identification rates and boosting confidence in a metabolomics experiment is paramount. Chimeric spectra are problematic only when they interfere with spectral matching and, ultimately, high-quality identification. In these experiments, the raw MS/MS spectra acquired with ZT Scan DIA are nearly 100% chimeric. However, the results demonstrate that the highest number of library matches is obtained when ZT Scan DIA spectra are deconvolved using Q1Dec and RTDec. Not only is the number of matches higher, but the quality of the match scores is also higher (Table 6).
To illustrate the MS-DIAL deconvolution process and the resulting spectral decontamination, screen captures are shown in Figures 5 and 6. Figure 5 shows two fundamental observations. First, the upper panels present the spectrum for Glu-Asp acquired by DDA (upper left) and ZT Scan DIA (upper right) for comparison. The purpose of the figure is not to detail the individual fragment masses but rather to present an overall assessment of the complexity of the MS/MS spectra. Interestingly, the DDA MS/MS spectrum is the most complex and exhibits the highest noise, likely due to non-optimal precursor selection for fragmentation. Comparatively, the ZT Scan DIA spectrum, while still chimeric, is less complex. In the lower panels, the effects of MS-DIAL-facilitated Q1 deconvolution (lower left) and multiplexed Q1 and RT deconvolution (lower right) on the spectral complexity are shown. The combination of Q1Dec and RTDec clearly demonstrates MS-DIAL's ability to “clean” the MS/MS spectrum and enable a high-quality reference match to Glu-Asp. Interestingly, the initial analysis of DDA data using MS-DIAL did not identify Glu-Asp, but because it was the same feature from the same sample, we manually adjusted the library match to ensure consistency.
Figure 6 shows the composite XIC for Glu-Asp, with the legend for the raw XIC on the left and the Q1 and RT deconvoluted data on the right. Although the data are overlaid, it is readily apparent visually which XIC fragments are associated with Glu-Asp at 8.87 min, and which fragments are from matrix interference and/or chimeric fragments.
A final example of how deconvolution of ZT Scan DIA data improves spectral quality compared to other scan modes is shown in Figure 7, which presents the processed 3-phosphoglycerate spectra obtained in DDA, SWATH, and ZT Scan from the same sample and compares them to the reference library spectrum from the NIST23 database. The similarity of the acquired data (panels a-c) was compared to the reference spectrum from the NIST23 library (d). Cosine similarity (dot product) is a widely applied metric for evaluating the alignment between two vectors, such as spectral profiles or molecular feature embeddings, by measuring the cosine of the angle between them independent of magnitude. Values approaching 1 indicate high similarity, while values near 0 or negative denote weak or dissimilar relationships. This metric, part of the overall quality score in MS-DIAL, provides a robust means of assessing similarity in high-dimensional analytical datasets.
Conclusions
ZT Scan DIA on the ZenoTOF 8600 system, combined with advanced deconvolution strategies in MS-DIAL, represents a substantive advance for untargeted metabolomics workflows. The data presented here clearly demonstrate that conventional assumptions regarding DDA superiority for spectral quality are no longer valid under modern acquisition and processing paradigms.
Although DDA remains widely used due to its perceived spectral selectivity, this study confirms that a significant proportion of DDA MS/MS spectra (>50%) are inherently chimeric, even when narrow isolation windows and high-Top N methods are employed. This intrinsic limitation reduces identification rates, reduces reproducibility, and limits MS/MS coverage to approximately half of detected features, even at ultra-fast acquisition rates.
In contrast, ZT Scan DIA enables near-complete MS/MS coverage (>99% of features) while leveraging quadrupole-axis deconvolution to resolve co-isolated precursors at sub-Da resolution, independent of physical isolation width. When combined with retention-time–based deconvolution, this approach delivers a marked improvement in spectral purity, producing approximately twice as many “clean” spectra as DDA and outperforming conventional SWATH DIA workflows.
Critically, these improvements in spectral clarity translate directly into higher identification rates and more confident library matches (Table 6). ZT Scan DIA datasets processed with combined Q1Dec and RTDec not only yield the highest number of annotations but also exhibit superior spectral similarity scores, underscoring the practical impact of reducing chimeric spectra on downstream metabolite identification.
These findings highlight several key conclusions:
- Chimeric interference is pervasive across all acquisition modes and represents a primary barrier to confident metabolite identification.
- Even with high-coverage (top 85 precursors) DDA experiments at unit resolution, ID rate is reduced due to a large fraction of chimeric spectra that cannot be cleaned.
- Modern, open-source software tools with built-in deconvolution (such as MS-DIAL) facilitate DIA data.
- Dynamic isolation windows schemes outperform narrow, static windows.
- Hardware-enabled innovations such as ZT Scan DIA, when paired with orthogonal deconvolution strategies, effectively address the challenge of chimeric interference.
- Comprehensive MS/MS coverage combined with post-acquisition deconvolution is more advantageous than selective acquisition alone.
Overall, ZT Scan DIA establishes a new performance benchmark for DIA-based metabolomics by unifying high coverage, improved spectral purity, and enhanced identification confidence. This workflow provides a scalable and robust solution for complex biological studies, enabling deeper and more reliable interrogation of the metabolome.
References
- Zeno trap: Defining new levels of sensitivity. (2021) SCIEX white paper, RUO-MKT-19-13373-B. https://sciex.com/content/dam/SCIEX/pdf/brochures/zenotrap-whitepaper.pdf
- Sayers, R, Tran, K, and Walsh, N. Continuing the data independent acquisition (r)evolution: Introducing ZT Scan DIA for quantitative proteomics. SCIEX white paper MKT-31731-A. https://sciex.com/content/dam/SCIEX/pdf/tech notes/technology/ztscan-dia-introduction.pdf
- PRS, Proos, R, Ozbalci, C, Hecht, M, Chiapparino, A, Rupasinga, T, Sayers, R, and Causon, J. [2026] Untargeted, quantitative metabolomics analysis on the ZenoTOF 8600 system using the novel ZT Scan DIA 2.0 workflow. SCIEX white paper MKT-37675-B. https://sciex.com/technotes/life-science-research/metabolomics/untargeted-quantitative-metabolomics-analysis-on-zenotof8600
- Stein, S. E. An Integrated Method for Spectrum Extraction and Compound Identification from Gas Chromatography/Mass Spectrometry Data. J. Am. Soc. Mass Spectrom. 1999, 10 (8), 770–781. https://doi.org/10.1016/S1044-0305(99)00047-1
- Tsugawa, H.; Cajka, T.; Kind, T.; Ma, Y.; Higgins, B.; Ikeda, K.; Kanazawa, M.; Vandergheynst, J.; Fiehn, O.; Arita, M. MS-DIAL: Data-Independent MS/MS Deconvolution for Comprehensive Metabolome Analysis. Nature Methods 2015 12:6 2015, 12 (6), 523–526. https://doi.org/10.1038/nmeth.3393
- Yin, Y.; Wang, R.; Cai, Y.; Wang, Z.; Zhu, Z. J. DecoMetDIA: Deconvolution of Multiplexed MS/MS Spectra for Metabolite Identification in SWATH-MS-Based Untargeted Metabolomics. Anal. Chem. 2019, 91 (18), 11897–11904. https://doi.org/10.1021/ACS.ANALCHEM.9B02655
- Tada, I.; Chaleckis, R.; Tsugawa, H.; Meister, I.; Zhang, P.; Lazarinis, N.; Dahlén, B.; Wheelock, C. E.; Arita, M. Correlation-Based Deconvolution (CorrDec) To Generate High-Quality MS2 Spectra from Data-Independent Acquisition in Multisample Studies. Anal. Chem. 2020, 92 (16), 11310–11317. https://doi.org/10.1021/ACS.ANALCHEM.0C01980
- Stancliffe, E.; Schwaiger-Haber, M.; Sindelar, M.; Patti, G. J. DecoID Improves Identification Rates in Metabolomics through Database-Assisted MS/MS Deconvolution. Nature Methods 2021 18:7 2021, 18 (7), 779–787. https://doi.org/10.1038/s41592-021-01195-3
- Stancliffe, E.; Gandhi, M.; Guzior, D. V; Mehta, A.; Acharya, S.; Richardson, A.; Cho, K.; Cohen, T.; Patti, G. J. MassID Provides near Complete Annotation of Metabolomics Data with Identification Probabilities. bioRxiv 2026, 2026.02.11.704864. https://doi.org/10.64898/2026.02.11.704864
- Röst, H. L.; Rosenberger, G.; Navarro, P.; Gillet, L.; Miladinoviä, S. M.; Schubert, O. T.; Wolski, W.; Collins, B. C.; Malmström, J.; Malmström, L.; Aebersold, R. OpenSWATH Enables Automated, Targeted Analysis of Data-Independent Acquisition MS Data. Nature Biotechnology 2014 32:3 2014, 32 (3), 219–223. https://doi.org/10.1038/nbt.2841
- Peckner, R.; Myers, S. A.; Jacome, A. S. V.; Egertson, J. D.; Abelin, J. G.; MacCoss, M. J.; Carr, S. A.; Jaffe, J. D. Specter: Linear Deconvolution for Targeted Analysis of Data-Independent Acquisition Mass Spectrometry Proteomics. Nature Methods 2018 15:5 2018, 15 (5), 371–378. https://doi.org/10.1038/nmeth.4643
- Li, H.; Cai, Y.; Guo, Y.; Chen, F.; Zhu, Z. J. MetDIA: Targeted Metabolite Extraction of Multiplexed MS/MS Spectra Generated by Data-Independent Acquisition. Anal. Chem. 2016, 88 (17), 8757–8764. https://doi.org/10.1021/ACS.ANALCHEM.6B02122
- Nikolskiy, I.; Mahieu, N. G.; Chen, Y. J.; Tautenhahn, R.; Patti, G. J. An Untargeted Metabolomic Workflow to Improve Structural Characterization of Metabolites. Anal. Chem. 2013, 85 (16), 7713–7719. https://doi.org/10.1021/ac400751j
- Matsuzawa, Y.; Tokiyoshi, K.; Buyantogtokh, B.; Oka, T.; Causon, J.; Yamamoto, R.; Takeuchi, M.; Takeda, U.; Takahashi, M.; Hasegawa, M.; Ivosev, G.; Cox, D.; Baker, P. R.; Chelur, A.; Bloomfield, N.; Miyamoto, J.; Harayama, T.; Deng, L.; Tsugawa, H. Retrospective Metabolomics via Dual-Dimensional Deconvolution Using ZT Scan DIA 2.0. bioRxiv 2025, 2025.08.20.671307. https://doi.org/10.1101/2025.08.20.671307
- Messner, C. B.; Demichev, V.; Bloomfield, N.; Yu, J. S. L.; White, M.; Kreidl, M.; Egger, A. S.; Freiwald, A.; Ivosev, G.; Wasim, F.; Zelezniak, A.; Jürgens, L.; Suttorp, N.; Sander, L. E.; Kurth, F.; Lilley, K. S.; Mülleder, M.; Tate, S.; Ralser, M. Ultra-Fast Proteomics with Scanning SWATH. Nat. Biotechnol. 2021, 39 (7), 846–854. https://doi.org/10.1038/S41587-021-00860-4;SUBJMETA=45,475,61,631;KWRD=PROTEOMICS
- Kim, S.; Thiessen, P. A.; Bolton, E. E. Programmatic Retrieval of Small Molecule Information from PubChem Using PUGREST. Methods in Pharmacology and Toxicology 2018, 1–24. https://doi.org/10.1007/7653_2018_30
- Kim, S.; Thiessen, P. A.; Bolton, E. E.; Bryant, S. H. PUG-SOAP and PUG-REST: Web Services for Programmatic Access to Chemical Information in PubChem. Nucleic Acids Res. 2015, 43 (W1), W605–W611. https://doi.org/10.1093/nar/gkv396
- Li, Y.; Kind, T.; Folz, J.; Vaniya, A.; Mehta, S. S.; Fiehn, O. Spectral Entropy Outperforms MS/MS Dot Product Similarity for Small-Molecule Compound Identification. Nature Methods 2021 18:12 2021, 18 (12), 1524–1531. https://doi.org/10.1038/s41592-021-01331-z
- Spectral Entropy and Normalized Entropy in MoNA. MoNA - Mass Bank of North America. https://mona.fiehnlab.ucdavis.edu/documentation/entropy