Compound Discoverer User Guide LC Studies XCALI98478 en
Compound Discoverer User Guide LC Studies XCALI98478 en
Compound Discoverer
User Guide for LC Studies
Software Version 3.3 SP2
Compound Discoverer, Exactive, Foundation, FreeStyle, Mass Frontier, Metabolika, mzCloud, mzVault,
Normalized Collision Energy, Orbitrap ID-X, Q Exactive, ToxID, and TraceFinder are trademarks; AcquireX,
Orbitrap, Orbitrap Fusion, Orbitrap Lumos, Thermo Scientific, and Xcalibur are registered trademarks of
Thermo Fisher Scientific Inc. in the United States.
The following are registered trademarks in the United States and other countries: Adobe, Flash, and Reader are
registered trademarks of Adobe Systems Incorporated. Microsoft, Internet Explorer, Visual C++, Windows,
and Excel are registered trademarks of Microsoft Corporation. Xeon is a registered trademark of Intel
Corporation. ActiveReports is a registered trademark of GrapeCity, Inc.
All other trademarks are the property of Thermo Fisher Scientific Inc. and its subsidiaries.
Thermo Fisher Scientific Inc. provides this document to its customers with a product purchase to use in the
product operation. This document is copyright protected and any reproduction of the whole or any part of this
document is strictly prohibited, except with the written authorization of Thermo Fisher Scientific Inc.
The contents of this document are subject to change without notice. All technical information in this
document is for reference purposes only. System configurations and specifications in this document supersede
all previous information received by the purchaser.
This document is not part of any sales contract between Thermo Fisher Scientific Inc. and a purchaser. This
document shall in no way govern or modify any Terms and Conditions of Sale, which Terms and Conditions of
Sale shall govern all conflicting information between the two documents.
Contents
Preface . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .20
Access the documentation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
View the user guides and tutorials . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
Download the user documentation for any Thermo Scientific product . . . . . 22
System requirements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
Check the computer specifications. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
Check the format setting for region and language . . . . . . . . . . . . . . . . . . . . . 25
Installation instructions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
Install the Compound Discoverer application . . . . . . . . . . . . . . . . . . . . . . . . 26
Install the mzCloud offline libraries. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
Install the mzVault 2.3 SP1 application. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
NIST libraries for GC EI Orbitrap data . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
NIST 2020 MSMS library for LC/MS/MS data . . . . . . . . . . . . . . . . . . . . . . 29
Special notices. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
Create and submit a bug report. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
Contact us . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
Chapter 3 Create a new study and an analysis by using the wizard . . . . . . . . . . . . . . . . . . .97
Available sample types . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
Directory structure for Compound Discoverer studies. . . . . . . . . . . . . . . . . . . . 98
Create a study template file that contains all the study information . . . . . . . . . 100
Manually create a template that includes all the study information . . . . . . . 100
Edit a spreadsheet for use as a study template . . . . . . . . . . . . . . . . . . . . . . . 101
Start the New Study and Analysis Wizard . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102
Define the study type, name the study, and optionally select a study
template and a processing workflow. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102
Select the study type . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
Name the new study, select the top-level studies folder, and optionally
select a study template . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
Select a processing workflow for the analysis . . . . . . . . . . . . . . . . . . . . . . . . 106
Add input files to the new study . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
Characterize the new input files. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109
Select the delimiters for parsing the file names. . . . . . . . . . . . . . . . . . . . . . . 111
Add or edit the study factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
Add categorical study factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112
Add numeric study factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113
Add biological replicate study factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114
Delete study factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115
Duplicate study factors. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115
Edit study factors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116
Automatically assign the study factor values. . . . . . . . . . . . . . . . . . . . . . . . . 116
Manually select the study factor values . . . . . . . . . . . . . . . . . . . . . . . . . . . . 118
Reset the sample assignments. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119
Select the sample types. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119
Extract study factor values from the file names of the input files . . . . . . . . . . . 120
Set up the sample groups and ratios for a new analysis. . . . . . . . . . . . . . . . . . . 127
Set up the sample groups for an analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . 128
Select the study variables for the sample groups . . . . . . . . . . . . . . . . . . . . 128
Review the generated sample groups and fix any assignment errors. . . . . . 129
Change the hierarchy of the study variables . . . . . . . . . . . . . . . . . . . . . . . 130
Change the sort order of the study variables . . . . . . . . . . . . . . . . . . . . . . . 131
Set up the group ratios. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132
Set up the group ratios individually . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133
Set up the group ratios by using the bulk ratio generator . . . . . . . . . . . . . 133
Prepare to submit the analysis to the job queue . . . . . . . . . . . . . . . . . . . . . . . . 135
Chapter 16 Explore compound relations with the molecular networks viewer . . . . . . . . .754
Overview of using the molecular networks feature . . . . . . . . . . . . . . . . . . . . . . 754
How the Generate Molecular Networks node works . . . . . . . . . . . . . . . . . . . . 755
Information displayed in the Similar Compounds table. . . . . . . . . . . . . . . . . . 757
Send compounds to the molecular networks viewer. . . . . . . . . . . . . . . . . . . . . 758
Mark selected compounds in the main compounds table. . . . . . . . . . . . . . . . . 760
Modify the simulation in the molecular networks viewer. . . . . . . . . . . . . . . . . 762
Move, Seek, and Selection modes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 763
Move mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 763
Seek mode . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 763
Selection mode. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 766
Use the Isolation mode to display specific clusters . . . . . . . . . . . . . . . . . . . . 767
Use the Toggle Backbone tool to display the backbone of a cluster . . . . . . . 767
Color-coded nodes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 769
Change the node style to display a pie chart or a structure . . . . . . . . . . . . . . 771
Size the nodes by peak area or MW of a compound. . . . . . . . . . . . . . . . . . . 772
Colorize a link by its score, coverage, or number of fragments . . . . . . . . . . . 772
Add directional arrows to the links . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 773
Interactive functions performed by using the mouse pointer . . . . . . . . . . . . 774
Molecular networks viewer toolbar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 774
Panes at the left of the molecular networks viewer . . . . . . . . . . . . . . . . . . . . . . 775
Search pane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 776
Graph Info pane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 776
Isolation pane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 777
Filters pane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 777
Thresholds pane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 778
Clusters pane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 779
Node Style pane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 780
Link Style pane . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 781
Pane at the right of the molecular networks viewer . . . . . . . . . . . . . . . . . . . . . 781
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .792
Preface
This guide describes how to use the Compound Discoverer™ application to qualitatively
process RAW data files with a targeted or untargeted workflow. A targeted workflow evaluates
the mass spectral data for the presence of specific compounds. An untargeted workflow
evaluates the mass spectral data, predicts the elemental composition of unknown compounds,
and searches mass spectral databases to identify these compounds.
To familiarize yourself with the Compound Discoverer application, follow the tutorials that
are available from the application Help menu.
Contents
• Access the documentation
• System requirements
• Installation instructions
• Special notices
• Create and submit a bug report
• Contact us
The Compound Discoverer application also includes a context-sensitive Help system, which
means that you can access content-specific Help for each page or dialog box in the user
interface by pressing the F1 key (or equivalent keys) on your computer keyboard. You can also
open the Help system to the Welcome page by choosing Help > Compound Discoverer Help
from the application menu bar.
1. Go to thermofisher.com.
2. Point to Support, and then click Manuals under Product Documentation on the left.
3. On the Documents & Support page, type the product name in the search box, and then
click the Search icon.
Search box
4. In the results list, click the title to open the document in your web browser, save it, or
print it.
System requirements
The Compound Discoverer 3.3 application can process data files produced by high-resolution
accurate-mass (HRAM) Thermo Scientific™ mass spectrometers, such as the Orbitrap
Fusion™, Q Exactive™, and Exactive™.
Table 1 lists the hardware and software requirements for the processing computer.
Table 1. Hardware and software requirements for the processing computer
System Minimum requirements
Hardware • 3.4 GHz dual-core processor
• 16 GB RAM
• 500 GB hard drive
• USB port
• Display monitor resolution of 1920 × 1080
Software • Microsoft Windows 10 64-bit operating system or Microsoft
Windows 11 64-bit operating system
• Microsoft .NET Framework 4.8
• Microsoft Office 2013
• PDF reader
System settings • To run processing workflows with online mass spectral database
searches, the computer must have unblocked access to the mass
spectral databases on the Internet.
• The computer must have the correct time and date settings and be
synchronized with Internet time.
• The Region and Language setting for the operating system must
be set to English (United States).
Table 2 lists the recommended hardware configurations for enhanced performance using the
Compound Discoverer application.
Table 2. Recommended hardware configurations for enhanced performance
System Recommended configurations
Hardware • Dual 8-core processor (for example, 2x Intel™ Xeon™ Gold 6134
CPU @ 3.20 GHz)
• 64 GB RAM
• 1 TB SSD (solid-state disk) hard drive for OS
• 2nd 3 TB (conventional disk) hard drive for data storage
• USB port
• Two 27 in. UHD monitors: Display monitor resolution of
3840 × 2160 pixels
To check the access to the mass spectral databases, the time and date settings, and the Internet
time, see Chapter 17, “Test communication to the online databases.”
To verify that the system meets the minimum requirements, see these topics:
• Check the computer specifications
• Check the format setting for region and language
1. From the Windows Explorer directory, right-click OSDisk (Drive:) (the directory for the
hard drive where the operating system is installed) and choose Properties.
The OSDisk (Drive:) Properties dialog box opens. This dialog box lists the file system
(NTFS or FAT) and the free disk drive space.
2. In the Windows search box, type System. Then, click System Information.
The System Information page opens. This page lists the operating system; the processor
type, speed, and number of cores; the installed RAM; and the system type (32-bit or
64-bit).
Installation instructions
Thermo Compound Discoverer is a licensed application. After you install the application, you
can use it for up to 60 days without activating the software license.
After you order the Compound Discoverer 3.3 application, you will receive a software media
kit that includes a key-shaped USB flash drive with the installation executable. In addition,
you will receive an email from Thermo MS Licensing providing you with the information
that you need to activate the software license.
When you are upgrading the software from a previous version of the application, you can find
the software installer and a license on the LSMS Software Download and Licensing Portal.
Note The following versions of the Compound Discoverer application can coexist on the
same computer: 1.0, 2.0, 2.1, 3.0, 3.1, 3.2, and 3.3 SP2.
The installation executable includes three installers. The Thermo Compound Discoverer 3.3
installer installs the Compound Discoverer application. The Thermo mzCloud Offline
Library installer installs four mzVault libraries that are July 2021 snapshots of the online
mzCloud mass spectral database, and the Thermo mzVault installer installs the mzVault
application.
Note If you are upgrading from the Compound Discoverer 3.0, 3.1, or 3.2
software, which is a free upgrade, you can also find your software license on the
Product Download web page.
Note If you have an earlier version of the application, follow the instructions to
remove it, and then restart the mzVault 2.3 SP1 installer.
If your computer meets the minimum requirements, the Next button becomes available.
3. Click Next to continue.
4. When the installation is complete, click Finish.
Customers who purchased an LC/MS system from Thermo Fisher Scientific on or after
January 1, 2018 are eligible for a free copy of the NIST 2020 MSMS library. You can obtain a
compatible version (that has been converted to the mzVault format) for use in the Compound
Discoverer application by sending an email message to Licensing at
[email protected]. Include the Sales Order number or the Purchase
Order number for the instrument in the email.
Customers who purchased the NIST 2020 library elsewhere can obtain the converted version
for use in the Compound Discoverer application by sending a proof of purchase for the library
to Licensing at [email protected].
Customers who purchased the NIST 2020 library elsewhere can obtain the converted version
for use in the Compound Discoverer application by sending a proof of purchase for the library
to Licensing at [email protected].
Special notices
Make sure that you follow the precautionary statements presented in this guide. The special
notices appear in boxes.
1. From the Compound Discoverer menu bar (on the processing computer where the error
occurred), choose Help > Create Bug Report.
The Thermo Discoverer Bug Reporter dialog box opens.
Contact us
Contact Email Telephone QR Code
1. Go to thermofisher.com.
2. Click Contact Us, select the country, and then select the type of support
you need.
3. At the prompt, type the product name.
4. Use the phone number or complete the online form.
Go to thermofisher.com/us/en/home/technical-resources.
Go to thermofisher.com/us/en/home/brands/thermo-scientific.
This introduction describes the features and enhancements in this release and how to set up
the application window, choose the size of the toolbar icons, and manage the recently used file
lists on the Start Page. In addition, the overview topic summarizes the application workflow
from starting the application to reporting the results of an analysis.
Some of the workflow nodes in the processing workflows require input from the customizable
lists and libraries for the application. For LC studies, these lists and libraries include the
structures of known compounds, mass lists, adduct ions, chemical transformations, ion
definitions, neutral losses, Metabolika pathways, compound classes (fragment lists), and mass
spectral libraries.
The result file from data processing includes a set of result tables and graphical views that are
based on the processing workflow. The application provides graphical views for displaying
chromatograms and spectra and statistical views for comparing the detected compounds by
sample file or sample group.
The application comes with defined templates for the processing workflows and reports.
These topics describe the new features and enhancements in the Compound Discoverer 3.3
application and the enhancements in the two service packs (3.3 SP1 and 3.3 SP2):
• Enhancements in the Compound Discoverer 3.3 SP2 release
• Enhancements in the Compound Discoverer 3.3 SP1 release
• New features in Compound Discoverer 3.3 for LC studies
• Enhancements in Compound Discoverer 3.3
• New processing workflow template for identifying the per- and polyfluoroalkyl
substances (PFAS) family of compounds
See “Defined processing workflow templates,” and “PFAS identification.”
For more information about using the Compound Discoverer application to identify
compounds in the PFAS family, refer to the following document:
http://assets.thermofisher.com/TFS-Assets/CMD/Application-Notes/an-001826-lsms-pf
as-analysis-workflow-compound-discoverer-an001826-na-en.pdf
• Additional transformation reaction, PFAS Chain Shortening, for the PFAS family of
compounds in the Transformations list
See “Transformations view.”
• Increased limit for the number of compounds that you can export to the molecular
networks viewer
The limit is now 3.000 compounds. See “Send compounds to the molecular networks
viewer.”
Enhancements common to both study types in the Compound Discoverer SP1 release
The Compound Discoverer 3.3 SP1 application includes the following enhancements for
both study types (LC and GC):
• The Mass List Editor can now open compound lists with a file size of up to 600 MB.
• The Natural Product Atlas database in the mass lists available from the Lists & Libraries
menu has been updated to version 2021_08. See “Mass Lists view.”
• The performance of the Map to BioCyc Pathways node has been improved to reduce the
run time of the mapping process.
Note The Detect Compounds (Legacy) node and the Find Expected Compound
(Legacy) node, with some performance improvements, are still available for
comparisons with existing analysis results.
– Find the most common adduct ion for each compound across all samples for relative
quantitation. Provide the option to base the chromatographic peak area for each
compound on the most common adduct ion or the summed area of all the detected
ions.
See the Area Integration parameter in the Group Compounds node and the Group
Expected Compounds node.
Note By default, the Area Integration parameter is set to Most Common Ion.
– Display the m/z value of the most common adduct ion in the m/z column and its ion
definition in the Reference Ion column of the Compounds and Expected
Compounds tables.
• New peak rating filter in the Group Compounds, Group Expected Compounds, and
Differential Analysis nodes provides peak thresholding based on the quality of the
chromatographic peaks.
This peak rating filter lets you filter out low-quality chromatographic peaks without
setting an overly restrictive minimum peak intensity threshold. To pass the filter,
chromatographic peaks must have a peak rating equal to or greater than the user-specified
threshold and be detected or found in at least the user-specified minimum number of input
files submitted for analysis. See Chromatographic peak rating filter.
Peaks that do not pass the peak rating thresholds in the Detect Compounds and Find
Expected Compounds nodes are removed early in the processing workflow. This data
reduction speeds up processing for the downstream nodes (gap, peak area refinement,
identification, mapping, and scoring) and increases overall performance.
The Differential Analysis node recalculates the peak ratings to capture changes in the
peak areas from the Fill Gaps node and the peak area refinement nodes.
The peak rating values of the chromatographic peaks are based on peak quality factors1,
relative peak area, and the coefficient of variance values (CVs).
• The Search mzCloud and Search mzVault nodes can perform MSn searches against the
library spectra in the mzCloud mass spectral database. The mzCloud search can retrieve
KEGG IDs and compound class information from the online mzCloud mass spectral
database and display these results in the Compounds table of the result file (analysis
result).
Tip The Search MSn parameter in the Search mzCloud and Search mzVault nodes is
set to True in the processing workflow templates provided with the Compound
Discoverer application. When you create your own custom processing workflow
templates, you must set this parameter to True to perform MSn searches. See the
Search MSn parameter in the Search mzCloud node and the Search mzVault node.
1
Chetnik et al, MetaClean: a machine learning-based classifier for reduced false positive peak detection in
untargeted LC-MS metabolomics data, http://www.ncbi.nlm.nih.gov/pmc/articles/pmc7895495/
2
ChromAlign: A two-step algorithmic procedure for time alignment of three-dimensional LC-MS
chromatographic surfaces, RG Sadygov et al. Analytical Chemistry, 2006
• A new version of the mzCloud Offline Spectral Library (2021B) and two in-silico spectral
libraries for lipids—LipidBlast4 in the Spectral Libraries list. See Spectral Libraries view.
• The Grouping & Ratios page of an analysis shows information on the statistical test that
the analysis will apply by using the selected groups and ratios.
• New Statistical Methods result table that provides detailed information on
transformations (performed by the Differential Analysis node), statistical tests, scaling
methods, gap filling, and QC methods that the analysis performed.
The Compound Discoverer 3.3 SP1 application includes the following enhancements:
• The Fill Gaps node includes a new parameter that lets the user specify a less restrictive
retention time tolerance for filling the gaps for compounds across the input files for the
analysis. For the Gap Filling node, gaps are missing chromatographic peaks for putative
compounds in one or more of the input files for an analysis. See “Fill Gaps node.”
• The Mass List Editor can now open compound lists with a file size of up to 600 MB.
• The Natural Product Atlas database in the mass lists available from the Lists & Libraries
menu has been updated to version 2021_08.
• Peak ratings are now available for chromatographic peaks that the Fill Gaps node
gap-filled by using a matching ion for the missing peak.
• The performance of the Map to BioCyc Pathways node has been improved, reducing the
run time of the mapping process.
When you reprocess analysis results (result files) for untargeted analyses from previous
versions of the Compound Discoverer application, a not available (n/a) warning appears on
the Detect Compounds node. The current workflow issues table below the Workflow Tree
area on the Workflows page of the Analysis view states that the node is not recognized by the
application.
(n/a) Detect
Compounds
4
Kind et al., LipidBlast in silico tandem mass spectrometry database for lipid identification,
http://dx.doi.org/10.1038/nmeth.2551
You cannot partially reprocess legacy analysis results from untargeted analyses. To reprocess
these results, you must replace the (n/a) Detect Compounds node with the Detect
Compounds node or the Detect Compounds (legacy node) and fully reprocess the analysis.
For details, see “Reprocess a legacy analysis result from an untargeted analysis.”
You can select the processing workflow and the input files and set up the study factors and
sample groups for an analysis by using the New Study and Analysis wizard. Or, you can set
up an analysis from within a study.
Compounds For LC studies, a compound (or component) is a chromatographic peak that the application
detected for a specific molecular weight (within the specified mass tolerance) and retention
time (within the specified tolerance). The Features table for a compound lists all the
chromatographic peaks that make up the peak for the compound.
Expected compounds For LC studies, an expected compound is a compound that the application generates by
using the parent compounds and transformations that you specify. You specify the parent
compounds and transformations in the Generate Expected Compounds node of a
processing workflow. After the application generates a list of expected compounds, it
searches for these compounds in the input files that you submit for analysis.
Features Features are chromatographic peaks with specific m/z × RT dimensions that the Detect
Unknown Compounds node detects. The Compound Discoverer application reports the
chromatographic peak areas in counts × seconds.
Gap In an extracted ion current (XIC) trace (chromatogram), gaps are missing data points—that
is, the scan or scans acquired at specific time points in the chromatographic run do not
include mass spectrum peaks for the compound’s mass within the user-specified mass
tolerance.
For the gap filling nodes, gaps are missing chromatographic peaks for putative compounds
in one or more of the input files for an analysis.
Input files Input files are Xcalibur raw data files that you have added to a study. The study lists the
relative location of the raw data files. If you move the raw data files to a different directory
after you add them to a study, you must resolve their location before you start an analysis.
Studies include a list of input files with their location, the sample information for each file,
and a list of the analyses run within the study. The sample information includes the file
name, study factor values, and sample type for each sample.
Study files also include the name and location of the folder where the Compound
Discoverer application stores the result files that you generate by running analyses within
the study.
For information about creating study template files that include all the study information,
see “Create a study template file that contains all the study information.”
Result files Result files contain the results of a specific processing workflow and input file set. Results
files are also known as analysis results. See Chapter 8, “Review the analysis results.”
Processing workflows Processing workflows are node-based processing methods for processing raw data files. You
can access the predefined processing workflow templates and your custom processing
workflow templates only from within a study. See Chapter 6, “Create and edit processing
workflows.”
Parent compounds For targeted analyses, a parent compound is an expected compound that you define in the
Expected Compounds library and then select for analysis in a processing workflow. The
result file from an expected compounds analysis includes an Expected Compounds table
with a Parent Compound column. You can select multiple compounds for an expected
compounds analysis.
In the molecular network viewer, parent compounds are the starting compounds for the
chemical transformations that you specify in the Generate Molecular Networks workflow
node.
LC studies overview
Figure 2 summarizes the workflow for setting up a study and an analysis, submitting a set of
input files for processing, and reviewing and reporting the results of the analysis.
For LC studies, there are two types of analyses—targeted and untargeted. In a targeted
analysis, the application starts with a list of expected compounds. In an untargeted analysis,
the application relies on matching the fragmentation spectra to compounds in a mass spectral
database. You can create processing workflows (processing methods) that combine these two
types of analyses.
Preliminary tasks
–and–
Customize the lists and libraries as applicable. For a If your processing computer has Internet access,
targeted analysis, add the expected compounds to check the communication to the online databases.
the Expected Compounds library.
See Chapter 17, “Test communication to the online
See Chapter 12, “Manage the lists and libraries.” databases.”
–or–
Select a report template and print a report. Export the data to an Excel spreadsheet.
See Chapter 11, “Create and print reports.” See “Export the tabular data in a result file to an
external file.”
The Compound Discoverer window opens with the Start Page displayed as a tabbed
document. As you create studies and process data, the application creates and populates the
recent file lists to the right of the What Would You Like to Do? links.
Figure 3 shows the initial application window with large toolbar icons:
• The icons in the first row are always available.
• Most of the icons in the second and third rows become available when you open a result
file.
Figure 3. Application window with the initial Start Page and large toolbar icons
For information about the toolbar icons and application menus, see “The Compound
Discoverer window.”
These topics describe the menu bar, the toolbar, and the recently used files lists on the Start
Page:
• Compound Discoverer menu bar
• Compound Discoverer toolbar
• Change the size of the toolbar icons
• Manage the recently used files lists on the Start Page
Note This user guide uses the following terms to describe the user interface:
• View—A dockable window that you can move to a second monitor.
• Page—A tabbed document. You can have many pages open simultaneously; however,
only one of these pages is the active page.
• Dialog box (modal)— A graphical element that accepts user input. Only one modal
dialog box can be open at a time. When it is open, a modal dialog box blocks you
from working in other parts of the application.
• Pane—A defined area of an application view, page, or dialog box.
• Prompt—A pop-up message box that you must dismiss to continue.
The Compound Discoverer study file type has the .cdStudy file
name extension.
Open Result Opens the Open Result File dialog box for selecting an existing
result file or result view file.
The file name extension for result files that contain the data
processing results is .cdResult.
The file name extension for files that contain the display layout for
the results tables, graphical views, and filter settings
is .cdResultView.
To restore the default layout for a result file, delete its associated
result view file.
Note You do not need an active software license to open result
files.
Save Saves recent changes to the current active page (selected tab), for
example, the active study page or result page.
Save All Saves recent changes to all the open pages in the application
window.
Recent Studies Displays a list of recent studies that you can open.
Recent Studies > Clear Clears the Recent Studies list.
Recent Results Displays a list of recent results that you can open.
Recent Results > Clear Clears the Recent Results list.
Exit Closes the application.
Reporting menu
The Start Page and Job Queue commands from this menu are always available. The other
View commands are available only when a result file is active.
Start Page Opens the Start Page, which lists the most recently opened result
files and study files.
Job Queue Opens the Job Queue page for viewing the progress of the current
analysis or the processing events of previous analyses.
Result Summary Opens the Summaries view.
Custom Tags Editor Opens the Custom Tags Editor for selecting the number of
custom tags that you want to use for tagging items in the result
tables and the text that you want to display for each tag.
Available only when the active result file includes mapped BioCyc
pathways.
KEGG Pathways Opens the KEGG Pathways view for viewing the KEGG™
pathways that are mapped to the compounds data.
Available only when the active result file includes mapped KEGG
pathways.
mzLogic Analysis Opens the mzLogic Analysis view.
FISh Scoring Queue Opens the FISh Scoring Queue view.
Window menu
Table 6 describes the icons in the Compound Discoverer toolbar from left to right.
Table 6. Toolbar icons (Sheet 1 of 3)
Icon Description
Study icons
Opens the New Study and Analysis Wizard that takes you through the process
of specifying the studies folder for all your study subfolders, creating a new
study, and starting a new analysis.
Opens the Open Study dialog box for selecting the current version or a
previous version of an existing study.
Opens the Open Result File dialog box for selecting the current version or a
previous version of a result file.
General icons
Saves the currently active item, such as a study or result file.
Saves all the open pages, such as the study pages and the result pages.
Opens the Start Page when it is not already open and makes it the active page.
Opens the Job Queue page when it is not already open and makes it the active
page.
Opens the Lists and Libraries Manager where you can select to open one of the
following editors:
• Expected Compounds—For viewing or modifying a list of compounds
• Transformations—For viewing or modifying a list of transformations
• Neutral Losses—For viewing or modifying a list of neutral losses
• Ion Definitions—For viewing or modifying a list of ion definitions
• Adducts—For viewing or modifying a list of adducts
• Mass Lists—For creating, editing, importing, exporting, or deleting mass
lists
• Spectral Libraries—For creating, editing, importing, exporting, or deleting
mzVault database files
• Metabolika Pathways—For adding, editing, importing, exporting, or
deleting Metabolika pathways.
• Compound Classes—For creating, editing, importing, exporting, or
deleting compound class libraries of fragment structures
Available when a result file is the current page in the application window. When the
respective view is open, brings the view to the forefront or makes the view the active view.
Opens the Summaries view as a docked view.
The Results Chart view includes the following pages: Scatter Chart, Histogram
Chart, Bar Chart, and Pie Chart.
Opens the Descriptive Statistics view as a docked view.
Available when the result file includes more than one input file.
Opens the Compounds Area Correction view as a docked view.
Reporting icons
Available when a result file (not necessarily the current page) is open in the application
window.
Opens the Open Report Design Template dialog box for selecting a report
template (.cdReportTemplate).
Opens the Customize Report dialog box for setting up a custom report
template.
Opens the Open Report Design Template dialog box for selecting a report
template (.cdReportTemplate).
Note The check mark to the left of Show Large Icons indicates that the application is
set to display large toolbar icons.
Auto-Hide the Start Page, the Chromatograms view, and the Mass
Spectrum view
To make more space for other views, you can auto-hide the Start Page and the
Chromatograms and Mass Spectrum views in the Compound Discoverer window. The
auto-hide features collapses each of these items down to a small tab.
Note Excluding the Start Page and Job Queue, views that open from the View menu or
toolbar are not tabbed documents—that is, when they are open, they are not listed in the
open files.
To display, dock, hide, or rearrange the tabbed pages (documents), see these topics:
• Tabbed pages list
• Open a hidden tabbed page
• Tab groups
• Shortcut menu commands that control the layout of the tabbed pages
• Rearrange the tabbed pages and graphical views
Current Tab
icon
Click the Current Tabs icon, , to display a list of open files. Then, select the
appropriate tabbed page from the list.
The selected page becomes active.
Tab groups
In the Compound Discoverer window, when two or more tabbed pages (documents) are open
in the same tab group, you can create more tab groups. Each tab group has its own Current
Tabs icon, . Figure 8 shows tab group examples.
Figure 8. Orientation of tab groups
Single tab group Current Tab icon
Two
horizontal
tab groups
Shortcut menu commands that control the layout of the tabbed pages
Table 8 describes the shortcut menu commands that control the properties of the tabbed
pages (documents) in the Compound Discoverer window.
Table 8. Shortcut menu for tabbed documents (Sheet 1 of 2)
Command Description
Dockable Activates the Auto Hide command.
Available for the Start Page and the result file views. This
command is not available for the Job Queue page.
Tabbed Document Makes the page a tabbed document.
Available for the Start Page, Job Queue page, License Manager
page, library pages, study pages, result file views, and report
template pages.
Auto Hide Hides the page while leaving the tab visible. Clicking the tab
opens the page. Clicking outside the page closes the page if more
than one tabbed document is open. The location of the tab
depends on the position of the tabbed document in the
application window.
Available for the Start Page and the views in the View menu when
these pages are dockable windows. This command is not available
for the Job Queue page.
Hide Closes the tabbed document.
Move to Previous Tab Changes the position of the tabbed document.
Group
Available only when the application window contains two or more
tabbed groups.
Move to Next Tab Changes the position of the tabbed document.
Group
This command is available only when the application window
contains two or more tabbed groups.
New Horizontal Tab Moves the selected tabbed document to a new horizontal tab
Group group.
Available only when there are two or more tabbed documents that
belong to the same tab group in the application window. Each tab
group has its own Current Tab icon.
Y To move a view or tabbed page to another position by using the mouse pointer
1. Drag the view by its title bar or the page by its tab. As you drag a view or a tabbed page by
its title bar, a guide tool appears. The guide tool consists of four directional arrows (inner
arrows) that are arranged in a diamond pattern around a central circle. In addition to the
guide tool, a directional arrow (outer arrows) appears in the middle of each of the four
window edges.
Figure 9. Guide tool
2. After the guide tool appears, align the pointer with the appropriate directional arrow, and
then release the mouse button.
Table 9. Rearranging views and tabbed pages by using the mouse pointer
Task Procedure
Move a graphical view to another Drag the pointer to the second monitor.
monitor.
Move the selected view above the second Drag the pointer to the inner up arrow, .
view.
cdResult Contains the results produced by processing a set of raw data files
and information about the analysis settings used to process the raw
data.
cdResultView Contains the layout settings that the application uses to display
the available tables and graphical views of a result file. These
settings also include the applied result filters.
Deleting this file erases all the custom layout settings and restores
the display to the default layout settings.
You can open EMF files in a raster image editor or a vector image
editor.
CSV You can import the contents of a CSV file into a Mass List file.
CLIB You can import the contents of a CLIB file as a Compound Class
list. A compound class list contains the fragment structures that
are common to the named compound class.
DB You can import mzVault libraries into the Spectral Libraries list.
MSP You can export compounds as MSP files that you can then import
into the NIST MS Search application.
IMPORTANT If the input files are from a FAIMS-MS experiment and any of them
include spectra for more than one FAIMS CV value (also known as CV switching),
you must specify the CV value that you want to process, and all the input files must
include spectra for the user-specified CV value.
If the input files are from an experiment with mass range switching, you must specify
the mass range that you want to process, and all the input files must include spectra
for this mass range.
3. Aligns the input files by their alignment features when the analysis includes multiple
input files and the processing workflow includes the Align Retention Times
(ChromAlign) node or the Alignment Retention Times node.
Note Most of the processing workflow templates in the Common Templates folder
use the Align Retention Times (ChromAlign) node.
The Align Retention Times (ChromAlign) node is new in the Compound Discoverer 3.3
application. This node automatically selects the first sample file in the Files for Analysis
area of an analysis as the reference alignment file. A sample file is an input file assigned
any of the following sample types: Sample, Control, or Standard. After you add input files
to the Files for Analysis area of the analysis, you can select a different sample file as the
reference file.
The legacy Align Retention Times node individually aligns each feature ((m/z × RT)) as
follows:
• When the set of input files includes only one sample group, the node uses the input
file with the most features (landmarks) as the reference file.
• When the set of input files includes multiple sample groups, the node aligns the
features (m/z value × RT) within a group first. Then, aligns the features among the
groups based on which group has the most features.
The new Align Retention Times (ChromAlign) node builds correlation matrices based on
spectral similarities. Then, creates regression curves by using the optimal path in the
correlation matrix.
Tip Use the Retention Time Corrections view and the File Alignments table to review
the corrected retention time of each detected compound in a result file.
4. Detects the XIC traces and chromatographic peaks by doing the following:
• For an untargeted processing workflow that includes the Detect Compounds node,
the analysis detects the chromatographic peaks in the XIC traces.
• For a targeted processing workflow that includes the Expected Compounds node, the
analysis searches for the expected compounds in the XIC traces.
Both the Detect Compounds node and the Find Expected Compounds node calculate
the chromatographic peak areas (for relative quantitation) according to the following
parameter settings:
• Use Most Intense Isotope Only:
– True: Displays the area of the chromatographic peak in the XIC trace for the
most intense isotope in the isotope pattern for the compound.
– False: Displays the combined area for all the detected isotopes in the isotope
pattern for the compound.
The Detect Compounds node automatically detect the isotope pattern for organic
compounds with the following elements: C, H, N, O, and S. You have the option to
add Cl and Br to this list.
• Remove Baselines:
– True: Does not include the extra area caused by integrating the area under the
XIC trace down to the baseline.
– False: Drops a perpendicular tangent line from the start and end points of the
chromatographic peak down to the baseline and includes this additional area in
the reported peak area.
5. Groups the compounds across the input files as follows:
• An untargeted processing workflow groups each compound across the input file set
by its MW × RT dimensions.
• A targeted processing workflow groups each expected compound across the input file
set by its formula × MW × RT dimensions.
When you turn on the peak rating filter, the grouping nodes (Group Compounds and
Group Expected Compounds) store only the compounds that pass the filter in the
analysis result. In addition, they do not send these failing peaks to the downstream nodes
(Fill Gaps, Peak Area Refinement, Identification, Pathway Mapping, Compound Scoring)
for further processing, which speeds up the processing time.
Even when you do not turn on the peak rating filter, you can use the Result Filters view to
filter the compounds in the analysis result by using the Peak Rating column or by any of
the peak quality factor (PQF) columns in the compounds tables. By default, the PQF
columns are hidden.
For details, see “Chromatographic peak rating filter.”
6. For an untargeted workflow, fills the gaps for missing chromatographic peaks when the
processing workflow includes the Fill Gaps node.
Filling the gaps with nonzero values is necessary for the statistical analyses.
7. For an untargeted workflow, predicts the elemental composition of the compounds when
the processing workflow includes the Predict Compositions node.
8. Assigns annotations to the compounds by using the available information from the
identification and pathway mapping nodes when the processing workflow includes the
Assign Annotations node.
The new processing workflow templates for untargeted analyses perform the initial peak
detection at a very low minimum peak intensity threshold of 10 000 (minimum intensity, in
ion counts, at the chromatographic peak apex). This low-intensity threshold filters out very
few chromatographic peaks. To remove low-quality or non-reproducible peaks, you can set up
a peak rating filter in the Group Compounds node. This peak rating filter combines peak
quality and peak reproducibility across the input file set.
Note When you submit 10 or more input files (assigned the Control, Standard, Sample,
or Blank sample type) for processing and the analysis includes one or both of the grouping
nodes, the application prompts you to set up a peak rating filter.
The application considers only the following sample types when it calculates the total
number of files for the analysis: Control, Standard, Sample, and Blank. It excludes the
following sample types from the total number of files: Identification Only and Labeled.
The processing workflow templates for targeted analyses still include a relatively high
minimum peak intensity threshold of 1 000 000. So, these templates automatically remove
low-intensity noise-level chromatographic peaks for most analyses. However, when you are
analyzing large data sets with similar samples, you might find it useful to set up a peak rating
filter to remove spurious compounds that are present in only a small subset of the samples.
Note Removing chromatographic peaks that do not pass the peak quality threshold in a
specified number of input files at the grouping stage of the analysis decreases the overall
processing time by streamlining other processing steps, such as gap filling and
identification.
By default, the peak rating filter in the processing workflow templates is not enabled. See
Figure 10.
To enable the filter, you must specify the minimum number of input files (number greater
than or equal to 1) where the chromatographic peak for a putative compound meets or
exceeds the specified peak rating value (0.1 to 10). You can also modify the various peak
quality factors that contribute to the overall peak rating value. See “Peak quality factors.”
Figure 10. Group Compounds node and Group Expected Compounds node parameters
O
OH Group Compounds Group Expected
C
Compounds
The Group Compounds and Group Expected Compounds nodes calculate the peak rating of
the chromatographic peaks for each compound across the set of input files. The Differential
Analysis node recalculates the peak rating values for each chromatographic peak before it
performs a differential analysis based on the peak areas.
For information about the peak rating values in the result tables, see “Peak Rating columns.”
You can filter the compounds in the compounds tables by the Peak Rating column.
For information about the peak quality factor columns in the result tables, see “Peak quality
factor (PQF) columns in the result tables.” You can filter the compounds tables in the analysis
results by the PQF columns.
Note The Detect Compounds (Legacy) node and the Detect Compounds node use
different approaches for isotope grouping:
• The Detect Compounds node uses the peak quality information to group the XIC
traces for common isotopes of C, H, N, O, and S and optionally for Cl and Br. It
ignores XIC traces with low-quality chromatographic peaks for isotope grouping.
• The Detect Compounds (Legacy) node does not use the peak quality information
during grouping. It uses the peak quality information only for filtering after grouping.
When used in a processing workflow that runs only a targeted analysis, the Find Expected
Compounds node includes hard-coded peak quality thresholds for isotope pattern
detection:
• PQF: J (Jaggedness) ≤ 0.4
• PQF: ZZI (Zig-Zag Index) ≤ 0.25
• PQF: M (Modality) ≤ 0.9
When used in a processing workflow that includes a targeted analysis and an untargeted
analysis, the Find Expected Compounds node uses the user-specified peak quality
thresholds in the Detect Compounds node.
The Find Expected Compounds (Legacy) node does not use peak quality information.
When a processing workflow includes untargeted and untargeted analyses, it can include
the Detect Compounds node and the Find Expected Compounds node or the Detect
Compounds (Legacy) node and the Find Expected Compounds (Legacy) node.
Where:
Wh = the peak width at half the maximum peak height
Wb = the peak width at the base
The calculated value ranges from 0 to 1. As the peak quality increases, the calculated value for
the peak quality factor decreases.
Figure 11. FWHM to base peak quality measurement
Baseline Original Smoothed
120
8.130
100 Peak height
80
Intensity [counts] (10^6)
60
Peak width at half height
40
20
8.056 Peak width
at the base
0
8.1 8.2 8.3 8.4 8.5 8.6 8.7
RT [min]
The calculated value ranges from 0 to 1. As the peak quality increases, the calculated value for
the peak quality factor decreases.
Figure 12. Jaggedness peak quality measurement
Baseline Original Smoothed
60
3.006
50
Intensity [counts] (10^3)
40
30 2.988
20
10
0
2.96 2.97 2.98 2.99 3.00 3.01 3.02 3.03
RT [min]
The calculated value ranges from 0 to 1. As the peak quality increases, the calculated value for
the peak quality factor decreases.
Modality = --a-
b
a
Intensity
Vn
Intensity
V1 b Vn + 1
V1
For information about checking the processing computer’s access to these online databases,
see Chapter 17, “Test communication to the online databases.”
For information about how the application determines and uses the best scans, see these
topics:
• Best MS1 scan for isotope pattern matching
• Best MS2 scan for fragments matching and spectral comparison
The application sends the isotope pattern information from the best MS1 scan to the
following workflow nodes for further processing:
• The Predict Compositions node for isotope pattern matching
• The Apply Spectral Distance node for calculating the spectral distance between the
assigned elemental composition and the experimental isotope pattern
To determine which MSn scans to attach to a compound, the application does the following:
1. Searches for MS2 scans for the precursor ion within the RT range of the peak
apex ± FWHM for a compound.
2. If it finds no MS2 scans for the precursor ion within this range, it searches for scans
within the start and end points of the chromatographic peak for a compound, as
determined by the Parameterless Peak Detection (PPD) algorithm.
3. To determine the best MS2 scan, it does the following:
a. Selects the MS2 scan for the preferred ion.
b. If it finds multiple spectrum trees for the preferred ion, it uses the MS2 scan from the
MS1 scan with the highest precursor intensity.
For each compound, the Group Expected Compounds and the Group Compounds nodes
send all the MS2 scans associated with that compound to the following nodes:
• Search mzCloud node
• Search mzVault node
For each compound, the Group Compounds node sends the best MS2 scans to the Apply
mzLogic node.
For information about setting up and running an analysis, see Chapter 4, “Set up, run, and
reprocess analyses.”
Tip For information about selecting a processing workflow template for an analysis, see
“Start a new analysis from within an existing study.”
When an mzCloud search yields no identity matches for an unknown compound, the
mzLogic algorithm provides a ranking score for the compound hits from the identification
nodes and pathway mapping nodes.
The mzLogic algorithm can provide a ranking score for the various database search results
when an unknown compound has available data-dependent MS2 scans and similarity results
from an mzCloud similarity search.
Note The ranking score is not a probability score. It is only a measure of how similar the
fragmentation spectra for a putative compound are to closely matching spectra in the
mzCloud spectral database.
During data processing, the Apply mzLogic node does the following:
1. Runs a forward search and a reverse search using the mzCloud service.
2. For compounds that have available MS2 scans, scores all the structure candidates (or the
specified maximum number of candidates) from the attached input nodes.
Note The following nodes can supply structures to the Apply mzLogic node: Search
ChemSpider, Search Mass Lists, Map to BioCyc Pathways, and Map to Metabolika
Pathways.
The mass lists that you select for the Search Mass Lists node must include structures.
The Endogenous Metabolites database 4400 Compounds.masslist file does not
include structures.
Because the mzLogic Analysis view includes a link to the ChemSpider database where you can
select putative structures, the result file does not need to include structure annotations.
The scoring model is based on extracting six observables from a query versus library spectral
pair:
• Spectral similarity (expressed as the spectral match value)
• Relative difference in the collision energies for the query and library spectra
• Polarity of the spectra
• Number of peaks in the spectra
• Relative energies of the query spectrum and library spectrum
• Identity of the compounds that produced the library spectrum and the query spectrum
The final score for a query spectrum versus library spectrum pair is based on the likelihood of
two spectra belonging to the same compound given the observed spectral match, the collision
energy of the unknown, the number of peaks in each spectrum, the polarity of each spectrum,
and the difference in the relative collision energies of the two spectra.
The following examples illustrate how to use the confidence score and the match score for a
query versus library spectrum comparison.
Suppose that you are identifying 6-hydroxynicotinic acid in your sample and assume that the
representative query spectrum for your sample exactly matches the mzCloud library spectrum,
producing a match score of 100.
When you search the mzCloud library using the standard ID search, which is restricted only
to the product spectra with a matching precursor ion, you find very similar spectra for related
compounds. Because the spectrum is non-specific, any molecule with the same molecular
weight and adduct is likely to have the same non-fragmented spectrum at the lower end of
activation energies. The Confidence algorithm severely penalizes these library search results
because they are based on a non-specific spectral fingerprint (only 1 precursor peak), making
the probability of a correct ID very low.
Figure 15. MS2 HCD 20 product spectrum of 6-hydroxynicotinic acid
Figure 16 shows the Confidence and Match scores for the three top hits for the low-energy
fragmentation scan of 6-hydroxynicotinic acid. Because the query and library spectra are so
non-specific (with only one fragment peak), the Confidence scores are extremely low
compared to the Match scores.
Figure 16. mzCloud search results for a query spectrum of 6-hydroxynicotinic acid obtained at a
collision energy of 20 HCD
m/z
Figure 18 shows the fragmentation spectrum for 6-Hydroxynicotinic acid at the higher
fragmentation energy (HCD 100).
Figure 18. MS2 HCD 100 product spectrum of 6-hydroxynicotinic acid
The search results against the mzCloud database look very different from the first example at
the lower fragmentation energy. In Figure 19, you see greater separation in the scores between
the true hit (6-hydroxynicotinic acid) and its analogs (false hits). The separation is even more
pronounced in the Confidence score ranking.
Figure 19. mzCloud search results for a query spectrum of 6-hydroxynicotinic obtained at a
collision energy of 100 HCD
When you look at the next best library hit, 3-hydroxypiconilic acid, which is a positional
isomer of the true hit, you see different fragmentation energies for the query spectrum
(HCD 100) and the library record (HCD 30). The assumption is that identical compounds
at similar fragmentation energies produce similar fragmentation spectra. So, if you observe
similar fragmentation spectra (= a high spectral match score) at different collision energies, it
might mean that the original assumption of identical compounds is incorrect, which the
heavily penalized Confidence score (80.4 spectral match score versus 9.2 confidence score)
reflects for this particular result.
Figure 20. Query spectrum (HCD 100) versus library spectrum (HCD 30)
When the Confidence score is significantly lower than the Match score, evaluate the possible
reason for this difference.
When the Match score and the Confidence score are both greater than 80%, the confidence
that you are identifying the compound correctly is relatively high.
The FISh scoring algorithm compares the experimental fragmentation spectra for a
compound to the expected fragmentation spectra based on the structure of the compound.
Note In the 3.1 and later versions of the application, the FISh scoring algorithm uses all
the fragmentation scans (in the spectrum tree for a compound), compared to earlier
versions of the application that only used MS2 scans. Therefore, if you reprocess data sets
that you already processed with an earlier version of the application, and these data sets
include MSn scans (where n > 2), the FISh coverage scores might be lower.
For targeted compounds (expected compounds), you can add FISh scoring to the automated
analysis.
For untargeted compounds, you can apply FISh scoring to your proposed structures or
compounds with structures and MS2 or higher spectra.
The FISh scoring algorithm matches the fragment structures in a list of expected fragments to
the centroids in the fragmentation scans of the precursor ions.
When a precursor ion scan is followed by only one fragmentation scan, the node calculates the
FISh coverage score as follows:
# matched centroids
FISh coverage score = -------------------------------------- × 100
# used centroids
where:
# matched centroids represents the number of matched centroids.
# used (matched + unmatched) centroids represents the number of centroids in the
fragmentation scan that are above the user-specified signal-to-noise threshold. The
algorithm skips centroids below the user-specified signal-to-noise threshold.
When a precursor scan is followed by more than one fragmentation scan, the node calculates a
composite score as follows:
( Σ per all scans # matched centroids )
FISh coverage score = -------------------------------------------------------------------- × 100
( Σ per all scans # used centroids )
The FISh scoring algorithm annotates the centroids in the fragmentation scans with the
matching fragment structures. It also provides a FISh Coverage score for data-dependent scans
in the Mass Spectrum view legend and a FISh Coverage score in the compounds table.
Table 11 lists the formulas that the application uses to calculate the mass defect of an
elemental composition.
where:
exact mass = monoisotopic mass of the elemental
composition
Standard mass defect exact mass – nominal mass
where:
nominal mass = integer mass of the elemental composition
where:
Kendrick mass = a × (b/c)
a = exact mass of the elemental composition
b = nominal mass of the Kendrick formula
c = exact mass of the Kendrick formula
For more information about using the mass defect feature, see these topics: “Filter By Mass
Defect node,” “Calculate Mass Defect node,” and “Mass Defect Plot view.”
For LC studies, follow these steps to report neutral loss fragments and to visualize the spectral
peaks that are due to neutral losses from higher-mass precursor ions in the fragmentation
scans for compounds in the Compounds table:
1. Add the Search Neutral Losses node to the processing workflow and specify the neutral
losses for the search. See “Search Neutral Losses node.”
The Group Compounds node automatically connects to the Search Neutral Losses node.
Note The m/z value of an expected fragment ion equals the m/z value of the best MS2
ion minus the user-specified neutral loss.
Tip Because the Search Neutral Losses node is downstream of the Group Compounds
node, you can quickly run a neutral loss search by reprocessing a previous analysis. See
“Reprocess an analysis.”
2. After you run an analysis with a processing workflow that includes the Search Neutral
Losses node, open the result file.
The Search Neutral Losses node generates the following:
• The Neutral Losses column in the Compounds table
• The main Neutral Losses table (see Neutral Losses table)
• A related Neutral Losses table for each detected compound
3. To check how many compounds the analysis found for each specified neutral loss, click
the Neutral Losses tab in the main result tables pane.
Figure 21. Neutral Losses table for an analysis with two specified neutral losses
4. To make it easier to review the neutral losses for compounds of interest, modify the layout
of the result page as follows:
a. Close the Chromatograms view by clicking the close icon in the upper-right corner of
the view.
b. Click the Compounds tab to open the Compounds table.
c. Right-click the Compounds table and choose Enable Column Fixing. Then, scroll
to the right and click the pin icon to the right of the Neutral Losses column header.
The Neutral Losses column moves to the leftmost column position.
d. Right-click the Compounds table and choose Expand All Column Headers.
The selected neutral losses are visible as subheadings in the Neutral losses column.
e. To sort the neutral losses by one of the neutral losses, select the specific (vertical)
neutral loss heading. Then, click the Neutral Losses column header until the arrow
points down.
f. (Optional) To pin compounds of interest to the top of the table, click the pin icon to
the right of their row numbers.
Figure 22 shows a result table from a processing workflow with the Search Neutral Losses
node with two neutral losses selected: glucuronide and glucuronic acid. The Neutral
Losses column is fixed at the left, and a compound is pinned to the top of the table.
Figure 22. Result file from a processing workflow with the Search Neutral Losses node (with
column fixing enabled and a compound pinned to the top of the table))
5. To view specific neutral losses for a compound in the Mass Spectrum view, do the
following:
a. Select the compound in the main Compounds table.
b. Click Show Related Tables at the bottom left of the result page.
c. In the first related tables pane, click the Neutral Losses tab.
d. In the Neutral Losses table, select the neutral loss that you want to visualize in the
mass spectrum.
e. In the Mass Spectrum view, review the fragmentation spectrum with the selected
neutral loss.
6. To return the Mass Spectrum view to the normal view, click another row in the
Compounds table.
Figure 23. Fragmentation spectrum with annotated neutral loss
To create a pooled quality control sample, combine a small aliquot from each sample in the
processing batch (input file set to be processed to create one result file). When setting up an
acquisition sequence, bracket the unknown samples by injecting this pooled quality control
sample at regular intervals.
For details about processing the raw data files from a sequence with quality control samples,
see these topics:
• Batch normalization for single sequence runs
• Batch normalization for multiple sequence runs (LC studies)
Note For information about setting up a processing workflow for a QC-corrected sample
set, see “Apply QC Correction node.” For information about reviewing the corrected
compound areas for a QC-corrected sample set, see “Compound Area Corrections view.”
An analysis does not create a regression curve for a particular compound and does not correct
the areas in the non-QC samples unless all three of these conditions are met:
• It detects the compound in the user-specified minimum percentage of the QC samples.
• The relative standard deviation of the detected peak areas for the compound in the QC
samples before and after correction does not exceed the user-specified thresholds.
• The number of samples acquired between the QC samples does not exceed the
user-specified number.
You can view the results of the batch normalization process in the main compounds result
table (Compounds table) and the Compound Area Corrections view. The Norm. Area
column in the compounds table displays the corrected compound areas. The Compound Area
Corrections view shows the effect of the signal correction on the absolute compound areas.
5
Dunn, W.B.; Broadhurst, D.; Begley, P.; Zelena, E.; Francis-McIntyre, S.; Anderson, N.; Brown, M.; Knowles,
J.D.; Halsall, A.; Haselden, J.N.; Nicholls, A.W.; Wilson, I.D.; Kell, D.B.; Goodacre, R. Human Serum
Metabolome (HUSERMET) Consortium. Nat Protoc. 2011, 6(7), 1060-83.
When the application excludes the selected compound during the QC signal correction step,
the Compound Area Corrections view is empty. In addition, the Norm. Area column for the
compound is empty across all the input files, and a filter icon appears on the compounds table
tab. Pointing to the tab displays a tooltip with the number of displayed compounds, detected
compounds, and filtered-out compounds. Clicking the filter icon displays all the detected
compounds, including the compounds with uncorrected peak areas.
For additional information, see Chapter 4, “Set up, run, and reprocess analyses,” and
“Compound Area Corrections view.”
IMPORTANT The input files can comes from different batches, but the parameter settings
for the instrument methods used in these different batches must be the same.
The node determines how to divide the input files into batches as follows:
1. The node orders the input files by their acquisition times.
2. After the node groups the input files by time and determines the largest time gaps, it
attempts to divide the input files into the number of user-specified batches, while
maintaining a minimum of five QC samples per batch. The node assumes that the biggest
gaps in the series of acquisition times are due to the time gaps between batches.
3. If a batch (as determined by a time gap) contains fewer than five QC samples, the node
merges it with the smallest neighboring batches. If the node cannot create multiple
batches with five QC samples each, it assigns all the input files to the same batch. The
SERRF algorithm can process one batch without problems.
4. The node numbers the detected batches, starting with 0, and reports the sequential batch
number (0 to the maximum user-specified batch number) in the Batch column of the
Input Files result table.
For information about adding the Apply SERRF QC Correction node to a processing
workflow, see “Peak area refinement node connections.” For information about the node
parameters, see “Apply SERRF QC Correction node.”
Using an unlabeled reference sample, the application detects unknown compounds above a
specified minimum intensity threshold, determines their elemental composition and identity,
and then determines the labeled counterparts (isotopologues) of these compounds in the
samples marked as labeled.
You can use any isotopic label in your experiments; however, when you use a label other than
carbon-13, you must specify the labeled element in the processing workflow.
The application reports the isotopologues and the fractional label incorporation (exchange
rate) for each compound. You can overlay the exchange rate as well as other statistical data
onto pathways using Metabolika™, which is directly integrated into the Compound
Discoverer application.
Note Currently, the application treats Sample, Control, and Standard samples
the same way. This functionality is subject to change in future releases.
The Analyze Labeled Compounds node adds the following columns to the Compounds table
in a result file: Labeling Status, Ave. Exchange, and Rel. Exchange Rate. See “Compounds
table (LC studies).”
To determine the presence of contaminating masses, the Analyze Labeled Compounds Node
evaluates the measured isotope pattern versus the fitted isotope pattern (for the expected
isotopologues). It also evaluates the distribution of the measured exchange rates for the
expected isotopologues. If the distribution is not continuous, for example, if the compound
has three exchangeable atoms and chromatographic peak area for the M+2 isotopologue is
significantly less than the chromatographic peak area for the M+1 and M+3 isotopologues,
the node assigns an Irregular Exchange status to the input file.
• ( ) Green—No Warnings—The measured isotope patterns and the exchange rates are
within acceptable limits.
• ( ) Gray—Compound was not detected in this sample.
PFAS identification
Per- and polyfluoroalkyl substances (PFAS) are a highly stable group of small molecules that
have a wide range of commercial applications. PFAS are highly soluble in water, chemically
stable, persistent in the environment, and can accumulate in the human body over time,
leading to adverse human health effects and potential human health risks. Due to these health
risks and the pervasiveness of PFAS in common household items, some compounds in the
PFAS family have been banned and analytical tests have been developed to check for their
presence.
LC-MS/MS analysis is a common analytical technique used for determining the presence of
per- and polyfluoroalkyl substances in extraction solutions. The Compound Discoverer
application supports the identification of PFAS in raw data files acquired with a Thermo
Scientific high-resolution accurate-mass (HRAM) mass spectrometer. The data files must
include high-quality MS1 full scans and data-dependent MS2 fragmentation scans.
These topics describe how to use the processing workflow template for PFAS analysis that is
provided with the application and how to review the analysis result:
• PFAS processing workflow template
• Review the results of a PFAS analysis
You can find this processing workflow template in the following folder for LC studies:
Common Templates\Workflow Templates\PFAS\
Note For information about selecting a processing workflow for an analysis, see “Select a
processing workflow for the analysis.”
The PFAS processing workflow template processes the raw data files as follows:
• Performs retention time alignment, unknown compound detection, and compound
grouping across all samples.
Table 12. Customized parameter settings for the Detect Compounds node in the PFAS
processing workflow template
Parameter Default setting for the node Custom setting for PFAS
Min. Peak Intensity 10,000 (1e4) 1,000 (1e3)
Ions [M+H]+1; [M+K]+1; [2M+FA–H]–1; [2M–H]–1;
[M+Na]+1; [M-H]–1 [2M–H+HAc]–1; [M+Cl]–1;
[M+FA–H]–1; [M–2H+K]–1;
[M–H]–1; [M–H+HAc]–1;
[M–H–H2O]–1
• Predicts elemental compositions for all compounds, fills gaps across all samples, and hides
background compounds (by using the Blank samples).
In the Predict Compositions node, the maximum number of fluorine atoms in the
elemental composition of the compound is limited to 50.
Minimum Element Counts: C H F
Maximum Element Counts: C90 H190 Br3 Cl4 F50 N10 O18 P3 S5
• Identifies compounds by using the following:
– The mzCloud™ mass spectral database (data-dependent MS2)
– The EPA DSSTox database in the ChemSpider chemical structure database (formula
or exact mass)
– The following mass lists, which are provided with the application:
– PFAS_NEG, PFAS_NIST, and PFASSTRUCT-2022-04-20
• Flags unknown compounds that share common set of fragments by using the Compound
Class Scoring node.
Uses the following compound class files, which are provided with the application: PFAS
General from FluoroMatch Suite.cLib and \PFAS Fine signature fragment_lib.cLib.
• Generates mass defect values in the Compounds table that are based on the selected mass
defect type (Kendrick for identifying homologous series).
Uses the following Kendrick mass: C F2.
• Generates a molecular network to visualize compounds that might be related.
Uses the following transformation: PFAS Chain Shortening (C F2 -> ) under Other.
• Supports the optional scripting node that is available on AnalyteGuru.
https://www.analyteguru.com/t5/Scientific-Library/Compound-Discoverer-PFAS
-Scripting-node/ta-p/20051
Note For more information about reviewing the results from an analysis that used the
PFAS processing workflow template that is provided with the application, refer the
following article:
http://assets.thermofisher.com/TFS-Assets/CMD/Application-Notes/an-001826
-lsms-pfas-analysis-workflow-compound-discoverer-an001826-na-en.pdf
1. Open the analysis result by double-clicking the job on the Job Queue page or by clicking
the analysis result on the Analysis Results page of a study.
2. Apply the following result filters to filter the Compounds table to display only high
confidence candidates with matched fragments as follows:
a. Set the following filters:
AND (All of these conditions must be true.)
• Mass Defect Is Between –0.116 and 0.268 In Type Standard MD
• Peak Rating Is Greater Than 3.00 In Any Sample
• Class Coverage Is Greater Than or Equal To 0.25 In Compound Class PFAS
General from FluoroMatch Suite
• Background is False
OR (One of these conditions must be true in addition to the four AND filters)
– mzCloud Best Match Is Greater Than 75.00
– Class Coverage Is Greater Than 0.00 In Compound Class PFAS Fine
Signature Fragment_Lib
– mzCloud Best Sim Match Is Greater Than 75.00
Figure 24. Result filter to reduce the number of compounds displayed in the Compounds
table to high confidence candidates with matching fragments
b. Click Apply.
Only compounds with a high confidence of belonging to the PFAS family remain
visible in the Compounds table.
3. Visualize the relationship among the remaining compounds in a scatter plot as follows:
a. In the application menu bar, choose View > Result Charts.
The Scatter Chart plot opens.
b. In the Data Source list, select Compounds.
c. Make the following selections for the three axes:
• For X Data, select Calc. MW.
• For Y Data, select Mass Defect: Kendrick MD [CF2].
• For Z Data, select RT [min].
d. Click Refresh to display the plot.
In Figure 25, data points from the filtered Compounds table are plotted in three
dimensions using the Kendrick MD of approximately 50 Da or exactly one CF2 to
elucidate the presence of homologous PFAS series. Homologous PFAS series will share
the same Kendrick MD but differ in molecular weight by 50 Da and have increasing
retention time based on PFAS chain length. The retention time is color coded as a third
dimension to provide a simple verification of this trend.
Two homologous series are identified at Kendrick MD [CF2] of -0.03 and -0.015
(highlighted by the two red rectangles in Figure 25). In the longer series, there are two
overlapping PFAS with a molecular weight of 349.9471 Da, which corresponds to
perfluoropentanesulfonic acid. To resolve this overlap, one data point is plotted as a
triangle instead of a circle. The compound that corresponds to the blue circle does not
follow the retention time trend of increasing retention with increasing chain length. This
retention time deviation is most likely explained by an alternate branching structure.
Figure 25. Scatter chart with a plot of the Kendrick mass defect for C F2 (y axis) against the
Calc. MW of the compound (x axis) and the retention time of the compound (z axis)
C5HF11O3S
C4HF9O3S C6HF13O3S
Overlapping PFAS
4. After you filter the Compounds table to display only potential PFAS, visualize the
molecular network between the compounds as follows:
a. Right-click the Compounds table and choose Molecular Networks > Send to
Viewer.
The Export Molecular Networks dialog box opens.
b. Select the Open Viewer After Export check box, and then click Export.
The viewer opens in a web browser where you can adjust the settings to optimize
clustering. For PFAS, class-based clustering is based on fragmentation and the
detection of homologous series by selecting the PFAS chain shortening
transformation in the Generate Molecular Networks node. This transformation
accounts for the addition or removal of up to ten CF2 moieties that correspond with
the expected differences in chain length for PFAS homologous series.
For LC studies, you can add one of the following two nodes to the processing workflow to
impute missing peak areas: Fill Gaps or Apply Missing Value Imputation. The processing
workflow templates for LC studies use the Fill Gaps node, which fills the gaps for missing
chromatographic peaks by revisiting the original XIC traces and searching below the noise
threshold. Use the Apply Missing Value Imputation node only if the Fill Gaps node does not
give you the expected results. The Apply Missing Value Imputation node is a peak area
refinement node that does not revisit the original data stream.
To avoid distorting the CVs of the sample groups with missing or partially missing peaks, this
imputation method does the following:
• Adds a corresponding variability to the mean for a partially missing group.
• Adds the smallest value detected in the input file divided by 2 for a completely missing
group.
• Calculates the typical (or local) standard deviations by using local estimations (depending
on the median of the areas) of the compound CVs. It uses a maximum number of 50
neighbors to estimate a local CV. The number of neighbors is lower than 50 if there are
fewer than 100 compounds with at least three sample-replicate values in the dataset. After
it estimates the local CVs, it removes the local outliers and estimates the local median
CVs with the remaining data.
• Calculates the mean (or minimum value) plus the added variability as a random value
(positive or negative). The added variability follows a Gaussian distribution that is
centered at the mean (or minimum value) with a standard deviation that corresponds to
the estimated typical standard deviation for similar area values for other study groups in
the analysis.
For details about the parameter settings for the Fill Gaps node, see “Fill Gaps node.”
Note For information about the terminology used in the Compound Discoverer
application, see “Terminology used in the Compound Discoverer application.”
Preliminary information:
• Available sample types
• Directory structure for Compound Discoverer studies
• Create a study template file that contains all the study information
For information about selecting the sample types, see “Select the sample types.”
Table 13. Sample types
Sample type Application use
a
Sample Detects the compounds in the sample.
Controla Detects the compounds in the sample.
Blank Detects the compounds in the sample. When the processing
workflow includes one or both Mark Background Compounds
nodes, marks these components as background compounds.
Identification Only Does not report the chromatographic peak areas for the
(LC studies only) compounds detected in this sample type. Uses the fragmentation
scans in this sample type for component identification when the
processing workflow includes the Group Compounds node.
Quality Control Detects the compounds in the sample. Pools the QC samples to
determine a group area for each detected compound for area
normalization.
Standarda Detects the compounds in the sample.
Labeled Determines the formulas for the compounds in the labeled
(LC studies only) samples.
a
The application attributes the same functionality to the Sample, Control, and Standard sample types. You can use
the Control and Standard sample types to label your control or standard samples; that is, you can use these sample
types as an additional study variable for grouping.
Note You can create more than one top-level folder for your studies. Each time you open
the New Study and Analysis Wizard, it opens to the last opened top-level folder.
Study files include a list of input files with their location, the sample information for each
file, and a list of the analyses run within the study. The sample information includes the
file name, study factor values, and sample type for each sample.
Result files contain the results of an analysis in tabular form. In addition, you can access
multiple views for visualizing the analysis results when you open a result file in the
application window.
This figure shows an example of the directory hierarchy for a study named Omeprazole Study
with one result file. The individual study folder and the study file share the same name.
Top-level folder where the application stores the individual study folders
Individual study folder
Contents of the
Omeprazole Study
folder
Create a study template file that contains all the study information
There are two ways to create a study template file.
For information about selecting a study template file for a new study, see “Define the study
type, name the study, and optionally select a study template and a processing workflow.”
6. In the Export to Excel dialog box, select the folder where you want to store the Excel
spreadsheet, select the All Items option, select the Open File After Export check box,
and then click Export.
The exported spreadsheet opens in the spreadsheet application.
7. Change the File Name column heading to File.
8. Save the spreadsheet.
Define the study type, name the study, and optionally select a study
template and a processing workflow
To start the New Study and Analysis Wizard, see “Start the New Study and Analysis Wizard.”
The wizard opens to the Study Name and Processing Workflow page.
Use the Study Name and Processing Workflow page (step 1 of 5) of the New Study and
Analysis Wizard to do the following:
• Select the study type: LC or GC (chromatography type = liquid chromatography or gas
chromatography).
• Name the current study. This step automatically creates a study folder and a study file
with the same name. See “Directory structure for Compound Discoverer studies.”
• Select a current folder or create a new top-level folder for storing all the study folders or a
subset of the study folders.
• (Optional) Select a template file for the study.
A study template file is either a study file that includes defined study factors or a
tab-delimited text file or spreadsheet file that includes all the study information. The
study information includes the location, sample types, and study factors for all the input
files in a study.
• (Optional) Select a processing workflow for the analysis.
Note You can select a processing workflow after you create the study.
Figure 28. New Study and Analysis Wizard – Study Name and Processing Workflow page
For more information about the Study Name and Processing Workflow page of the wizard, see
these topics:
• Select the study type
• Name the new study, select the top-level studies folder, and optionally select a study
template
• Select a processing workflow for the analysis
The Workflows list in the Processing area includes the processing workflow templates that are
available for the study type that you selected.
Note The wizard retains this selection (LC or GC) after you close it. The next time you
create a new study, the Study Type area displays your last selection.
You can process either GC/MS data or LC/MS data from within a study; you cannot
process both types of data from within the same study. After you create a study, you
cannot modify its study type.
You can perform step 1 through in any order—that is, naming the study, selecting the
studies folder, and selecting a processing workflow are independent of each other.
Name the new study, select the top-level studies folder, and optionally select a
study template
The New Study and Analysis Wizard is open to the Study Name and Processing Workflow
page and you have selected the study type. See Start the New Study and Analysis Wizard and
Select the study type.
Y To name a new study, select the top-level studies folder, and optionally select a study
template
1. In the Study Name and Directory Structure area, type a name for the new study in the
Study Name box.
Note When you create a new study, the application uses the study name that you
specify for two items: the study file (.cdStudy) and the study folder where it stores the
study file. The application stores the result files from the analyses run within the study
in this named study folder. For details, see “Directory structure for Compound
Discoverer studies.”
Note The first time you create a study, the studies folder is undefined and the Studies
Folder box is empty and outlined in red. If you have already created at least one
studies folder, the Studies Folder box is populated with the name and location of the
last studies folder that you created.
2. Use the current folder name and location that is displayed in the Studies Folder box or do
the following to select an existing folder or create a new folder:
a. Click the browse icon to the right of the Studies Folder box.
Figure 29. Undefined studies folder
b. Browse to an existing folder or use the New Folder command to create a new folder.
The name and location of the new folder appear in the Studies Folder box.
Note You can store all your study folders in a single top-level folder or categorize
your studies by setting up multiple top-level folders. See “Directory structure for
Compound Discoverer studies.”
Tip To create two studies with the same name but different data types, store the
two studies in different folders.
3. (Optional) To select a study template file, click the browse icon, , next to the Study
Template File box. Then, do one of the following:
• To import only the study factors for the study, browse to and select a study file
(.cdStudy).
• To import all the study information, browse to and select a text file or a spreadsheet
file that contains the study information in a compatible format.
Note To create a spreadsheet that contains all the study information, see “Create
a study template file that contains all the study information.”
If you are creating a new study and a new analysis in the New Study and Analysis Wizard,
click Next to go to the Input File Selection page of the wizard.
For information about starting the wizard and defining the study, see “Define the study type,
name the study, and optionally select a study template and a processing workflow.”
Note The example Xcalibur RAW files for the tutorials are on the Compound Discoverer
USB key in the software media kit or in the software media that you downloaded from the
Life Sciences and Mass Spectrometry Software Download and Licensing Portal website
page. The software installation process does not install the example files on your
processing computer.
The file names of the selected files appear in the Files box, the number of files that you
selected appears below this box, and the Next button becomes available.
Figure 32. Input File Selection page with selected files (brown O-rings and red O-rings)
Tip To remove any of the added files, select them and click Remove Files.
–or–
• You can use the Input File Characterization dialog box that opens when you add input
files to an existing study.
The sample information includes the sample type and study factor values for each sample.
For information about starting the New Study and Analysis Wizard, see “Start the New Study
and Analysis Wizard.”
Figure 33. Input File Characterization page of the wizard (with 12 O-rings)
Opens the Extract Sample Information from Sample Names dialog box
Automatically assigns the defined study factor values
To characterize the input files for a study, follow these steps as applicable:
1. Select the delimiters for parsing the file names.
2. Add or edit the study factors.
3. To select or assign the study factor values to the samples (in the Samples area), see the
following topics as applicable:
• Automatically assign the study factor values
• Manually select the study factor values
• Reset the sample assignments
4. Select the sample types.
Y To select the delimiters that separate the factors in the file names
1. Select the check box or check boxes of the delimiter or delimiters for the input file names.
For example, the underscore character is the delimiter that separates the study factors
from the other parts of the file name in the following file names:
In this case, the study factors are O-ring color and solvent. The O-ring colors are red and
brown. The solvents are water and ethanol.
Blank_100EtOH_1 Blank_H2O_1
Blank_100_EtOH_2 Blank_H2O_2
Red_100EtOH_1 Red_H2O_2
Red_100EtOH_2 Red_H2O_1
Brown_100EtOH_1 Brown_H2O_1
Brown_100EtOH_2 Brown_H2O_2
2. If the delimiter is not available, select the Other check box and type the delimiter
character in the box.
1. From the menu bar at the top right of the Study Factors pane, choose Add > Categorical
Factor.
The categorical factor editor appears with [new factor] automatically selected.
Figure 34. Categorical factor editor
2. Select the [new factor] text and type the name of the factor.
For example, type Color.
3. For each item that you want to add to the Items list, do the following:
a. In the Items box (to the left of the Add button), begin typing a factor.
For example, type Red for one of the Color study factor items.
If the file name contains a character delimiter (underscore, hyphen, comma, space,
plus, or other defined character) and you selected the check box for this delimiter, the
editor automatically enters the appropriate text in the Items box as you start typing.
Otherwise, you must type all the characters for the item.
Note The file name parsing feature is not available on the Study Definition page
of an existing study, so you must type all the characters for the item.
Add button
b. Click Add.
The current item appears in the Items list.
Tip To continue editing the study factor, click Edit. The editor expands, providing
access to the study factor name and items.
1. From the menu bar at the top right of the Study Factors pane, choose Add > Numeric
Factor.
The numeric factor editor appears with [new factor] automatically selected.
Figure 35. Numeric factor editor
Add button
b. Click Add.
The value appears in the Values list in ascending order.
5. To save the study factor, click Apply.
The factor editor collapses to show only the study factor name and the Values list.
Figure 37. Numeric factor with a list of values
Tip To continue editing the study factor, click Edit. The editor expands, providing
access to the study factor name and items.
You can add only one biological replicate factor to a study. Use the biological replicate factor
for nested statistical models—that is, for studies that include one study factor nested within
another study factor.
1. From the menu bar at the top right of the Study Factors pane, choose Add > Biological
Replicate Factor.
The biological replicate factor editor appears with [new factor] automatically selected.
Figure 38. Biological replicate factor editor
Tip To continue editing the study factor, click Edit. The editor expands, providing
access to the study factor name and items.
1. In the Study Factors pane, click in the title bar of the factor.
Note Because the application cannot recognize whether a study factor is in use, this
prompt appears even when you attempt to delete an undefined study factor.
Y To create a new study factor by using the Copy and Paste commands
1. In the Study Factors pane, select the factor that you want to copy.
The title bar of the selected factor turns blue.
2. Click Copy. Then, click Paste.
A copy of the selected factor appears.
Note You can only manually assign study factor values on the Samples page of an existing
study.
1. If you have not already set up the study factors, set them up. See “Add or edit the study
factors.”
Figure 39. Input file Characterization page with two study factors—Color and Solvent
2. In the command bar of the Input File Characterization page or dialog box, click Assign.
The application assigns the study factors to the samples. See Figure 40.
Figure 40. Input file Characterization page with assigned sample types and study factors
3. Check the study factor columns, verify the sample assignments, and manually assign the
study factor values if necessary.
On the Input File Characterization page (or dialog box) or on the Samples page of an
existing study, do the following:
• To select the factor values for a single sample, select the appropriate value from the
list in each factor column.
• To select the same value for a consecutive sample range, drag the pointer across the
rows of interest. Then right-click and choose Set Factor To > Value.
• To select the same value for nonconsecutive samples, hold down the CTRL key and
click the samples of interest. Then right-click and choose Set Factor To > Value.
Note To edit the values for a study factor, you must first clear the sample assignments
if the values are assigned to samples.
By default, the selected sample type for each input file is Sample. Clicking Assign on the Input
File Characterization page (or dialog box) automatically assigns the Blank sample type to
samples with a file name that includes “blank” as a delimited value (for example,
solvent_blank_1.raw, where the underscore character is the delimiter).
Extract study factor values from the file names of the input files
Use the Extract Sample Information From Sample Names dialog box to automate the setup
and assignment of the study factors. This dialog box is accessible from the Input File
Characterization page of the New Study and Analysis Wizard or from the Input File
Characterization dialog box that opens when you add input files to a study.
Note For the extraction and assignment process to work, you must select the study factor
portions of an example input file name and define these portions appropriately as
categorical, numerical, or biological replicate factors.
When the undefined portions of the file name are not exactly the same for all the samples
that you want to characterize, you can mark the text to be ignored or you can manually
assign or edit the study factor values for these samples after returning to the Input File
Characterization page or dialog box.
1. Open the Input File Characterization page (step 3 of 5) of the wizard or the Input File
Characterization dialog box from within an existing study.
Note When you are working within an existing study, the Input File Characterization
dialog box opens when you add files to the study. You cannot open the Input File
Characterization dialog box to characterize raw data files that are already linked to the
study.
2. Click Advanced.
The Extract Sample Information From Sample Names dialog box opens with the first file
name displayed in the Regular Expression Builder box and the file names of all the
selected input files listed below it. See Figure 41.
Figure 41. Extract Sample Information From Sample Names dialog box
3. If the Regular Expression Builder box does not contain a representative sample name,
right-click a representative sample name in the Sample Name column and choose Use
Name of Selected Sample as Regular Expression Template.
Figure 42. Urine_0-4hr_01 sample name selected
Right-click command
5. If the study factor columns do not populate as expected, modify the regular expression
operators within the square brackets.
Table 14. Examples of regular expression operators
Operator Quantifier Matching pattern
[0-9] none A single integer
[0-9] + One or more integers
[A-Z] none One alphabetic character
[0-9A-Z] none One alphanumeric character
[0-9A-Z] + One or more alphanumeric characters
[0-9A-Z-] + One or more alphanumeric characters, one or more
hyphens, or both (for example, 555-0000)
[0-9A-Z] [0-9A-Z] + One or more alphanumeric characters, a space, and
one or more alphanumeric characters (for example,
ABC 123)
For example, to populate the rows for the other file names with defined time periods, add
a hyphen to the operators in the square brackets. Figure 45 shows the hyphen character
added to the operator set for a categorical factor.
Figure 45. Hyphen building block added to the regular expression for a categorical factor
Original operator set = [0-9A-Z]
Modified operator set = [0-9A-Z-]
6. If the sample names include extra text that differs from sample to sample and does not
define a study variable, exclude this text from the regular expression by selecting it and
clicking .
Note Figure 46 and Figure 47 show a different sample set than the previous figures.
For example, in the sample set shown in Figure 46, exclude all the characters to the right
of the categorical variable by selecting _01_MDF and clicking .
Figure 46. Selecting the text that you want to exclude from the expression
The regular expression builder replaces the selected text with the following expression:
(?:.+)?
Figure 47 shows the result of these actions:
• Selecting the time period (0-3) as a categorical variable
• Adding a hyphen to the categorical variable expression
• Selecting “_01_MDF” as text to ignore
Figure 47. Regular expression that defines the categorical variable and the text to ignore
7. To enter the study factor names, type the study factor names in the column-heading
boxes.
For example, replace CategoricalVariable_1 with Time Period and NumericalVariable_1
with Replicate (Figure 45 and Figure 48).
Figure 48. Study factors renamed
Tip You can edit the study factor names in two ways:
• In the column-heading boxes, replace the default column headings with the study
factor names.
• On the Input File Characterization page, edit the study factors.
8. Click OK to return to the Input File Characterization page where you can modify the
values in the study variable columns if necessary.
Tip The Regular Expression Builder does not assign sample types. To assign Blank as
the sample type, click Assign on the Input File Characterization page after selecting
the appropriate delimiters.
For example, select the Underscore check box on the Input File Characterization page
and click Assign, for the following file name: Solvent_Blank_1.
Table 15 describes the parameters in the Extract Sample Information From Sample Names
dialog box.
Table 15. Extract Sample Information From Sample Names dialog box parameters
Parameter Description
Regular Expression Builder
Initially displays the first file name in the Sample Name list. Use this file name (or a
different file name in the list) to build a regular expression that extracts the study factor
values for the file names in the Sample Name list.
Buttons
[0-9] Identifies a numeric factor.
Note The Sample Groups and Ratios page of the New Study and Analysis Wizard is the
fourth page of the wizard. To open the fourth page of the wizard, you must select one or
more input files on the second page of the wizard.
Note When you start a new analysis in an existing study, you must manually add the
input files to the Files for Analysis area of the Analysis view.
IMPORTANT Do not create ratios with sample groups that have an n/a assignment for a
study variable. To use a sample group in a ratio, you must replace the n/a assignment with
a study factor value.
The Study Variables area contains a File check box, a Sample Type check box, and an
additional check box for each study factor you created on the Input Files Characterization
page of the wizard or the Study Definition page of the current study.
Figure 49 shows the Sample Groups and Ratios page of the wizard. The Generated Sample
Groups area lists the input files for the analysis.
Note To start a new analysis from within an existing study, click New Analysis in the
study command bar. Or, reprocess an existing analysis on the Analysis Results page of
a study.
• Sample Groups and Ratios page of the New Study and Analysis Wizard
Note The Sample Groups and Ratios page of the New Study and Analysis Wizard is
the fourth page of the wizard. To open the fourth page of the wizard, you must select
one or more input files on the second page of the wizard.
In the Study Variables area of the page, select the check boxes for the study variable or
variables for sample grouping.
Selected study variables have a light green background, a dark green handle ( ) on the
left, and a Sorting icon ( )on the right.
Review the generated sample groups and fix any assignment errors
After you select the study variables for sample grouping, the Generated Sample Groups area
displays the generated sample groups.
The hierarchy of the selected study variables in the Study Variables area affects the sample
group names and the denominator list for bulk ratio generation. See “Change the hierarchy of
the study variables.”
The naming scheme for the study groups in the Generated Sample Groups area is as follows:
• Group names (green) consist of the common values for the selected study variables.
• Sample names (blue) consist of a unique ID and the input file name.
Following data processing, the result file displays the chromatographic peak areas for
individual samples and the named sample groups.
When selecting the study variables generates an n/a sample group that includes Sample,
Control, or Standard sample types, the application highlights the selected study variable and
the n/a group in red and displays the following error message:
There are samples with unset study variables selected for grouping.
Y To fix an assignment error on the Sample Groups and Ratios page of the wizard or the
Grouping & Ratios page of an analysis within an existing study
1. Check the sample type and study factor assignments of the input files as follows:
• If you are creating a new study, in the New Study and Analysis Wizard, return to the
Input File Characterization page.
• If you are modifying an existing study, open the Samples page.
2. Do any of the following:
• Assign study factor values to the Sample, Control, or Standard sample types. If
necessary, create new study factor values for these samples.
• For LC studies, change the sample type assignment for samples without a study factor
value to Blank, Identification Only, or Quality Control as appropriate.
3. Do one of the following:
• In the New Study and Analysis Wizard, return to the Sample Groups and Ratios page
and verify the assignments.
• In an existing study where you have started a new analysis, return to the Grouping &
Ratios page.
The hierarchy of the selected study variables in the Study Variables area affects the sample
group names and the denominator list for bulk ratio generation.
Place the pointer over the handle ( ) to the left of a variable name. When the move
cursor ( ) appears, drag the variable up or down in the list.
Figure 51. Changing the hierarchy of the study variables
Move cursor
Click the Sorting icon for the study variable that you want to sort by and choose Sort
Ascending or Sort Descending.
Figure 52. Sort order shortcut menu for the study variables
For details about setting up the group ratios, see the following topics:
• Set up the group ratios individually
• Set up the group ratios by using the bulk ratio generator
The application assumes you are setting up the group ratios for a differential analysis. When
you set up at least one group ratio, information about the statistical test that the Differential
Analysis node will perform appears in the Generated Ratios area.
Tip The Differential Analysis node is a post-processing node in the processing workflow
for an analysis. When you set up group ratios for an analysis that does not include the
Differential Analysis node, a validation prompt appears when you submit the analysis to
the job queue. If you do not want the analysis to perform a differential analysis, you can
click Ignore to dismiss the prompt and start the run.
The workflow templates provided with the application that include Statistics in their file
names include the Differential Analysis node.
In the Manual Ratio Generation area of the page, select a group from the Numerator list,
select a group from the Denominator list, and click Add Ratio. Repeat this step for each
pair of groups that you want to compare.
Note The hierarchy of the selected study variables in the Study Variables area affects the
sample group names and the denominator list for bulk ratio generation and the sort order
of the study variables affects the denominators to be used in the Bulk Ratio Generation
area. For details, see “Change the hierarchy of the study variables,” and
1. Modify the sort order and hierarchy of the study variables as appropriate.
2. In the Bulk Ratio Generation area, select one or more denominators and click Add
Ratios.
When you select more than one variable in the Study Variables area and the selected
variables have multiple values, the Select/Deselect Item in All Groups icon, , appears
when you place the pointer near a denominator in the Denominators to Be Used area.
To automatically select multiple check boxes for the same value in all groups, select one of
the check boxes for the value, and then click the Select/Deselect Item in All Groups
icon, .
Clicking Finish on the Input File Selection page (after selecting input files), on the Input File
Characterization page, or on the final wizard page opens the study page and the Analysis view.
The Analysis view contains the selected input files. If you selected a processing workflow on
the Study Name and Processing page of the wizard, the Workflow box on the Analysis view
displays the name of the processing workflow, and the Workflows page contains the
processing workflow.
Before starting a run, you can edit the study, the processing workflow, the input file list, and
the result file name. Some of the defined processing workflows require customization. For
example, for a targeted analysis, you must select the target compounds for the Generate
Expected Compounds node. For information about customizing a processing workflow, see
“Customize the processing workflow.”
If the processing workflow is valid and the Analysis view contains one or more input files, the
Run command at the top right of the Analysis view is available.
If the Run command is unavailable, you must fix the analysis errors. See “Troubleshoot
common analysis errors.”
These topics describe how to set up additional tables, lists, or files for use in a processing
workflow:
• Set up individual isotope patterns by using the Isotope Ratio Editor
• Create an isotope patterns list by using the Pattern List Editor
• Extract analog and PDA traces
Tip You cannot set up the sample groups and ratios on the Grouping and Ratios page
without first adding the appropriate input files to the Analysis view.
To set up a new analysis from within an existing study, follow these steps:
1. Start a new analysis from within an existing study
2. Select a workflow template
3. Customize the processing workflow
Tip You can also start a new analysis by opening an analysis template.
The following procedure describes how to select a processing workflow template or load an
existing processing workflow from a result file. For information about creating a new
processing workflow from scratch, see Chapter 6, “Create and edit processing workflows.”
Command bar
Workflow box
2. In the Workflow Tree area on the Workflows page, create a new processing workflow or in
the Workflows command bar, do one of the following to select an existing processing
workflow:
• To select a standard processing workflow from the Common Templates folder, click
Open Common, select the folder for the field of interest, select one of the provided
processing workflow templates, and click Open.
The file name appears in the Workflow box, and a description of the processing
workflow appears in the Description box.
• To select a processing workflow from another folder, click Open, locate the
processing workflow file (.cdProcessingWF) of interest, and click Open.
The file name appears in the Workflow box.
• To load the processing workflow that you previously used to process a specific result
file, click Open, locate the result file (.cdResult) of interest, and click Open.
3. To customize the processing workflow, see the next topic, “Customize the processing
workflow.”
Table 17 lists the workflow nodes in processing workflows within LC studies that require
custom settings.
Table 17. Workflow nodes that require customization for an LC study (Sheet 1 of 2)
Node Required customization
Generate Expected From the Compound list, select library compounds.
Compounds
In the defined workflow templates, this selection is empty.
Create FISh Trace From the Compound list, select a library compound.
Table 17. Workflow nodes that require customization for an LC study (Sheet 2 of 2)
Node Required customization
Search Mass Lists From the Input Files list, select mass lists.
For LC studies, Thermo Fisher Scientific has optimized most of the parameter settings in the
defined processing workflow templates by the area of study (vertical market). Table 18 lists the
parameters that usually require a different setting, regardless of whether you use one of the
defined templates or create your own processing workflow.
Table 18. Parameter modifications for workflow nodes
Node Parameter settings to optimize or modify
Create Analog Trace Select the trace of interest.
Create Mass Trace Select the trace of interest.
Generate Expected Set up the dealkylation and transformation steps and select
Compounds the appropriate ions.
Search nodes Select the libraries or lists of interest.
Detect Compounds For the recommended minimum peak intensity settings for a
(Legacy) specific mass spectrometer model, see Table 19.
Group Compounds Set up a peak rating filter to remove low-quality
chromatographic peaks from the analysis.
Group Expected Set up a peak rating filter to remove low-quality
Compounds chromatographic peaks from the analysis.
Table 19 lists the recommended range for the minimum peak intensity setting in the Detect
Compounds (Legacy) node for an LC study. The optimal setting depends on the sensitivity of
the mass spectrometer.
IMPORTANT Unless you are creating a new analysis for comparison with legacy analyses,
Thermo Fisher Scientific recommends that you use the new Detect Compounds node
instead of the Detect Compounds (Legacy) node. With the new Detect Compounds
node, you can use the peak rating filter to eliminate compounds for further processing and
from the analysis result unless their chromatographic peaks pass the peak quality threshold
in a specified number of input files.
1. Open the study of interest and start a new analysis. See “Start a new analysis from within
an existing study.”
2. On the Input Files page or the Samples page of the study, select the files of interest.
3. Right-click the selection and choose Set As Input File, or drag the files of interest to the
Files for Analysis area of the Analysis view.
Go to the next topic, “Edit the file name for the result file to be generated by the analysis.”
Edit the file name for the result file to be generated by the analysis
After you add the input files to be processed to the Analysis view, the application
automatically populates the Result File box with the file name of the first input file. You can
edit the file name.
In the Analysis view, select the default file name in the Result File box and type the new
name.
Go to the next topic, “Select the study variables and set up the group ratios for a comparison
analysis.”
Select the study variables and set up the group ratios for a comparison analysis
To run a differential analysis automatically, you must define the sample groups and ratios for
the new analysis.
Note When the processing workflow includes the Differential Analysis node, the Analysis
Validation confirmation box opens if the analysis does not include defined sample groups
and ratios on the Grouping & Ratios page of the analysis.
1. If you have not already opened an existing study and started a new analysis, do the
following:
a. Open an existing study. See “Open an existing study.”
b. Click New Analysis in the study command bar.
c. On the Workflows page of the analysis, select a workflow template or set up a
processing workflow. See “Select a workflow template.”
d. Select the input files for the analysis. See “Select the input files for the new analysis.”
2. On the Grouping and Ratios page of the analysis, select the study variables and set up
group ratios.
For details about selecting the variables for the analysis and setting up group ratios, see “Set up
the sample groups and ratios for a new analysis,” which describes how to set up the sample
groups and ratios on the Sample Groups and Ratios page of the New Study and Analysis
Wizard.
Tip For LC studies, the application does not know the polarity of the scan data or whether
the data contains data-dependent or data-independent fragmentation (acquisition) scans.
For best results, check the following:
• When the processing workflow contains any of these nodes—Create Mass Trace,
Create FISh Trace, or Create Pattern Trace—verify that the ion polarity setting
matches the data.
• When the processing workflow contains the Create FISh Trace node, verify that the
setting for Fragmentation Mode matches the data.
Y To troubleshoot an analysis
1. In the Analysis view, point to the Caution symbol, , to display the list of missing
analysis items, or check the error information in the Current Workflow Issues pane
(Figure 56) below the Post-Processing pane on the Workflows page.
2. Using the information provided in the list of missing items, fix the analysis errors until
the Caution symbol disappears. See Table 20.
Figure 55. Error messages for a defined processing workflow in an LC study
For example, Figure 55 shows the error message that appears before you customize the
defined processing workflow provided in the Common Templates folder:
Degradants w Statistics Expected w FISh Scoring and Unknown ID w Online and
Local Database Searches.cdProcessing WF
Figure 56 shows the corresponding error messages in the Current Workflow Issues pane.
Figure 56. Current Workflow Issues pane with issues for an LC study
In addition, each workflow node that is missing a value for one of its parameters has an
exclamation mark, , in its upper-right corner.
To fix the analysis errors for this example, you must select a compound for the Create
FISh Trace node, one or more compounds for the Generate Expected Compounds node,
and a mass list for the Search Mass Lists node.
Tip You can update all the libraries and lists when an analysis is open—that is, the
available selections in the workflow nodes that include selections from these libraries
and lists automatically update when you modify the associated libraries and lists.
Go to “Submit an analysis to the job queue and fix any validation issues.”
Note To troubleshoot validation issues, which occur after you submit a run, see
“Common validation issues.”
Submit an analysis to the job queue and fix any validation issues
To submit an analysis to the job queue, understand the validation issues that might occur, and
control the job queue, see these topics:
• Submit an analysis to the job queue
• Common validation issues
1. Decide whether you want to create one result file for the entire set of input files or one
result file for each input file.
• To create a single result file, leave the By File check box clear.
• To create a separate result file for each input file, select the By File check box.
2. (Optional) In the Result File box, type a name for the result file to overwrite the default
name.
–or–
• Do not change the analysis settings and
click Ignore.
The analysis includes sample groups and On the Workflows page, add the
ratios, but the processing workflow does not Differential Analysis node to the workflow.
include the Differential Analysis node.
IMPORTANT If the analysis does not include sample ratios or the Differential Analysis
node, the application does not run a differential analysis, and the Compounds and
Expected Compounds tables in the result file does not include the following columns:
Ratios, Log2 Fold Change, P-value, and Adj. P-value.
The application can process two runs simultaneously. If you submit a second run while the
first run is being processed, the status of the second run changes to Not Queued, Running, or
Execution Failed. If you submit a third run while the application is processing the first two
runs, its status changes to Waiting.
Figure 57. Job Queue list with runs in various processing states
Command
bar
To work with the jobs on the Job Queue page, see the following instructions.
• Open the Job Queue page
• Cancel a run
• Promote a run
• Remove completed or failed jobs from the Job Queue
• Refresh the Job Queue list
• Open a result file from the Job Queue page
• Display verbose messages on the Job Queue page
• View the processing steps for a job
• Filter the Job Queue list
• Job Queue page parameters
From the Compound Discoverer menu bar, choose View > Show Job Queue.
The Job Queue page opens as a tabbed document.
Cancel a run
You can cancel a run before it ends—that is, you can cancel a run when it is in the Waiting or
Running state.
1. On the Job Queue page, select the run that you want to cancel.
2. In the command bar on the Job Queue page, click Abort.
When you cancel a run, its status changes to Aborted, and the application does not create
a result file for the run.
Promote a run
If the application has not started processing a job, you can promote the job one position at a
time. You cannot promote a job to a position in front of a job that is already running—that is,
you cannot pause a run in progress to promote a job in front of it.
Y To promote a run
1. On the Job Queue page, select a job that is waiting in the job queue list.
2. Click Promote.
1. On the Job Queue page, select the jobs that you want to remove.
2. In the command bar on the Job Queue page, click Remove. Then, at the prompt, click
OK.
The selected jobs disappear from the Job Queue list. The application does not remove the
result files from the study.
On the Job Queue page, click the expand icon, , to the left of the job row.
Note Use the filters for the column that you want to sort by. For example, to display
only the runs that you ended before completion, follow this procedure.
1. Click the icon, , to the left of the Execution State filter list and select Equals.
Use the Analysis Results page of a study to review or reprocess a completed analysis or to open
a result file. An analysis consists of a processing workflow, optional sample groups and ratios,
and the selected input files.
Note The Execution State on the Job Queue page updates more quickly than the
Execution State on the Analysis Results page.
Selecting an analysis in the list on this page makes these commands available.
Open Result Opens the selected result file.
Reprocess Opens the Analysis view with the list of input files that were used
for the selected analysis. The Workflows page contains the
processing workflow, and the Grouping & Ratios page contains
the sample groups and ratios used for the selected analysis.
Search box Use this box to type text to search for.
Search For Use this dropdown list to select the column to search by.
Table columns
Execution State Displays the status of the analysis.
Creation Date Displays the date and time when you submitted the run to the job
queue.
File Name Displays the file name of the result file.
Review an analysis
Use the Analysis Sequence Details window to review an analysis.
Y To review an analysis
1. On the Analysis Results page of a study, right-click the analysis and choose Show Details.
The Analysis Sequence Details window opens and displays the processing workflow in the
Workflow Tree pane and the result file name and input files in the Processing Step pane.
2. To view the parameter settings for a workflow node, select it in the Workflow Tree pane.
The parameter settings appear in the Parameters pane at the left.
Note The Processing Step is locked, preventing you from reprocessing the analysis
from this window.
Figure 59. Analysis Sequence Details window showing the parameter settings for the Detect
Compounds node (LC study)
Reprocess an analysis
You can reprocess an entire analysis or only the post-compound detection portions of an
analysis.
Note Because result files (analysis results) for untargeted analyses from previous versions
of the Compound Discoverer application contain an obsolete Detect Compounds node,
you cannot partially reprocess these legacy analysis results. You can fully reprocess these
legacy analyses after you replace the “not available” node with the current Detect
Compounds node or the Detect Compounds (Legacy) node. See “Reprocess a legacy
analysis result from an untargeted analysis.”
Y To reprocess an analysis
1. On the Analysis Results page of a study, right-click the analysis and choose Reprocess.
• If the Analysis view is open and contains information for an analysis that you have
not yet submitted to the job queue, the following prompt appears:
The analysis was not started. Do you really want to discard the analysis?
• If the Analysis view is closed, it reopens with the settings from the selected analysis. In
addition, the Grouping & Ratios tab and Workflows tab appear to the right of the
study tabs.
2. If prompted, do the following:
a. Check the Analysis view and decide whether you want to discard the current settings.
b. Click Yes to replace the current settings in the Analysis view with the settings from
the selected analysis. Otherwise, click No to return to the in-progress analysis.
3. Do one of the following:
• To reprocess the entire analysis, go to step 5.
• To reprocess only part of the processing workflow, go to step 4.
Note For LC studies, you can add or reprocess any of the Peak Area Refinement
nodes, Pathway Mapping nodes, Search nodes, and Compound Scoring nodes
without reprocessing the entire workflow.
You must reprocess the entire workflow to reprocess any of the nodes for Spectrum
Processing, Trace Creation, Compound Detection, or Expected Compounds (except
for the Merge Features node).
4. To reprocess only part of the processing workflow or add workflow nodes downstream of
the core processing workflow, do the following:
a. Click the Workflows tab.
The processing workflow appears in the Workflow Tree (Reprocess) area. All the
nodes are white with a gray tab, which indicates that they are not set for reprocessing.
Tip When you attempt to reprocess an analysis result from a previous version of
the Compound Discoverer application, some of the processing workflow nodes
are obsolete and unavailable for reprocessing. The gear icon and n/a label indicate
obsolete workflow nodes.
You cannot directly reprocess analysis results with obsolete processing workflow
nodes. To reprocess analysis results from previous versions of the Compound
Discoverer application, you must replace the obsolete workflow nodes with
comparable nodes on the Workflow Nodes page.
Note If you are only partially reprocessing the existing input files, the Source File
row appears in the Analysis view. You cannot edit the source file name.
c. If you are partially reprocessing the data, consider changing the result file name.
If you do not change the result file name, the application automatically appends a
number to the file name when you submit the run.
d. Go to step 7.
5. To reprocess the entire processing workflow, right-click the Input Files node and choose
Reprocess. Then, at the prompt, click OK.
6. If necessary, make changes to the sample groups and ratios on the Grouping & Ratios
page and the settings in the Analysis view.
Note When you change any of the settings on the Grouping & Ratios page or the
files for analysis in the Analysis view, the application automatically reprocesses the
entire processing workflow.
1. On the Analysis Results page of the legacy study, right-click the analysis and choose
Reprocess.
• If the Analysis view is open and contains information for an analysis that you have
not yet submitted to the job queue, the following prompt appears:
The analysis was not started. Do you really want to discard the analysis?
• If the Analysis view is closed, it reopens with the settings from the selected analysis. In
addition, the Grouping & Ratios tab and Workflows tab appear to the right of the
study tabs.
2. If prompted, do the following:
a. Check the Analysis view and decide whether you want to discard the current settings.
b. Click Yes to replace the current settings in the Analysis view with the settings from
the selected analysis. Otherwise, click No to return to the in-progress analysis.
3. To open the Workflows page of the analysis, click the Workflows tab or click Edit to the
right of Processing Step (Partially Reprocessing).
c. On the Workflow Nodes page, drag one of the detect compounds nodes to the
Workflow tree area.
The detect compounds node automatically connects to the Merge Features node and
the Group Compounds node, but the Align Retention Times node does not
automatically connect to the detect compounds node.
Note When you replace the (n/a) Detect Compounds node with the Detect
Compounds (Legacy) node and you want the node to yield results that are similar
to the previous version of the Compound Discoverer application, keep the default
setting of False for the Use Peak Quality for Isotope Grouping parameter.
The User Peak Quality for Isotope Grouping parameter is an advanced parameter
under Isotope Pattern Detection.
d. Connect the Align Retention Times node to the detect compounds node.
6. Make other changes to the processing workflow as appropriate.
7. To start the analysis, click Run.
The name of the first input file automatically populates the Result
File box. If the analysis creates only one result file, you can type a
name for the result file in the Result File box.
For LC studies, you can access the Isotope Ratio Editor dialog box from the Create Pattern
Trace node or the Pattern Scoring node.
1. Open the Isotope Ratio Editor dialog box. See Open the Isotope Ratio Editor.
2. In the Isotope Ratio Definition Type area of the Isotope Ratio Editor dialog box, select
the Define from Elemental Composition Formula option.
Figure 60. Isotope Ratio Editor dialog box with an isotope pattern for a defined elemental
composition
• Resolution: 60 000
• Charge: 1
For example, to enter the elemental composition of omeprazole where only one of the
carbon atoms has been replaced with carbon-13, type C16 [13]C H19 N3 O3 S.
4. Refine the settings for the intensity threshold, resolution, and charge, as appropriate and
in any order:
• In the Int. Threshold [%] box, type the relative intensity threshold.
The application removes isotopic peaks below this relative intensity threshold from
the Mass Shift versus Intensity [%] table.
• In the Resolution box, type the resolution for the scans.
Note The scan header in the raw data file lists the resolution of each scan, and the
instrument method that is associated with the raw data file lists the resolution of
each scan event.
• In the Charge box, type or select the charge state of the ions.
The application uses the charge state to display the theoretical mass spectrum in the
graphical display.
5. Select the check boxes of the required peaks.
6. Open the Isotope Ratio Editor dialog box. See Open the Isotope Ratio Editor.
7. Right-click in the Elemental Composition box and choose Paste.
1. Open the Isotope Ratio Editor dialog box. See Open the Isotope Ratio Editor.
2. Select the Custom Isotope Ratio Pattern option.
Below the Isotope Ratio Definition Type area, the available parameters change.
Figure 61. Custom Isotope Ratio Pattern view
These two options define the parameters that appear below the Isotope Ratio Definition
Type area.
Define from Elemental Selecting this option makes the following features visible:
Composition Formula
• Elemental Composition box
• Int. Threshold [%] box
• Resolution box
• Charge box
Selecting the Define from Elemental Composition Formula option makes the following
parameters visible: Elemental Composition, Int. Threshold [%], Resolution, and Charge.
Elemental Specifies the elemental composition of the compound of interest.
Composition
When you type a composition in this box, the application
automatically creates a table of mass shifts and intensities.
Displays a graph of the full isotope distribution in the mass shift and intensity table.
Use this table to specify the required peaks in the isotope pattern.
When you select the Define from Elemental Composition Formula option, the application
automatically populates this table; you cannot edit the entries.
Required Specifies whether the isotope is required.
(for an isotope pattern • When the check box is selected, the isotope is required.
defined from a
• When the check box is clear, the isotope is not required.
user-specified
elemental composition)
Mass Shift Specifies the mass shift from the pattern base peak (A0).
Intensity [%] Specifies the relative intensity [%] of the isotope to the pattern
base peak.
Rows Specify the values for the isotopes.
Custom isotope ratio pattern view
Selecting the Custom Isotope Ratio Pattern option makes the following buttons visible: Get
from Composition Formula and Get from Clipboard.
Get from Composition Creates an isotope pattern for an elemental composition formula.
Formula The application reads the elemental composition that you entered
in the Elemental Composition box before selecting the Custom
Isotope Ratio Pattern option.
Get from Clipboard Imports the isotope pattern from the Clipboard. You can export a
custom pattern to the Clipboard from the spectrum list view in
the FreeStyle application or from a third-party software
application.
Graph of Intensity [%] versus m/z value
Displays a graph of the full isotope distribution in the mass shift and intensity table.
Mass shift and intensity table
When you select the Custom Isotope Ratio Pattern option, you can edit the mass shift and
intensity values for the isotope pattern.
2. In the Workflow Tree area on the Workflows page, select the Pattern Scoring node.
The parameters pane for the Pattern Scoring node opens at the left.
3. In the Isotope Patterns box, type the elemental composition formulas for the defined
isotope patterns that you want to compare. Separate each pattern with a semicolon and a
space—for example, C15S; C17S.
4. Click the more icon, , to the right of the Isotope Patterns box.
The Pattern List Editor opens. The table contains the elemental compositions that you
entered.
Figure 62. Pattern List Editor
5. Select the isotope pattern that you want to edit, and then click Edit Pattern.
The Isotope Ratio Editor dialog box opens. To edit the pattern, see Set up individual
isotope patterns by using the Isotope Ratio Editor.
6. For each pattern that you want to add, click New, and then set up the isotope pattern in
the Isotope Ratio Editor.
After you add a pattern, it appears in the Patterns table of the Pattern List Editor dialog
box. The Name and Elemental Composition columns display the elemental composition
of the pattern. The text in the Name column is editable.
7. Click OK to close the Pattern List Editor dialog box.
The isotope patterns appear in the Isotope Patterns box of the Pattern Scoring node.
Table 26 describes the commands and table columns in the Pattern List Editor dialog box.
Table 26. Pattern List Editor parameters
Parameter Description
Commands
New Opens the Isotope Ratio Editor dialog box for setting up an
isotope pattern.
Delete Deletes the selected pattern.
Edit Opens the Isotope Ratio Editor dialog box for editing the
selected pattern.
Columns
Name By default, displays either the elemental composition for each
isotope pattern that you create with a defined elemental
composition or the text “CustomPattern #” for each isotope
pattern that you create by setting up a custom isotope ratio
pattern.
Note To extract UV traces and PDA traces in the same analysis, you must add two Create
Analog Trace nodes to the processing workflow. Set up one node to extract UV traces and
the other node to extract PDA traces.
Analog traces include data from a UV-Vis detector, analog data from LC devices controlled by
the Xcalibur data system or an equivalent Thermo Scientific application, and analog signals
from devices connected to the analog channels on the communications panel of your mass
spectrometer.
Note For details about this node, see “Create Analog Trace node.”
3. In the Parameters pane, under General Settings, select UV or Analog from the Trace Type
list.
4. In the RT offset [min] box, type the offset time, in minutes, for the UV-Vis or Analog
trace.
If there is a time difference between when the sample enters the mass spectrometer and
the UV-Vis or analog detector’s flow cell, use the offset time to align the chromatographic
traces. A value of 0 indicates that the UV-Vis or analog detector and the mass
spectrometer simultaneously detected the sample.
5. In the Custom Label box, type text to identify the trace in the Specialized Traces table of
the result file window.
Note For details about this node, see “Create Analog Trace node.”
3. In the Parameters pane, under General Settings, do the following in any order:
• In the Trace Type list, select PDA.
• In the RT offset [min] box, type the offset time, in minutes, for the PDA traces.
If there is a time difference between when the sample enters the mass spectrometer
and the PDA detector’s flow cell, use the offset time to align the chromatographic
traces. A value of 0 indicates that the PDA detector and mass spectrometer
simultaneously detect the sample.
• In the Custom Label box, type text to identify the trace in the Specialized Traces table
on the result file page.
4. In the PDA Settings area, do any of the following:
• To extract a plot of the average intensity of the scanned wavelength range versus time,
select True for Total Scan.
This trace is labeled as a Total Scan in the Specialized Traces table.
• To extract a plot of the spectrum maximum of the scanned wavelength range versus
time, select True for Spectrum Maximum.
This trace is labeled as a Spectrum Maximum in the Specialized Traces table.
• To extract a plot for a specified wavelength range, select True for Wavelength Range.
Then, type the wavelength range in the Min. and Max. Wavelength boxes.
For information about adding and editing study factors, see “Add or edit the study factors,” in
Chapter 3.
Study files
When you create a study with the New Study and Analysis wizard, the application creates a
study file (.cdStudy) and a study folder. The study file contains a list of input files (Xcalibur
RAW files) with their associated sample information and a list of analyses with their associated
result files (.cdResult). The sample information includes the sample type of each input file
and the relationship between the input files.
When you open an existing study, it opens as a tabbed document with a command bar and a
set of tabbed pages: Study Definition, Input Files, Samples, and Analysis Results.
• The Study Definition page contains an editable list of the study factors. See “Edit the
study description and the study factors.”
• The Samples page contains editable lists of the sample type and study factor values for
each input file. See “Edit the sample type and study factor values.”
• The Input Files page tracks the location of the input files. These input files typically reside
outside the study folder—for example, they might reside on your processing computer,
on a shared server, or on the data system computer where they were acquired. If you
delete or rename an input file from the specified folder location after you add it to a study,
this warning symbol, , appears to the left of the ID column for the deleted file.
Pointing to the warning symbol displays the expected location of the missing file and
instructions for resolving the issue. See “Resolve the location of the input files in a study.”
• The Analysis Results page tracks the result files generated by analyses run within the
study. Result files reside within the study folder.
The Start Page lists the 20 most recent studies. The study name appears in blue hypertext.
Pointing to the study name underlines it.
Figure 63. Application window and Start Page
Open an Existing Study
toolbar icon Menu bar
The study opens to the Analysis Results page, which is the last page of four pages. The study
tab includes an image of two racked test tubes, , on the left, the study name in the middle,
and a close icon, , on the right.
Tip On the study tab, an asterisk to the right of the study name indicates unsaved
changes.
Figure 64. Analysis Results page of a study
Study tab
The Add Files, New Analysis, and Open Analysis Template commands are independent of
the active page within the study. The Remove Files command is only active when a file is
selected on the Input Files page or the Analysis Results page.
Add Files Opens the Open dialog box where you select the input files
(Xcalibur RAW files) that you want to include in the study.
Remove Files Executes one of two actions:
• Removes the selected files on the Input Files page when it
is the active page.
• Removes the selected analysis results on the Analysis
Results page when it is the active page.
Open Containing Folder Opens the folder that contains the selected file.
For information about adding and editing the study factors, see Add or edit the study factors
in Chapter 3
Figure 65. Study definition page of a study
Note If you selected a study template with study factors when you created the study or
added study factors by using the Input File Characterization page of the New Study and
Analysis wizard, the Study Factors area contains the defined factors.
Use this pane to enter and store a description of the current study.
Study Factors pane
Tip To add input files to an analysis, drag the files of interest from the Input Files page to
the Analysis view. See “Set up a new analysis from within an existing study.”
1. On the Input Files page of a study, select the input file of interest.
2. Below the table, click Show Details.
The Samples page opens below the input files table.
3. Click the Files tab.
Figure 66 shows the hidden Files page that lists the full file name of the selected input file,
including its directory location.
Figure 66. File path of selected input file
Note Removing input files from a study does not delete them from their directory
location.
1. On the Input Files page, select the rows to remove and click Remove Files in the study
command bar.
Depending on whether you have run an analysis with the selected input files, one of the
following confirmation boxes appears:
• If you have not run an analysis, the Remove Input File confirmation box appears.
• If you have run an analysis, the Remove Analysis Result Files confirmation box
appears.
2. Do one of the following:
• In the Remove Input File confirmation box, click Yes to remove the input files from
the study.
–or–
• In the Remove Analysis Result Files message box, click Remove Files to remove the
input files and their associated analyses from the study.
When you remove an input file from the study, the analyses associated with the input
file disappear from the Analysis Results page, but the result files (.cdResult) remain in
the study folder.
If the network or directory path changes between the study and the raw data files, an
exclamation symbol ( ) appears to the left of the file ID.
Figure 67. Exclamation symbol for missing input file, Find Missing Files command, and Files page
of the Details section
Y To resolve the input files list when you move a study or the Xcalibur RAW files
1. On the Input Files page of a study, display the details for the missing files and check their
original location.
2. If you know where the files are currently stored, add the files to the study by doing any of
the following:
• Open the Add Files browser by clicking the Add Files command in the Study page
command bar. Then, select the files and click Open.
• Right-click the row for the missing file and choose Find Missing Files. Then, in the
Browse for Folder dialog box, browse to the holding folder, select the Xcalibur RAW
files, and click OK.
The Adding Files confirmation box opens. The progress remains at 0.0% until you click
OK.
3. Click OK to continue.
The Adding Duplicate Files message box opens. The progress again remains at 0.0% until
you click OK.
4. Click OK to continue.
The application restores the connection between the study and the raw data files.
Tip To open the Samples page of a study, click the Samples tab.
Figure 69. Samples page and its shortcut menu
Filter
row
To select or modify the sample types and study factor values or view the file information on
the Samples page, see the following topics as applicable:
• Edit the sample identifier on the Samples page
• Edit the sample types on the Samples page
• Edit the study factor values on the Samples page
• View the file information on the Samples page
Tip To select a row, click a column that does not include a dropdown list.
Tip To select a row, click a column that does not include a dropdown list.
Click Show Associated File at the bottom left of the Samples page.
The following details appear—the full file name and location, the file size in kilobytes,
and the acquisition time of the Xcalibur RAW file.
For information about the Copy with Headers, Copy, Clear Selection, and Cell Selection
Mode commands, see “Common operations for manipulating data tables.”
Set Sample Type To Assigns the selected sample type to a sample range.
Set Factor To Assigns the selected study factor value to a sample range.
Set As Input File Adds the selected input file to the Files for Analysis area in the
Analysis view.
Save the study file if you turned off the auto-save feature
By default, the application saves the study file after you submit an analysis and when you close
the study file. However, you can change this behavior by turning off the auto-save feature.
IMPORTANT If you turn off the auto-save feature, the application prompts you to save
your changes when you attempt to close a study file with unsaved changes. See “Turn off
the auto-save feature for studies.”
Unsaved changes include, for example, the last completed run on the Analysis Results
page, new input files on the Input Files page, new study factors or study factor values on
the Study Definition page, new sample assignments on the Samples page, and other study
parameters.
Note The parameter settings for the workflow nodes in the defined processing workflow
templates are not necessarily the same as the default settings for the workflow nodes
themselves. The parameter settings in the processing workflow templates provided with
the application are optimized by the field of study (vertical market).
For details about editing and creating processing workflows, see these topics:
• Processing workflow templates
• Workflows page
• Edit an existing processing workflow
• Defined processing workflow templates for LC studies
• Create a completely new processing workflow for LC studies
• Connect the workflow nodes for an LC study
• Save a custom processing workflow as a template
Note For LC studies, the application provides more than 30 defined processing workflow
templates organized by field of study.
In addition to analyzing the MS data, the application can display chromatograms from the
following detectors:
• A photo-diode array (PDA) detector or an ultraviolet-visible (UV-Vis) detector that is
controlled by a Thermo Scientific data system
• An analog detector that is connected to the mass spectrometer’s analog input channels
Workflows page
Within an existing study, you use the Workflows page to select, create, and edit processing
workflows. The Workflows page is a tabbed page that opens to the right of the tabbed study
pages when you start a new analysis or reprocess an existing analysis. You cannot edit a
processing workflow from outside a study.
For details about the Workflows page, see the following topics:
• Open the Workflows page
• Workflows page command bar
• Workflows page shortcut menu
Closing a study erases the analysis settings, and reopening a study opens only the study pages.
Table 32. Workflow Tree pane and workflow node shortcut menu commands (Sheet 2 of 2)
Command Description
Hide Node Numbers Turns the numbering on or off.
Reprocess Sets the selected node for reprocessing or sets the selected node
and its related nodes for reprocessing.
The application automatically connects some of the workflow nodes as you add them to a
processing workflow. But for some of the workflow nodes, you must make the appropriate
connections. When a node is missing a connection, a Caution symbol appears in its
upper-right corner.
Several workflow nodes require custom parameter selections. When a workflow node is
missing a custom parameter selection, an exclamation mark appears in its upper-right corner.
1. To view the validation errors, point to the Caution symbol in the Analysis view.
A missing connection begins with the following text:
Missing connection for
An exclamation mark, , appears in the upper-right corner of a node when the node is
missing a required parameter selection or the selected item is not found in its corresponding
list or library. See “Troubleshoot common analysis errors.”
1. To view the validation errors, point to the Caution symbol in the Analysis view or review
the issues described in the Current Workflow Issues pane below the Post-Processing
Nodes pane.
For a missing parameter selection, the issue description includes the following text:
Missing value for parameter ‘Node’
2. To fix the workflow node error, select the node in the Workflow Tree pane.
Figure 72. Missing parameter value for the Search Mass Lists node
3. In the Parameters of ‘Workflow Node’ pane to the left, make a selection for the missing
parameter value.
1. Select the node in the Workflow Nodes pane and drag it to the Workflow Tree pane.
Note A Caution symbol appears in the upper-right corner of a workflow node that is
missing a connection. Because a missing connection takes priority over a missing
parameter setting, if a workflow node is also missing a parameter setting, a blue circle
with an exclamation mark appears after you fix the missing connection.
2. If necessary, make the appropriate node connections. See Connect the workflow nodes for
an LC study.
3. To display the parameters for a workflow node, click the node in the Workflow Tree pane.
The Parameters pane lists the parameters for the selected node.
4. Edit the parameter settings for the node.
Note For an LC study that includes a targeted analysis and an untargeted analysis, the
processing workflow must include either the Detect Compounds (Legacy) node and the
Find Expected Compounds (Legacy) node or the Detect Compounds node and the Find
Expected Compounds node.
1. Right-click the node in the Workflow Tree pane and choose Cut.
2. Check whether any validation issues appear.
3. (Optional) To save the processing workflow for reuse, click Save. Then, rename the
workflow if necessary, select an appropriate folder, and click Save.
Tip If you use the processing workflow for the current analysis, you can reuse the
processing workflow without saving it. After you successfully run the analysis, you can
rerun it by selecting the completed analysis on the Analysis Results page of a study.
Figure 73 shows the workflow tree in the following processing workflow file:
MetID w Stats Expected w FISh Scoring and Background Removal.cdProcessingWF
This processing workflow uses the Generate Expected Compounds and Find Expected
Compounds nodes to find expected parent1 compounds and their dealkylation, dearylation,
and transformation products. FISh Scoring is applied to explain the fragments in the
fragmentation spectra based on in silico fragment prediction of the parent and
dealkylation/dearylation products.
IMPORTANT Because the Find Expected Compounds node requires full (MS1) scans,
do not filter out all the full (MS1) scans for a targeted analysis.
4. The Generate Expected Compounds node creates a list of expected compound ions by
using the following user-specified parameter settings:
• Parent compound or compounds
• Number of dealkylation and dearylation steps
• Number and type of transformation steps
• List of possible adduct ions
The Generate Expected Compounds node passes the following information to the Find
Expected Compounds node for each expected compound:
• Parent compound or compounds
• Elemental composition of the expected neutral compound
• Molecular weight of the expected neutral compound
• Whether the expected compound is a product of a dealkylation step
• Transformations required to produce the expected compound
• Composition change between the parent and expected compound
It also passes the following information for each expected compound ion:
• Charge
• Theoretical m/z value
Note The Generate Expected Compounds node does not send information to a result
table. It sends information to the Find Expected Compounds node, which requires
input from at least one Generate Expected Compounds node. The Filter By Mass
Defect node can also process the input from one or more Generate Expected
Compounds nodes.
5. The Find Expected Compounds node performs the following steps by using the full
(MS1) scan data from the Align Retention Times node and the information from the
Generate Expected Compounds node (or multiple Generate Expected Compounds
nodes):
a. Creates a set of mass tolerance and intensity tolerance fit parameters for each
expected compound ion (m/z value) by using the theoretical m/z value for the ion, the
user-specified mass tolerance, the theoretical isotope pattern for the ion, and the
user-specified intensity tolerance for the isotopic ions.
b. For each expected compound ion, the Find Expected Compounds (Legacy) node
does the following:
i. Checks each full (MS1) scan that passes through the connected data processing
node for centroids that match the mass and intensity tolerance rectangles. The
pattern search (set of rectangles) looks for the base peak (most intense centroid)
of the pattern first.
In most cases, the base peak is the A0 centroid for the monoisotopic ion. But in
some cases, for example, in compounds that contain two bromine atoms or four
chlorine atoms, the isotopic peaks have a higher intensity than the monoisotopic
peak.
ii. Draws a filtered XIC trace by summing the centroids found for each data point.
If a data point does not contain the user-specified number of matching isotopes
and the theoretical intensity of the missing isotope was above the noise threshold,
the node assigns a zero intensity value to the data point (see Figure 74).
Figure 74. Filtered XIC trace with a zero intensity data point
Integrated chromatographic peak
(shaded area) Filtered XIC trace (blue)
c. For each expected compound found, the Find Expected Compounds (Legacy) node
creates a summed trace by summing the XIC traces of the associated expected
compound ions.
d. Detects and integrates the chromatographic peaks in each XIC trace. Does not report
chromatographic peaks with an apex peak height that is below the user-specified
minimum (chromatographic) peak intensity. If the average peak width of the peaks in
the processed retention time range is greater than the setting for the Average Peak
Width parameter, the Find Expected Compounds node rejects all of the
chromatographic peaks.
The Find Expected Compounds node passes the following information to the Expected
Compounds per File table for each expected compound that it finds in an input file:
• RT (min)—Retention time of the chromatographic peak apex.
Note If the node finds more than one adduct ion for an expected compound, the
chromatogram is a summed trace.
• Best SFit [%]—Best spectral fit score between the measured and expected isotope
patterns for the expected compound ions. When the node finds only one adduct ion,
the best spectral score is equal to the score for the adduct ion that it found.
• Max # MI—Maximum number of matched isotopes for any of the adduct ions.
When the node finds only one adduct ion, the maximum number of matched
isotopes is equal to that of the adduct ion that it found.
• #Adducts—Number of adduct ions that it found.
• Area—Total area of the chromatographic peaks for the found adduct ions of the
expected compound. Peak areas are reported in counts × seconds.
• Parent Area%—Relative area of the chromatographic peak for the expected
compound as compared to the total area of all found peaks for the expected
compound.
Note The parent area is the chromatographic peak area of the expected
compound, rather than the area of the compound listed in the Parent Compound
column. The compounds listed in the Parent Compound column are the library
compounds that you selected in the Generate Expected Compounds node.
6. The Group Expected Compounds Node groups the chromatographic peaks by their
molecular weight × retention time (MW × RT) dimensions across the input file set and
creates the Expected Compounds table.2
When you enable the peak ratings filter, the node filters out expected compounds that do
not pass the peak quality threshold in the specified number of input files.
You can select between two peak integration models:
• Most Common Ion—Reports the chromatographic peak area of the most common
adduct ion detected across the samples.
• All Ions–Reports the summed areas of all the adduct ions detected in each sample.
This node also selects the best hit ions for each compound across the input file set:
• Selects the best ion and related MS1 scan for each compound as the one with the
highest resolution and the highest intensity for the preferred ion. When you open a
result file, the mass spectrum view displays the MS1 scan for the best ion across the
input file set.
• Selects the best fragmentation data by using the user-specified preferred precursor ion
with the highest intensity that has data-dependent MS2 scans.
Note You use the Preferred Ions parameter in the Group Expected Compounds node
to specify the preferred ions.
7. When the input file set includes blank samples (Sample Type: Blank), the Mark
Background Compounds node compares the peak areas of the compounds (same parent
compound, molecular weight, and retention time) that are found in both the blank
samples and the non-blank samples, and labels these compounds as Background
Compounds if their peak areas do not meet the specified threshold.
8. When the spectrum data in the input files includes fragmentation scans (MS2, MS3,
MSn), the FISh Scoring node compares the best fragmentation scan for an expected
2
For result files generated in Compound Discoverer 2.0, the Group Expected Compounds node also added the
Best Compounds column to the Expected Compounds per File table. The Workflow page of the Result
Summary view displays the Best Compound criteria for updated result files.
compound across the input file set to the expected structures. The scoring process can add
considerable processing time.
9. When the processing workflow includes both the Find Expected Compounds node and
the Detect Compounds node, the Merge Features node consolidates the chromatographic
peaks that these nodes find in the main Merged Features table. The consolidation is based
on the m/z × RT dimensions of the features.
Note The Merge Features node also creates the Merged Features table that is related
to the Manual Peaks table. When you manually integrate a chromatographic peak for
a specialized trace such as a UV trace, you can compare the selected peak in the
Manual Peaks table to the peaks in the related Merged Features table. The application
populates the related Merged Features table with the chromatographic peaks that fall
within the retention time window specified in the Merge Features node—that is, the
application populates the Merged Features table that is related to the Manual Peaks
table with chromatographic peaks that have a similar retention time to the selected
manual peak.
10. When the processing workflow includes any of the search nodes, the application searches
the selected databases.
Note You can connect the Group Expected Compounds node to any of the search
nodes and any of the mapping nodes.
11. When the analysis includes group ratios, the Differential Analysis node runs the
differential analysis.
After the analysis finishes, you can open the result file.
Figure 75 shows a schematic of the main and related result tables for the targeted workflow
shown in Figure 73. The Structure Proposals table is empty until you populate it.
Figure 75. Result tables for a basic targeted workflow (with the Create Analog Traces and Merge
Features nodes)
Expected
Expected Expected
Expected Expected Merged
Merged Expected
Expected Input
Input
Compounds
Compounds Compounds Formulas Features
Features Features
Features per
per File
File Files
Files
per File
Structure
Structure Expected
Expected Expected
Expected Expected
Expected Expected Expected
Proposals
Proposals Compounds
Compounds Compounds
Compounds Compounds
Compounds Compounds Compounds
per
perFile
File per
per File
File
Expected
Expected Expected
Expected Expected Expected
Expected
Compounds
Compounds Formulas
Formulas Compounds Compounds
Features Merged
per
perFile
File per
per File
File per
per File
File Features
Expected
Expected Merged Related Related
Related Expected
Formulas
Formulas Features Structures Structures
Structures Formulas
Expected
Merged
Merged Expected
Expected Transformations
Transformations Compounds Chromatogram
Chromatogram Specialized
Specialized
Features
Features Features per File
Features per File Peaks
Peaks Traces
Chromatogram
Chromatogram Input
Input File
Peaks
Peaks Files
Files Alignments
Related
Related
Structures
Structures
Input
Input
Files
Files
Specialized
Specialized Study
Study Statistical
Statistical
Traces
Traces Information
Information Methods
Methods
Input
Input Input
Files
Files Files
You can connect the Group Expected Compounds node to any of the search nodes and
mapping nodes.
Figure 76. Group Expected Compounds node connected to the mapping nodes and the search
nodes
Adding the search nodes to the processing workflow adds the main and related tables shown
in Figure 77 to the result file.
Figure 77. Additional result tables generated by the search nodes
mzVault Results mzCloud Results mzCloud Results mzCloud Results mzCloud Results
Adding the mapping nodes to the processing workflow adds the following result tables to the
result file:
• Main: Metabolika Results, Metabolika Pathways, KEGG Pathways, BioCyc Pathways,
and BioCyc Results
• Related result tables for specific expected compounds: Metabolika Results, Metabolika
Pathways, KEGG Pathways, and BioCyc Pathways, and BioCyc Results
• Related result tables for specific Metabolika results: Expected Compounds and
Metabolika Pathways
• Related result tables for specific Metabolika pathways: Expected Compounds and
Metabolika Results
• Related result tables for specific KEGG pathways
You can connect the Group Expected Compounds node to the following peak refinement
nodes: Mark Background Compounds and Scale Areas. You can add these nodes, and then
reprocess the data set without reprocessing the entire workflow, as these nodes use only the
data provided by the grouping node.
This processing workflow uses the Detect Compounds node to find chromatographic peaks
for unknown compounds (MW × RT) and the Predict Compositions node to determine the
possible elemental compositions of the unknown compounds. It also determines the possible
identity of the unknown compounds as follows:
• The Search ChemSpider node searches selected databases of MS1 scans by using the
molecular weight or predicted formulas when available.
• The Search mzCloud node searches the mzCloud database of fragmentation scans by
using the molecular weight or predicted formulas when available.
• The Map to Metabolika Pathways searches the pathways for the detected compounds.
When the application finds a matching compound, it maps the pathway for ease of
viewing.
The Assign Compound Annotations node assigns the following annotations to the detected
compounds: Name, Formula, and Structure. The Apply mzLogic node combines mzCloud
similarity searching (MS2 and MSn) with structure similarity matching to rank putative
database results.
Figure 78. Processing workflow that finds and identifies unknown compounds
3. The Align Retention Times node chromatographically aligns the input files in a sample
set.
4. The Detect Compounds (legacy) node does following:
• Detects contiguous mass traces in the full (MS1) scans by using the parameter
settings for the mass tolerance and intensity threshold.
• Detects chromatographic peaks in the contiguous mass traces.
• Groups isotopes.
• Applies peak quality filters for isotopic features.
• Groups adducts by using the user-specified ions and base ions lists.
• Reports the unknown compounds (MW × RT) by occurrence in the Compounds per
File table.
IMPORTANT Make sure that the Preferred Ions list for the Group Compounds node
includes the selections in the Base Ions list (advanced parameter) for the Detect
Compounds node.
5. The Group Compounds node uses the specified mass and RT tolerances to group
chromatographic peaks with the same MW × RT values in the Compounds table.
You can select between two peak integration models:
• Most Common Ion—Reports the chromatographic peak area of the most common
adduct ion detected across the samples.
• All Ions–Reports the summed areas of all the adduct ions detected in each sample.
When you enable the peak ratings filter, the node filters out compounds that do not pass
the peak quality threshold in the specified number of input files.
It then sends the best fragmentation data across the input files to the Search mzCloud
node and Predict Compositions node.
This node also selects the best hit ions for each compound across the input file set:
• Selects the best ion and related MS1 scan for each compound as the one with the
highest resolution and the highest intensity for the preferred ion. When you open a
result file, the mass spectrum view displays the MS1 scan for the best ion across the
input file set.
• Selects the best fragmentation data by using the user-specified preferred precursor ion
with the highest intensity that has data-dependent MS2 scans.
The Predict Compositions node and the search nodes use the best hit ions.
Note You use the Preferred Ions parameter in the Group Compounds node to specify
the preferred adducts.
6. The Search mzCloud node searches the mzCloud database for matching and similar
fragmentation spectra.
7. The Predict Compositions node predicts the elemental compositions of the unknown
compounds.
8. The Map to Metabolika Pathways node searches the Metabolika pathways for matching
compounds.
9. The Search ChemSpider node searches the ChemSpider database for matching
compounds.
10. The Fill Gaps node fills in missing peaks or peaks below the detection threshold (specified
in the Detect Compounds node) for subsequent statistical analysis.
IMPORTANT You can substitute the Fill Gaps node with the Apply Missing Value
Imputation node. For details, see “Apply Missing Value Imputation node.”
Do not use the Apply Missing Value Imputation node when the processing workflow
includes either of the QC correction nodes.
11. If the analysis includes QC samples, the Apply SERRF QC Correction node normalizes
the chromatographic peak areas in the input files that you assigned the sample type of
Sample.
12. The Mark Background Compounds node determines the background compounds in the
blank samples (Sample Type: Blank) and labels these compounds as background
compounds.
13. The Differential Analysis node runs a differential analysis on the defined sample ratios
and calculates the p-values.
14. The Assign Compound Annotations node assigns and compares the annotations provided
by the Predict Compositions, Search ChemSpider, Search mzCloud, and Search Mass
Lists nodes.
15. The Apply mzLogic node runs a forward search and a reverse search using the mzCloud
service. For compounds that have available MS2 scans, it scores all the structure
candidates (or the specified maximum number of candidates) from the attached input
nodes. Adds the following columns to the result tables: #Similarity Results (Compounds
table) and mzLogic Score (search result tables).
For information about the result tables that this processing workflow generates, see
Chapter 10, “Descriptive information for the result tables.”
Figure 79 shows a schematic of the main and related tables for a basic untargeted workflow.
Figure 79. Result tables for an untargeted workflow
Metabolika Results
mzCloud Results
ChemSpider Results
Filled Gaps*
Metabolika Pathways
Study
Information
Statistical Methods
* By default, the Filled Gaps table is hidden.
The file names for the processing workflows include the following descriptive text:
• Expected—The workflow runs a targeted analysis with the Generate Expected
Compounds and Find Expected Compounds nodes.
IMPORTANT You must customize the targeted workflows by selecting the targeted
compounds in the Generate Expected Compounds node. Before you can select the
compounds for the node, you must add the compounds to the Expected Compounds
library.
In the defined processing workflows for targeted analyses, the Generate Expected
Compounds node is set up to generate a mass list for the following adduct ions:
[M+H]+1 and [M–H]–1. For best results, make the appropriate selections for your
analysis from the Ions list in the Generate Expected Compounds node.
Tip Mobile phase additives can have a significant effect on the base ions (adduct ions
with the highest intensity) in the full scan data for an LC/MS experiment. To avoid
misinterpreting the isotopic ion clusters, make sure that the Base Ions list in the
Detect Compounds node includes the predominant adduct ions. For example, if the
mobile phase contains a significant amount of ammonium acetate, consider adding
the ammonium adduct, [M+NH4]+1, to the list.
The processing workflows in the E and L folder include [M+NH4]+1 in the Base Ions
list.
• Local Database—The workflow searches your local mzVault and mass list files. For
metabolomics templates, the local databases include the local Metabolika pathways files.
Tip In the Search mzVault and Search Mass Lists nodes, select the appropriate files.
For LC studies, the application includes nine mzVault library files and nine mass lists.
• mzVault libraries:
– EpoxidizedSoybeanOil_Library_AN001586
– mzCloud Offline for mzVault_Endogenous_2021B
– mzCloud Offline for mzVault Autoprocessed_2021B
– mzCloud_Offline for mzVault_Endogenous_2021B
– mzCloud_Offline for mzVault_Endogenous-Autoprocessed_2021B
– Bamba Lab 34 Lipid Mediators Library Stepped NCE 10 30 45
– Bamba Lab 598 Polar Metabolites Stepped NCE 10 30 45
– LipidBlast-VS68-Neg
– LipidBlast-VS68-Pos
• Mass lists
– Arita Lab 6549 Flavonoid Structure Database
– Chemical List PFASSTRUCT-2022-04-20
– EFS HRAM Compound Database
– Endogenous Metabolites Database 4400 Compounds
– Extractables and Leachables HRAM Compound Database
– LipidMaps Structure Database 2021-09-13
– Natural Products Atlas_2020_06
– PFAS NEG
– PFAS NIST
• FISh Scoring—The workflow includes the FISh Scoring node, a structural confidence
scoring and annotation tool for comparing the predicted fragments of expected
compounds to the experimental fragmentation scans. This node adds a significant
amount of processing time.
• Stats or Statistics—The workflow includes the Differential Analysis node. If you submit
an analysis that does not include ratios, a warning message appears. If you do not want to
run a differential analysis, you can ignore the warning and submit the run.
• Compound Class Scoring—The workflow includes the Compound Class Scoring node.
IMPORTANT Before you can select the compounds classes for the node, you must
add the fragment lists to the Compound Classes library.
Note All the untargeted workflows include the Mark Background Compounds node
for filtering compounds detected in the blank samples.
The following processing workflow templates are installed with the application:
• DegradantID folder
– Degradants Related and Unknown ID w Database Searches
– Degradants Unknown ID w Pattern Trace and Pattern Scoring
– Degradants w Stats Related and Unknown ID w Database Searches
• EandL folder
– E and L Expected w FISh Scoring
– E and L Unknown ID with Online and Local Database Searches
– E and L w Stats Unknown ID w Online and Local Database Searches
• Environmental folder
– Environmental Expected w Transformation and FISh Scoring
– Environmental Unknown ID w Online and Local Database Searches
– Environmental w Stats Unknown ID w Online and Local Database Searches
• Food Research folder
– Food Research Expected w FISh Scoring
– Food Research Unknown ID w Online and Local Database Searches
– Food Research w Stats Unknown ID w Online and Local Database Searches
• Forensics folder
– Forensics Expected w FISh Scoring
– Forensics Unknown ID w Compound Class Scoring and Database Searches
– Forensics w Stats Unknown ID w Compound Class Scoring and Database Searches
• ImpurityID folder
– Impurity ID Related and Unknown
– Impurity ID w Stats Related and Unknown
• Lipidomics folder
– Untargeted Lipidomics using Local Databases, LipidBlast in-silico library and
LipidMaps database.cdProcessingWF
– Untargeted Lipidomics using Online Databases, LipidBlast in-silico library and
LipidMaps database.cdProcessingWF
• Metabolomics folder
– Max ID - Detect Unknowns with ID Using Online Database Searches Single Sample
– Untargeted Metabolomics Quick Detection Unknowns No ID
– Untargeted Metabolomics using Online Databases, mzLogic, and Molecular
Networks
– Untargeted Metabolomics with Statistics Detect Unknowns with ID using Local
Databases
– Untargeted Metabolomics with Statistics Detect Unknowns with ID using Online
Databases
– Untargeted Metabolomics with Statistics Detect Unknowns with ID using Online
Databases and mzLogic
• MetID folder
– MetID Generate Inclusion List For Acquisition Pos Mode
– MetID Generate Inclusion List For Acquisition Neg Mode
– MetID Pattern Scoring with Background Removal
– MetID w Stats Expected and Unknown w Background Removal
– MetID w Stats Expected and Unknown w MMDF and Background Removal
– MetID w Stats Expected w Background Removal
– MetID w Stats Expected w FISh Scoring and Background Removal
• NaturalProduct
– Natural Product Unknown ID w Online and Local Database Searches
– Natural Product Unknown ID w Stats Online and Local Database Searches
• PFAS
– PFAS Unknown ID w Database Searches and Molecular Networks.cdProcessingWF
• PolymerID
– Unknown Polymer ID w Stats Online and Local Database Searches
– Unknown Polymer ID with Online and Local Database Searches
• Stable Isotope Labeling folder
– Stable Isotope Labeling w Metabolika Pathways and ID using Local Databases
– Stable Isotope Labeling w Metabolika Pathways and ID using Online Databases
A processing workflow always begins with the Input Files node. All processing workflows that
process MS data require the Select Spectra node. The only processing workflow that does not
require the Select Spectra node is limited to processing the data from an analog detector.
1. Open the Workflows page. If the Workflows Tree area contains a processing workflow
that you do not want to edit, click Clear.
2. Drag the required Input Files node from the Workflow nodes pane to the Workflow Tree
pane.
The Input Files node reads the information in the raw data files.
3. To process the spectral data, drag the Select Spectra node to the Workflow Tree pane.
The Input Files node automatically connects to the Select Spectra node. The Select
Spectra node reads and filters the MS scan data in the raw data files. The default
parameter settings pass all the scan data to the next node.
4. To align multiple input files, drag the Align Retention Times (ChromAlign) node to the
Workflow Tree pane. Then, connect the Select Spectra node to the alignment node.
IMPORTANT The new Align Retention Times (ChromAlign) node is more rugged
than the original alignment node. So for most analyses, use the Align Retention
Times (ChromAlign) node instead of the Align Retention Times node. The original
alignment node is available for comparing the results from previous data sets to new
data sets of similar samples.
The alignment node chromatographically aligns the MS scan data in the input files. The
Align Retention Times (ChromAlign) node
IMPORTANT For analyses with multiple input files, always add one of the alignment
nodes to the processing workflow.
Note The Find Expected Compounds node accepts input from any of the
Spectrum Processing nodes.
c. Drag one or more Generate Expected Compounds nodes to the Workflow Tree
pane.
• To apply different transformation rules to multiple compounds, drag multiple
Generate Expected Compounds nodes to the Workflow Tree pane, one for each
set of rules.
• To apply the same transformation rules to one or more compounds, drag a single
Generate Expected Compounds node to the Workflow Tree pane.
The Compounds parameter is empty until you select the compounds of interest.
If you submit an analysis to the job queue without selecting the compounds of
interest, a Caution symbol appears.
d. For each Generate Expected Compounds node, select the compounds of interest. See
“Generate Expected Compounds node.”
e. Connect the Generate Expected Compounds node or nodes to the Find Expected
Compounds node.
f. Drag the Group Expected Compounds node to the Workflow Tree pane.
The Find Expected Compounds node automatically connects to it.
g. To add FISh scoring to the processing workflow for targeted compounds, drag the
FISh Scoring node to the Workflow Tree pane.
The Group Expected Compounds node automatically connects to it.
Note If the processing workflow does not include the Apply SERRF QC Correction
node, you can use the Apply Missing Value Imputation node instead of the Fill Gaps
node. Because the Group Compounds node does not automatically connect to the
Apply Missing Value Imputation node, you must manually connect the Group
Compounds node to the Apply Missing Value Imputation node.
8. To correct for batch effects, connect the Fill Gaps node to the Apply QC Corrections
node. Or, if the input files for the analysis are from more than one continuous batch,
connect the Fill Gaps node to the Apply SERRF QC Correction node.
If the analysis does not include quality control samples, the application ignores the QC
correction node.
9. To identify unknown compounds, add identification nodes to the processing workflow.
As you drag an identification node into the Workflow Tree pane, the Group Compounds
node automatically connects to it.
a. Drag the Predict Compositions node to the Workflow Tree pane.
IMPORTANT For best results, always include the Predict Compositions node in a
processing workflow for untargeted compounds. Without the Predict
Compositions node, the workflow does not report the elemental compositions of
the unknown compounds without a hit from the online search databases or the
local Metabolika database.
b. To assign a name, formula, and structure to the compounds in the Compounds table,
drag the Assign Compound Annotations node to the Workflow Tree pane.
c. (Optional) To search the online ChemSpider database, drag the Search ChemSpider
node to the Workflow Tree pane.
d. (Optional) To search mass lists, drag the Search Mass Lists node to the Workflow
Tree pane. In the Workflow Tree pane, select the Search Mass Lists node, and then,
in the parameters pane, select the appropriate mass lists.
e. (Optional) To search the online mzCloud database, drag the Search mzCloud node
to the Workflow Tree pane.
f. (Optional) To search your local mzVault database, drag the Search mzVault node to
the Workflow Tree pane. In the Workflow Tree pane, select the Search mzVault
node, and then, in the parameters pane, select libraries from the mzVault Library list.
10. To map detected compounds to a biochemical pathway, add pathway mapping nodes to
the processing workflow. As you drag a pathway mapping node into the Workflow Tree
pane, the Group Compounds node automatically connects to it.
• To search the local Metabolika pathways, drag the Map to Metabolika Pathways
node to the Workflow Tree pane.
• To search the KEGG database, drag the Map to KEGG Pathways node to the
Workflow Tree pane.
• To search the BioCyc database, drag the Map to BioCyc Pathways node to the
Workflow Tree pane.
11. To rank the hits from the Search ChemSpider node, Map to Metabolika Pathways node,
Search Mass Lists node, and Map to BioCyc Pathways node, drag the Apply mzLogic
node to the Workflow Tree pane.
When present, the Search ChemSpider node, Map to Metabolika Pathways node, Search
Mass Lists node, and Map to BioCyc Pathways node automatically connect to the Apply
mzLogic node.
12. To compare the features found by the Find Expected Compounds and Detect
Compounds nodes if applicable, drag the Merge Features node to the Workflow Tree
pane.
The Find Expected Compounds and Detect Compounds nodes automatically connect to
the Merge Features node.
13. To add Trace Creation nodes to the processing workflow, do the following:
• For each UV, PDA, or analog trace that you want to extract, drag a Create Analog
Trace node from the Workflow Nodes pane to the Workflow Tree pane.
The Input Files node automatically connects to the Create Analog Trace node.
• For each pattern trace that you want to extract, drag a Create Pattern Trace node to
the Workflow Tree pane. Then, connect the Align Retention Times (ChromAlign)
node to each Create Pattern Trace node.
• For each Fragment Ion Search (FISh) trace that you want to create, drag a Create
FISh Trace node to the Workflow Tree pane. Then, connect the Align Retention
Times (ChromAlign) node to each Create FISh Trace node.
• For each mass trace that you want to extract, drag a Create Mass Trace node to the
Workflow Tree pane. Then, connect the Align Retention Times (ChromAlign) node
to each Create Mass Trace node.
14. In the Post-Processing Nodes pane, do the following:
• To add a differential analysis (volcano plot) to the workflow for the specified sample
groups and ratios, drag the Differential Analysis node to the Post-Processing Nodes
pane below the Workflow Tree pane.
• To add the descriptive statistics columns to the Compounds and Expected
Compounds tables in the result file, drag the Descriptive Statistics node to the
Post-Processing Nodes pane below the Workflow Tree pane. The descriptive statistics
columns provide information about the chromatographic peak areas (mean area,
median area, minimum area, quartile areas, standard deviation and relative standard
deviation of the areas) for each detected compound.
Note In a result file, the descriptive statistics columns are hidden by default.
15. To export the MS scan data in the raw data files to a common data format, drag an
Export Spectra node to the Workflow Tree pane. Then, connect one of these nodes to
the Export Spectra node:
• Select Spectra
• Align Retention Times
Note The Export Spectra node does not export analog data.
For details about each workflow node, see Chapter 7, “Workflow nodes.”
When you add any of the following nodes to a processing workflow, you must connect them
manually:
• Any of the spectrum processing nodes (except for the Select Spectra node)
• Any of the trace creation nodes (except for the Create Analog Trace node)
• Generate Expected Compounds node (connects to the Find Expected Compounds node)
• Input spectrum processing node to the Detect Compounds node
• Any of the peak area refinement nodes
For LC studies, you can use the following nodes multiple times in a processing workflow: the
Generate Expected Compounds node, all the Trace Creation nodes, the Mark Background
Compounds node, and all the Spectrum Processing nodes, except for the Align Retention
Times node.
2. Click one of the white boxes and hold down the mouse button until a red border and an
arrowhead appear.
3. Continue holding down the mouse button as you drag the arrowhead to the output node
of interest.
4. To automatically format the layout of the workflow nodes, click Auto Layout in the
Workflows command bar.
Note None of the provided processing workflow templates include the Normalize Areas
node or the Scale Areas node. Thermo Fisher Scientific recommends using QC samples
and the appropriate QC correction node to correct for batch errors.
The Fill Gaps node takes input only from the Group Compounds node, and the Group
Compounds node automatically connects to it.
When you add any of the other peak area refinement nodes, you must make the appropriate
connections.
The processing workflow templates provided with the application use the following peak
refinement nodes in this connection order—Fill Gaps > Apply QC Correction or Apply
SERRF QC Correction > Mark Background Compounds.
Figure 81. Suggested connection order for the peak area refinement nodes
If you add the Normalize Areas node to your processing workflow templates, the appropriate
input and output connections depend on whether your blank samples are solvent blanks or
matrix blanks:
• If the blanks are solvent blanks, connect the nodes in this order:
Group Compounds > Fill Gaps > Apply SERRF QC Correction or Apply QC Correction
> Mark Background > Normalize Areas
IMPORTANT Do not use the Apply Missing Value Imputation node with either of the
QC correction nodes. Use the Fill Gaps node instead.
The Apply Missing Value Imputation node only imputes the missing peak area
values—whereas, the Gap Filling node actually re-detects missing peaks by
performing peak detection at a very low threshold.
Because the Gap Filling node detects actual peaks, it provides more accurate results
than the Apply Missing Value Imputation node. More of the gap-filled compounds
will pass the RSD filter in the QC correction nodes and therefore have a QC
correction applied to them.
• If the blanks are matrix blanks, connect the nodes in this order:
Group Compounds > Fill Gaps > Apply SERRF QC Correction or Apply QC
Correction> Normalize Areas > Mark Background.
If you add the Scaling Factor node, add it to the end of the processing workflow template.
Table 33. Input and output nodes for each workflow node in an LC study (Sheet 2 of 4)
Workflow node Input nodes Output nodes
Find Expected Compounds node • Generate Expected Compounds • Group Expected Compounds
• Spectrum Processing • Merge Features
Find Expected Compounds (Legacy) • Generate Expected Compounds • Group Expected Compounds
node • Spectrum Processing • Merge Features
Generate Expected Compounds node None • Find Expected Compounds
• Filter By Mass Defect
Group Expected Compounds node Find Expected Compounds • FISh Scoring
• Mark Background Compounds
Merge Features node • Find Expected Compounds None
• Detect Compounds
5. Compound Detection
Analyze Labeled Compounds node Assign Compound Annotations None
Detect Compounds node Spectrum Processing • Group Compounds
• Merge Features
Detect Compounds (Legacy) node Spectrum Processing • Group Compounds
• Merge Features
Fill Gaps node Group Compounds Any of the peak area refinements
nodes (except the Apply Missing
Value Imputation node)
Group Compounds node Detect Compounds • Compound Identification
• Pathway Mapping
• Fill Gaps
• Mark Background Compounds
6. Peak Area Refinement
IMPORTANT For information about connecting the peak area refinement nodes in the correct input–output order,
see “Peak area refinement node connections.”
Apply Missing Value Imputation node One of these: One of the other peak area
• Group Compounds refinement nodes.
• Normalize Areas
• Scale Areas The most common output nodes
• Apply QC Correction are either one of the QC correction
nodes or the Mark Background
Compounds node.
IMPORTANT Do not use the Apply
Missing Value Imputation node None of the provided processing
with the QC correction nodes. workflow templates for LC studies
include the Apply Missing Value
Imputation node.
Table 33. Input and output nodes for each workflow node in an LC study (Sheet 3 of 4)
Workflow node Input nodes Output nodes
Apply QC Correction node Fill Gaps Normalize Areas or Scale Areas
Table 33. Input and output nodes for each workflow node in an LC study (Sheet 4 of 4)
Workflow node Input nodes Output nodes
8. Pathway Mapping
All • Group Expected Compounds None
• Group Compounds
9. Compound Scoring
Apply Spectral Distance node • Search Mass Lists None
• Search ChemSpider
• Map to KEGG Pathways
• Map to Metabolika Pathways
• Map to BioCyc Pathways
Apply mzLogic node • Search Mass Lists None
• Search ChemSpider
• Map to Metabolika Pathways
• Map to BioCyc Pathways
Calculate Mass Defect node Group Compounds None
Compound Class Scoring node Group Compounds None
Generate Molecular Networks node Assign Compound Annotations None
Pattern Scoring node Group Compounds None
Search Neutral Mass Losses node Group Compounds None
10. Post-Processing (white nodes)
Descriptive Statistics node None None
Differential Analysis node None None
Export Xcalibur Inclusion or Exclusion None None
List node
Result Exporter None None
Scripting Node None None
Note When you run an analysis, the application automatically saves the processing
workflow to the result file. Selecting a result file on the Analysis Results page of a study
and clicking Reprocess opens the processing workflow saved with the result file.
Use the Workflows page (in any study) to create or edit processing workflows. The Workflows
page is a tabbed page to the right of the Grouping & Ratios page and is available only when
the Analysis view is open.
Workflow nodes
A processing workflow consists of a set of connected workflow nodes, and each workflow
node has a set of associated parameters. To create and edit processing workflows, see
Chapter 6, “Create and edit processing workflows.”
Note For information about the workflow nodes for GC studies, refer to the Compound
Discoverer User Guide for GC Studies or the Help system.
In the Compound Discoverer 3.3.x application, Thermo Fisher Scientific has improved
the functionality of the Detect Compounds and Find Expected Compounds nodes, which
now require fewer if any custom parameter settings.
To support legacy processing workflows, the Workflow Nodes pane still includes the
Detect Compounds (Legacy) and the Find Expected Compounds (Legacy) nodes.
The Align Retention Times (ChromAlign) and Result Exporter nodes were added in the
Compound Discoverer 3.3.0 application.
The processing workflow templates use the improved Detect Compounds and Find
Expected Compounds nodes and the new Align Retention Times (ChromAlign) node.
In the Workflow Nodes pane on the Workflows page, the workflow nodes are organized into
functional groups. For information about the workflow nodes, see the following topics:
• Input and Output nodes
• Spectrum Processing nodes
• Trace Creation nodes
• Expected Compounds nodes
• Compound Detection nodes
• Peak Area Refinement nodes
• Compound Identification nodes
• Pathway Mapping nodes
• Compound Scoring nodes
• Post-Processing nodes
Note All processing workflows begin with the Input Files node.
Note You can add multiple Export Spectra nodes to a processing workflow.
The Export Spectra node requires input from one of the data processing nodes. The
processing workflow shown in Figure 82 reads the input files, extracts the MS scans of
interest, and exports the data to an open source format file. Running this processing workflow
does not require an active software license.
Figure 82. Minimum processing workflow for the Export Spectra node
Selections:
• Mascot Generic Format (*.mgf )—Generates an MGF file,
which lists the MS scans by retention time. The scan data for
each time point consists of two columns: mass and intensity.
• mzDATA (*mzData)—Generates an XML-based file that
third-party mass spectrometry software packages can read.
• mzML (*mzML)—Generates an XML-based file that
third-party mass spectrometry software packages can read.
To view information about the input files for a result file, open its Input Files table.
For an LC study, the following workflow nodes are available in the Spectrum Processing area
of the Workflow Nodes pane:
• Align Retention Times node
• Align Retention Times (ChromAlign)
• Filter By Mass Defect node
• Filter By Scan Event node
• Filter Centroids node
• Select Spectra node
IMPORTANT In the spectrum processing node that supplies spectra to the Alignment, do
not apply a scan filter that excessively reduces the retention time window, as doing so
might cause an alignment failure. The Align Retention Times node requires a minimum
amount of representative scan data to chromatographically align the input files in an
analysis.
Note The Align Retention Times node generates the File Alignments result table. When
you open a result file from an analysis that included the Align Retention Times node, the
Retention Time Correction view is available.
Table 35 describes the parameters for the Align Retention Times node.
Table 35. Align Retention Times node parameters (Sheet 1 of 2)
Parameter Description
1. General Settings
Alignment Model Specifies the curve-fitting algorithm for chromatographically
aligning the input files.
Default: True
Selections:
• True: If the data set includes features with negative retention
times, the algorithm shifts all of the features to avoid
cropping.
• False: Removes features with negative retention time values
from the analysis (crops the feature from the feature list passed
to the connected nodes).
Mass Tolerance Specifies the mass tolerance to be used for feature matching.
Default: True
In the result file that the analysis generates, the Statistical Methods
table lists the reference file.
You can add multiple Filter By Mass Defect nodes to a processing workflow.
To specify the elemental compositions for the node, you can enter the elemental compositions
in the Custom Compositions area of the node, or you can use the Generate Expected
Compounds node to provide the compositions.
Tip For more information about mass defects, see “Mass defect types and visualization
techniques.”
Table 37 describes the parameters for the Filter By Mass Defect node.
Table 37. Filter By Mass Defect node parameters (Sheet 1 of 2)
Parameter Description
1. General Settings
Filter Direction Specifies whether the selected mass defect filter keeps or removes mass
spectral peaks (centroids) from further processing.
Selections:
Where:
Selections:
• Floor rounds down.
• Ceiling rounds up.
• Round rounds to the nearest integer value.
Using the calculated mass defect values calculated and the exact mass values calculated from
the elemental composition input, these mass tolerance values define the rectangular mass
defect filters.
Mass Tolerance The input from the data processing nodes is a table of m/z values and
intensities for each full (MS1) scan.
Specifies the mass tolerance for the ions that the filter removes or
passes through to the next node.
Default: 0.025 (Da or unit-less for the Relative Mass Defect selection)
Leave these boxes empty if you want to use one or more Generate
Expected Compounds nodes to generate the list of elemental
compositions.
Ions Specifies the ion definitions to be used with the custom compositions.
For each elemental composition, the node creates one mass defect filter
for each ion definition—that is, if you select five ion definitions, the
node creates five mass defect filters for each elemental composition.
Select the ion definitions from the dropdown list. Use the Ion
Definition Editor to create new ion definitions.
Table 38 describes the parameters for the Filter By Scan Event node.
Table 38. Filter By Scan Event node parameters (Sheet 1 of 3)
Parameter Description
Filter Settings
Mass Analyzer Specifies the mass analyzer that the instrument used to acquire the
scan. A Thermo Scientific hybrid mass spectrometer, such as the
LTQ Orbitrap mass spectrometer, contains two mass analyzers
and can acquire both ITMS (low-resolution) and FTMS
(high-resolution) scans in one data file.
Default: (Not Specified)—The node does not filter the scan events
by polarity mode.
Default: 1.5
Minimum Intensity Specifies the minimum intensity threshold for the mass spectral
Threshold peaks (centroids). The node excludes centroids from the analysis
that are below this intensity value.
The mass spectral scans are numbered 1, 2, 3, and so on in single integer increments from the
beginning to the end of the acquisition run. Use the Select Spectra node to select the scans
that you want the application to process. Limiting processing to the scans of interest decreases
processing time and minimizes false positives. For example, if you know the retention time of
the compounds of interest, exclude scans that fall outside a specific retention time window.
IMPORTANT For LC studies, the default parameter settings for the Select Spectra node
are appropriate for most analyses. When using settings other than the defaults, follow
these guidelines:
• Because the application uses the full (MS1) scans to measure the accurate mass and
isotope patterns of the mass spectral peaks, do not filter out the full (MS1) scans when
the processing workflow includes any of these nodes:
– Find Expected Compounds
– Find Expected Compounds (Legacy)
– Detect Compounds
– Detect Compounds (Legacy)
• The following nodes require a representative amount of data to function properly:
– Align Retention Times
– Align Retention Times (ChromAlign)
– Find Expected Compounds
– Find Expected Compounds (Legacy)
If you excessively reduce the retention time window (for example, by using an RT or
scan number range), the chromatographic alignment algorithm (for the alignment
nodes) and the automatic peak width detection algorithm (for the find expected
compounds nodes) might fail to produce satisfactory results.
The only basic parameters in this group are the lower and upper RT limits.
Note The retention time filter excludes scans outside the specified limits; however, the
application does not check the validity of the retention time settings against the actual
acquisition time for the raw data file. When both the lower and upper RT limits are set to
0 (default), the application does not use retention time to filter scans.
Lower RT Limit Excludes scans acquired before the user-specified retention time.
When this parameter is set to 0, the node processes the first scan
that passes through the other filters.
When this parameter is set to a value that is greater than the last
available scan number, the node filters out all of the scans.
Note The charge state filter excludes higher-order scans of precursor ions with a charge
state that is outside the specified limits. The charge state filter does not affect the MS1
scans.
Lowest Charge State Excludes higher-order scans of precursor ions with a lower charge
state than the specified charge state.
Default: 0; Minimum: 0
Highest Charge State Filters out scans from precursor ions with a higher charge state
than the specified charge state.
Default: (Not Specified)—The node does not filter the scan events
by polarity mode.
The compound detection nodes can process spectra for only one mass range and one CV
value per input file.
MS1 Mass Range This filter applies only to MS1 scans. The format for this
parameter setting is [50.0000-1000.0000], where the dash is a
hyphen.
MS1 scans must match the specified mass range to pass the filter.
Default: 1.5
4. Replacements for Unrecognized Properties
Unrecognized Charge Specifies the charge state or states to process when the charge state
Replacements of the precursor ion is indeterminate.
Default: 1
Default: ITMS
Selections:
• Ion Trap (ITMS)
• Fourier Transform (FTMS)
• Time of Flight (TOFMS)
• Single Quad (SQMS)
• Triple Quad (TSMS)
• Sector Field (SectorMS)
Unrecognized MS Specifies the MS order when the application cannot retrieve this
Order Replacements information from the input file.
Default: True
Selections:
• True—The node considers the isotope pattern in reevaluating
precursors.
• False—The node does not consider the isotope pattern in
reevaluating precursors.
Provide Profile Spectra When set to True, the node stores the profile data for the scans.
When set to Automatic, the node checks whether any other nodes
in the processing workflow require profile data. If the profile data
is not required by any of the workflow nodes in the processing
workflow, the node only stores the centroid data.
Default: Automatic
Store Chromatograms Specifies whether the analysis stores the total ion current (TIC)
and base peak chromatograms in the result file.
Default: False
You can access analog traces from the Specialized Traces table.
Note The Create Analog Trace node can also convert and display a pressure trace from an
LC pump and a temperature trace from a column heater or autosampler with temperature
control, when these instruments are controlled by the Xcalibur data system or equivalent
Thermo Scientific application.
Total scan traces display the average absorbance for each time
point of all the wavelengths in the scan range.
Default: False
Default: False
Note Use the Min. and Max. Wavelength boxes to specify a trace of average absorbance
versus time.
• To display the chromatogram for a specific scan wavelength, type the same
wavelength number in the Min. and Max. Wavelength boxes.
• To display a plot of the average absorbance values for a range of wavelengths, type the
beginning wavelength number in the Min. Wavelength box and the ending
wavelength number in the Max. Wavelength box.
Wavelength Range Specifies whether the node extracts the entire acquired scan or a
wavelength range.
Table 42 describes the parameters for the Create FISh Trace node.
Table 42. Create FISh Trace node parameters (Sheet 1 of 3)
Parameter Description
1. Compound Selection
Compound Specifies the compound that the node uses to generate expected
fragment ions.
Default: True
Individual Traces Specifies whether the node generates individual traces for each
generated fragment ion. You can access the individual traces from
the FISh Trace Fragments result table.
Default: True
Custom Label Type text that you can use to identify the chromatogram in a
report.
Default: True
Max. Depth Specifies the maximum number of steps in the fragmentation
pathway.
Range: 1 to 20
Aromatic Cleavage Specifies whether the node includes a cleavage step in the
fragmentation pathway for highly aromatic structures—that is, for
aromatic structures where n in Huckel’s rule is 2 or higher.
Default: True
Default: 0
Table 43 describes the parameters for the Create Mass Trace node.
Table 43. Create Mass Trace node parameters (Sheet 1 of 2)
Parameter Description
1. General Settings
Trace Type Specifies the chromatogram type to be generated.
Default: BPC
Default: 0
Mass Tolerance Specifies the mass tolerance for the spectral search.
(typed numeric value When you select Da or mmu (0.001 Da) in the units list, the mass
and selected units) tolerance is an absolute ± value for the mass specified in the Mass
box.
When you select ppm (parts per million) in the Units list, the
mass tolerance is a relative range:
Mass ± (Mass × User-specified ppm)/1e6
Default: 3 ppm
Tip For more information about working with isotope patterns, see these topics:
• Set up individual isotope patterns by using the Isotope Ratio Editor
• Specialized Traces table
Table 44 describes the parameters for the Create Pattern Trace node.
Table 44. Create Pattern Trace node parameters (Sheet 1 of 2)
Parameter Description
1. General Settings
Isotope Ratios Displays either the elemental composition of the compound of
interest or the following text: [custom pattern].
You use the Isotope Ratio Editor to set up the isotope pattern for
the compound of interest.
Range: 0.01–100.0
The A0 isotope is always the isotope with the lowest m/z value,
but it is not necessarily the isotope with the highest intensity. For
example, with more than one bromine atom, a bromine and a
chlorine atom, or more than four chlorine atoms, the M + 2 (A2)
isotope is the most intense isotope.
Selections: MS1–MS10
Polarity Specifies the polarity of the mass spectrum.
Note For information about how the node calculates the confirmation score, see “FISh
scoring for proposed structures.”
Default: True
Match Transformations Specifies whether the node matches fragments with
transformation shifts.
Default: True
S/N Threshold Specifies the signal-to-noise threshold for centroids. The node
ignores centroids below this threshold in the fragmentation
(MS/MS, MS3, and so on) spectra.
Default: 3
High Acc. Mass Specifies the mass tolerance for high-resolution mass spectra
Tolerance measured in the Orbitrap mass analyzer of a Thermo Scientific
mass spectrometer.
Default: True
Use Libraries Specifies whether the node uses fragmentation libraries for
fragment prediction.
Default: True
Note Using fragmentation libraries to predict fragments adds
significant time to data processing; however, it also provides
significantly more predicted fragments.
Range: 1 to 20
Aromatic Cleavage Specifies whether the node includes a cleavage step in the
fragmentation pathway for highly aromatic structures—that is, for
aromatic structures where n in Huckel’s rule is 2 or higher.
Default: True
Min. Fragment m/z Specifies the minimum m/z value of a fragment ion to be
generated by the prediction fragmentation pathway.
Default: 50
Using the input from one or more Generate Expected Compounds nodes, the Find Expected
Compounds node looks for expected compounds in the MS1 scans filtered through the data
processing nodes. The expected compounds are the parent compounds that the Generate
Expected Compounds nodes provide to the Find Expected Compounds node, and the
reaction products for these parent compounds. Each Generate Expected Compounds node
predicts the reaction products by using the user-specified Dealkylation step and the
user-specified transformation steps. The Dealkylation step can comprise multiple dealkylation
and dearylation reactions.
The processing results for the Find Expected Compounds node appear in these tables:
Expected Compounds Table, Expected Compounds per File Table, Expected Formulas Table,
and Expected Features Table.
For more information about how the application finds expected compounds, see “Targeted
processing workflows for expected compounds.”
Table 46 describes the parameters for the Find Expected Compounds node.
Table 46. Find Expected Compounds node parameters (Sheet 1 of 3)
Parameter Description
1. General Settings
Mass Tolerance Specifies the mass tolerance that the node uses to create each
extracted ion chromatogram (XIC).
When set to false, the analysis reports the summed areas for all the
isotope peaks in the isotope pattern. The analysis result displays
the TIC for all the detected isotopes.
Default: True
Default: False
Gap Ratio Threshold Does not store (ignores) chromatographic peaks from XIC traces
(advanced) that have a gap ratio exceeding the specified threshold. The gap
ratio for an XIC trace is the number of missing data points (gaps)
to the total number of data points for the XIC trace.
Default: 5; range: 3 to 20
Using the input from one or more Generate Expected Compounds nodes, the Find Expected
Compounds node looks for expected compounds in the MS1 scans filtered through the data
processing nodes. The expected compounds are the parent compounds that the Generate
Expected Compounds nodes provide to the Find Expected Compounds node, and the
reaction products for these parent compounds. Each Generate Expected Compounds node
predicts the reaction products by using the user-specified Dealkylation step and the
user-specified transformation steps. The Dealkylation step can comprise multiple dealkylation
and dearylation reactions.
The processing results for the Find Expected Compounds node appear in these tables:
Expected Compounds Table, Expected Compounds per File Table, Expected Formulas Table,
and Expected Features Table.
For more information about how the application finds expected compounds, see “Targeted
processing workflows for expected compounds.”
Table 46 describes the parameters for the Find Expected Compounds node.
Table 47. Find Expected Compounds (Legacy) node parameters (Sheet 1 of 3)
Parameter Description
1. General Settings
Mass Tolerance Specifies the mass tolerance that the node uses to create each
extracted ion chromatogram (XIC).
When set to false, the analysis reports the summed areas for all the
isotope peaks in the isotope pattern. The analysis result displays
the TIC for all the detected isotopes.
Default: True
Minimum Peak Specifies the minimum apex intensity, in counts, of the detected
Intensity chromatographic peak. The node discards chromatographic peaks
below this intensity threshold.
The default transformations library contains common Phase 1 and Phase 2 metabolic
transformations. If the transformation list does not include the possible transformations for
your compound, add them to the transformations library as described in “Add or edit
transformations with the Transformation Editor.”
When you add transformations to the library, you can assign them to one of the following
groups: Phase I, Phase II, or Others. The Generate Expected Compounds node treats
transformations assigned to the Others group as Phase I transformations; that is, it applies
Phase I and Others transformations before it applies Phase II transformations.
To predict the transformation products for the selected parent compounds, the Generate
Expected Compounds node follows these rules:
1. When the user enables Dealkylation, apply the dealkylation steps first. If a subsequent
transformation reverses the dealkylation step, reject the subsequent transformation. When
the user enables both Dealkylation and Dearylation, apply both of these steps first, and
then determine two separate reaction pathways for the remaining transformation steps.
Consider all steps under Dealkylation together as one step. For example, consider the
selections shown in Figure 83 as one step in the total set of reaction pathways and create
separate reaction pathways. Apply the transformation steps on the parent compound and
the reaction products from the dealkylation pathways.
Figure 83. Dealkylation step example
Parent compounds
2. When more than one reaction pathway produces the same elemental composition, use the
pathway with the lowest number of transformation steps.
3. Reject transformations that remove elements that are not present. For example, do not
apply an oxidative dechlorination step if the compound does not contain chlorine.
4. For Phase I and Others transformations, limit the maximum number of times to apply
the transformation on a single compound to the lower of these two values:
– Max Occurrence setting for the transformation in the Transformations library
For example, for the oxidation transformation, the default value for Max Occurrence
is 3.
– Maximum number of steps specified by the node’s Max. # All Steps parameter minus
any previously applied Dealkylation step.
5. For Phase II transformations, limit the maximum number of times to apply the
transformation on a single compound to the lowest of these three values:
– Max Occurrence setting for the transformation in the Transformations library
– Maximum number of steps for all reactions (setting for node’s Max. # All Steps
parameter) minus any previously applied Dealkylation or Transformation step
– Maximum number of steps for a Phase II transformation (setting for node’s Max. #
Phase II parameter)
You can connect one or more Generate Expected Compounds nodes to the Find Expected
Compounds node and the Filter By Mass Defect node.
Table 48 describes the parameters for the Generate Expected Compounds node. The
application cannot use the processing workflow until you select a compound from the
Compound list.
If you want to generate expected compounds for more than one parent compound, do the
following:
• To target multiple compounds with a different set of transformation rules for each
compound, add multiple Generate Expected Compounds nodes to the processing
workflow.
• To target multiple compounds with the same set of transformation rules, add one
Generate Expected Compounds node to the processing workflow and select multiple
compounds in the node’s Compound list.
Default: True
Apply Dearylation When you select True, the node applies the dearylation
transformations for the specified compound before applying other
transformations.
Default: False
Parent compound
Products of Products of
one dearylation one dearylation
Parent compound
step step
Default: 200
3. Transformations
Phase I Specifies the set of possible Phase 1 transformations.
Table 49 describes the parameters for the Group Expected Compounds node.
Table 49. Group Expected Compounds node parameters (Sheet 1 of 3)
Parameter Description
1. General Settings
RT Tolerance [min] Specifies the retention time tolerance, in minutes, that the node
uses to group mass peaks generated from the same parent
compound through the same reaction pathway.
Default: False
Preferred Ions Select the preferred ions (adducts) from the list. The application
uses the list to select the best fragmentation data for each
compound to submit to an mzCloud or mzVault search.
Selection: The selection list includes all the ion definitions in the
Ion Definitions list under Lists & Libraries.
Area Integration Specifies which ions the node uses to determine the
chromatographic peak areas.
For the Most Common Ion selection, the node uses the
chromatographic peak area of the most common adduct ion
detected across the input file set.
For the All Ions selection, the node sums the areas of all the
adduct ions detected in a particular input file.
These parameter settings determine how the node calculates the peak rating values for
chromatographic peaks. See “Chromatographic peak rating filter.”
(Contribution value for the individual parameter/Divided by the sum of all the contribution
values)×10.
Area Contribution Specifies the chromatographic peak area contribution to the peak
rating.
Default: 3
CV Contribution Specifies the contribution of the coefficient of variation for the
peak areas across replicate samples to the peak rating. If an analysis
includes no replicates, the node sets the CV contribution to 0
during processing.
Default: 10
Default: 5
Jaggedness Specifies the jaggedness contribution to the peak rating.
Contribution
Default: 5
Modality Contribution Specifies the modality contribution to the peak rating.
Default: 5
Zig-Zag Index Specifies the zig-zag index contribution to the peak rating.
Contribution
Default: 5
3. Peak Rating Filter
The node does not store chromatographic peaks that have a peak rating below the threshold
value in the specified number of files.
When either or both of the peak rating parameters are set to 0, this filter is not enabled.
Default: 0; range: 0 to 10
Number of Files Specifies the minimum number of files across the input set where
the chromatographic peak for a compound must meet the peak
rating threshold to be stored in the result file.
The optimum value depends on the number of files in the data set
and the probability of similar compounds in these files.
When there is a very low probability that the samples contain the
same compounds, set this value to 0. For replicate samples that
probably contain the same compounds, set this value to the
number of replicates or one less than the number of replicates.
The node ignores this value unless you specify a nonzero peak
rating threshold.
Default: 0
Default: 25
Source Efficiency [%] Specifies the isotopic purity of the labeled compound introduced
into the biological system.
Default: 100
2. Pattern Analysis
Mass Tolerance [ppm] Specifies the mass tolerance for the isotope search and XIC trace
generation.
Default: 5 ppm
Intensity Tolerance [%] Specifies the relative intensity tolerance for isotope pattern
matching.
Default: 30
Intensity Threshold [%] Specifies the isotope intensity threshold, relative to the pattern’s
base peak, for the theoretical isotope pattern.
Default: 0.1
Default: 3
3. General Settings
Mark Irregular Exchange Adds the Irregular Exchange status to the following result table
columns: Labeling Status column of the Compounds Table and
the Status column of the Compounds per File Table.
Default: True
Exclude Blanks Specifies whether to exclude blank samples (Sample Type:
Blank) from the main Compounds table.
Default: True
Hide Unprocessed Specifies whether to hide compounds with unassigned formulas
in the Compounds table. The node does not process compounds
without formulas.
Default: True
The processing results for the Detect Compounds node appear in the Compounds per File
table and the Features per File table.
Table 52 describes the parameters for the Detect Compounds (Legacy) node.
Table 52. Detect Compounds (Legacy) node parameters (Sheet 1 of 4)
Parameter Description
1. General Settings
Mass Tolerance [ppm] Specifies the mass tolerance for the XIC traces.
Default: 10 000
Ions Specifies the adduct ions that might be in your samples.
The selection of base ions includes the ammonium adduct because extractable and
leachable compounds tend to be nonpolar compounds that require ammonium acetate as
a mobile phase modifier for optimal chromatography.
Min Element Counts Specifies the minimum number of each possible element. The
node uses these values for the isotope pattern search.
Default: C H
Default: 5; range: 3 to 20
Use Most Intense Specifies whether the node reports the chromatographic peak area
Isotope Only for the most intense isotope peak within an isotope pattern or for
all the detected isotope peaks in the isotope pattern.
When set to false, the analysis reports the summed areas for all the
isotope peaks in the isotope pattern. The Chromatograms view in
the analysis result displays the TIC for all the detected isotopes.
Default: True
2. Trace Detection (advanced parameters)
Max. Number of Gaps Specifies the maximum number of contiguous missing values in
to Correct the XIC trace for the compound. You can set the maximum
number of allowed contiguous missing values from 0 to 3.
When the XIC trace includes more than the maximum number of
allowed contiguous missing values (gaps), the node does not store
the trace in the analysis result.
Default: 2; range: 0 to 3
Min. Number of Specifies the minimum number of non-zero values adjacent to a
Adjacent Non-Zeroes gap before the node fixes the gap.
Default: 2; range: 1 to 3
3. Peak Detection (advanced parameters)
Filter Peaks Specify whether you want to turn on the chromatographic peak
filtering parameters.
Default: True
Default: True
4. Isotope Pattern Detection (advanced parameters)
Min. #Isotopes Specifies the minimum number of isotopes in the isotope pattern
are required in the mass spectrum scans that define the
chromatographic peak.
Default: 2
Use Peak Quality for Specifies whether the node filters out low-quality
Isotope Grouping chromatographic peaks before it groups the XIC traces for ions in
an isotope pattern.
When set to True, the node uses a two-stage approach for isotope
grouping. In the first stage, the node groups only high-quality
peaks that pass the peak quality thresholds. In the second stage,
the node groups the remaining chromatographic peaks.
When set to False, the node does not consider the peak quality
thresholds for isotope grouping.
Default: False
Filter out Features with Specifies whether to remove a feature when all the
Bad Peaks Only chromatographic peaks for the feature across the input file set are
low-quality chromatographic peaks that do not pass the peak
quality thresholds.
Default: True
See “Chromatographic peak rating filter.”
Zig-Zag Index Specifies the zig-zag index threshold for the chromatographic
Threshold peaks.
Default: True
5. Skipped
6. AcquireX Settings (advanced)
Detect Persistent When set to True, this parameter generates a related Background
Background Ions Ions table for each input file. This parameter is not relevant for
typical operation of the Compound Discoverer application. Its use
is currently limited to the AcquireX™ data acquisition workflow.
Default: False
Table 53 lists the recommended range for the minimum peak intensity parameter. The
optimal setting depends on the sensitivity of the mass spectrometer.
Table 53. Recommended minimum peak intensity range
Minimum peak intensity
Mass spectrometer
(chromatographic peak height)
Q Exactive, Q Exactive Plus, and Q Exactive HF 500 000 to 1 000 000
Orbitrap Fusion, Orbitrap Lumos, and Orbitrap ID-X 50 000 to 100 000
Default: 10 000
Min. #Scans per Peak Specifies the minimum number of scans (data points) required to
(advanced) define a chromatographic peak.
Default: 5; range: 3 to 20
When set to false, the analysis reports the summed areas for all the
isotope peaks in the isotope pattern. The analysis result displays
the TIC for all the detected isotopes.
Default: True
2. Trace Detection (advanced parameters)
A gap is a missing data point in the XIC trace for a compound. If the XIC trace for a
compound contains more than the specified number of contiguous gaps, the node ignores
the chromatographic peaks for this XIC trace.
Max. Number of Gaps Specifies the maximum number of contiguous missing values in
to Correct the XIC trace for the compound. You can set the maximum
number of allowed contiguous missing values from 0 to 3.
Default: 2; range: 0 to 3
Min. Number of Specifies the minimum number of non-zero values adjacent to a
Adjacent Non-Zeroes gap before the node fixes the gap.
Default: 2; range: 1 to 3
3. Peak Detection
Chromatographic S/N Filters out all chromatographic peaks below the specified
Threshold threshold.
Default: False
The node automatically groups the various isotopic ions for the
elements C, H, N, O, and S.
When set to True, the node uses a two-stage approach for isotope
grouping. In the first stage, the node groups only high-quality
peaks that pass the peak quality thresholds. In the second stage,
the node groups the remaining chromatographic peaks.
When set to False, the node does not consider the peak quality
thresholds for isotope grouping.
Default: True
Default: True
See “Chromatographic peak rating filter.”
Zig-Zag Index Specifies the zig-zag index threshold for the chromatographic
Threshold peaks.
The selection of base ions includes the ammonium adduct because extractable and
leachable compounds tend to be nonpolar compounds that require ammonium acetate as
a mobile phase modifier for optimal chromatography.
Default: True
6. AquireX Settings (advanced)
Detect Persistent When set to True, the node detects and stores persistent
Background Ions background ions.
Default: False
For information about adding the Fill Gaps node to a processing workflow, see “Peak area
refinement node connections.”
The Fill Gaps node adds the (hidden) Gap Status column to the Compounds Table and
creates the related (hidden) Filled Gaps Table.
The Filled Gaps table describes how the node calculated the missing chromatographic peak
areas. Clicking a row in the Filled Gaps table displays the gap-filled trace and the integrated
peak area.
Default: False
Preferred Ions Select the preferred ions (adducts) from the list. The application
uses the list to select the best fragmentation data for each
compound to submit to an mzCloud or mzVault search.
For the Most Common Ion selection, the node uses the
chromatographic peak area of the most common adduct ion
detected across the input file set.
For the All Ions selection, the node sums the areas of all the
adduct ions detected in a particular input file.
These parameter settings determine how the node calculates the peak rating values for
chromatographic peaks. See “Chromatographic peak rating filter.”
(Contribution value for the individual parameter/Divided by the sum of all the contribution
values)×10.
Area Contribution Specifies the chromatographic peak area contribution to the peak
rating.
Default: 3
CV contribution Specifies the contribution of the coefficient of variation for the
peak areas across replicate samples to the peak rating. If an analysis
includes no replicates, the node sets the CV contribution to 0
during processing.
Default: 10
FWHM to Base Specifies the FWHM to base contribution to the peak rating.
Contribution
Default: 5
Jaggedness Specifies the jaggedness contribution to the peak rating.
Contribution
Default: 5
Modality Contribution Specifies the modality contribution to the peak rating.
Zig-Zag Index Specifies the zig-zag index contribution to the peak rating.
Contribution
Default: 5
3. Peak Rating Filter
The node does not store chromatographic peaks that have a peak rating below the threshold
value in the specified number of files.
When either or both of the peak rating parameters are set to 0, this filter is not enabled.
For more information about the peak rating filter, see “Chromatographic peak rating filter.”
Default: 0; range: 0 to 10
Number of Files Specifies the minimum number of files across the input set where
the chromatographic peak for a compound must meet the peak
rating threshold to be stored in the result file.
The optimum value depends on the number of files in the data set
and the probability of similar compounds in these files.
When there is a very low probability that the samples contain the
same compounds, set this value to 0. For replicate samples that
probably contain the same compounds, set this value to the
number of replicates or one less than the number of replicates.
The node ignores this value unless you specify a nonzero peak
rating threshold.
Default: 0
The Apply Missing Value Imputation node provides two methods for imputing missing
chromatographic peak areas for compounds: Random Forest and Median + Small Value.
• The Random Forest imputation method consists of a self-trained machine learning
system.
• The Median + Small Value imputation method is a mixed method:
– (Partially missing values) For study groups where a compound is detected in at least
one of the samples, but not all the samples, the node imputes an area value for each
value by using the mean of the values that are present in the same study group.
– (Full gap) If the compound is not present in any of the samples in a study group, the
node imputes area values for the compound in each sample by dividing the smallest
detected peak area for any compound in a sample by 2.
Table 57. Missing Value Imputation node parameters (Sheet 1 of 2)
Parameter Description
1. General Settings
Imputation Method Specifies the imputation method for missing chromatographic
peaks across a set of input files.
Selections:
• (Default) Automatic Selection—The application
automatically uses the Median + Small Value with Variability
imputation method when the analysis includes three or more
study groups and the deconvolution node detects 50 or more
compounds. If these conditions are not met, the application
uses the Random Forest imputation method. For information
about defining the study groups, see “Set up the sample
groups and ratios for a new analysis.”
• Median + Small Value with Variability
• Random Forest Imputation
Fill Blanks with Min Specifies whether the node fills the chromatographic peak for a
Value compound with a minimum value in blank samples.
Default: False
2. Random Forest Settings
When the processing workflow includes the Apply QC Correction node and the input file set
includes QC samples, an analysis uses the QC samples to create a linear regression or cubic
spline curve of area versus acquisition time for each detected compound. For more
information, see Using quality control samples to compensate for batch effects.
The Apply QC Correction node adds the QC columns and the Norm. Area column to the
compounds table (Compounds table). You can view the correction curves in the Compound
Area Corrections view and the QC corrected areas for the compounds in the Normalized Area
column. For more information, see “Compound Area Corrections view.”
The analysis uses the QC samples to create a regression curve for each detected compound.
Unless all three of the following conditions are met, the analysis does not create a regression
curve for a particular compound or does not correct the areas in the non-QC samples:
• It detects the compound in the specified minimum percentage of QC samples.
• The relative standard deviation of the detected peak areas for the compound in the QC
samples does not exceed the specified threshold.
• The number of samples acquired between the QC samples does not exceed the specified
number.
If the analysis does not create a regression curve for a compound, it does not perform a
batch normalization of the peak areas for the compound in the non-QC samples (Sample
type: Sample, Control, or Standard).
Default: Linear
The Min. QC Coverage Max. [%], QC Area RSD [%], and Max. Corrected QC Area RSD
[%] parameters act as display filters for the compounds table.
When the compounds table includes compounds that the analysis was unable to normalize,
a filter icon appears on the table’s tab. Pointing to the tab displays the number of
compounds displayed, the total number of compounds detected, and the number of
compounds filtered out.
Min. QC Coverage [%] Specifies the minimum percentage of the QC samples where the
analysis must detect a particular compound before it creates a
regression curve for the compound. If the coverage falls below
this value, the analysis does NOT perform a batch normalization
of the peak areas for the compound in the non-QC samples.
The Apply SERRF QC Correction node takes input from the Group Compounds node or the
Fill Gaps node. It creates the QC correction columns in the Compounds table and the Batch
column in the Input Files table.
For information about how the Apply SERRF QC Correction node creates the time-based
normalization curve for the chromatographic peak areas, see “Batch normalization for
multiple sequence runs (LC studies).”
Table 59. Apply SERRF QC Correction node parameter descriptions
Parameter Description
1. General Settings
Min. QC Coverage [%] Specifies the minimum percentage of the QC samples where
the analysis must detect a particular compound before it
creates a regression curve for the compound. If the coverage
falls below this value, the analysis does NOT perform a batch
normalization of the peak areas for the compound in the
non-QC samples.
Default: 2; range 1 to 50
Interpolate Gap-filled QC When set to True, the node discards the gap-filled areas for
Areas the QC samples and instead uses the mean of the
non-gap-filled areas for these compounds.
Default: False
IMPORTANT As you drag the Fill Gaps and the Normalize Areas nodes into the Workflow
Tree pane, the application automatically connects the Group Compounds node to the Fill
Gaps node, and the Fill Gaps node to the Normalize Areas node. The application does not
connect the Mark Background Compounds node to other nodes, so you must manually
make the appropriate connections.
For more information about the node connections, see “Peak area refinement node
connections.”
Table 60 describes the parameters for the Mark Background Compounds node.
Default: 5
Default: 0 (The node does not use this parameter to mark background
compounds.)
Hide When the Hide Background parameter is set to True, the tab for the
Background compounds table includes a filter icon ( ), and the compounds that fall
below the threshold are hidden. Clicking the filter icon displays the
filtered compounds.
Default: True
Thermo Fisher Scientific does not recommend using this node. To compensate for batch
effects, add quality control samples to the batch and a QC correction workflow node to the
processing workflow.
When the Normalize Areas node is part of a processing workflow, the Use Normalized Areas
check box is available in the Descriptive Statistics view and the Principal Component Analysis
view. The Normalize Areas node adds the Norm. Area column to the compounds table.
IMPORTANT When both the Normalize Areas and Mark Background Compounds nodes
are part of a processing workflow, do the following:
• If the analysis includes solvent blanks (Blank sample type), connect the Mark
Background Compounds node to the Normalize Areas node.
• If the analysis includes matrix blanks (Blank sample type), connect the Normalize
Areas node to the Mark Background Compounds node.
Selections:
• Constant Sum
• Constant Mean
• Constant Median
• Median Absolute Deviation (MAD)
Tip For best results, select Constant Mean for input file sets
that include matrix blanks.
Default: True
Tip For best results, follow these guidelines:
• Select True for solvent blanks to exclude them from the
normalization process. If you do not exclude the solvent
blanks from the normalization process, the Fill Gaps node
adds small “noise” peaks to the solvent blanks for each
detected compound in the input file set. The Normalize
Areas node magnifies these small peaks, causing the
sample-to-blank ratio to fall below the user-specified value
in the Mark Background Compounds node. The Mark
Background Compounds node then hides most of the
detected compounds across the input file set (marks them
as background compounds).
• Select False for matrix blanks, such as plasma and urine, as
these blanks typically contain a large number of
compounds that you might want to hide.
Table 62 describes the parameters for the Scale Areas node. If your processing workflow
includes both of the grouping nodes, and you want to scale the output from both of these
nodes, you must connect each grouping node to a separate Scale Areas node.
Figure 84. Connections to Scale Areas node
Multiplies the peak area for a compound by the value for the
selected numerical study factor.
Default: True
Tip For best results, follow these guidelines:
• Select True for solvent blanks to exclude them from the
scaling process.
• Select False for matrix blanks, such as plasma and urine, as
these blanks typically contain a large number of
compounds that you might want to scale.
Table 63 describes the parameters for the Assign Compound Annotations node.
Table 63. Assign Compound Annotation node parameters (Sheet 1 of 2)
Parameter Description
1. General Settings
Mass Tolerance Specifies the mass tolerance for validating the annotations.
Available sources: mzCloud Search, mzVault Search, BioCyc Search, Mass List Match,
Metabolika Search, ChemSpider Search, and Predicted Compositions
Data Source #1 Specifies the primary source for the compound annotations.
Default: Empty
Default: True
Use Spectral Distance When set to True, uses the SFit score from the Apply Spectral
Distance node to select the best candidate.
Uses the SFit Threshold and SFit Range values to limit the
number of valid candidates.
Default: True
SFit Threshold Specifies the minimum SFit score for a candidate.
When set to False, the node does not remove the assigned
annotations when you reprocess the analysis. If you modify the
identification workflow nodes, these nodes can overwrite the
existing annotations. If you remove the identification nodes
responsible for the initial annotations, these annotations remain in
the final analysis result.
Default: False
Default: C H
Max. Element Counts Specifies the maximum count for each element in the
hypothetical compound. If an element is not listed, its maximum
count is the same as its minimum count.
imax
Ni ( Vi – 2 )
i
D = 1 + ------------------------------------------
2
where:
• D is the value for the RDB equivalents
• imax is the total number of different elements in the
composition
• Ni is the number of atoms of element i
• Vi is the valence of atom i
Min. H/C Specifies the minimum hydrogen-to-carbon ratio.
Default: 0.1
Default: 3.5
Im
------------- × 100 = Min. Pattern Cov. [%]
It
where:
Im = the matching isotope peaks
It = the theoretical isotope peaks
Default: True
3. Fragments Matching
Use Fragments Specifies whether the application uses the fragment matching
Matching algorithm, which ranks the identified candidates (chemical
formulas) by the number of matching centroids (with an
m/z value that matches a subset of the elemental composition for
a particular candidate) in the fragmentation scan for the
precursor ion.
Default: True
Mass Tolerance Specifies the mass tolerance for matching the centroids in the
fragmentation scans to the m/z values for the expected fragments.
When the processing workflow includes a ChemSpider search, the processing computer must
have Internet access. To verify whether the processing computer can access the ChemSpider
database, run the Communication test. See Chapter 17, “Test communication to the online
databases.”
Default: KEGG
Search Mode Default: By Formula or Mass
Default: False
Table 66 describes the parameters for the Mass List Search node.
Default: True
RT Tolerance [min] Specifies the retention time tolerance, in minutes, for the search.
The node searches for matching peaks in a retention time window
equal to the expected retention time plus or minus the specified
RT tolerance.
Default: 2; range: 0 to 10
Mass Tolerance Specifies the mass tolerance that the node uses to search for
matching compounds.
Table 67 describes the table columns in the Select Input Files dialog box.
Table 67. Select Input File(s) dialog box parameters
Table column Description
Selected Selecting the check box for a mass list adds the list to the search.
Filename Displays the file name of the imported file.
Description Editable text field.
Clicking the table cell displays a text entry field for typing a name
or a description of the mass list. Use this column to name and sort
your mass lists.
File Size Displays the file size of the imported file.
Uploaded Displays the date (month/day/year) and time (hour/minute) when
you added the file to the library in the following format:
MM/DD/yyyy HH:mm
Updated Displays the date and time when the file was updated.
Context Displays the source of the mass list—for example, Import from
CSV or Import from XML.
State Specifies whether the mass list is available, corrupted, or missing.
If you remove a mass list file from the ServerFiles folder and restart
the computer, the state of the file changes to Missing. If you edit a
mass list file in the ServerFiles Folder and restart the application,
the state of the file changes to Corrupted.
For LC studies, the Search mzCloud node requires input from the Group Compounds node
or the Group Expected Compounds table.
Note In addition to running an automated search with an analysis, you can manually
submit a fragmentation scan to the mzCloud database from the Mass Spectrum view for
an active result file.
When the processing workflow includes an mzCloud search, the processing computer must
have Internet access. To verify whether the processing computer can access the mzCloud
database, run the Communication tests.
Default: All
Precursor Mass Tolerance Specifies the tolerance for the precursor mass.
Selections:
• Filtered—Removes extraneous mass peaks that do
not match the theoretical mass spectrum.
• Recalibrated (Default)—The mass peaks for the
known ion fragments are recalibrated to match the
theoretical mass spectrum by a series of manually
supervised ion calibration steps.
Displays the hits with the highest match score above the
cutoff storage number in the result file.
Default: False
Search MSn Tree Specifies whether the analysis submits the full MSn
spectral tree for the database search.
Default: False
Selections:
• HighChem DP—Alternative score using a modified
dot product algorithm that includes a “hard penalty”
function for spectra with a low number of fragment
ions.
• HighChem HighRes—Use for high-resolution data.
Correlation algorithm that uses the geometric mean
of a modified Spearman’s rank order correlation to
separately determine the intensity and m/z accuracy
of the fragment ions.
• NIST
• Cosine—Dot product algorithm with fragment
intensities weighted by 0.75 and no weighting on
fragment m/z values.
Default: True
Match Activation Energy Specifies whether to search for fragmentation spectra
generated by the same ion activation energy within a
tolerance or by any ion activation energy.
Default: True
Similarity Search Specifies the similarity search algorithm.
Selections:
• None (Default)—Does not run a similarity search.
• Confidence Forward— Uses the dot product of the
forward search match and distribution of peaks, and
the ratio of the most intensive matching peaks as part
of the similarity score. A higher similarity score
indicates the extent to which the unknown
component resembles the library compound.
• Confidence Reverse—Uses the dot product of the
reverse search match and distribution of peaks, and
the ratio of the most intensive matching peaks as part
of the similarity score. A higher similarity score
indicates the extent to which the library compound
resembles the unknown component.
• Similarity Forward—Searches for a match between
the best fragmentation scan for a compound (across
the input file set) and a fragmentation scan in the
mzCloud database. Unlike the Identity Search, this
search ignores the m/z value of the precursor ion.
• Similarity Reverse—Searches for a match between the
fragmentation scans in the mzCloud database and the
best fragmentation scan for a compound (across the
input file set).
Match Factor Threshold Specifies the minimum match factor for reporting a
spectrum match.
Default: False
Note Keep the default setting of False for GC CI data.
Max. Isolation Width [Da] Specifies the maximum MS2 isolation width. The
isolation width for a scan is listed in its scan header.
Default: False
Match Activation Energy See Match Activation Energy under DDA Search.
Default: Any
Activation Energy Tolerance See Activation Energy Tolerance under DDA Search.
Default: 100
Apply Intensity Threshold See Apply Intensity Threshold under DDA Search.
Default: False
Match Factor Threshold See Match Factor Threshold under DDA Search.
Default: 20
For LC studies, this node requires input from the Group Compounds node or the Group
Expected Compounds node. And it can process compounds from both of these nodes in one
processing workflow.
For GC studies, this node requires input from the GC CI Deconvolution node.
The node creates the mzVault Results table. See “mzVault Results table.”
Y To use mzVault libraries created with the mzVault 1.1 or earlier application
Default: All
Match Ion Activation Type Specifies whether to only search for library scans that
match the ion activation type of the query spectrum.
Default: True
Match Ion Activation Energy Specifies whether to search for library scans generated by
the same ion activation energy within a tolerance of the
query spectrum or by any ion activation energy.
Default: True
Default: True
Remove Precursor Ion When set to True, the search ignores (removes from
consideration) mass peaks within 2.2 Da of the precursor
ion’s m/z value in the query spectrum—that is, the match
score is not negatively affected if the library spectrum
does not include a mass peak within 2.2 Da of the
m/z value for the precursor ion.
Default: True
Precursor Mass Tolerance Specifies the tolerance for the precursor mass.
Default: True
Selections:
• HighChem DP—Alternative score using a modified
dot product algorithm that includes a “hard penalty”
function for spectra with a low number of fragment
ions.
• HighChem HighRes—Use for high-resolution data.
Correlation algorithm that uses the geometric mean
of a modified Spearman’s rank order correlation to
separately determine the intensity and m/z accuracy
of the fragment ions.
• NIST
Displays the hits with the highest match score above the
cutoff storage number in the result file.
Default: 10
Default: False
The Map to BioCyc Pathways node adds the following items to the result file:
• The BioCyc Pathways and BioCyc Results main tables
• The #BioCyc Pathways and BioCyc Pathways columns in the main compounds table
• The BioCyc Compound IDs, BioCyc Compound Names, and BioCyc Compound
Formula columns in the related compounds table and the related BioCyc Pathways table.
Table 71 describes the parameters for the Map to BioCyc Pathways node.
Table 70. Map to BioCyc Pathways node parameters (Sheet 1 of 2)
Parameter Description
1. Search Settings
BioCyc Database/Organism Specifies the databases for the search.
to be Searched
Use the Map to KEGG Pathways node to add explanations to the result file about the reaction
pathways for each detected compound.
The node requires the following input— a list of molecular weights or chemical formulas. For
LC studies, the KEGG node requires input from the Group Compounds node, the Group
Expected Compounds node, or both of these nodes.
The Map to KEGG Pathways node adds the following items (output) to the result file:
• The KEGG Pathways table
• The KEGG Compound IDs, KEGG Compound Names, and KEGG Compound
Formula columns in the related compounds table
• The #Pathways and Pathways columns in the main compounds table
When the processing workflow includes the Map to KEGG Pathways node, the processing
computer must have Internet access. To verify whether the processing computer can access the
KEGG Pathways database, run the communication test as described in Chapter 17, “Test
communication to the online databases.”
Table 71 describes the parameters for the Map to KEGG Pathways node.
Table 71. Map to KEGG Pathways node parameters
Parameter Description
1. Search Settings
Search Mode Specifies the search mode.
Note 378 Metabolika pathway files are automatically installed with the application. You
can edit these pathways or create your own pathways by using the Metabolika pathway
editor. For details, see “Metabolika Pathways view.”
Table 72 describes the parameters for the Map to Metabolika Pathways node.
Table 72. Map to Metabolika Pathways node parameters
Parameter Description
1. Search Settings
Metabolika Pathways Specifies the Metabolika pathways to search.
Default: All
Search Mode Specifies whether to search by formula, mass, or both.
The Calculate Mass Defect, Compound Class Scoring, Pattern Scoring, and Search Neutral
Losses nodes require input from the Group Compounds node. The Generate Molecular
Networks node requires input from the Assign Compound Annotations node, and the Apply
mzLogic and Apply Spectral Distance nodes take input from the Search ChemSpider node.
Table 73 describes the parameters for the Apply Spectral Distance node.
Table 73. Apply Spectral Distance node parameters
Parameter Description
1. General Settings
Mass Tolerance Specifies the mass tolerance that the node uses to search for
matching mass peaks.
Default: 0.1
S/N Threshold Specifies the signal-to-noise threshold for the isotope search. The
node does not include isotopes that are expected to be below this
threshold in the SFit score—that is, these isotopes are not required
in the measured isotope pattern.
Default: 3
Use Dynamic Specifies whether to shift the theoretical isotope pattern if the
Recalibration pattern base peak in the query spectrum is shifted.
Default: True
Range: 5 to 100
Match Factor Threshold Specifies the minimum match score returned for a compound by
an mzCloud similarity search. The analysis ignores compounds
with match scores below this threshold.
Default: 30
Advanced parameters
FT Fragment Mass Specifies the mass tolerance for the mass peaks in
Tolerance high-resolution fragmentation spectra when searching the
mzCloud spectral database.
Table 37 describes the parameters for the Filter By Mass Defect node.
Table 75. Filter By Mass Defect node parameters
Parameter Description
1. Mass Defect
Fractional Mass Fractional Mass = exact mass – floor (exact mass)
Standard Mass Standard Mass Defect = exact mass – nominal mass
Defect
Relative Mass Relative Mass Defect = 1e6 × (exact mass – nominal mass)/exact mass
Defect
Kendrick Mass Kendrick Mass Defect = Kendrick mass –nominal Kendrick mass
Defect
Kendrick mass – nominal Kendrick mass
where:
Kendrick mass = a × (b/c)
a = exact mass of the elemental composition
b = nominal mass of the Kendrick formula
c = exact mass of the Kendrick formula
Selections:
• Floor rounds down.
• Ceiling rounds up.
• Round rounds to the nearest integer value.
2. Kendrick Formulas
When you select Kendrick Mass Defect as the Mass Defect Type, this user-specified
elemental composition specifies the Kendrick formula.
Formula 1–5 Use to add Kendrick formulas.
In a processing workflow, connect the Group Compounds node to the Compound Class
Scoring node.
When you add the Compound Class Scoring node to a processing workflow, you must select
the compound class fragment lists.
For information about adding compound class fragment lists to the Compound Classes
library, see “Compound Classes view.”
Table 76 describes the parameters for the Compound Class Scoring node.
Table 76. Compound Class Scoring node parameters (Sheet 1 of 2)
Parameter Description
1. General Settings
Compound Classes Select the compound classes that you want to use for the fragment
search.
Default: 50
Default: True
Note For LC studies, the Assign Compound Annotations node connects to the Generate
Molecular Networks node.
Table 77 describes the parameters for the Generate Molecular Networks node.
Table 77. Generate Molecular Networks node parameters (Sheet 1 of 4)
Parameter Description
1. Spectral Similarity
These parameters define how fragmentation spectra are compared to determine the
similarity score between two compounds.
Use Full MSn Tree Specifies whether the node determines the spectral similarity from
the full MSn tree or only from the MS2 spectra.
Default: True
Match Transformations For fragments that are not a direct match by mass, specifies
whether the shift to the fragment’s mass must match the mass of
the assigned transformation between the two compounds.
Default: True
Variate For fragments that are not a direct match by mass, specifies
Transformations whether the masses of all the variations of the individual steps of
the assigned transformation between the two compounds are used
as the expected fragments shifts. Fragments shifted by these masses
are considered as matching.
Default: False
S/N Threshold Specifies the signal-to-noise threshold for the centroids in the
fragmentation spectra. The node ignores centroids below the
signal-to-noise threshold—that is, the node attempts to match
only centroids with m/z values above the threshold.
Default: 3
Mass Tolerance Specifies the mass tolerance to use for fragments matching.
Default: 50
2. Transformations
Phase I Specifies the set of possible single-step Phase 1 transformations.
Default: 1; range: 1 to 10
Max. # All Steps Specifies the maximum number of all steps to be applied.
Default: 3; range: 1 to 10
3. Applied View Filters
These parameters filter out (hide) connections with lower confidence in the Similar
Compounds table for a compound. This filtering has no effect on the stored data—that is,
you can click the filter icon on the Similar Compounds tab to undo all of these filters.
Require When set to True, similar compounds without assigned
Transformation transformations are hidden.
Default: True
Require MSn When set to True, similar compounds without a spectral similarity
score are hidden.
Min. MSn Score Specifies the minimum MSn Score value for a similar compound.
If a similar compound has no fragmentation data, the application
does not apply this filter.
Min. MSn Coverage Specifies the minimum Forward or Reverse Coverage value for a
connection. The connection is hidden only if both values are
below this threshold. If a similar compound has no fragmentation
data, the application does not apply this filter.
Min. # Fragments Specifies the minimum Forward or Reverse Matches value for a
connection. The connection is hidden only if both values are
below this threshold. If a similar compound has no fragmentation
data, the application does not apply this filter.
Default: 3
These parameters define the filtering rules for storing connections. The application does not
store connection that fall below these thresholds. The main purpose of these filters is to
reduces the amount of stored data by removing low confidence connections.
Require If set to True, removes connections without assigned
Transformation transformations.
Default: False
Require MSn If set to True, removes connections without a spectral similarity
score.
Default: False
Min MSn Score Specifies the minimum MSn Score value for a stored connection.
If the connection has no fragmentation data, the application does
not apply this filter and stores the connection if it passes the other
filters.
Default: 0
The Pattern Scoring node adds the Pattern Matches column to the compounds table and
creates the related Matched Patterns Table.
Range: 0 to 100%
Post-Processing nodes
These nodes provide additional information about the detected or found compounds.
• Descriptive Statistics node
• Differential Analysis node
To store the descriptive statistics in the result file, the processing workflow must include the
Group Expected Compounds node for targeted workflows and the Group Compounds node
for untargeted workflows.
Note The Descriptive Statistics node generates the descriptive statistics for individual
compounds; it has no effect on the Descriptive Statistics view that is available for a result
file.
This node requires input from a compounds node for a sample set with replicate data points
and generated ratios. If the Grouping & Ratios page of an analysis does not contain generated
ratios, the following confirmation message appears:
No quan ratios defined in ‘Grouping & Ratios’ tab
Default: True
2. Peak Rating Contributions
These parameter settings determine how the node calculates the peak rating values for
chromatographic peaks. See “Chromatographic peak rating filter.”
(Contribution value for the individual parameter/Divided by the sum of all the contribution
values)×10.
Area Contribution Specifies the chromatographic peak area contribution to the
peak rating.
Default: 3
CV contribution Specifies the contribution of the coefficient of variation for
the peak areas across replicate samples to the peak rating. If
an analysis includes no replicates, the node sets the CV
contribution to 0 during processing.
Default: 10
FWHM to Base Specifies the FWHM to base contribution to the peak rating.
Contribution
Default: 5
Jaggedness Contribution Specifies the jaggedness contribution to the peak rating.
Default: 5
Modality Contribution Specifies the modality contribution to the peak rating.
Zig-Zag Index Contribution Specifies the zig-zag index contribution to the peak rating.
Default: 5
Table 81 describes the parameters for the Export Xcalibur Inclusion or Exclusion List node.
Table 81. Export to Xcalibur Inclusion/Exclusion List node parameters
Parameter Description
1. General Settings
File Name Specifies the file name for the inclusion/exclusion list.
Selected Instrument Bases the format of the inclusion/exclusion list on the selected
MS.
Default: False
4. LTQ Orbitrap Settings
Maximum Concurrent Specifies the maximum number of entries with overlapping time
Entries windows.
Default: 5
When the file name is not specified, the application uses the name
of the result file for the exported TXT or XLS files.
Default: False
Scripting Node
Use the Scripting Node to perform custom post-processing actions on the data in the result
tables.
Tip For information about using the Scripting node, go to the Resources page of the
following web site: https://mycompounddiscoverer.com/.
Use a colon to separate the table name from the table columns, a
comma to separate the table columns, and a semicolon to separate
tables.
Format:
Default: False
For details about each of the graphical views and result tables, see these chapters:
• Chapter 9, “Graphical views for a result file.”
• Chapter 10, “Descriptive information for the result tables.”
Note Result files are also known as analysis results, as you can open them from the
Analysis Results page of a Compound Discoverer study.
When you open a result file for the first time, you see a tabbed document with the default
layout in the application window. You can modify the layout and save these changes with the
result file. The next time you open the result file, it will open with your custom layout.
To open a result file from a previous version of the application, you must update the file to the
current version.
For instructions on how to open, close, and update result files, see these topics:
• Open result files created in the current version of the application
• Open result files created in previous versions of the application
• Update modes for legacy result files
• Close a result file
Tip You can also drag and drop result files (CDRESULT) from Windows Explorer into
the application window.
You can open as many result files as you want. To view a particular result file, click its tab.
1. From the menu bar, choose File > Open Result. In the Open dialog box, browse to the
appropriate folder, select the result file of interest, and click Open.
The File Update Required dialog box opens.
Figure 85. File Update Required dialog box
The application immediately processes the file without first creating a temporary file.
Errors that occur during the update process can corrupt the file.
IMPORTANT To prevent loss of data when not using the fail-safe mode to update
legacy result files, Thermo Fisher Scientific recommends that you manually store
copies of these files in another directory.
Note You can install multiple versions of the Compound Discoverer application on
the same data system computer.
• Clearing both check boxes turns off the fail-safe mode. This option takes the least amount
of processing time, but it risks the possibility of corrupting the original file and making it
unrecoverable.
The factory default layout for a result page includes the following items:
• A tab with the result file name
• For LC studies, the Chromatograms view on the top left is populated with XIC traces for
the component with the largest chromatographic peak area across the input files—that is,
the view is populated with the XIC traces for the compound listed in the first row of the
Compounds table or the Expected Compounds table. The view is zoomed to the start and
end points of the chromatographic peak.
• For LC studies, the Mass Spectrum view on the top right is populated with the MS1 scan
with the highest resolution and highest intensity related to the preferred ions across the
input files. The spectrum tree to the left includes the MS1 scans and the available
fragmentation scans within the following time range for a compound:
– Peak apex (RT) ± the peak’s full width at half maximum (FWHM)
–or–
– Start and end points of the chromatographic peak, as determined by the peak
detection algorithm
–
• A set of tabbed main tables below the two graphical views
• A collapsed area for the related tables below the main tables
Figure 86 shows a result page for a result file generated with an untargeted processing
workflow.
Figure 86. Default layout for a result page from an untargeted workflow (LC studies), numbered at
the top left from 1 to 3 and from the bottom left from 4 to 7
1 2 3
4 5 6 7
For more information, see “Modify the result page layout for ease of use.”
Tip For information about modifying the layout of the table columns and rows, see
Chapter 15, “Common operations for manipulating data tables.” This chapter describes
how to hide or display specific table columns by using the Field Chooser dialog box,
reorder or stack the table columns by using the mouse, and pin rows to the top of the
tables.
These topics describe some of the ways that you can change the layout of a tabbed result page:
• Float a result page view
• Enlarge a result page view
• Use the collapsible pane options for filtering, grouping, coloring, and discriminating by
• Show or hide result tables
Use the collapsible pane options for filtering, grouping, coloring, and
discriminating by
These four views on a result page include a collapsible pane of filtering, grouping, coloring, or
discriminating by options:
• Chromatograms view. See “Chromatograms view.”
• Trend Chart view. See “Trend Chart view.”
• Principal Component Analysis view. See “Principal Component Analysis view.”
• Partial Least Squares Discriminant Analysis. See “Partial Least Squares Discriminant
Analysis view.”
• Descriptive Statistics view. See “Descriptive Statistics view.”
Note If the analysis does not include samples with different study factor values or sample
types, the application cannot group the samples, and only the Files check box appears in
the Group By list.
Table 84 describes the effect of clearing and selecting the check boxes in the collapsible pane
at the left of a view.
Table 84. Effect of clearing or selecting the check boxes in the collapsible pane
Objective Selection
Remove or display data Use the check boxes under Filter By to remove or display data
points points by sample type, study factor value, or individual file.
By default, the application selects all the check boxes under Filter
By for the Chromatograms view.
By default, the application does not select the Blank sample type
or the Quality Control sample type check boxes for the statistics
views, and the Identification Only sample type does not appear
in the Sample Type list.
Visually distinguish data For the Chromatograms view, use the check boxes under Group
points by grouping them By to colorize the traces by group. The application duplicates the
check boxes in the Study Variables area of the Grouping and
Ratios page of the analysis.
For the Trend Chart view, use the check boxes under Group By
to change how the data points for the selected compound are
grouped across the input files.
Distinguishing data For the Principal Component Analysis view, use the Color By
point by color check boxes to distinguish the principal components by color.
For the Descriptive Statistics view, use the Color By check boxes
to visually group the box plots by color.
Discriminate by specific For the Principal Least Squares–Discriminant Analysis view, use
study variables the Discriminate By check boxes to select the study variables for
the supervised analysis.
Table 85 describes the Group By and Filter By options in the collapsible pane.
Table 85. Options in the collapsible left pane (Sheet 1 of 2)
Feature Description
(X/Y) The left integer is the number of samples that are selected under
Filter By (and that also contain the selected compound). The right
integer is the number of samples that contain the selected
compound.
ON/OFF toggle for ON—The check boxes are available.
the Filter By items
OFF—The check boxes are unavailable and the items are not
filtered out.
Default: Clear
Filter By Study Factor Select these check boxes to display data for one or more study
factors.
Filter By File Select these check boxes to display data from one or more of the
files.
For more information about using the options on the collapsible pane, see these topics:
• Filter the result data by the study variable values
• Filter the result data by input file
• Group the result data by the study variables or by individual files
1. If the collapsible pane is closed, click the icon, , in the upper-left corner of the view.
2. Under Filter By, click the expand icon to the left of the study variable name to open the
values list.
3. Clear the check boxes for the items that you want to hide or values that you want to
remove from the statistical calculations.
1. Under Filter By, click the expand icon to the left of File to open the File list.
2. Clear the check boxes for the files that you want to exclude from the display or the
statistical calculations.
Note By default, for the statistical views, the check box for the Blank sample type under
Filter By is clear.
1. To group the data by the study variables or by the individual files, select a row in the
active result table. Then, select one or more check boxes under Group By.
2. To view a color legend of the sample groups, right-click the Chromatograms view and
choose Display Options > Show Legend.
1. Open a result file. See “Open, close, and update result files.”
2. Click the Select Table Visibility icon, , to the left of the result table tabs.
The Select Visible Tables dialog box opens.
3. Select the check box that corresponds to the table that you want to show, or clear the
check box that corresponds to the table that you want to hide.
4. Click OK to accept the changes.
By default, the Tags column contains five tags. From left to right, the subcolumn headings for
the tags are labeled A, B, C, D, and E, and the tag colors are set to blue, red, yellow, green, and
purple. You can change the number of tags to 0 or any integer value from 1 to 15 by selecting
or clearing the check boxes in the Custom Tags Editor.
Default tags
3. (Optional) If you have created a stored custom tags file (.tags) and you want to edit it, do
the following:
a. Click Import and select the file that you want to edit.
b. In the Custom Tags Editor, select the number of tags that you want to display in the
Tags columns of the result tables in the current result file by selecting the check boxes.
4. To modify the colors and text labels for the tags, follow the instructions in Table 86.
You use the Color dialog box to modify the colors.
Figure 89. Color dialog box
Change the color of a tag 1. In the editor, click the color circle to the left of the
to a custom color. name box.
2. In the Color dialog box, click Define Custom Colors.
3. Define a custom color by using the color gradient or
entering the Red, Green, and Blue (RGB) values or the
Hue, Saturation, and Luminosity (HSL) values, and
then and click Add to Custom Colors.
4. Select the custom color and click OK.
5. To apply the custom settings in the Custom Tags Editor dialog box to the tags in the
current result file, click Apply.
The Custom Tags Editor dialog box remains open until you close it.
6. (Optional) To export the defined tags to an external file that you can use with any result
file, do the following:
a. Click Export.
The Save Custom Tags Editor Settings dialog box opens.
b. Name the file, select its storage location, and click Save.
Note For more information about the shortcut menus for result tables, see “Shortcut
menu commands for the result tables.”
Before you add tags to the result tables, define the names and colors of the tags. See “Define
custom tags by using the Custom Tags Editor.”
Table 87. Adding tags to and removing tags from items (Sheet 1 of 3)
Task Procedure
Add a tag to an item in the Click the circle for the tag in the Tags column for the
current result table. item.
Table 87. Adding tags to and removing tags from items (Sheet 2 of 3)
Task Procedure
Remove specific tags from 1. Select the items (rows) of interest by using the
selected items in the current SHIFT key or the CTRL key.
result table.
2. Right-click a selected table row and choose Set Tags.
The Set Tags dialog box opens.
3. Double-click the tags that you want to remove.
The center of the selected tags turn gray.
4. Select the Selected Items option and the In This
Table option.
The following figure shows the settings for removing
tags 1 and 5 from the selected items (rows) in the
current result table.
5. Click Apply.
Remove specific tags from 1. Select the items (rows) of interest by using the
selected items in the current SHIFT key or the CTRL key.
result table and all the subtables
2. Right-click a selected table row and choose Set Tags.
for the selected items.
The Set Tags dialog box opens.
3. Select the tags that you want to remove.
The color of the selected tags turns to gray.
4. Select the Selected Items option and the In This and
All Subtables option.
5. Click Apply.
Table 87. Adding tags to and removing tags from items (Sheet 3 of 3)
Task Procedure
Remove specific tags from all 1. Right-click a selected table row and choose Set Tags.
items in the current result table.
The Set Tags dialog box opens.
2. Select the tags that you want to remove.
The color of the selected tags turns to gray.
3. Select the All Items option and the In This Table
option.
4. Click Apply.
Remove specific tags from all 1. Right-click a selected table row and choose Set Tags.
items in the current result table
and all items in the subtables for The Set Tags dialog box opens.
the current result table. 2. Select the tags that you want to remove.
The color of the selected tags turns to gray.
3. Select the All Items option and the In This and all
Subtables option.
4. Click Apply.
The Custom Tags Editor is not available until you open a result file.
Table 88. Importing or exporting custom tags
Task Procedure
Export custom tags to a TAGS 1. Open the result file that includes the definitions that
file you want to export. Or, open any result file.
2. From the application menu bar, choose View >
Custom Tags Editor.
The Custom Tags Editor opens.
3. Define the tags if you have not already done so.
4. Click Export.
The Save Custom Tags Editor Settings dialog box
opens.
5. Name the file, select where you want to store the file,
and click Save.
Import custom tags from a 1. Open the result file of interest.
TAGS file
2. From the application menu bar, choose View >
Custom Tags Editor.
The Custom Tags Editor opens.
3. Click Import.
The Load Custom Tags Editor Settings dialog box
opens.
4. Browse to and select the TAGS file of interest. Then,
click Open.
In addition to the factory default layout, the application comes with the following custom
layouts for LC studies:
• Stable Isotope Labeling—Opens the Isotopologues Distribution Chart, the Trend Chart,
and the Metabolika Pathways view in the bottom right of the result page. In the
Compounds table, hides the following columns: #Metabolika Pathways, Avg. Exchange,
FISh Coverage, and Metabolika Pathways. Opens the Labeled Compounds per File table
that is related to the currently selected compound in the main Compounds table.
• Statistics—Opens the Differential Analysis and Trend Chart views on the bottom left and
the Principal Component Analysis, Partial Least Squares Discriminant Analysis, and
Hierarchical Cluster Analysis views on the bottom right. Closes the Chromatograms and
Mass Spectrum views. Closes the search and pathway result tables if they are visible.
For details about working with layouts, see the following topics:
• Save the current layout of a result file
• Reset the layout to the factory defaults
• Create a custom layout
• Apply a layout
• Manage the layouts
With the result file selected as the active page, do one of the following:
• In the toolbar, click the Save the Currently Active Item icon, .
• From the menu bar, choose File > Save.
With the result file selected as the active page, choose Window > Reset Layout from the
menu bar.
The application closes the result file, and then reopens the result file to the factory default
layout.
Apply a layout
A layout determines which of the available result tables and graphical views appear as well as
where they appear in the application window when you open a result file.
Y To apply a layout
1. With the result file selected as the active page, choose Window > Apply Layout from the
menu bar.
2. Select a layout from the list or use the hot keys.
Use the Compound Annotation Editor dialog box to edit the annotations for compounds of
interest in the compounds table (Compounds table or Expected Compounds table) and the
Structure Proposals table.
1. Double-click the row of interest in the compounds table or related Structure Proposals
table.
The Compound Annotation Editor dialog box opens.
The application automatically populates the Molecular Weight/Error in Da and
Molecular Weight to Fit boxes. If the formula, structure, and name are available, the
application also populates these fields.
Figure 92. Compound Annotation Editor dialog box
Open
(structure file)
Save
(structure file)
Drawing area
The application
populates these
boxes as you draw
the structure.
Table 89 describes the parameters in the Compound Annotation Editor dialog box.
Table 89. Compound Annotation Editor parameters
Parameter Description
Description page
Formula Displays the elemental formula of the structure in the drawing
area or the assigned formula.
Formula to Fit Displays the elemental formula of the component found by the
Find Expected Compounds node in a targeted analysis for an LC
study.
Molecular Displays the molecular weight (MW) of the structure in the
Weight/Error in Da drawing area and the difference between the structure’s calculated
MW and the MW for the selected compound—that is, the MW
in the Molecular Weight to Fit box.
Molecular Weight to Displays the molecular weight (based on the formula) of the
Fit compound selected in the Compounds table or the Expected
Compounds table.
Name Displays the name of the compound from an online or local
database search.
For details about adding or deleting proposed structures for a compound, see the following
topics:
• Add structure proposals
• Delete structure proposals
1. Select the compound of interest in the main compounds table (Expected Compounds
table or Compounds table).
2. Click Show Related Tables at the bottom left of the result page.
3. Do one of the following:
• Click the Structure Proposals tab. Then, right-click a row in the Structured Proposals
table and choose Structure Proposals > Use As Compound Annotation.
• Open any of the related results tables, right-click the entry of interest and choose Use
as Compound Annotation.
The FISh scoring algorithm uses the structure in the structure column of the result table or
the drawing area of the Compound Annotation Editor dialog box. The selected entry in the
compounds table must have MS2 data (check the indicator in the MS2 column).
For details about running the FISh scoring algorithm, see the following topics:
• Apply FISh scoring by using a shortcut menu command
• Apply FISh Scoring from the Compound Annotation Editor dialog box
• Specify the FISh scoring parameter settings
3. Set up the parameters. See “Specify the FISh scoring parameter settings.”
4. Click OK.
In the FISh Scoring Queue view to the left of the table, one job appears for each selected
entry. For each entry that includes a structure, the application runs the FISh scoring
algorithm. The run time increases as the complexity of the structure increases. When an
entry does not include a structure, the job ends in failure and is highlighted with a red
border.
Apply FISh Scoring from the Compound Annotation Editor dialog box
Y To apply FISh Scoring from the Compound Annotation Editor dialog box
1. To open the FISh Scoring page of the Compound Annotations dialog box, right-click an
entry in the compounds table and choose Edit Compound Annotation.
2. Make sure that the drawing area on the Description page includes a structure.
3. Click the FISh Scoring tab.
4. Select the Apply FISh Scoring check box.
Figure 94. FISh Scoring page of the Compound Annotation Editor dialog box
5. Specify the parameter settings. See “Specify the FISh scoring parameter settings.”
6. Click Save.
1. In the Specify FISh Scoring Settings dialog box or on the FISh Scoring page of the
Compound Annotation Editor dialog box, make the following selections:
• To annotate the full spectrum tree, select the Annotate Full Spectrum Tree check
box.
• To use the general fragmentation rules, select the Use General Rules check box.
• To use the fragmentation libraries, select the Use Fragmentation Libraries check
box.
Tip If time allows, select the Use Fragmentation Libraries check box. Using the
fragmentation libraries provides significantly more structural information;
however, it can also add a significant amount of processing time.
• To allow aromatic cleavage as one of the fragmentation steps, select the Allow
Aromatic Cleavage check box.
• In the Max. Depth list, select the maximum number of steps allowed in the
fragmentation pathway.
2. Use the default values or type new values in the following boxes:
• For the FTMS scans, type a value in the High Accuracy Mass Tolerance box and
select the appropriate units.
• For the ITMS scans, type a value in the Low Accuracy Mass Tolerance box and select
the appropriate units.
• In the S/N Threshold box, type a value for the FTMS scans.
Table 90 describes the parameter settings for the FISh scoring algorithm.
Table 90. FISh scoring parameters
Parameter Description
Annotate Full Annotates the full spectrum tree (MS/MS, MS3, and so on) in the
Spectrum Tree Mass Spectrum view.
Use General Rules Uses the general fragmentation rules.
Use Fragmentation Uses the fragmentation libraries.
Libraries
Allow Aromatic Allows aromatic cleavage as one of the reaction steps.
Cleavage
Max. Depth Specifies the maximum number of fragmentation reactions to
consider in the fragmentation pathway.
Note The filters for each table are independent of other table filters. For example, a
retention time filter for the Merged Features table does not affect the entries in the other
tables that include a retention time column. In addition, filtering only removes rows from
the display; it does not update or change any of the calculated values.
To set up and apply the data reduction filters, see these topics:
• Set up, apply, and save filter sets
• Create a result filter with an AND logical conjunction
• Create a result filter with an OR logical conjunction
• Create a result filter with both of the logical conjunctions
• Load a saved filter set
• Result Filters view parameters
Value
Table Operator
column
Property
Specifies a column in the selected result table. The dropdown list contains all the columns in
the selected result table.
After you select the result column to filter, you select the operator to filter by. Some filters are
single operators and some filters require multiple operators. The available operators depend
on the type of data that you are filtering: numeric values, alphanumeric text, indicators, or
conditions.
When you set up filters with multiple operators, not all combinations of the operators make
sense. For example, for the Gap Status column of a main compounds table, you can set up the
following filter: Gap status > Has No Value > In Every File Holding a Value. This filter hides
every compound in the table.
Value
Filters for numeric columns with numeric values— for example, the RT column—end with a
text box for entering the numeric value or a selection list of the input files.
Filters for columns with alphanumeric strings—for example, the Name column—end with a
text box for entering the string.
Note To change which tables are visible, use the Select Visible Tables dialog box.
4. In the Result Filters view, select the table of interest in the left pane.
The table name appears in the right pane. The following selection tree appears below the
table name.
Table Name
AND Add group
Condition list
Selected
property
Tip When you select the Is Equal To operator, type a numeric value to a
minimum precision of two decimal places or a minimum precision that is equal
to the number of decimal places that are displayed in the column, whichever is
greater. For example, for any of the Area columns, type a numeric value with two
decimal places, even though the Area column displays a numeric value with no
decimal places.
Operator list
Value box
Selected
property
This figure shows a set of filter conditions that limits the displayed results to
chromatographic peaks from 2 to 7 minutes.
Condition list
Value box
• For a status property, such as Gap Status, select one or more conditions and values as
applicable. For a multiple condition filter, not all combinations make sense. Clicking
Remove at the end of a filter removes the individual filter.
This figure shows a filter for the Gap Status column.
The indicator icon to the left of the table name turns from green to yellow, and the
button displays OFF.
• To clear the filters for a specific table, select the table in the left pane of the Result
Filters view. Then, click Clear.
• To clear all the filters in a filter set, click Clear All.
Note When you use the AND logical conjunction, all of the connected property
conditions must be True.
1. In the Result Filter view, keep the AND logical conjunction as the first item in the filter
tree.
2. For each property that you want to conjoin with the AND conjunction, click Add
Property, select a property from the list, and set the property boundaries.
This figure shows a filter set that uses three properties conjoined with an AND
conjunction. When you apply this filter set to the data in a compounds table, only those
rows that meet all three conditions remain; that is, you see only those detected
chromatographic peaks with a retention time greater than 3.00 minutes, with an
integrated peak area greater than 1 000 000, and where the compound has
data-dependent scans for the preferred ion.
Three properties
connected to an
AND conjunction
Note When you use the OR logical conjunction, only one of the connected property
conditions must be True.
1. Select the OR logical conjunction as the first item in the filter tree.
2. For each property that you want to conjoin with the OR conjunction, click Add
Property, select a property from the list, and set up the property conditions.
This figure shows a filter set that uses two properties conjoined with an OR conjunction.
When you apply this filter set to the data in the result table, those rows that meet at least
one of the conditions remain—that is, you see the rows where the FISh coverage value is
greater than or equal to 30 or where the FISh coverage column has no reported value.
Two properties
connected to an
OR conjunction
Y To create a filter set using both the AND and the OR logical conjunctions
1. Keep the AND logical conjunction as the first item in the filter tree.
2. To conjoin two properties with the OR conjunction, do the following:
a. Click Add Property and select OR from the dropdown list.
b. Set up the properties that you want to conjoin with the OR conjunction.
This example shows the condition where the FISh Coverage must be greater than or
equal to 30 or have no value. (For LC studies, the FISh scoring algorithm does not
calculate scores for DIA scans.)
3. For each property that you want to conjoin with the AND conjunction, click the Add
Property list that connects to the AND conjunction, select a property from the list, and
set up the property boundaries.
Figure 96 shows a result filter that keeps chromatographic peaks that meet the following
conditions:
• A FISh Coverage score that meets one of these conditions:
– A FISh Coverage score that is greater than or equal to 30
–or–
– No FISh Coverage score
–and–
• A Retention Time from 4 to 7 minutes
Figure 96. Filter that uses both AND and OR conjunctions
Table 91 describes the panes, buttons, icons, and check box in the Result Filters view.
Table 91. Result Filters panes and buttons (Sheet 1 of 2)
Feature Description
Left pane Lists the main tables included in the current result file. An On/Off
button and an indicator icon appear to the left of the table name.
ON/OFF button Use to turn on or turn off the conditions for the associated main
table.
Indicator icons ( ) Gray—Indicates that the table is unfiltered.
Displays the filter settings for the selected table. You can modify these settings as described
in “Set up, apply, and save filter sets.”
AND or OR Specifies the logical connection between properties or groups.
Add Group Adds a group.
Add Property Adds a property.
1. Open the result file of interest. See “Open, close, and update result files.”
2. From the menu bar, choose View > Result Summary.
The Summaries view includes these five pages: Workflow, Processing Messages, Filter,
Study, and Grouping & Ratios.
Workflow summary
To view the processing workflow that the analysis used to create the active result file, open the
Workflow page. This page lists the name of the processing workflow, the creation date for the
result file (.cdResult), the raw data files (.raw) that were processed to create the result file, and
the parameter settings for the workflow nodes.
For information about creating a processing workflow, see Chapter 6, “Create and edit
processing workflows.”
Filter summary
To view a summary of the filters used to reduce the data in the results window, open the Filter
page. This page lists the name of the latest filter set (FILTERSET file type) that you applied to
the result file and the filter conditions in the filter set. Use the Result Filters view to create
filter sets. See “Filter the data for data reduction.”
Study summary
To view a summary of the study settings for the input files that make up the result file, open
the Study page. This page lists the following:
• Name and creation date of the study
• Directory location of the study
• Study factors and their values
• Sample names (Xcalibur RAW files) and their directory location
Figure 98 shows an example summary of the sample groups and ratios for an analysis. The
summary lists the selected study variables, sample groups, and ratios in order from top to
bottom.
Note Shortcut menus are also known as context menus or right-click menus.
For information about the shortcut menus for the result tables on a result page, see these
tables:
• Table 92 describes the commands in the shortcut menus for the result tables in either type
of study.
• Table 93 describes the shortcut menu commands that are specific to LC studies.
Table 92. Shortcut menu commands for the result tables (Sheet 1 of 4)
Command Description
All main and related result tables
Copy With Headers Copies the current table row and its associated column
headings to the Clipboard.
Copy Copies the current table row to the Clipboard. Does not
copy the column headings.
You can copy the structures in the following tables to the Clipboard:
• For LC studies only: Compounds and Expected Compounds
• For either study type: BioCyc Results, mzCloud Results, mzVault Results, ChemSpider
Results, Mass List Search Results, Metabolika Results, and Structure Proposals
Copy Structure > As MOL Copies the selected structure in MOL format to the
Clipboard.
Copy Structure > As InCHI Copies the selected structure in InCHI format to the
Clipboard.
Copy Structure > As InCHI Copies the selected structure in InCHI Key format to the
Key Clipboard.
All related tables with a Structure column
Use as Compound Uses the annotations in the selected entry for the related
Annotation compound in the main compounds table (Expected
Compounds table or Compounds table).
Table 92. Shortcut menu commands for the result tables (Sheet 2 of 4)
Command Description
Add to Structure Proposals Add the selected entry to the Structure Proposals table for the
related compound in the main compounds table (Expected
Compounds table or Compounds table).
Add to Structure Proposals Add the selected entry to the Structure Proposals table for the
and Apply FISh Scoring related compound in the main compounds table (Expected
Compounds table or Compounds table) and runs the FISh
scoring algorithm.
All result tables with a Tags column
Table 92. Shortcut menu commands for the result tables (Sheet 3 of 4)
Command Description
Clear Compound Removes the annotation, which includes the structure, name,
Annotation formula, annotation source, and FISh coverage score if
applied.
Apply FISh Scoring Applies the FISh scoring algorithm to the selected entries and
populates the FISh Coverage column with the calculated
score.
Compounds table only
Molecular Networks > Send Opens the molecular networking viewer in a web browser.
to Viewer See “Modify the simulation in the molecular networks
viewer.”
Table 92. Shortcut menu commands for the result tables (Sheet 4 of 4)
Command Description
Structure Proposals > Apply Opens a dialog box where you can specify the settings for the
FISh Scoring to Selected FISh scoring algorithm and submit the information in the
selected row. See “Apply FISh scoring.”
You can export the contents of any of the result tables to a spreadsheet file. When you open
the Export to Excel dialog box from any of the main tables, the Level 1 selection in the dialog
box defaults to the active result table. When you open the Export to Excel dialog box from
any of the related tables, the Level 1 selection defaults to the main compounds table, which
depends on the processing workflow.
For LC studies, the main compounds table is the Compounds table when the processing
workflow includes the Detect Compounds node or the Detect Compounds node and the
Expected Compounds node. When the processing workflow includes only the Expected
Compounds node, the compounds table is the Expected Compounds table.
1. Right-click the result table that you want to export and choose Export > As Excel.
The Export to Excel dialog box opens.
The default storage path is C:\Users\Public\Documents\result file name.
The default selection in the Level 1 box depends on where you opened the dialog box:
• If you opened the dialog box from a main table, the default selection in the Level 1
box is the name of the active result table.
• If you opened the dialog box from a related table, the selection in the Level 1 box is a
main compounds table.
Figure 99. Export to Excel dialog box with the Compounds table selected
2. Make the appropriate selections in the Export to Excel dialog box as follows:
If you open the Export to Excel dialog box from a main table, the Level 1 selection is the
table where you opened the dialog box. If you open the Export to Excel dialog box from a
related table, the Level 1 selection is a main compounds table.
You can modify the export options in any order.
Table 94. Export options (Sheet 1 of 2)
Export option Procedure
Change the directory folder, Click the browse icon, , next to the Path box. Then,
the file name, or both. select a directory folder and rename the file as
appropriate, select the spreadsheet type (XLS or
Microsoft Excel File), and click Save.
Select the result tables to If necessary, select a different result table from the Level
export. 1 list. Then, as appropriate, select an available table from
the Level 2 list, followed by an available table from the
Level 3 list.
1. Right-click the result table that you want to export and choose Export > As Plain Text.
The Export to CSV File dialog box opens. The File Name box displays the name of the
selected result table.
2. Select the folder where you want to store the file, name the file as necessary, and click
Save.
The text file (CSV) appears in the selected folder.
1. Right-click the Compounds table or the Expected Compounds table and choose Export
> As Xcalibur Inclusion/Exclusion List.
The Export Xcalibur Inclusion/Exclusion List dialog box opens.
Figure 100. Export Xcalibur Inclusion/Exclusion List dialog box
Table 95 describes the parameters in the Export to Xcalibur Exclusion List dialog box.
Table 95. Export to Xcalibur Inclusion/Exclusion List dialog box parameters (Sheet 1 of 3)
Parameter Description
Path Specifies the file name and directory path of the text file that
contains the inclusion/exclusion list for your Xcalibur instrument
method.
Table 95. Export to Xcalibur Inclusion/Exclusion List dialog box parameters (Sheet 2 of 3)
Parameter Description
Options
IMPORTANT An Orbitrap Fusion mass spectrometer accepts a retention time range of
0.01 to 999 minutes.
Make sure that the retention time window for each compound falls within the retention
time range of the instrument method.
• Expected RT – Left RT Tolerance > Minimum retention time for the instrument
method
• Expected RT + Right RT Tolerance < Maximum retention time for the instrument
method
Left RT Tolerance Specifies the minimum start time for the chromatographic peak. If
[min] the Expected RT minus the Left RT Tolerance setting is less than
zero, the application exports a value of zero.
Default: 1
Range: 0.001 to 1000
Right RT Tolerance Specifies the maximum stop time the chromatographic peak.
[min]
There is no error checking for the calculated maximum retention
time.
Default: 1
Range: 0.001 to 1000
Checked Items Only Specifies that the application exports only the selected compounds
to the named text file.
Default: Clear
Include Isotopic Peaks Adds an entry for each isotopic spectral peak.
LTQ Orbitrap Options
Mass Precision Specifies the mass precision.
(Decimals)
For the LTQ Orbitrap, the mass precision of the exported data
must match the required mass precision for your Xcalibur
instrument methods. You specify the required mass precision for
Xcalibur instrument methods in the Instrument Configuration
dialog box of the Foundation platform.
Default: 5; range: 0 to 5
Table 95. Export to Xcalibur Inclusion/Exclusion List dialog box parameters (Sheet 3 of 3)
Parameter Description
Max. Concurrent Specifies the maximum number of entries with overlapping time
Entries windows.
Default: Clear
Instrument
LTQ Orbitrap Exports the list in the appropriate format for the LTQ Orbitrap
instrument control software.
Q Exactive Exports the list in the appropriate format for the Q Exactive
instrument control software.
1. Right-click the Compounds table and choose Export > As TraceFinder List.
The Export to TraceFinder dialog box opens. The default folder is either
drive:\Users\Public\Documents or the last folder that you selected. The default file name
is the name of the active result file.
Figure 101. Export to TraceFinder dialog box
Tip The application comes with an empty library named Custom mzVault Library.db. For
information about creating a new spectral library, see “Spectral Libraries view.”
1. Open a result file. See “Open, close, and update result files.”
2. Open one of these tables by clicking its tab—Compounds table or Expected Compounds
table.
3. Right-click a compound in the table and choose Export > Add Compound to Existing
mzVault Library.
The Export to mzVault Library dialog box opens. Its Spectra view displays the available
fragmentation spectra for the selected compound.
Figure 102. Export to mzVault Library dialog box populated with a Compounds table compound
4. Do the following:
• In the spectrum tree, select the check boxes for the spectra that you want to add to
the compound entry.
• In the mzVault Library area, select the existing mzVault library from the Selected
Library list.
1. Open a result file. See “Open, close, and update result files.”
2. Open one of these tables by clicking its tab—Compounds table or Expected Compounds
table.
3. (Optional) Filter the table to display only the compounds of interest or select the check
boxes for the compounds of interest.
4. Right-click the table and choose Export > As mzVault Library.
The New mzVault Library dialog box opens. By default, the application populates the
Library Name box with the name of the opened result file.
5. Do the following:
• Type a new library name or keep the default name.
• In the Options area, select whether to exclude compounds without a name, export
only the compounds with selected check boxes, or both.
6. Click Export.
For details about exporting compounds to a mass list, see the appropriate topic:
• Export compounds from the Compounds table to a new mass list
• Export compounds from the Compounds table to an existing mass list
5. Do the following:
• In the dropdown Mass List Name list, select the mass list where you want to add the
selected compounds.
• (Optional) To add the retention time information to the mass list, select the Export
Retention Time check box.
6. To avoid exporting named compounds that are already in the mass list, do one of the
following:
• To exclude a named compound only if its mass and retention time match the
duplicate compound in the mass list within specified tolerances, make sure that the
Do Not Add Duplicate Names check box is not selected. Then, in the Entries
Matching area, enter the tolerances that you want the application to use to exclude
duplicate named compounds from being exported to the mass list.
Note If you do not want to exclude a named compound unless both its mass and
retention time are an exact match to the named compound in the mass list, set
both tolerance values to 0.
• To exclude any named compound with a name that matches that of a named
compound in the existing mass list, select the Do Not Add Duplicate Names check
box.
For information about the graphical views that are available when a result page is active in the
application window, see “Graphical views for a result file.”
Table 96 lists the shortcut menu commands for copying data to the Clipboard or an external
file.
Table 96. Commands for copying an image of a graphical view (Sheet 1 of 2)
Copy the image Copy the data points to the Copy the data to an Copy the data to a
Graphical view
to the Clipboard Clipboard image file TEXT or CSV file
Chromatograms view Copy > Image Copy > Points Export > Image As Export > Points As
Mass Spectrum view Copy > Image Copy > Points—Copies the Export > Image As Export > Points As
scan label and the m/z and
intensity values for annotated
centroids to the Clipboard.
Copy > Raw Points—Copies
the m/z and intensity values
for all centroids to the
Clipboard. Does not copy the
scan label.
Scatter Chart Copy > Image Copy > Data Export > Image As Export > Data As
Histogram Chart Copy > Image Copy > Data Export > Image As Export > Data As
Bar Chart Copy > Image Copy > Data Export > Image As Export > Data As
Pie Chart Copy > Image Copy > Data Export > Image As Export > Data As
Tip You can copy structures without opening the Structure column. But opening
the Structures column lets you confirm that the entry you want to copy has a
structure annotation.
4. Right-click the entry of interest and choose Copy Structure > As Mol, As InChi, or As
InchIKey.
Figure 105. Shortcut menu for a result table with a Structure column
5. Open the application where you want to paste the structure and press CTRL+V.
Figure 106. ChemSpider search box with pasted InChi string
Note Use the Result Charts view to create histograms, bar charts, pie charts, and scatter
plots of the result data.
For details about the Result Filters view, see “Filter the data for data reduction.” For details
about viewing a summary of the analysis parameters for a result file, see “View the result
summaries.”
Chromatograms view
For LC studies, when you initially open a result file, the Chromatograms view displays the
XIC traces for the compound in the first row of the Compounds table or the Expected
Compounds table. By default, the display zooms in on the detected peaks for the selected
rows.
The Chromatograms view consists of a collapsible pane on the left and the graphical view on
the right. Right-clicking the graphical view opens a shortcut menu (see Table 98).
View a chromatogram
In an opened result file, you can display chromatogram traces (a plot of intensity versus time)
by selecting a row in any of these result tables:
• Untargeted LC/MS workflow
– Compounds
– Compounds per File
– Features
• Targeted LC/MS workflow
– Expected Compounds
– Expected Features per File
– Expected Compounds per File
– Merged Features
– Expected Formulas
• General LC/MS traces
– FISh Trace Fragments
– Specialized Traces: TIC, BPC, XIC, isotope pattern, and FISh
– Manual Peaks
Y To view a chromatogram
Note When you select a row for a filtered study file, the Chromatograms view does
not display a trace. For example, if the Blank check box is clear under Filter By
Sample Type and you select a row for a Blank sample, the Chromatograms view
remains empty. By default, the Blank Sample Type check box is clear until you select
it.
• Click the Compounds per File tab or the Expected Compounds per File tab and
select a row.
The Chromatograms view displays the integrated chromatographic peak for the
selected table row (MW × RT dimensions). Each trace is a composite of its related
adducts, and the peak area for a compound is the sum of the areas for its related
adducts (parent area for the adducts listed in the Expected Features table or the
Features table).
• Click the FISh Trace Fragments tab and select a row.
The Chromatograms view displays a trace for the structure displayed in the selected
table row. The FISh Trace Fragments table appears in the main table set when you
select True for Individual Traces in the Create FISh Trace node of a processing
workflow.
• Click the Specialized Traces tab and select a row.
Note The Specialized Traces table contains traces generated by these nodes:
Create Mass Trace, Create Analog Trace, Create Pattern Trace, and Create FISh
Trace.
• Open the related tables, click the Filled Gaps tab, and select a row.
Tip By default, the Filled Gaps table is hidden. To display this table, open the
Select Visible Tables dialog box, select the Filled Gaps check box, and click OK.
For more information, see “Filled Gaps table.”
4. To determine the origin of a trace in a result file that includes multiple input files,
right-click the Chromatograms view and choose Display Options > Show Legend.
5. To decrease or increase the number of legends displayed, right-click the Chromatograms
view and choose Display Options > Legend Size > #Rows, where # is an integer value
from 1 to 10.
In Figure 107, the vertical red line indicates the peak apex of an integrated
chromatographic peak. The triangle below the retention time label indicates the
corresponding data point in the XIC trace.
Figure 107. Chromatograms view showing the shortcut menu (LC/MS data)
Red line indicating the apex of the Triangle indicating the data
integrated chromatographic peak point that corresponds to the
(RT in compounds table) retention time label in the plot
When the number of legend lines becomes too large for the available display space, the
application displays an empty view with the following text:
Not enough space for drawing the chart properly.
Note The size of the Chromatograms view limits the number of plots the view can
display. When the application can no longer draw the plots properly, the following
message appears:
Not enough space for drawing plot properly
To display more plots, you can resize the view or drag the view to a second monitor
and expand the view to fill the monitor.
In a Chromatograms view with more than one plot, a light blue bar on the left border
highlights the active plot (see Figure 108). If you right-click the Chromatograms view and
choose Plots > Remove Plot, the application removes the active plot.
Figure 108. Chromatograms view with two plots
Note Manual Peak Integration is available only for specialized traces in LC studies.
Tip Take care to avoid clicking an editable column in the result table, as doing so
undoes the row selection and sets the focus to the table cell.
Under Filter By, clear the check box for the study variable value.
Note These workflow nodes generate specialized traces: Create Mass Trace, Create
Analog Trace, Create Pattern Trace, and Create FISh Trace.
6. To add the manual peak to the Manual Peaks table, do one of the following:
• Click Apply.
• On the keyboard, press A.
If the result file did not already contain a Manual Peaks table, the new table appears in the
main table tab set. The Manual Peaks table contains a row for the new manual peak.
Table 98. Shortcut menu commands for the Chromatograms view (Sheet 2 of 5)
Command Description
Copy > Image Copies an image of the Chromatograms view (including the legend) to the Clipboard.
You can paste the image into a Microsoft Office document as a raster image or into a
vector-drawing program as a vector image.
Copy > Points Copies the data as a two-column list of data points and copies the scan header. The first
column lists the retention time and the second column lists either the relative intensity or
the counts.
Export > Image As Opens the Save As dialog box where you can specify a file name and save the contents of the
Chromatograms view as one of these selectable image formats: EMF, PNG, GIF, JPG, TIFF,
or BMP. The EMF format is a vector image.
Exports > Points As Opens the Save As dialog box where you can specify a file name and save the contents of the
Chromatograms view as a text file. The default file name is Chart.txt.
When the chromatogram is a plot of relative intensity versus retention time, the application
saves the data points as a two-column list. The first column lists the retention time and the
second column lists the relative intensity (%). When the chromatogram is a plot of area (in
counts) versus retention time, the second column lists the area for the peaks.
Plots > Add Plot Adds an empty, active plot to the Chromatograms view. Only the screen size limits the
maximum number of displayed plots. When you reach the screen’s limit, the
Chromatograms view appears to be empty and the following message appears:
Not enough space for drawing chart properly.
Plots > Remove Plot Removes the active plot, which has a gray border.
Table 98. Shortcut menu commands for the Chromatograms view (Sheet 3 of 5)
Command Description
Plots > Freeze Content Keeps the chromatogram of the currently selected row in the view when you select another
row (in the current table or another result table). The application uses dashes to distinguish
the frozen chromatogram trace.
Frozen peak with dashes
Plots > Clear Frozen Clears the frozen chromatogram from the view.
Content
Plots > Distribute Updates all of the plots simultaneously as you select different table rows. When Plots >
Selection to All Distribute Selection to All is not enabled, only the active plot updates.
Display Options > Displays a pop-up box with the following information, from top to bottom, when you place
Show Tooltips the cursor, +, over a chromatographic peak:
• Parent compound and any applicable transformations for an expected compound
• Chemical formula for an expected compound
• Adduct ion (feature)
• m/z value (feature)
• Molecular weight (compound or expected compound)
• Selected retention time, in minutes
• Intensity (height), in counts, of the selected point on the chromatogram trace
• File name of the input file
Table 98. Shortcut menu commands for the Chromatograms view (Sheet 4 of 5)
Command Description
Chromatogram with a tooltip for an entry in the Compounds per File table.
Chromatogram with a tooltip for an entry in the GC EI Compounds per File table
Table 98. Shortcut menu commands for the Chromatograms view (Sheet 5 of 5)
Command Description
Display Options > Displays the y-axis scale as relative intensity (0 to 100%). The legend changes to Intensity
Relative Intensity [%].
The default y-axis scale is an absolute scale; the y-axis legend is intensity [counts].
Display Options > Redraws only the chromatographic peaks of the displayed traces. Does not redraw the
Crop to Detected Peaks baseline portions of the traces as you select different table rows.
Manual Peak Use to add a manual peak to a specialized trace. Adding a manual peak adds the Manual
Integration Peaks table to a result file. See “Manually integrate chromatographic peaks.”
Selecting a trace in the Specialized Traces table makes this command available.
Table 99 describes the traces that you can display in the Chromatograms view.
Table 99. Chromatograms view traces (Sheet 1 of 2)
Trace type Description
UV Displays a chromatogram created from the UV signal from a
UV-Vis detector or the analog channel of a PDA detector.
Analog Displays a trace of response versus time.
Raw data files can contain analog data from a device that is
hard-wired to the analog channels of a Thermo Scientific mass
spectrometer.
PDA total scan Displays a chromatogram of the total absorbance for the entire
scan wavelength range for each time point.
PDA spectrum Displays a chromatogram of the highest absorbance reading in the
maximum wavelength range for each time point.
PDA wavelength range Displays a chromatogram of the total absorbance for the specified
wavelength range for each time point.
Base peak Displays a chromatogram of the most intense mass spectral peak
chromatogram (BPC) in the specified mass range for each time point.
Total ion Displays a chromatogram of the total intensity from all the mass
chromatogram (TIC) spectral peaks in the specified mass range for each time point.
The XIC trace is made up of the mass spectral peaks that match
the specified mass value within the specified mass tolerance.
Pattern trace Displays a TIC trace of the summed intensities of the mass
spectral peaks that match a specified pattern for each time point.
FISh trace Displays a TIC trace of the summed intensities of the mass
spectral peaks in a fragmentation scan (MS/MS or MS3) that
match the predicted fragments of the selected library compound
and its transformation products for each time point.
LC studies
• Compounds table • Features table
• Compounds per File table • Expected Features table
• Expected Compounds table • mzCloud Results table
• Expected Compounds per File table • Structure Proposals table
The Mass Spectrum view displays the matching spectral tree and a zoomed-in view of
the full MS scan.
b. Select the scan of interest from the spectral tree.
Y To review the matching fragment structures predicted by the FISh scoring algorithm
1. Open a result file generated by a processing workflow that included the Expected
Compounds node and the FISh Scoring node. Then, select a compound in the Expected
Compounds table.
2. (Optional) Enlarge the Mass Spectrum view.
3. In the spectral tree, select an MS2 or higher scan.
4. Review the color-coded mass spectral peaks, the theoretical fragment structures, and the
transformations for the shifted mass spectral peaks.
The FISh Scoring node annotates centroids that match the m/z value of a theoretical
fragment ion with its theoretical structure and color-codes the centroids in a
fragmentation scan as follows.
Color Meaning
( ) Green Direct match—Matches the m/z value of a theoretical fragment ion.
( ) Blue Shifted match—Matches the m/z value of a theoretical fragment ion
with at least one transformation applied.
Y To run the FISh scoring algorithm on an untargeted compound and view the structure
annotations in the Mass Spectrum view
1. Open a result file from an untargeted analysis that returns structures.
For LC studies, the processing workflow must include the Detect Compounds node,
Group Compounds node, Predict Compositions node, Assign Compound Annotations
node, and any of the identification or pathway mapping nodes that return structures. In
addition, the input files must include fragmentation scans.
2. Display the FISh Coverage and Structure columns as follows:
a. Open the Field Chooser dialog box by clicking the icon, , in the upper-left corner
of the Compounds table.
b. Select the FISh Coverage and Structure check boxes, and then close the Field Chooser
dialog box.
3. Right-click a compound of interest that has a structure and fragmentation spectra and
choose Apply FISh Scoring.
The Specify FISh Scoring Settings dialog box opens.
4. Specify the fragmentation settings and click OK.
The FISh Scoring Queue opens, and the application starts processing the MS2 scans
against the structure. When processing is completed, the calculated FISh coverage score
appears in the FISh Coverage column.
5. If the Mass Spectrum view is closed, open it by choosing View > Mass Spectrum from
the menu bar.
6. In the spectral tree at the left of the Mass Spectrum view, select an MS2 scan.
The annotated spectrum appears in the Mass Spectrum view. The spectrum header
displays the number of centroids that the FISh scoring algorithm matched to theoretical
fragments, the number of theoretical fragments that had no matching centroid, and the
number of fragments that it did not attempt to match (skipped) because their expected
intensity was below the signal-to-noise setting.
The FISh scoring algorithm annotates each centroid that matches the m/z value of a
theoretical fragment ion with its theoretical structure, m/z value, and formula. In
addition, the application displays matching centroids in green with a green circle at the
top.
Note The Use As Reference command is unavailable when the Show Library Spectra
As Reference command is available.
Note The mzCloud database is compatible only with the Internet Explorer™ web
browser. To access the mzCloud spectral database from the Compound Discoverer
application, set Internet Explorer as your default Internet browser.
2. In the spectral tree pane of the Mass Spectrum view, select a fragmentation scan.
3. Right-click the spectrum plot and choose Submit To mzCloud.
The online mzCloud application opens to the Select Spectrum dialog box and displays
the selected query spectrum.
4. Optimize the settings and click OK.
Figure 111. mzCloud search for a query spectrum of Chlorpyrifos
–or–
• From the start time to the end time for the chromatographic peak as defined by the peak
detection algorithm
For MS1 scans, the tree lists the scan number, the retention time at the apex of the
chromatographic peak, the mass analyzer, and the scan polarity.
For MSn scans, the tree lists the scan number, the retention time at the apex of the
chromatographic peak, the mass analyzer, the scan polarity, and the fragmentation
information, including the scan power, collision cell type, and the fragmentation type
(data-dependent acquisition [DDA] or data-independent acquisition [DIA]).
If the analysis predicts a formula for the compound or feature, the MS1 scan shows the
isotope pattern fit for the detected compound. Colored rectangles highlight the mass spectral
peaks (centroids) that match the theoretical isotope pattern. These rectangles have a
minimum display width to ensure that they are still visible when you zoom out or use the
Undo All Zoom/Pan shortcut menu command.
Note The isotope pattern fit algorithm is “resolution aware”; that is, in addition to the list
of elemental compositions provided by the Generate Expected Compounds node or the
Predict Compositions node, it uses the resolution information provided with the scan data
to perform an isotope pattern fit and calculate a spectral distance score.
If the resolution information is unavailable, it uses the setting for the Unrecognized MS
Resolution parameter in the Select Spectra node.
Figure 112. MS1 scan with color-coded mass peaks and the shortcut menu (LC study)
Table 101 describes the color coding for the centroids in an MS1 spectrum.
Table 101. Color coding for the centroids in an MS1 spectrum for a compound with a formula
annotation
Color Meaning
( ) Lavender Indicates the most intense centroid in the spectrum.
Note The A0 isotope (monoisotopic ion) is always the isotope with the
lowest m/z value, but it is not necessarily the isotope with the highest
intensity. For example, for compounds with more than one bromine
atom, a bromine atom and a chlorine atom, or more than four chlorine
atoms, the M + 2 (A2) isotope is the most intense isotope.
( ) Green The labeled centroid matches the delta mass and the relative intensity of
the theoretical isotope pattern for the formula annotation within the
specified tolerances.
( ) Red The expected centroid for this m/z value is missing or its intensity does not
fall within the tolerance range for the theoretical isotope pattern for the
formula annotation.
( ) Light blue The expected centroid for this m/z value (for the formula annotation)
might be missing because its theoretical intensity is at the level of the
baseline noise.
For details about the Result Charts view, see the following topics:
• Open the Result Charts view
• Use the copy, export, and zoom commands for the Result Charts view
• Display and pin the Options Pane for the Result Charts view
• Histogram charts
• Bar charts
• Pie charts
• Scatter plots
1. Open a result file. See “Open, close, and update result files.”
2. In the menu bar, choose View > Result Charts.
By default, the Result Charts view opens as a floating window. You can resize the window,
drag the window to another screen, or dock the window. See “Rearrange the tabbed pages
and graphical views.”
The Options pane for modifying the appearance of a chart view is a collapsible pane to
the left of each chart.
Figure 113. Floating Result Charts view with the collapsed (unpinned) Options pane at the left
Use the copy, export, and zoom commands for the Result Charts view
Y To access and use the shortcut menu commands for a Result Charts page
1. On the selected page of the Result Charts view, select the variables that you want to plot.
2. Click Refresh to plot the data.
3. Right-click the chart and choose one of the commands listed in Table 103.
Table 103. Common shortcut menu commands for the Result Chart viewsa
Command Availability Function
Undo All Pan/Zoom Scatter Chart, Histogram Undoes all panning and zooming.
Chart, and Bar Chart Returns the view to the default
magnification.
Zoom Out Scatter Chart, Histogram Undoes the last zoom-in action.
Chart, and Bar Chart
Copy > Image All Copies a raster image of the plot to
the Clipboard.
Copy > Data All Copies a list of the data points to the
Clipboard.
Export > Image As All Saves the plot in the selected file
format.
Display and pin the Options Pane for the Result Charts view
The Options pane of the Result Charts view contains the formatting options for the grid
lines, fonts, and so on.
Histogram charts
Use the Histogram Chart page of the Result Charts view to plot the frequency distribution of
a variable (result table column). You can display the data as a column chart, a bar chart, a line,
or a stepped line chart.
Table 104 describes the parameters for the Histogram Chart page.
Table 104. Histogram Chart parameters (Sheet 1 of 6)
Parameters Description
Data Source Specifies the source of the data that you want to plot. The
available data sources are the result table columns in the result file.
Refresh Refreshes the display with data points from the selected data
source.
Options pane commands
Load Loads the Option pane settings that you saved with the Save
command.
Save Saves the settings that you selected in the Options pane.
Factory Defaults Resets the settings of the options in the Options pane to the
defaults in effect when you installed the application.
Default: 0.8
Column Label Font Specifies the font of the column labels that appear on top of the
bars in the histogram. These labels are visible if you set Show
Column Amount to True.
The default x-axis title is the category selected from the Data
Source list.
Y-Axis Type Specifies the axis type (scale) of the y axis.
• (Default) Linear: Plots the data on a linear scale.
• Log: Plots the data on a logarithmic scale.
• Percent: Plots the data as a percentage of the number of items.
Y-Axis Title Specifies the label for the y axis. The default y-axis title is Count.
Reduce Number of Determines whether the application increases readability by
Axis Labels reducing the maximum number of axis labels to 30. If the chart
includes more than 30 values, it displays only every second or
every third label.
• (Default) True: Reduces the maximum number of axis labels
to 30.
• False: Does not reduce the maximum number of axis labels to
30.
Axis Title Font Specifies the font used to denote the labels of the x and y axes.
Default: 20
Note For discrete numbers, the actual group number might be
different.
Default: 1
Note For discrete numbers, the actual group number might be
different.
Use Full Series Value Determines the range of data values that the application uses to
Range compile the histogram.
• (Default) True: Uses all data for the histogram.
• False: Uses only the data between the values specified by the
Minimum Value option and the Maximum Value option.
Minimum Value Specifies the minimum value of the displayed data range. When
you use this parameter and the Maximum Value parameter, set
Use Full Data Range to False. Use these two parameters when you
want to show only a subrange of the data in a histogram.
Maximum Value Specifies the maximum value of the displayed data range. When
you use this option and the Minimum Value option, set the Use
Full Data Range parameter to False.
5. Legend Options
Show Legend Determines whether a legend appears and where it appears.
• (Default) None: Does not display a legend.
• Top: Displays a legend at the top of the histogram.
• Left: Displays a legend to the left of the histogram.
• Bottom: Displays a legend at the bottom of the histogram.
• Right: Displays a legend to the right of the histogram.
Legend Font Specifies the font for the legend.
Default: Firebrick
Bar charts
Bar charts plot categorical and ordinal data types in columns with the count of the data types
as the column height.
The Data Source list contains the numerical data categories that are available for the bar chart.
The Options pane contains the different options that you can use to customize the bar chart.
Moving the cursor over the columns in the chart activates a tooltip with information about
the data category.
Table 105 describes the parameters on the Bar Chart page of the Result Charts view.
Table 105. Bar Chart parameters (Sheet 1 of 3)
Command or option Description
Options pane
Load Loads the Option pane settings that you saved with the Save
command.
Save Saves the settings that you selected in the Options pane.
Factory Defaults Resets the options in the Options pane to the default settings in
effect when you installed the application.
Chart Options
Chart Type Determines how the application plots the data.
• (Default) Column: Displays the data as columns extending
from bottom to top.
• Bar: Displays the data as bars extending from left to right.
Horizontal Grid Lines Determines whether the bar chart displays horizontal grid lines
and specifies the style of these lines.
• (Default) None: Displays no horizontal grid lines in the bar
chart.
• Solid: Displays solid horizontal grid lines in the bar chart.
• Dotted: Displays dotted horizontal grid lines in the bar chart.
Axis Options
X-Axis Title Specifies the label for the x axis. The default x-axis title is the
category selected from the Data Source list.
Y-Axis Title Specifies the label for the y axis. The default y-axis title is Count.
Y-Axis Type Specifies the axis type (scale) of the y axis.
• (Default) Linear: Plots the data on a linear scale.
• Log: Plots the data on a logarithmic scale.
• Percent: Plots the data as a percentage of the number of items.
Axis Title Font Specifies the font used to denote the labels of the x and y axes.
Default: 30
Bar Display Specifies the appearance of the columns or bars.
• (Default) Flat: Displays the columns or bars as flat rectangles.
• Cylinder: Displays the columns or bars as cylinders.
• Emboss*: Displays the columns or bars as three-dimensional
rectangles.
• LightToDark: Displays the columns or bars as shaded
rectangles.
Range: 0.1–1.0
Default: 0.8
Show Amount Determines whether the amount, or count of items, is displayed in
the chart.
• True: Displays the amount or count of items in the chart.
• (Default) False: Does not the display the amount or count of
items in the chart.
Show Percentage Determines whether percentage values are displayed above the
columns or to the right of the bars.
• True: Displays percentage values above the columns or to the
right of the bars.
• (Default) False: Does not display percentage values above the
columns or to the right of the bars.
Default: CornflowerBlue
Data Source Displays the result category used to plot the data.
Pie charts
The Pie Chart page shows several categories of data as a solid circle composed of slices (a pie)
or as a ring (a doughnut). You can use a pie chart or a doughnut chart to indicate the relative
size of quantities of data.
1. With an active result file, choose View > Result Charts from the menu bar.
2. Click the Pie Chart tab in the Result Charts view.
3. From the Data Source list, select the type of data to display.
4. Click Refresh to draw the chart.
5. (Optional) To change the chart’s appearance, do the following:
a. Open the Options pane and adjust the colors, labels, and legends of the display and
specify how the chart displays small slices. Or, click Load to load the option settings
that you most recently saved.
–or–
• Change the Small Slice Threshold (%) setting.
Table 106 describes the parameters on the Pie Chart page of the Result Charts view.
Table 106. Pie Chart page parameters (Sheet 1 of 3)
Command or Option Description
Options pane
Load Loads the Option pane settings that you saved with the Save
command.
Save Saves the settings that you selected in the Options pane.
Factory Defaults Resets the options in the Options pane to the default settings in
effect when you installed the application.
Chart Options
Chart Type Specifies the type of chart to display:
• (Default) Pie: Displays the chart as a solid circle composed of
slices.
• Doughnut: Displays the chart as a ring.
Default: 0
Angle of First Slice Specifies the location of the first chart slice.
Default: 0
Pie Slice Options
Show Type Name with Determines whether the application displays the name of the data
Value group next to the value.
• True: Displays the name of the data group next to the value.
• (Default) False: Does not display the name of the data group
next to the value.
Show Slice Amount Determines whether the application displays the amount of each
slice.
• (Default) True: Displays the amount of each slice.
• False: Does not display the amount of each slice.
Show Slice Percentage Determines whether the application displays the percentage of
each slice.
• (Default) True: Displays the percentage of each slice.
• False: Does not display the percentage of each slice.
Label Style Specifies the label style of the chart segments.
• Disabled: Does not display a label.
• (Default) Inside: Displays the label on top of the chart
segment.
• Outside: Displays the label outside the chart segment.
Labels Font Specifies the font of the data set labels.
Default: 5%
Small Slice Collection Specifies the label for the pie segment composed of consolidated
Label segments.
Default: Other
Small Slice Collection Specifies the color of the pie segment composed of consolidated
Color segments.
Default: Gray
Show as Supplemental Determines whether to display small segments as a supplemental
Pie pie chart.
• True: Displays small segments as a supplemental pie chart.
• (Default) False: Leaves small segments as is.
Supplemental Pie Size Specifies the size of the supplemental pie chart relative to the
original pie chart.
• Largest: Displays the supplemental pie chart as much larger
than the original pie chart.
• Larger: Displays the supplemental pie chart as larger than the
original pie chart.
• Comparable: Displays the supplemental pie chart as about the
same size as the original pie chart.
• (Default) Smaller: Displays the supplemental pie chart as
smaller than the original pie chart.
• Smallest: Displays the supplemental pie chart as much smaller
than the original pie chart.
Data Source Displays the result category used to plot the data.
Scatter plots
Use the Scatter Chart page to set up a scatter plot for visualizing whether there is a linear or
logarithmic relationship between two or three variables (columns) in a result table.
For details about working with scatter plots, see these topics:
• Set up a scatter plot
• Use a filter set to filter the scatter plot
• Customize the appearance of a scatter plot
• Customization options for a scatter chart plot
• Scatter Chart page parameters
• Scatter Chart page shortcut menu commands
Depending on your selections, a two- or three-dimensional linear plot of the data points
appears. With the default appearance settings, the data points appear as blue circles in a
2D plot and as circles of varying colors in a 3D plot. For a 3D plot, a color legend for the
lowest to the highest Z data value appears to the right of the scatter chart.
Figure 114. Three-dimensional scatter chart
Y To interactively filter the scatter plot by using the Result Filters view
1. Set up and apply a set of result filters in the Result Filters view.
On the Scatter Chart page of the Result Charts view, the Refresh button turns orange.
2. Click Refresh to refresh the scatter chart plot.
1. Open the Options pane on the Scatter Plot page of the Result Charts view and pin it. See
“Display and pin the Options Pane for the Result Charts view.”
2. In the Options pane, do any of the following:
• To change the scaling, colors, labels, and legends in the display, under Axis Options,
make the appropriate changes.
• The application applies the changes as you make them.
• To change the font size or font type of the axis labels, click the expand icon to the left
of Axis Scale Font or Axis Title Font. Then, make the appropriate selections or click
the browse icon to open the Font dialog box where you can make your selections.
• A browse icon and a set of font parameters appear.
• To return the option settings to the original default settings, click Factory Defaults.
By default, the application saves the file to the last open folder
and uses the following convention to name the file:
ResultTable_x_XDataSelection_Range_YDataSelection_Range
_z_ZDataSelection_Range_DataPointType.txt
Exporting data from a scatter plot creates three text files: File
name Filtered-out Points.txt, File name Points.txt, and File name
Selected Points.txt.
Select Item for Point Highlights the appropriate row in the result table (selected Data
Source).
Check Point By default, changes the selected point to a red diamond and
selects the check box in the Checked column for the selected
point (row in the selected Data Source result table). You can
change the appearance of selected points by making the
appropriate selections in the Series Options > Checked Points
area of the Options pane. When you save the data to a text file,
the Selected Points.txt file lists the selected points.
You define the sample groups by selecting one or more check boxes under Group By in the
collapsible pane to the left of the chart. In addition, you can change the sort order of each
sample group and the hierarchy of the sample groups.
You can use the Trend Chart view to plot the data for a compound (or feature) in these result
tables: Compounds, Expected Compounds, and Merged Features. For a single compound, the
chart can plot the data as a trendline plot or a box-and-whisker plot. For two or more
compounds, the chart displays the data only as a trendline plot, with one trendline for each
compound.
Note The trendline plot can plot sample groups that include only one data point;
however, to plot the error bars for a group, it requires a minimum of two data points.
With two data points, the circle represents the calculated median and the error bars
represent the minimum and maximum areas.
The box-and-whisker plot requires a minimum of two data points to plot the box for a
sample group. It does not plot sample groups that include only one data point. If none of
the sample groups includes the minimum number of data points, the following text
appears in the chart area: No results available to plot.
By default, the trendline plot type plots the median peak area for the selected component on
the y axis against equally spaced sample groups on the x axis and connects the data points with
straight lines. Each data point appears as a solid circle with two error bars. The circle
represents the median. If the group includes a least three data points, the application uses the
following equation to calculate the standard error of the median and draw the error bars:
standard error of the median = π ⁄ 2 × std. dev (x) ⁄ ( N )
Figure 115. Trendline display of the data distribution
The box-and-whisker chart plots the peak area for a selected component on the y axis as a
rectangle against equally spaced sample groups on the x axis. The height of the rectangle
represents the peak areas in the interquartile range (see Figure 116). The application uses the
following equations to calculate the upper and lower whiskers:
Interquartile range (IQR) = Quartile 3 (Q3) – Quartile 1 (Q1)
Upper whisker = Q3 + IQR × 1.5
Lower whisker = Q1 – IQR × 1.5
When the data set contains a small number of data points, the whiskers typically end at the
highest and lowest data points. If the data set does not include a data point between the top of
the interquartile range and the calculated value for the upper whisker, the application does not
draw an upper whisker. If the data set does not include a data point between the bottom of the
interquartile range and the calculated value for the lower whisker, the application does not
draw a lower whisker.
Figure 116. Box-and-whisker display of the data distribution
Upper whisker
Third quartile
Interquartile
range Median
First quartile
Lower whisker
Note To calculate the quartiles, the application uses a method that is similar to the type 6
method in the R statistical computing software.
For details about working with the Trend Chart view, see the following topics:
• Open the Trend Chart view
• Define the sample groups to compare
• Compare the peak areas for a single compound by sample group
• Change the sort order of the defined groups
• Compare the peak areas for multiple compounds by group
• Change the hierarchy of the variables used for grouping
• Show the error bars in a trendline chart
• Trend Chart view parameters
1. Open the result file of interest, and then open the Trend Chart view.
2. In the left pane, under Group By, select the appropriate check boxes to define the sample
groups.
1. Open a result file, open the Trend Chart view, and define the sample groups that you
want to compare.
2. In the Compounds, Expected Compounds, or Merged Features table, select the
compound (or feature) of interest.
3. Right-click the graph area and choose Show Legend.
The legend displays the sample group colors.
4. In the Plot Type list, select Trendline Chart or Box Whisker Chart.
Depending on the selection, either a trendline plot or a box-and-whisker plot appears in
the graph area. A tooltip opens when you place the cross-hair cursor anywhere on a box or
whisker in the box-and-whisker plot or on data point in a trendline plot.
Figure 117 shows a box-and-whisker plot. Placing the cross-hair cursor on a box or
whisker opens a tooltip with descriptive statistics.
Figure 117. Box-and-whisker plot for one compound
Y To change the sort order of the defined groups in the Trend Chart view
In the left pane, under Group By, click the sorting icon, , next to the study variable to
sort and choose Sort Ascending or Sort Descending.
Changing the sort order changes the order of the groups on the x axis (Figure 118).
Figure 118. Trend chart with compounds grouped by solvent and sorted in ascending or
descending order by chromatographic peak area
Sorted in
ascending
order
Sorted in
descending
order
1. Open the result file of interest, open the Trend Chart view, and define the sample groups.
2. To select the compounds to plot, press the CTRL key and select rows in the result table
(Compounds, Expected Compounds, or Merged Features), taking care to avoid clicking
an editable table cell.
The plot changes to a scaled trendline. The data points represent the group median. The
legend displays the name (if available), molecular weight, and retention time of each
selected compound.
Figure 119 shows a scaled trendline chart. To view descriptive statistics for the data
points, click anywhere in the plot to activate the cross-hair cursor, and then place the
cross-hair cursor on each data point of interest.
Use the handle ( ) next to the variable to drag the variable up or down in the list.
1. In the Scaling list above a trendline plot in the Trend Chart view, select Unscaled.
2. Right-click the plot and choose Show Standard Errors.
The application plots the group median with error bars for the standard error of the
median (Figure 120).
The compound’s name, elemental composition, MW, and RT appear above the graph
(Figure 121). The bars are colorized by input file.
Figure 121. Isotopologues Distribution Chart with no grouping selected
Tip By default, the Show Position Tooltips feature is turned on. If a tooltip does
not appear, click the chart. If a tooltip still does not appear, right-click the chart
and choose Show Position Tooltips.
• To group the isotopologues by input file, under Group By, select the File check box.
• To group the isotopologues by sample type, under Group By, select the Sample Type
check box.
Figure 123. Isotopologues grouped by sample type
Y To set up the mass defect plot and examine it for similar compounds
1. Open a result file from a processing workflow that included the Detect Compounds
node.
2. From the application menu bar, choose View > Mass Defect Plot.
The compounds listed in the Compounds table appear as blue circles in the Mass Defect
plot. The y-axis label displays the mass defect type, and the red dashed lines indicate the
valid range of the mass defect values.
3. To specify how to calculate the mass defect, do the following:
• From the Type list, select the mass defect calculation. Then, if you selected Kendrick
Mass Defect, enter the Kendrick formula.
• From the Rounding list, select Ceiling, Floor, or Round.
4. To highlight the compounds that have an assigned name in orange, select the Highlight
Named Compounds check box.
5. To work interactively with the plot, do the following:
• Use the shortcut menu commands for the view to do any of the following:
– Zoom in or out of the plot.
– Copy the image or the data points to the Clipboard.
– Export the image or the data points to an external file.
– Check or clear the visible data points. Checking a data point highlights the data
point in red and places a check mark in the Checked column for the compound
in the Compounds table.
• Double-click a data point to navigate to the compound in the Compounds table.
• Point to a compound to display a tooltip with information about the compound’s
mass defect, name, elemental composition, molecular weight, and retention time.
Figure 124. Mass Defect Plot view with its shortcut menu displayed
For more information, see “Mass defect types and visualization techniques.”
The Principal Component Analysis view contains three pages: Scores Plot, Loadings Plot, and
Variances Plot. The scores plot shows the correlation among the observations. The loadings
plot shows the relationship among the variables for a given pair of principal components. The
variance plot shows the percentage and cumulated percentage of the variance that a principal
component accounts for. In general, as the proportion of variance increases for the first two or
three principal components, the dissimilarity between the sample groups increases.
For details about working with the Principal Component Analysis view, see the following
topics:
• Set up a principal component analysis
• Interpret the scores plot
• Interpret the loadings plot
• Work interactively with the loadings plot
The Principal Component Analysis view is available for result files that include any of these
tables: Compounds, Expected Compounds, or Merged Features.
• Check the percentage values for the principal components. The labels on the x and y axes
include the proportion of variance that the principal components add to the total variance
as a percentage.
• Place the cross-hair cursor on a data point to view its coordinates.
Figure 125 shows the scores plot for the compounds that leach out of four o-ring types
soaked in ethanol. The black, brown, and white O-rings show a similar variance for PC1,
while the red O-rings show a significant variance in the other direction for PC1. The
brown and white O-rings show a similar variance for PC2.
Figure 125. Scores plot for four O-ring types (white, brown, red, and black)
Figure 126 shows the scores plot for the compounds found in two sources of oregano.
The principal components for the two groups are in opposite quadrants, which means the
two groups are significantly different.
1. To determine the relative correlation between various data points, review their relative
location in the plot.
• Data points that are near each other are similar.
• Data points that are on opposite sides of the origin have a negative correlation.
• Data points in the corners of the plot have a strong contribution to both principal
components—that is, these data points differentiate between groups.
2. Place the cross-hair cursor on a data point to display a tooltip information about the
compound.
For LC studies the tooltip displays the principal component coordinates, molecular
weight, retention time, maximum peak area, and number of adduct ions. See Figure 127.
Figure 127. Loadings plot with its shortcut menu displayed (LC study)
Shortcut menu
Y To select a data point or check one or more data points in the Loadings Plot
• To check multiple points, drag the mouse pointer across a rectangular area of the plot to
zoom in on that area, then right-click the plot and choose Check All Visible Points.
The application selects the check boxes for the checked data points in the data source
table and changes the color of the selected points to blue in the loadings plot. The color
of the checked data points also changes to blue in the interactive views.
Figure 128. Loadings plot with checked points shown in blue
• To clear all the selected check boxes in the result table, right-click the result table and
choose Uncheck All > In This Table.
Compare the contribution of the first two or three principal components to the
cumulative variance.
For example, Figure 129 shows the variance plot for the compounds leached from red
O-rings soaked either in water or an aqueous solution of sodium chloride.
Figure 130 shows the variance plot for the same type of O-rings soaked either in water or
ethanol.
The PC1–PC3 components make up more of the cumulative variance when comparing
the extraction strength of water to ethanol than when comparing water to a salt solution.
From these results, you can infer that the extraction strengths of water and ethanol differ
more than the extraction strengths of water and a salt solution.
Figure 129. Water versus an aqueous solution of sodium chloride (extraction strength)
A box-and-whisker plot displays the data for a variable as a rectangular box with a set of
whiskers at each end. The line through the rectangle represents the median value in the data
set. The lower portion of the rectangle represents the data points that fall within the second
quartile and the upper portion represents the data points that fall within the third quartile.
The circles that fall outside the fence whiskers are outliers.
The application uses the following equations to calculate the upper and lower fences:
Interquartile range (IQR) = Quartile 3 (Q3) – Quartile 1 (Q1)
Upper fence = Q3 + IQR × 1.5
Lower fence = Q1 – IQR × 1.5
Note To calculate the quartiles, the application uses a method that is similar to the type 6
method in the R statistical computing software.
Third quartile
Interquartile
range Median
First quartile
Lower fence
By default, the graph displays the data for all the samples and duplicates the grouping on the
Grouping and Ratios page of the analysis. Each group appears in a different color. The legend
shows the colors of the sample groups.
Use the Group By check boxes in the collapsible pane to change the grouping. Use the Filter
By check boxes to remove samples from the plot.
For more information about the shortcut menu commands, see “Copy or save graphical views
for publication.”
Table 112. Common tasks for the Descriptive Statistics view
Task Procedure
Copy or save the data as an image. 1. Right-click the plot and choose Show Legend.
2. Right-click the plot and choose Copy > Image to copy an image to the
Clipboard.
–or–
Right-click the plot, choose Export > Image As, and select an image type
to save the data to an image file.
Copy or save the data as editable Right-click the plot and choose Copy > Data to copy the text to the
text. Clipboard.
You can paste this text to Notepad, an Excel spreadsheet, and so on. The data
appears in a columnar format.
–or–
Right-click the plot and choose Export > Data As to save the data to a text file.
The file contains two data sets. The first set consists of these columns from left
to right: Groups, Name, Minimum Value, Maximum Value, Std. Deviation,
Mean, Median, Q1 Value, Q2 Value, and Q3 Value. The second set lists the
outlier data points and consists of these columns from left to right: Groups,
Name, and Outlier.
View the entry in the result table for In the plot, double-click the data point.
an outlier data point.
Select the check box for an outlier In the plot, right-click the data point and choose Check Point.
data point.
In the result table, the check box is selected for the corresponding compound
or expected compound.
Export the outlier data points to a 1. In the view, zoom in on the outlier points so that they are the only visible
spreadsheet. points on the screen.
2. Right-click the plot and choose Check All Visible Points to select the
rows for these outliers in the compounds table.
3. In the compounds table, sort the checked rows by the variable of interest.
For example, sort the rows by molecular weight, retention time, or both.
4. Right-click the compounds table and choose Export > Export to Excel.
5. In the Export to Excel dialog box, select a folder and name the file, select
the Checked Items Only check box, and click Export.
Selecting this check box transforms the area counts to the log10
scale.
Use Normalized Areas Select to display normalized chromatographic peak areas.
For details about working with the Differential Analysis view, see the following topics:
• Review the initial differential analysis
• Change the analysis settings for a differential analysis
• Run a new differential analysis
• Differential Analysis view parameters and shortcut menu commands
1. Open the result file for an analysis that included at least two raw data files.
2. In the menu bar, choose View > Differential Analysis.
If the processing workflow included the Differential Analysis node, the Differential
Analysis view opens with the differential analysis from data processing. The ratios in the
Comparison list match the ratios on the Grouping and Ratios Summary page of the
Summaries view. The initial p-value setting is 0.05 (–log10 0.05 = 1.3) and the initial
Log2 Fold change setting is 1 (a ratio of 2 to 1). Depending on the setting for the Log10
Transform parameter in the Differential Analysis node, the y-axis scale spans the p-value
range (0–1) or the –log10 p-value range (–log10 0 = Infinity, –log10 1 = 0).
3. To show the legend, right-click the plot and choose Show Legend.
The data points are color-coded, as defined by the legend.
Figure 132 shows a comparison between two extraction solvents—water and an aqueous
solution of 50% ethanol.
Figure 132. Differential Analysis view with the analysis run during data processing (LC study)
Analysis summary for the negative fold Analysis summary for the positive fold
changes (down-regulated compounds) changes (up-regulated compounds)
• Select a different ratio from the Comparison list. This changes the data points in the plot.
• Change the p-value setting by dragging the slider. This changes the y-axis range of the
shaded areas.
• Change the fold change setting by dragging the slider. This changes the x-axis range of the
shaded areas.
Figure 134 shows the effect of changing the Log2 Fold Change setting.
Figure 134. Fold change increased from 1 to 8
1. In the Differential Analysis view, click the Perform New ANOVA Calculation icon, ,
to the right of the Data Source box.
The Perform New ANOVA Calculation dialog box opens.
Figure 135. Perform New ANOVA Calculation dialog box in front of the Differential Analysis
view
The Perform New ANOVA Calculation dialog box closes, and the Pending Results area
appears at the bottom of the view. The Pending Results area displays the data source
(result table), the reference group, and the progress of the new calculation.
When the calculation is complete, the new ratio (sample group vs. reference group) appears
in the Comparison list.
9. To view the analysis, select the data source in the Data Source list and the new ratio in the
Comparison list.
The volcano plot updates with the selected analysis.
Use the slider to change the value in the corresponding box. Data
points that fall outside the upper and lower thresholds are in the
shaded regions.
The region shaded in green identifies the data points that are
significantly different (populations differ based on the p-value
setting) and that fall outside the upper fold change threshold.
Legend By default, the legend is hidden.
Table 115 describes the shortcut menu commands for the Differential Analysis view.
Table 115. Shortcut menu commands for the Differential Analysis view (Sheet 1 of 2)
Command Description
Show Position Tooltips Displays information about the compound.
Zoom Out Undoes the last zoom.
Undo All Zoom/Pan Zooms out to the full plot range.
Table 115. Shortcut menu commands for the Differential Analysis view (Sheet 2 of 2)
Command Description
Copy and Export See “Export spectral data to a new or existing mzVault
commands library.”
Show Legend Displays the legend for the color-coded data points.
Note The color-coded data points (circles) represent compounds in the selected data
source. Selecting a data point turns it blue and selects the check box for the corresponding
compound in the data source table.
Check Point To activate this command, point to a data point.
Selects the data point in the view and the check box for the
corresponding compound in the data source table.
Uncheck Point To activate this command, point to a selected compound.
The Partial Least Squares Discriminant Analysis view opens to the right of the tabbed
result tables.
3. In the collapsible pane on the left, under Discriminate By, select the study factors that you
want to discriminate by.
4. On the right above the plot, select the data source from the Data Source list.
5. In the #sPLS-DA Compounds box, type the number of compounds that you want to use
to differentiate the selected study factors.
6. To update the plot, click anywhere in the plot.
The orange circles represent the discriminating compounds.
7. Right-click the plot and choose Check All sPLS-DA Points.
The application selects the check boxes of the corresponding compounds in the selected
data source. The blue circles represent the checked discriminating compounds.
Figure 136. PLS-DA view with five checked discriminating compounds (LC study)
8. To view the list of compounds that you can use to differentiate the selected experimental
variables, set up a result filter to display only the checked compounds in the selected result
table.
Selection: 1 to 6
#sPLS-DA Specifies the number of compounds that when used together can
Compounds discriminate between the values for the study variables selected
under Discriminate By.
Use Normalized Areas Uses the normalized areas in the result file.
1. Open a result file that contains mapped KEGG pathways—that is, a result file from an
analysis that included the Map to KEGG Pathways node.
2. In the main compounds table, select a compound.
3. Below the main compounds table, click Show Related Tables to show the related tables.
4. In the related tables pane, click the KEGG Pathways tab to make it the active result table.
5. In the KEGG Pathways table, select the pathway that you want to view.
6. In the menu bar, choose View > KEGG Pathways.
The selected KEGG pathway opens. The blue circle indicates the selected compound in
the KEGG pathway. The red circles indicate related compounds that were not found in
the mzCloud database. The green circles indicate related compounds that were found in
the mzCloud database.
Figure 137 shows the selections of caffeine in the Compounds table and caffeine
metabolism in the KEGG Pathways table. The related Compounds tables shows the
compounds related to caffeine, which appear as red circles in the pathway diagram.
Figure 137. Caffeine metabolism pathway where caffeine appears as a blue circle (LC study)
1. Open a result file for an analysis that maps the detected compounds to their BioCyc
pathways.
2. In the main table pane, click the BioCyc Pathways tab.
3. Select a pathway of interest.
4. Below the BioCyc Pathways table, click Show Related Tables.
5. Click the BioCyc Results tab.
6. In the BioCyc Results table, select a compound of interest.
7. From the application menu bar, choose View > BioCyc Pathways.
The BioCyc Pathways view opens to the right of the result tables.
8. In the Omics-Overlay list, select Area.
Figure 139 shows a BioCyc pathway with an Area overlay.
1. Open a result file from an analysis that included the Map to Metabolika Pathways node.
2. In the Compounds table, select the compound.
3. Click Show Related Tables.
4. In the Related Tables pane, click the Metabolika Pathways tab.
5. Select the pathway that you want to view.
6. From the application menu bar, choose View > Metabolika Pathways.
The view opens to the right of the Compounds table and displays the selected pathway.
The structure for the selected compound is blue, the structures for other detected
compounds are red, and the structures for undetected compounds in the pathway are
black.
7. To overlay a data source, select the source from the Overlay Data Source list and type or
select a cell size from 6 to 30 pixels in width in the Overlay Cell Size box.
The data source selections depend on the processing workflow, but always include the
annotation source, chromatographic peak area, or Metabolika pathways graphic.
Figure 140. Retention Time Correction view with the regression curve for one input file
• To view overlaid regression curves for multiple input files, select multiple input files.
Figure 141. Retention Time Correction view with regression curves for multiple input files
For information about acquiring a set of raw data files with interspersed quality control
samples, see “Using quality control samples to compensate for batch effects.”
1. Open a result file for an analysis with batch normalization—that is, an analysis that
includes QC samples and a processing workflow with a QC correction node.
2. In the compounds table, select one or more compounds.
3. From the menu bar, choose View > Compound Area Corrections.
In the default layout, the Compound Area Correction view opens to the right of the
tabbed result tables. The Compound Area Correction view displays a scatter plot of the
areas for the selected compound on the y axis against the acquisition time for each input
file on the x axis.
The legend below the plot describes the data point symbols and the regression curve or
curves.
The Apply QC Correction node generates two regression lines: a blue curve for the
original areas and an orange curve for the corrected areas.
Table 117 describes the data point symbols. The legend is not static. If a compound does not
include any gap-filled QC samples, the legend does not include the diamond symbols for that
compound.
Table 117. Compound Area Corrections view legend (Sheet 1 of 2)
Data point symbol Description
QC samples
( ) Blue triangle Original area for the QC sample
( ) Orange triangle Corrected area for the QC sample
( ) Blue diamond Original area for a gap-filled QC sample
( ) Orange diamond Corrected area for a gap-filled QC sample
Non-QC samples
( ) Blue circle Original area for a non-QC sample
( ) Orange circle Corrected area for a non-QC sample
Pointing to a data point (circle, triangle, or diamond) displays a tooltip with the input file
name, the compound area, and the status—original or corrected.
Figure 142 shows the Compound Area Corrections view for a compound with 12 QC
samples. The chromatographic peak area for three of the QC samples is gap-filled.
Figure 142. Compound Area Corrections view for an analysis with the Apply QC Correction node, 12
QC samples, and a compound with three gap-filled QC samples
For an LC study, if you use the Apply SERRF QC Correction node and the analysis includes
input files from more than one batch, a blue vertical line separates the batches in the plot.
Figure 143 shows an analysis with the Apply AC Correction node for a set of input files from
two batches.
Figure 143. Compound Area Corrections view (Apply SERRF QC node for LC studies))
Figure 144. Compound Area Corrections view for a selected compound (daidzein)
The application uses an agglomerative (bottom-up) approach to find the similarities between
samples and compounds. Initially, the hierarchical cluster analysis assigns each compound to
its own singleton cluster. The analysis then proceeds iteratively, at each stage joining the two
most similar clusters into a new cluster, continuing until there is one overall cluster
represented by a dendrogram.
The dendrogram to the left of the heat map represents the distance (or dissimilarity) between
the compound clusters. The width of each node is proportional to the distance (or
dissimilarity) between two compounds at the lowest level or two compound clusters at the
higher levels. The dendrogram above the heat map represents the distance (or similarity)
between samples. The height of each node is proportional to the distances (or similarity)
between two samples at the lowest level or two sample clusters at the higher levels.
Pointing to a dendrogram node displays the distance between clusters, and pointing to a heat
map cell displays the compound and sample information.
By default, under Filter By, all the input files are selected, and under Other Settings, the Show
Labels check box for the compound labels is clear.
Figure 145. Hierarchical cluster analysis for the compounds detected in six samples from lean and
fat ZDF rats (LC studies)
Zoom Out (Rows) Zoom Out (Columns)
Display and Dendrogram that represents the distance
analysis (or similarity) between samples
Heat map legend (scaled areas)
options
Column
labels
To determine how to cluster the data, the application provides a choice of commonly used
distance functions and linkage methods (see Table 120 and Table 121).
To set up a hierarchical clustering analysis and work interactively with the heat map’s cells, see
these topics:
• Set up a hierarchical clustering analysis
• Work interactively with the heat map grid
• Hierarchical Cluster Analysis view parameters
1. Filter the result table that you want to use as the data source. The analysis uses the only
visible compounds in the table.
Note When you apply a filter that displays only the checked compounds in the
selected result table, a red bar appears to the right of the heat map. If only a subset of
the compounds is checked, the bar is discontinuous, with red indicating a checked
compound and gray indicating an unchecked compound.
Note By default, under Other Settings, the following check boxes are selected:
Column Settings: Show Clusters and Show Labels—Displays a dendrogram above the
heat map and sample labels below the heat map.
Row Settings: Show Clusters—Displays a dendrogram to the left of the heat map.
6. In the left pane, under Other Settings, select these analysis options:
• Under Distance Function, select how the analysis determines the distance between
the data points, where the data points are files (columns) or compounds (rows)
(Table 120).
• Under Linkage Method, select how the analysis performs the clustering analysis
(Table 121).
7. In the left pane, select these labeling options:
• Under Color By, select whether to display color bars above the heat map to visually
differentiate the samples by their study factor variables.
• Under Other Settings > Row Settings, select whether to display the compound labels.
Selecting the Show Labels check box under Row Settings displays the compound
labels to the right of the heat map.
• Under Other Settings > X-Axis Label, select the label for the sample columns.
8. To run the analysis, click Refresh.
Table 118. Common tasks for the heat map grid (Sheet 2 of 2)
Task Procedure
View information about a heat Point to the cell.
map cell.
The tooltip displays the compound’s name (if available),
its MW, RT, file ID, and the study factor value (if
available).
View the distance value for a Point to the node.
dendrogram node.
The tooltip displays the distance value.
Default:
• Available values
• Missing values white
Selection:
Available values Missing values
Distance Function Specifies the distance function to use for calculating the distance
between data points (see Table 120).
Default: Euclidean
Linkage Method Specifies the method to use for hierarchical clustering (see
Table 121).
Default: Complete
X-Axis Label Specifies the labels for the sample columns.
Selections: All Factors, Selected Factors, File ID, Full File Name,
and Sample Name
Note The application does not save the results from this mzLogic analysis to the result
file; however, you can add suitable candidates to the Structure Proposals table for a
compound and apply FISh scoring.
The ranking score provided by an mzLogic analysis is not a probability score. It is only a
measure of how similar a putative structure is to closely matching structures in the
mzCloud spectral database.
1. In the mzLogic Analysis view, open the Similar Structures from mzCloud area.
2. In the Candidates area, click a candidate to select it.
In the Similar Structures from mzCloud area, the matching portions of the similar
structures are highlighted in blue.
Available when the Auto Refresh check box is clear and you select
a different compound in the Compounds table.
Refresh Refreshes the Candidates area when you select another
compound.
Default: Medium
Note The Statistical Methods table is new in the Compound Discoverer 3.3 application.
For information about specific result tables, see the following topics:
• Common result tables
• Expected Compounds result tables
• Compound detection result tables
• Compound Identification result tables
• Pathway Mapping result tables
• Compound Scoring tables
• Statistical Methods table
• Differential analysis columns
• Descriptive statistics columns
• QC Correction columns
• Annotations Source column in a compounds table
• Peak Rating columns
• Peak quality factor (PQF) columns in the result tables
Adducts table
Use the Adducts table to view the list of adducts in the Adducts library. By default, the
Adducts table is hidden. See “Show or hide result tables.”
Peak Model Displays the peak model for the chromatographic peak and
includes information about the width and symmetry of the peak.
The retention time alignment node creates the File Alignments table. See “Align Retention
Times node.”
If you did not select a reference input file for the node’s Reference
File parameter, the node automatically selects the first sample file
(assigned the Sample Type of Sample) in the Files for Analysis
area.
#Landmarks For the original Align Retention Times node, this column displays
the number of features that the analysis used to align the specified
file to the reference file.
The Create FISh Trace node creates the FISh Trace Fragments table when the Individual
Traces parameter for this node is set to True. See “Create FISh Trace node.”
Table 125 describes the columns in the FISh Trace Fragments table.
Table 125. FISh Trace Fragments table
Column Description
Checked Use this column to select the rows that you want to display in the
result table and in reports after you apply result filters.
Tags Use this column to label the entries with custom tags that you
created by using the Custom Tags Filter view. You can filter the
result table on these tags.
Parent Compound Displays the selected compound in the Create FISh Traces node.
Formula Displays the elemental composition of the fragment ion.
Ion Displays the ion description.
m/z Displays the mass-to-charge ratio of the ion.
TIC Displays the total ion current for the fragment ion.
Mode Displays the fragmentation mode.
File ID Displays the integer that the application assigned to the input file.
Study File ID Displays the study file ID (F#) of the input file.
Structure Displays the ion’s molecular structure.
For a result file from an analysis of multiple input files from an LC/MS acquisition sequence,
use the related File Alignments table to check the chromatographic alignment of the features
in the input files.
Table 126 describes the information displayed in the Input Files table.
Table 126. Input Files table
Column Description
Alignment Score For the Align Retention Times (ChromAlign) node, displays the
alignment score (0.000 to 1.000).
Creation Date Displays the acquisition time stamp from the data system.
File Name Displays the file name of the input file.
Instrument Hardware Displays the hardware version of the Thermo Scientific mass
(hidden) spectrometer or analog detector used to acquire the raw data file.
Instrument Name Displays the mass spectrometer type used to acquire the raw data
file.
Max. Mass [Da] Displays the maximum mass that the analysis processed.
Min. Mass [Da] Displays the minimum mass that the analysis processed.
RT Range [min] Displays the data acquisition time for the raw data file.
Software Revision Displays the software version of the instrument control software
used to acquire the raw data file.
Ref. File ID For LC studies, this column displays the reference file that the
retention time alignment algorithm used.
Sample Type Displays the sample type.
Study File ID Displays the file identification number (F#) assigned by the
Compound Discoverer application.
Study factor columns Each study factor column displays the study factor value (item).
The Manual Peaks table is a main table. It is also related to the main Specialized Traces table.
To create this table, you must add a manual peak to a specialized trace. See “Manually
integrate chromatographic peaks.”
The Merge Features Node adds the Merged Features table to the result file. The Merged
Features table has the following primary related tables: Expected Compounds, Expected
Compounds per File, Compounds, Compounds per File, and Manual Peaks.
Tip When you add the Merge Features node to the processing workflow, the Find
Expected Compounds node and the Detect Compounds node automatically connect to it.
The Merge Features node consolidates the chromatographic peaks from these two input
nodes.
m/z × Area
-------------------------------
m/z
RT [min] Displays the area weighted average retention time of all related
features (same m/z × RT dimensions within the specified
tolerances).
RT × Area
-------------------------------
RT
Max. Area Displays the area of the largest chromatographic peak found in the
data set for the current m/z × RT dimensions.
Ion Conflict Status Indicates whether there is a conflict between the Detect
Compounds and Find Expected Compounds nodes.
( ) Green No conflict—Both nodes assigned the same ion to this
chromatographic peak.
( ) Orange Not found by the Detect Compounds node.
The Specialized Traces table lists the specialized traces that you requested in the processing
workflow. For information about manually integrating chromatographic peaks in a specialized
trace, see “Manually integrate chromatographic peaks.”
Tip To view a trace in the Chromatograms view, select the trace of interest in the
Specialized Traces table.
For information about adding structure proposals to the table, see “Add or delete proposed
structures for a compound.”
To edit this entry, click the table cell and type text in the box.
Formula Displays the elemental composition formula of the compound.
Molecular Weight Displays the molecular weight of the compound.
FISh Coverage Displays the FISh coverage score that is based on the proposed
structure. See “FISh scoring for proposed structures.”
Comments Displays an application-generated comment or a user-specified
comment for the compound.
To edit this entry, click the table cell and type text in the box.
The Group Expected Compounds node adds the Expected Compounds table to the result
file.
Table 132 describes most of the columns in the main Compounds table and the first-level
related Compounds tables. For information about the differential analysis, descriptive
statistics, and quality control columns, see the applicable topic:
• Differential analysis columns
Note The Differential Analysis node generates the following columns: Group Areas,
Group CV[%], Ratio, Log2 Fold Change, P-value, and Adj. P-value.
Note The Descriptive Statistics node generates the following columns: Min. Area, Q1
Area, Median Area, Q3 Area, Mean Area, Area SD, and Area CV [%].
• QC Correction columns
Note The QC correction node generates the following columns: #Usable QC, RSD
QC Areas [%], RSD Corr. QC Areas [%], and Norm. Area.
( ) Green—No Gap
Available when Search MSn Tree you set Search MSn Tree to
True. 'mzCloud Best Tree Match' and '# Spectra with MSn
(n>2) Hit' columns will be available when Search MSn Tree is
set to 'True' in the Search mzCloud node
mzCloud Library Search mzCloud Indicates whether the search found matches in the selected
Matches (hidden) libraries.
• ( ) Gray—No matches found
• ( ) Green—Single match found
• ( )Orange—Multiple matches found
mzVault Best Match Search mzVault Displays the score for the best hit from the mzVault mass
spectrum library.
Name Group Expected Displays the user-specified compound name.
Compounds
To populate this cell, you can type a name or use the Edit
Compound Annotation command.
Parent Compound Group Expected Displays the targeted compound.
Compounds
For more information about the peak quality factors and the color-coded table cells, see “Chromatographic peak
rating filter.”
Peak Rating Group Expected Displays the calculated peak rating for the compound.
Compounds
PQF: FWHM2Base Group Expected Displays the calculated peak quality factor for the compound’s
Compounds chromatographic peak.
PQF: Jaggedness Group Expected Displays the calculated value for the jaggedness of the
Compounds compound’s chromatographic peak.
Clicking a row in the Expected Compounds per File table displays an XIC trace for the
selected compound. The XIC trace is a summation of the related ion traces. The integrated
peak area is shaded, the vertical red line indicates the chromatographic peak apex, and the
triangle indicates the data point that corresponds to the retention time (RT) label. When the
data has been chromatographically aligned (by using the Align Retention Times node), the
RT values for the labeled data points might differ slightly from those for the closest MS1 scan
in the spectral tree.
The Find Expected Compounds node creates the Expected Compounds per File table. The
primary tables related to this table are as follows: Expected Compounds, Input Files, Expected
Formulas, Merged Features, Expected Features, and Related Structures.
Table 133 describes the columns in the Expected Compound per File table. By default, some
of these columns are hidden.
Table 133. Expected Compounds per File table (Sheet 1 of 3)
Column Description
Checked Use this column to select the rows that you want to display in the
result table and in reports after you apply result filters.
Tags Use this column to add custom tags to the result table items. You
can filter the result table by these tags. See “Custom color-coded
tags for result table entries.”
Parent Compound Displays the user-specified parent compound or compounds.
You can filter the data by using this integer; for example, the
following filter reduces the table to the expected compounds
found in one input file: Study File ID is equal to F1, F2, F3...F11,
and so on.
The Find Expected Compounds node creates the Expected Features per File table.
Table 134 describes the columns in the Expected Features per File table.
Table 134. Expected Features per File table (Sheet 1 of 3)
Column Description
#MI Displays the number of matched isotopes for the ion.
Area Displays the summed area of all the related peaks (same expected
compound) in the current input file.
Charge Displays the charge of the ion.
Checked Use this column to select the rows that you want to display in the
result table and in reports after you apply result filters.
File ID (hidden) Displays the integer that the application assigned to the input file.
FWHM [min] Displays the width of the chromatographic peak at its half-height
in minutes.
Intensity (hidden) Displays the maximum intensity of all the related peaks per input
file.
Ion Displays the ionized form of the compound.
m/z Displays the mass-to-charge ratio of the ion.
Molecular Weight Displays the molecular weight of the monoisotopic neutral
compound.
Clicking a row in the Expected Formulas table displays overlaid XIC traces for the selected
expected compound, with one XIC trace for each input file where the compound is detected.
Each XIC trace is a summation of the ion traces for the same neutral elemental composition
(same molecular weight). By default, the Chromatograms view zooms in on the x-axis range of
the detected peaks for the same expected compound.
The Find Expected Compounds Node creates the Expected Formulas table, which has the
following primary related tables: Expected Compounds, Input Files, Expected Compounds
per File, Related Structures, and Transformations.
For information about targeted processing workflows, see “Targeted processing workflows for
expected compounds.”
1. Open a result file from a targeted analysis (Find Expected Compounds node).
2. Select a row in one of these tables—Expected Features or Expected Compounds per File.
3. Click Show Related Tables.
4. Click the Related Structures tab.
Transformations table
Use the Transformations table to review the transformations for each formula in the Expected
Formulas table.
Tip For information using the shortcut menu commands for the Compounds table, see
“Shortcut menu commands for the result tables.”
Tip Clicking a row in the Compounds table displays XIC traces for the selected unknown
compound, with one XIC trace for each input file. Each XIC trace is the summation of all
the related ion traces. The summed XIC trace is made up of the data points with the
highest intensity at each time point.
To view the adduct ion traces for a compound detected in a specific input file, do the
following:
1. Select the compound in the Compounds table.
2. In the first set of related tables for the selected compound, select the input file in the
Compounds per File table.
3. In the second set of related tables (for the compound detected in the selected input
file), open the Features table and select the adduct ion of interest.
Table 138 describes most of the columns in the main Compounds table and the first level
related Compounds tables. For information about the differential analysis, descriptive
statistics, and quality control columns, see the applicable topic:
• Differential analysis columns
Note The Differential Analysis node generates the following columns: Group Areas,
Group CV[%], Ratio, Log2 Fold Change, P-value, and Adj. P-value.
Note The Descriptive Statistics node generates the following columns: Min. Area, Q1
Area, Median Area, Q3 Area, Mean Area, Area SD, and Area CV [%].
• QC Correction columns
Note The QC correction node generates the following columns: #Usable QC, RSD
QC Areas [%], RSD Corr. QC Areas [%], and Norm. Area.
Table 138. Compounds table columns listed in alphabetical order (Sheet 1 of 10)
Column Node Description
Symbol = first character
#Adducts (hidden) Group Compounds Displays the number of adduct ions that the analysis found for the
compound.
#BioCyc Pathways Map to BioCyc Displays the number of BioCyc pathways that include the current
(hidden) Pathways compound.
#ChemSpider Results Search ChemSpider Displays the number of matching compounds found by the
ChemSpider search for the current composition or molecular
weight.
Table 138. Compounds table columns listed in alphabetical order (Sheet 2 of 10)
Column Node Description
#mzVault Results Search mzVault Displays the number of matching compounds found by the
mzVault search for this molecular weight.
Table 138. Compounds table columns listed in alphabetical order (Sheet 3 of 10)
Column Node Description
Annot. ΔMass [Da] Assign Compound Displays the mass difference in daltons between the experimental
(hidden) Annotations mass and the annotated mass.
Annot. ΔMass [ppm} Assign Compound Displays the mass difference in ppm between the experimental
Annotations mass and the annotated mass.
Annotation MW Assign Compound Displays the molecular weight of the matched compound from the
(hidden) Annotations specified annotation source.
Area (one subcolumn for Group Compounds Displays an area column for each input file.
each input file) (hidden if
the analysis includes study To display the areas for each input file, click the expand icon to the
groups) right of the column name.
Area (Max.) Group Compounds Displays the maximum chromatographic peak area from all of the
input files for compounds with the same retention time and
molecular weight (within the user-specified RT and mass
tolerances).
Avg. Exchange Analyze Labeled Average number of atoms exchanged for compound detected in
Compounds input file.
Background (hidden) Mark Background Displays a selected or clear check box that indicates whether the
Compounds compound was also found in the Blank sample above the
user-specified Sample/Blank or Blank/Sample level.
• Selected—Indicates that the compound is a background
compound.
• Clear—Indicates that the compound is not a background
compound.
BioCyc Pathways Map to BioCyc Displays whether the current compound is present in the named
(hidden) Pathways pathway.
To display the pathway names, click the expand icon to the right of
the column name.
Calc. MW Group Compounds Displays the calculated molecular weight of the neutral compound.
Checked Group Compounds Selecting this check box specifies that the compound is a selected
item.
Table 138. Compounds table columns listed in alphabetical order (Sheet 4 of 10)
Column Node Description
Class Coverage Compound Class Displays the class coverage score for the compound by individual
Scoring compound classes. Expand the header to display the names of the
compound class libraries.
Color-coded cells:
• Light green to dark green for values from 0.00 to 100.00 or
gray
• Gray / N/A means no MS2 spectra are available for the
compound or there are 0 matched and 0 unmatched centroids
in the MS2 spectra.
Table 138. Compounds table columns listed in alphabetical order (Sheet 5 of 10)
Column Node Description
Gap Status (hidden) Fill Gaps Indicates whether chromatographic peak for the compound was
completely missing or whether some of the user-specified ions were
not detected.
To display the pathway names, click the expand icon to the right of
the column name.
Table 138. Compounds table columns listed in alphabetical order (Sheet 6 of 10)
Column Node Description
Labeling Status (per file) Analyze Labeled The Analyze Labeled Compounds Node evaluates the measured
Compounds isotope pattern versus the fitted isotope pattern (for the expected
isotopologues) to determine the presence of contaminating masses.
It also evaluates the distribution of the measured exchange rates for
the expected isotopologues.
Table 138. Compounds table columns listed in alphabetical order (Sheet 7 of 10)
Column Node Description
Mass List Matches Search Mass Lists Indicates the match status for each mass list.
• ( ) Green—Single match found
• ( ) Orange—Multiple matches found
• ( ) Gray—No matches found
Use the related Mass List Search Results table to investigate the
matching compounds.
Metabolika Pathways Map to Metabolika Displays whether the current compound is present in the named
(hidden) Pathways Metabolika pathways.
To display the pathway names, click the expand icon to the right of
the column name.
m/z Group Compounds Displays the m/z value of the leftmost isotope peak of the most
common adduct ion of for this compound across the input files
(area-weighted average)
MS Depth (hidden) Group Compounds Displays the maximum depth of the mass spectral tree that the
analysis assigned to the compound.
This column is not available when the input files do not have any
fragmentation scans.
MS2 Group Compounds Displays whether the analysis found data-dependent fragmentation
scans for the compound.
• ( ) Red—No MSn—There are no available MSn scans.
• ( ) Green—ddMS2 for preferred ion—There is at least one
data-dependent MS2 scan for the preferred adduct ion.
• ( ) Blue—ddMS2 for other ion—There is at least one
data-dependent MS2 scan, but the scans are not for the
preferred adduct ion.
• ( ) Orange—DIA only—Only data-independent [all ions
fragmentation (AIF)] scans are available.
mzCloud Best Match Search mzCloud Displays the best match score (from 0 to 100) from the mzCloud
identity search for the compound.
Table 138. Compounds table columns listed in alphabetical order (Sheet 8 of 10)
Column Node Description
mzCloud Best Sim Match Search mzCloud Displays the best similarity score from the mzCloud similarity
search for the compound.
Table 138. Compounds table columns listed in alphabetical order (Sheet 9 of 10)
Column Node Description
PQF: Modality Group Compounds Displays the calculated peak quality factor for modality of the
detected compound’s chromatographic peak.
PQF: Zig-Zag Index Group Compounds Displays the calculated peak quality factor for the zig-zag index of
the detected compound’s chromatographic peak.
QC Fill Status (hidden) Apply QC Indicates the status for each QC sample.
Correction
Possible states:
• ( ) Green—Filled by re-detected peak
• ( ) Gray—N/A
• ( ) Orange—Filled by matching ion
• ( ) Blue—Filled by simulated peak
Rel. Exchange [%] Analyze Labeled Average exchange relative to the maximum exchange rate.
Compounds
100 × Average Exchange/Max. Exchange
Reference Ion Group Compounds Displays the most common adduct ion for this compound across
the input files
RT [min] Group Compounds Displays the retention time of the chromatographic peak for the
compound.
RT Tolerance [min] Group Compounds Displays the retention time tolerance specified in the Group
(hidden) Compounds node.
Structure (hidden) Group Compounds Displays the structure of the compound.
or structure
proposal The structure field is populated, the searches return a structure, or
you edit the annotations.
Tags – Use this column to add custom tags to the result table items. You
can filter the result table by these tags. See “Custom color-coded
tags for result table entries.”
The following three columns appear in the related Compounds table for the main KEGG Pathways table. Each
column contains an ordered list for the same KEGG compounds—that is, the first item in the KEGG Compound
IDs column corresponds to the first item in the KEGG Compound Names column and the first item in the KEGG
Compound Formula column, and so on.
KEGG Compound Map to KEGG Displays a list of the KEGG compound IDs in ascending order
IDs Pathways from left to right.
KEGG Compound Map to KEGG Displays a list of the KEGG compound names.
Names Pathways
KEGG Compound Map to KEGG Displays a list of the KEGG compound formulas.
Formulas Pathways
Table 138. Compounds table columns listed in alphabetical order (Sheet 10 of 10)
Column Node Description
The following two columns appear in the related Compounds table for the main KEGG Pathways table and the
search results tables.
Max. ΔMass [Da] Displays the maximum mass difference between the measured mass
for the compound in the Compounds table and the theoretical
mass of the compound, in daltons.
Max. ΔMass [ppm] Displays the maximum mass difference between the measured mass
for the compound in the Compounds table and the theoretical
mass of the compound, in parts per million.
This column appears in related Compounds table for the main mzCloud Search Result table.
Scan Number Search mzCloud Displays the scan number of the scan that matches the reference
scan in the mzCloud database.
This column appears in the related Compounds table for the main mzVault Results table.
ΔRT [min] Search mzVault Displays the difference between the measured retention time of the
compound and the retention time of the mzVault library entry in
minute.
Table 139 describes the columns in the Compounds per File table.
Table 139. Compounds per File table (Sheet 1 of 2)
Column Description
#Adducts Displays the number of adduct ions.
Area (All Ions) Displays the chromatographic peak area in counts * minutes for
(hidden) the summed peaks of all the adduct ions in this input file.
Area (Reference Ion) Displays the chromatographic peak area in counts * minutes in the
XIC trace for the most common adduct ion across the input files.
Calc. MW Displays the calculated molecular weight of the neutral
compound.
Checked Use this column to select the rows that you want to display in the
result table and in reports after you apply result filters.
Related Topics
• Shortcut menu commands for the result tables
• Show or hide table columns
The Detect Compounds Node adds the Features per File table to the result file.
Table 140 describes the columns in the Features per File table.
Table 140. Features per File table (Sheet 1 of 3)
Column Description
# Good Peaks (hidden) Number of chromatograms peaks (one or more for each isotopic
ion for the feature) that pass the peak quality factor thresholds
specified in the Detect Compounds node under Isotope Pattern
Detection.
#MI Displays the number of matching isotopes for the unknown
compound ion.
# Poor Peaks (hidden) Displays the number of chromatographic peaks that do not pass
the peak quality factor thresholds specified in the Detect
Compounds node under Isotope Pattern Detection.
By default, the Filled Gaps table is hidden. To display the table, click the Select Table
Visibility icon, , select the Filled Gaps check box, and click OK.
For information about the status flags, see “Labeling Status (per
file).”
Exchange Rate [%] Displays the contribution of individual isotopologues to the final
measured pattern.
File ID Displays the integer that the application assigned to the input file.
Study File ID Displays the study file ID (F#) of the input file.
The Similar Compounds table is generated by the Generate Molecular Networks node. See
“Generate Molecular Networks node.”
( ) Green—Full Match
( ) Orange—Partial Match
( ) Red—No Match
ΔMass [Da] Displays the mass difference in daltons between the search mass
and the mass of the matching compound in the ChemSpider
database.
ΔMass [ppm] Displays the mass difference in ppm between the search mass and
the mass of the matching compound in the ChemSpider database.
SFit[%] Displays the spectral similarity score from the Apply Spectral
Distance node.
The Search Mass Lists Node creates the Mass List Search Results table.
The main Mass List Search Results table displays all of the compounds in the selected mass
lists that match the compounds detected by the untargeted analysis. The related Mass List
Search Results table lists information about the compound selected in the main Compounds
table.
Table 146 describes the columns in the Mass List Search Results table.
Table 146. Mass List Search Results table (Sheet 1 of 2)
Column Description
Tags Use this column to add custom tags to the result table items. You
can filter the result table by these tags. See “Custom color-coded
tags for result table entries.”
Formula Displays the elemental composition of the compound in the mass
list.
Molecular Weight Displays the molecular weight of the compound in the mass list.
RT [min] Displays the chromatographic retention time (when available) of
the compound in the mass list.
Structure Displays the structure (when available) of the compound in the
mass list.
Name Displays the name of the compound in the mass list.
Miscellaneous These columns display additional information about the
annotations compound in the mass list.
Reference List Name Displays the name of the mass list that contains the matching
compound.
Additional columns in the related Mass List Search Results table
Compound Match (LC Indicates the match status between the current item and the
studies only) assigned compound annotation.
• ( ) Green—Full Match
• ( ) Orange—Partial Match
• ( ) Red—No Match
Max. ΔMass [Da] Displays the maximum mass difference between the measured
mass for the compound in the Compounds table and the
theoretical mass of the compound, in daltons.
Selecting a row in the mzCloud Results table displays a mirror plot with the selected
fragmentation scan on the top and the matched reference scan from the mzCloud database on
the bottom. The centroids for the matching fragments are displayed as green sticks with a
green circle at the end. Red circles on the x-axis indicate the m/z values of the missing
fragments.
In the collapsible spectrum tree pane to the left of the spectrum plot, the coverage bars
indicate whether the matching library spectrum is from an identity hit or a similarity hit.
Figure 147. Mirror plot with annotations for a matching spectrum in the mzCloud Results table
Coverage bar—dark blue for an identity hit and light blue for a similarity hit
Reference spectrum
from the mzCloud mass
Query spectrum from a sample spectral database
(Scan #2176 in input file F1)
Tip To automatically annotate the matching fragments in the mirror plot, you must set
the following advanced parameter in the Search mzCloud node to True—Annotate
Matching Fragments. By default, this parameter is hidden and set to False.
Table 148 describes the columns in the mzCloud Results Hits table.
Table 148. mzCloud Results Hit table (Sheet 1 of 2)
Column Description
Confidence Displays the confidence of the match.
Intensity Threshold Displays the relative intensity threshold for the search spectrum.
Library Spectrum ID Displays the spectrum ID of the library spectrum.
Table 150 describes the columns in the mzVault Results Hits table.
Table 150. mzVault Results Hit table
Column Description
Intensity Threshold Displays the relative intensity threshold for the search spectrum.
Match Displays the match quality of this search spectrum to the library
spectrum.
Name Displays the compound name in the mzVault library file.
Scan # Displays the scan number from the input files that contains the
best matching fragmentation spectrum.
Use the Predicted Compositions table to review the possible chemical formulas for the
selected compound in the Compounds table. The Predicted Compositions tables lists the
possible chemical formulas based on the compound’s molecular weight.
Table 151 describes the columns in the Predicted Compositions result table.
Table 151. Predicted Compositions table (Sheet 1 of 3)
Column Description
Checked Use this column to select the rows that you want to display in the
result table and in reports after you apply result filters.
Tags Use this column to add custom tags to the result table items. You
can filter the result table by these tags. See “Custom color-coded
tags for result table entries.”
Compound Match Displays whether the predicted composition matches the currently
assigned compound annotation.
( ) Red—No Match
( ) Green—Full Match
Formula Displays the predicted elemental composition.
Molecular Weight Displays the molecular weight.
Range: 0 to 1
Pattern Cov. (%) Displays the summed intensity of the matching isotope peaks in
the measured MS1 spectrum relative to the summed intensity of
the theoretical isotope pattern.
The Map to BioCyc Pathways node creates the BioCyc Pathways table. The columns that are
available in the main BioCyc Pathways table and the related BioCyc Pathways table depend on
which nodes provide data to the Map to BioCyc Pathways node.
Table 152 describes the columns in the BioCyc Pathways main table. The BioCyc Pathways
table lists all the BioCyc pathways that include at least one of the compounds in the main
compounds table.
Table 152. BioCyc Pathways table (Sheet 1 of 2)
Column Description
Checked Use this column to select the rows that you want to display in
the result table and in reports after you apply result filters.
Tags Use this column to add custom tags to the result table items.
You can filter the result table by these tags.
Pathway Name Displays the names of the mapped Metabolika pathways that
include matching structures (by formula, mass, or both) for
at least one compound in the Compounds table.
#Mapped Compounds Displays the number of detected compounds (in the
Compounds table) mapped onto the BioCyc pathway.
#Mapped Expected Displays the number of expected compounds (in the
Compounds Expected Compounds table) mapped onto the BioCyc
pathway.
#Matched Compounds Displays the number of compounds in the pathway that map
to the Compounds table.
#Matched Expected Displays the number of compounds in the pathway that map
Compounds to the Expected Compounds table.
Total #Matched Displays the number compounds in the pathway that map to
Compounds the Compounds table, the Expected Compounds table, or
both tables.
#Compounds in Pathway Displays the total number of compounds in the pathway.
The Map to BioCyc Pathways node creates the BioCyc Results table.
The Map to KEGG Pathways Node creates the KEGG Pathways table.
Table 154 describes the columns in the main KEGG Pathways table and the KEGG Pathways
table for a compound. The KEGG Pathways table lists all the KEGG pathways that include at
least one of the compounds detected by the untargeted search.
Table 154. KEGG Pathways table (Sheet 1 of 2)
Column Description
Checked Use this column to select the rows that you want to display in the
result table and in reports after you apply result filters.
Tags Use this column to add custom tags to the result table items. You
can filter the result table by these tags. See “Custom color-coded
tags for result table entries.”
Pathway ID Displays the identification number of the mapped KEGG
pathway.
Pathway Name Displays the name of the mapped KEGG pathway that includes at
least one compound in the Compounds table.
#Referenced Compounds (LC studies) Displays the number of detected compounds in the Compounds
table that are referenced in the identified KEGG pathway.
The Map to Metabolika Pathways node creates the Metabolika Pathways table.
Table 155 describes the columns in the Metabolika Pathways table. The main Metabolika
Pathways table lists all the Metabolika pathways that include at least one of the compounds in
any of the main compounds tables (Compounds table, Expected Compounds table, or both).
The related table displays the search results for the selected compound in the main
compounds table.
Table 155. Metabolika Pathways table (Sheet 1 of 2)
Column Description
Checked Use this column to select the rows that you want to display in
the result table and in reports after you apply result filters.
Tags Use this column to add custom tags to the result table items.
You can filter the result table by these tags.
Pathway Name Displays the names of the mapped Metabolika pathways that
include matching structures (by formula, mass, or both) for
at least one compound in the Compounds table.
#Mapped Compounds Displays the number of detected compounds (in the
Compounds table) mapped onto the Metabolika pathway.
#Mapped Expected Displays the number of expected compounds (in the
Compounds Expected Compounds table) mapped onto the Metabolika
pathway.
#Matched Compounds Displays the number of compounds in the pathway that map
to the Compounds table.
#Matched Expected Displays the number of compounds in the pathway that map
Compounds to the Expected Compounds table.
Total #Matched Displays the number compounds in the pathway that map to
Compounds the Compounds table, the Expected Compounds table, or
both tables.
#Compounds in Pathway Displays the total number of compounds in the pathway.
Additional columns in a Metabolika Pathways table for a compound in the main compounds
table
Metabolika Compound IDs Displays Metabolika compound ID.
(related table)
The Map to Metabolika Pathways node creates the Metabolika Results table.
( )—Full Match
( )—Partial Match
( )—No Match
The Compound Class Scoring node adds the Class Coverage column to the Compounds table
and creates the Compound Class Matches table, which is related to the Compounds table.
Table 157 describes the columns in the Compound Class Matches table.
Table 157. Compound Class Matches table
Column Description
Checked Use this column to select the rows that you want to display in the
result table and in reports after you apply result filters.
Tags Use this column to add custom tags to the result table items. You
can filter the result table by these tags. See “Custom color-coded
tags for result table entries.”
Name Displays the name of the library that contains the matching
structure.
Description Displays the user-specified description of the library.
FISh Coverage Displays the FISh coverage score (see “FISh scoring for proposed
structures.” ).
Class Coverage Displays the number of matching centroids in the best
fragmentation scan divided by the total number of fragments in
the selected compound class libraries.
# Matched Fr. Displays the number of library fragments that match the centroids
in the best fragmentation scan for a compound.
# Missed Fr. Displays the number of library fragments that do not match the
centroids in the best fragmentation scan for a compound.
Figure 149. Fragmentation scan and compound class match result for a detected compound
#Matched fragments
The Pattern Scoring node adds the Matched Patterns table to the result file.
Table 159 describes the columns in the main and related Neutral Losses tables.
Table 159. Neutral Losses table (Sheet 1 of 2)
Column Description
Main table
Checked Use this column to select the rows that you want to display in the
result table and in reports after you apply result filters.
Tags Use this column to add custom tags to the result table items. You
can filter the result table by these tags. See “Custom color-coded
tags for result table entries.”
• P-value
• Adj. P-value.
In addition, the following columns appear in the Merged Features table, Group Areas, Ratio,
and Log2 Fold Change. The background colors of the table cells provide visual information
about the numeric values in these columns.
When the compound is not found, the cells have a gray background.
The Differential Analysis node bins the group areas in the current
result file on the Log10 scale, with one bin for each order of
magnitude (1e4, 1e5, 1e6, and so on), and uses a different background
color for each bin. The values in the lowest bin have a pale-yellow
background. The values in the highest bin have a green background.
Color-coding:
• Lowest values:
• Highest values:
Note When a result table includes the Group Areas column, the
Area column is hidden, by default.
Group CV(%) Displays the coefficient of variation for the groups. Groups with a high
degree of variation (20% or greater) have a red background.
Color-coding:
( ) Values equal to or greater than 20 have a red background.
The cells for compounds with defined ratios greater than 0 have the
following background colors:
• Compounds with ratios between 0 and 0.5 are divided into five
equal bins. The background color for the table cells is a
progressively darker blue hue as the ratio approaches zero.
• Compounds with ratios greater than 2 are divided into five equal
bins. The background color for the tables cells is a progressively
darker red hue as the ratio increases.
0.5 2
0 < Ratio < 0.5 Ratio ≥ 2
Log2 Fold Change Displays the fold change (ratio) in the log base 2 scale.
The cells for compounds with log2 fold change values between
– infinity and infinity have the following background colors:
• Compounds with log2 fold change values more negative than –
1.00 are divided into five equal bins. The background color for
the table cells is a progressively darker blue hue as the value
becomes more negative.
• Compounds with log2 fold change values greater than 1.00 are
divided into five equal bins. The background color for the table
cells is a progressively darker red hue as the value increases.
Log2 Fold Change Log2 Fold Change
negative –1 1 positive
QC Correction columns
Table 163 describes the columns that the Apply QC Correction node (in combination with
QC samples) adds to the various compounds tables.
Table 163. QC correction columns (Sheet 1 of 2)
Column Description
Norm. Area (hidden) Displays the normalized areas of the QC corrected
compounds per input file.
Possible states:
( ) Gray—N/A
For LC data, the Assign Compound Annotations node determines the validity of the
annotations from the annotation sources that are selected in the Assign Annotations node
when the processing workflow includes these annotation sources. You can select up to six
compound identification and pathway mapping sources in the Assign Annotations node
The Annot. Source column indicates the match status for the selected compound from the
compound identification nodes and pathway mapping node in the processing workflow. The
expanded column heading displays the annotation sources.
Table 164. Match states from the annotation source nodes in a processing workflow
Possible states Meaning
( ) Green—Full Match The current formula and structure annotations match
the best available item from the particular source
(online database or local mass list).
Gray—No Results Retrieved no data from the particular source.
Orange—Not the Top Hit Current compound annotation matches one of the
hits, but not the top one.
Orange—Partial Match Only the formula for the current compound
annotation matches the items retrieved from the
particular source.
Orange—Unused Retrieved items from the particular source, but did not
assign any annotations.
Red—Invalid mass The best available item from the particular source has
a molecular weight that does not match the molecular
weight of the compound within the specified mass
tolerance.
Red—No match The particular source does not have an item that
matches the current annotations for the compound.
An analysis that includes the following workflow nodes generates a Peak Rating column in
each of the following result tables.
• The Group Compounds node generates the Peak Rating column in the Compounds
table.
• The Group Expected Compounds node generates the Peak Rating column in the
Expected Compounds table.
• The Differential Analysis node generates the (recalculated) Peak Rating column in the
GC CI Compounds table.
Orange 0 to 2.5
When you open a report template for editing, it opens as a tabbed page with a workspace area
on the left. You can use the standard report templates provided with the application or you
can create your own custom report templates.
Note The following reporting features are new in the Compound Discoverer 3.3
application:
• The name of the study and the file name of the result file (analysis result)—By
default, these items appear in the upper-middle of the page header section and
cover-page section of the report.
• mzCloud mirror plot—You can now add an mzCloud mirror plot to a report
template. See “Add mzCloud mirror plots to a report template.”
Reporting workflow
The following flowchart shows the reporting workflow (Figure 150 and Figure 151).
Yes Yes
From the Reporting menu, From the Reporting menu, From the Reporting menu,
choose Create Report choose Edit Report Template choose Create Report Template.
and select a report template. and select a report template.
The Customize Report dialog box
The report resolution page opens. The report template page opens. opens.
No
Does the template display
the data correctly?
Previous
page
Yes
For LC studies, the application comes with the nine defined report templates.
Table 167. LC report templates provided with the application (Sheet 1 of 2)
Report template name Description
Compounds No Graphs A4 Generates single row for each visible compound in the
Compounds table. Reports the structure, name, and
formula when available. Reports the calculated molecular
weight and group areas.
Compounds with Graphs A4 Generates a single page for each visible compound in the
Compounds table. Reports the structure, name, and
formula when available. Reports the calculated molecular
weight and group areas. The report includes a
chromatogram plot, an MS1mass spectrum plot, and an
MS2 spectrum plot.
Expected Compounds No Graphs A4 Generates a single row for each visible compound in the
Expected Compounds table. Reports the parent
compound, formula, calculated molecular weight,
retention time, dealkylations, transformations,
composition change, FISh coverage, and group areas.
Expected Compounds per File No Graphs A4 Generates a single row for each visible compound in the
Expected Compounds table. Reports the parent
compound, formula, calculated molecular weight,
retention time, ions and the m/z values of the ions,
composition change, chromatographic peak area, and
study file ID.
Expected Compounds per File with Graphs A4 Generates a separate page for each visible compound in
the Expected Compounds per File table. Includes the
following graphs: MS1 spectrum, chromatogram, and
MS2 spectrum.
Expected Compounds with Structures No Graphs A4 Generates a single row for each visible compound in the
Expected Compounds table. Reports the parent
compound, formula, calculated molecular weight,
composition change, retention time, FISh coverage, and
group areas. Reports the structure and name when
available.
1. Open a result file. See “Open, close, and update result files.”
In the application window, the reporting menu commands and the reporting toolbar
icons ( ) become available.
2. Determine which main table you want to include in the report and filter the data in this
table as appropriate. See “Filter the data for data reduction.”
For LC studies, you can use the report templates provided with the application with one
of these tables as described in the template name: Compounds table, Expected
Compounds table, or Expected Compounds per File table.
3. To select an existing report template, choose Reporting > Create Report from the menu
bar or click the Create Report icon, .
The Open Report Design Template dialog box opens to the Report Templates folder.
To select an appropriate report template, you must know what data the report template is
designed to resolve. Typically, a report template resolves the filtered data from one of the
main tables and one or more of the graphs associated with the table. A report template
can also resolve data from one or more related tables. The predefined report templates
resolve the data in the Compounds table or the Expected Compounds table.
4. Select the appropriate report template and click Open.
The report resolution page opens with the thumbnail pane on the right and a report
preview on the left.
The tab format for the report resolution page is as follows:
Report Template Name
As the application resolves the data with the report template, the following icon displays
the progress.
When the data is resolved, the progress icon disappears, and the application begins
rendering the report pages. The current page/estimated pages box lists the progress.
If the selected template does not contain ReportInfo items, the application displays the
pages as it renders them. If the selected template contains a ReportInfo item, the
application does not display the rendered pages until it has rendered all of the report
pages. ReportInfo items include the time stamp in the upper left and the page number at
the center bottom of the defined report templates.
If the report contains too many pages, the application cancels the report generation and
the following message box appears.
5. If the application cancels the report generation, repeat step 2 to step 4. This time, reduce
the number of reported items by modifying the filters.
6. Review the contents of the report.
7. On the report resolution page, click the Print icon, , in the toolbar to print the
report.
The Print dialog box opens.
8. Select the appropriate printer and the page range that you want to print.
The report templates that come with the application default to printing on A4 paper.
9. If you are not printing on A4 paper, change the printer setting.
10. Click OK to print the report.
Create a new report template by using the Customize Report dialog box
When you create a new report template by using the Customize Report dialog box, the
application automatically adds a date-and-time stamp on the left side of the report header, the
Compound Discoverer logo on the right side of the report header, and the page number in the
report footer. By default, the application displays information from the currently selected
main result table on A4 paper in the portrait orientation with the data columns listed left to
right. You can select a different main table, add related tables and graphs (chromatogram,
mass spectrum, mirror plot), and change the paper type, page orientation, and logo image.
1. Open a result file. See “Open, close, and update result files.”
2. Do one of the following:
• From the menu bar, choose Reporting > Create Report Template.
• In the toolbar, click the Create a New Report Template icon, .
The Customize Report dialog box opens in front of the New Report Template page.
Figure 152. Customize Report dialog box for the Compounds table
3. In the Reported Table list at the top of the dialog box, select the main table for the report
template.
A list of data items for the selected table appears. By default, the Columns list is
expanded, and the Graphs and Related Tables lists are collapsed.
4. To select the columns for the main table, any of the associated graphs, and any of the
main table’s related tables, do the following:
a. Under the selected table name, click each expand icon, , to open these sections:
• Columns
• Graphs
• Related Tables
b. In the expanded sections, select the check box for each column, graph, or related
table (and associated columns) that you want to include in the generated reports.
5. To add a separator line above the column heading row for each result table, select the
Draw Lines check box.
Tip For result tables with a large number of data columns, transpose the columns by
selecting the Transpose Data check box. Otherwise, you might need to resize the data
columns to fit the page.
Note Selecting another table clears the Transpose Data check box.
Each selected data column appears as a two-column row in the report template. The first
column displays the column heading and the second column displays the data from a
table row.
b. In the Indenting box, type the indentation value from 0.00 to 1.00 inch.
Figure 153 shows a report template for selected columns in the Compounds per File table and
two associated graphs. The default report template uses the Compound Discoverer
application icon. You can select a different image and resize its picture container as
appropriate.
Note The Customize Report dialog box automatically adds the file name and study name
fields to the page header section of the report template.
Figure 153. Report template with data from the Compounds per File table
Date-and-time stamp (automatically added) Logo image’s Default
picture logo image
Selected main table name container
Selected
columns
Selected
graphs
Note The Customize Report dialog box automatically adds the file name and study name
fields to the page header section of the report template.
Y To modify the current color scheme or to add or remove color schemes from the list
1. Open the Customize Report dialog box. See “Create new report templates.”
2. Select the main result table and the related tables for the report.
3. Click Modify.
Two color selection lists appear below the Color Scheme list. The list on the left changes
the background color for the column headings. The list on the right changes the
background color for the data columns.
4. From the color lists, select one or two background colors, and then click Add.
The application displays the effect of the color scheme in the data item list, adds the new
color scheme to the Color Scheme list, and makes the Remove button available.
Note Accepting the settings in the Customize Report dialog box adds the new color
scheme to the ColorScheme.xml file that is located in the following folder:
C:\Users\Public\Public Documents\Thermo\Compound Discoverer 3.3
\Common Templates\ReportTemplates
If you remove the new color scheme before you click OK at the bottom of the
Customize Report dialog box to accept the settings, the application does not add the
new color scheme to the ColorScheme.xml file.
• If you do not want to keep the new color scheme, click Remove.
The application undoes the color selections, leaves the color lists open, and removes
the color scheme from the Color Scheme list.
Default: Selected
Transpose Data Specifies the layout of the data in the result table columns.
The default layout (check box cleared) matches the result table
layout, with columns displayed from left to right and rows
displayed from top to bottom. Select this check box to transpose
the columns to rows.
Default: Cleared
Tip When you select a table item in the data item list, the
application automatically clears the Transpose Data check box.
For each table that you want to transpose, select the table name
and make sure that it is highlighted in blue. Then select the
Transpose Data check box.
Indenting Specifies the indentation of the selected related table data from the
left edge of the page, from 0.00 to 1.00 inch.
Each color scheme consists of two colors. The first color is the
background of the column headings. The second color is the
background of the table rows.
Default: Transparent/Transparent
Note Accepting the settings in the Customize Report dialog box
adds the new color schemes to the ColorScheme.xml file that is
stored in the same folder as the common report templates.
General Settings
PaperKind Specifies the size of the paper for printing the report. Select the
appropriate paper size before sending the report to the printer.
Default: A4
Orientation Specifies the orientation of the report, either Portrait or
Landscape.
Default: Portrait
Logo Image Specifies the logo image to appear by default in the upper-right
corner of each report page.
Applies the new color scheme to the selected table and adds the
new color scheme to the Color Scheme list.
Remove Removes the selected color scheme from the Color Scheme list.
Note You can open more than one report template page in the application window.
Tip For reference information about the toolbars, shortcut menus, and so on, see
“Reference information for the report template page.” For information about changing
the paper type and page orientation, see “Select the paper type, print width, page
orientation, and watermark for a report template.”
Figure 154 shows the Compounds No Graphs template on the report template page.
Figure 154. Report template page
1. Open a result file. See “Open, close, and update result files.”
2. From the application window, do one of the following:
a. Choose Reporting > Create Report Template from the menu bar, or click the
Create a New Report Template icon, , in the toolbar.
The Customize Report dialog box opens.
b. Make the appropriate selections and click OK.
The report template page opens as a tabbed document. The tab format is as follows:
Main Result Table Name
1. Open the report template that you want to edit. See “Open a report template for editing.”
The report template opens as a tabbed page in the application window. Typically, the
cover page section is closed.
2. (Optional) To view the items in the cover page section that you might want to edit, drag
down the sizing handle to the left of PageHeader. Then, edit the items as applicable.
Cover page
properties
b. In the properties pane, select True for Visible in the Behavior section.
1. Open the report template. See “Open a report template for editing.”
The report template opens as a tabbed page in the application window.
2. Select the logo image container.
The picture properties appear in the Properties pane.
Note The Compound Discoverer icon is the default logo for the common templates
and the templates that you create with the Customize Reports dialog box.
3. In the Data area, click the browse icon, , to the right of the Image property. You might
have to click the row to make the browse icon appear.
The Open dialog box opens with a setting of All image files for the file type.
4. Browse to the folder where you stored the logo of interest, select the logo, and click
Open.
The selected image appears in the container.
5. Modify the Layout properties as appropriate.
Change the format of the date and time field in a report template
The templates that come with the application include a DateTimeInfo field (date and time
stamp) in the Cover Page section and the Page Header section.
1. Open the report template. See “Open a report template for editing.”
The report template opens as a tabbed page in the application window.
2. Select the DateTimeInfo item.
The properties for this item appear in the Properties pane.
Figure 156. Output Format box for the Date Time Info field in the report templates
3. Under Appearance, click the OutputFormat box to make the browse icon appear. Then
click the browse icon.
The Output Format dialog box opens.
Figure 157. Output Format dialog box
1. Open the report template. See “Open a report template for editing.”
The report template opens as a tabbed page in the application window.
2. From the Section Reports pane to the right of the workspace area, drag an item to the
appropriate location on a workspace section of the page.
Some of the items appear as containers (boxes) where you can add text or images.
3. Select the item to open its properties below the Section Reports pane.
4. Edit the properties of the item as necessary.
1. Open a report template for editing. See “Open a report template for editing.”
2. From the Section Reports pane to the right of the workspace area, drag the RichTextBox
item to the appropriate location on a workspace section of the page.
3. Do one of the following:
• Type text in the rich text box.
• Click the Load File link below the properties pane to open the Open dialog box.
Then, select the file type (RTF, TXT, HTML, or HTM), browse to the file location,
select the file, and click Open. Figure 158 shows the properties for the rich text box
and the Load File link below the properties pane.
1. Open the report template. See “Open a report template for editing.”
The report template opens as a tabbed page in the application window.
2. Depending on the orientation of the table columns, do the following:
• If the result table columns are arranged horizontally from left to right, go to step 3.
• If the result table columns are arranged vertically from top to bottom, go to step 5.
3. To add a main table column to a column set that is arranged horizontally, do either of the
following:
• To place the new column to the right of the current column set, select the
PageHeader bar.
• To place the new column to the right of a specific column, select the column heading.
Y To add a data graph that is associated with the main table to the report template
1. Open the report template. See “Open a report template for editing.”
The report template opens as a tabbed page in the application window.
2. Do one of the following:
• Right-click the DetailSection_Main_Table_Name bar and choose the Add Field >
Data Graph of interest from the shortcut menu.
–or–
a. Click the DetailSection_Main_Table_Name bar.
b. In the toolbar, click the Add Items icon, , to open a list of data graphs and
related table column and graph selections.
The available items in the list include the data graphs associated with the current
main table and the related tables and graphs for the current main table.
c. Select an available data graph from the list.
The new data graph appears below the right of the last graph currently in the template.
Add the mirror plots for each mzCloud hit for a compound to a report
1. Add a sub report for the mzCloud results table to the details section as follows:
a. Click the DetailSection_Main_Table_Name bar to select it.
b. In the toolbar, click the Add Items icon, , and choose mzCloud Results > Name
(or any table columns that you want to report). See Figure 159.
Sub Report
2. Double-click SubReport_mzCloud_Results.
The DetailSection mzCloud_Results area appears below the bottom toolbar. See
Figure 160.
Figure 160. Detail section for the mzCloud Results table selections
Create a report template for the mzCloud Results table that includes a mirror plot
Y To create a report template for the mzCloud hits in the mzCloud Results table
1. From the application menu bar, choose Reporting > Create Report Template.
The Customize Report Dialog box opens.
2. In the Reported Table list, select mzCloud Results.
3. Under Columns select the check boxes for the table columns you want to include in the
report.
4. Expand Graphs and select the MS2 Spectrum check box.
Note The MS2 Spectrum under Graphs for the mzCloud Results table is a mirror
plot of the best MS2 spectrum for the compound and its best match spectrum from
the mzCloud mass spectral database.
5. Make the appropriate selections for the appearance, color scheme, paper kind,
orientation, and logo image.
For example, select the Draw Lines and Transpose Data check boxes. And, select
Landscape for the paper orientation. See Figure 161.
Figure 161. Customize Report box for the mzCloud Results table
Mirror Plot
6. Click OK.
The template appears on the report designer page. The default print width for the
landscape orientation on A4 paper is 10.993 inches. See Figure 162.
Figure 162. Report designer page with a report template for the mzCloud Results table
Figure 163. Modified report template for the mzCloud Results table
1. Open the report template. See “Open a report template for editing.”
The report template opens as a tabbed page in the application window.
Note The available items in the first list include the data graphs associated with
the current main table and the related tables for the current main table. When
you select a related table from the first list, the second list includes the available
columns for that table.
3. If the DetailSection is collapsed, click the expand icon to open the section.
Expand icon
The added table column appears at the bottom of the DetailSection (data area). The
application automatically adds a line above the column heading.
Figure 167. Expected Compounds per File table with the ΔMass [ppm] column from the related
Expected Features per File table
1. Open the report template. See “Open a report template for editing.”
The report template opens as a tabbed page in the application window.
2. To open the subreport editor area, do one of the following:
• Double-click the Subreport_Related_Table_Name box.
–or–
a. Select the Subreport_Related_Table_Name box.
The Edit Sub-Report icon, , becomes available.
b. Click the Edit Sub-Report icon.
The TextBox item for the related table column appears in a separate section. The
container for the item is sized to the full width of the page.
Figure 168. Report template page with the subreport section open
3. To change the properties of the TextBox item in the subreport section, select it.
The properties for the selected item appear in the properties pane to the right of the
workspace.
4. Make changes as necessary in the properties pane, or click the property dialog link below
the properties pane to open the TextBox dialog box and make similar changes.
5. To close the subreport section, click the Close Sub-Report icon in the report designer
toolbar.
1. Open the report template. See “Open a report template for editing.”
The report template opens as a tabbed page in the application window.
2. Resize the column heading container for the subreport column. Then, move it to an
appropriate location in the set of labels for the main table columns.
3. Resize the Subreport_Related_Table_Name container. Then, move it to the appropriate
location in the Detail Section.
4. Double-click the Subreport_Related_Table_Name container that you moved to the Detail
Section.
The DetailSection_Related_Table_Name appears below the bottom toolbar.
5. In the related table’s Detail Section, resize the containers for the data items from the
related table.
Figure 169 shows a report template for the Expected Compounds per File table with an
additional column from the related Expected Features per File table.
Figure 169. Related table column moved up to the main table
Column heading for the
related table column
1. Open the report template. See “Open a report template for editing.”
The report template opens as a tabbed page in the application window.
2. Select the item.
The properties for the selected item appear in the properties pane, at the bottom right of
the report template page. For information about the properties of each item, see
“Property settings for the sections and items in a report template.”
3. Do any of the following:
• To move the item to another position, do either of the following:
– Move the item by dragging it (or you can use the arrow keys on the keyboard).
– In the Properties pane, under Layout, expand Location, and then change the X
and Y values.
• To resize the container for the item, do either of the following:
– Resize the item by dragging the handle points of the container.
– In the Properties pane, under Layout, expand Size, and then change the Width
and Height values.
• To change other properties for the item, do either of the following:
– Modify the properties in the Properties pane.
– Click the Property Dialog link at the bottom right of the report template page,
and then modify the property in the item-specific dialog box.
Note For the RichTextBox item, in addition to the Property Dialog link, you can
click the Load File link to load text from a file. See “Add a rich text box to a report
template.”
1. Open the report template. See “Open a report template for editing.”
The report template opens as a tabbed page in the application window.
2. On the report template page, drag the PageBreak item from the Section Reports pane to
the bottom of the Detail Section of the report template.
When you select one of the paired sections to delete, the application removes both
sections. You cannot delete one section without deleting the other, and you cannot delete
the Detail Section section.
1. Open the report template. See “Open a report template for editing.”
The report template opens as a tabbed page in the application window.
2. Select one of the workspace sections by clicking the section header or by clicking within
the section area.
3. From the shortcut menu, choose Delete.
Tip Make sure to enlarge the workspace section enough to hold all of the items that you
want to add to that section of the template.
The size of a section on the report template page is not necessarily the same as its size in
the generated report.
1. Open the report template. See “Open a report template for editing.”
The report template page opens as a tabbed page in the application window.
2. Do any of the following:
• For the CoverPage, PageHeader, DetailSection, and PageFooter sections, do the
following
– In the sizing bar, drag the sizing handle.
– To vertically enlarge a workspace section, drag down the handle that is aligned
with the header of the subsequent section. To reduce a workspace section, drag
the handle up.
• To enlarge the Appendix section, drag it down by the bottom edge of the report
template page. To reduce this workspace section, drag the bottom edge up.
Add, align, and transpose columns in a report template by using the shortcut menu
commands
See the following table for information about using the shortcut menu for the report template
page to modify a report template.
Table 169. Using the shortcut menu commands on the report template page (Sheet 1 of 2)
Task Do the following
Open the shortcut Right-click the report template page.
menu for the report
template page.
Add an item to a Right-click the section bar of a section area and choose Add Field > Item of Interest.
section of the report
template.
Table 169. Using the shortcut menu commands on the report template page (Sheet 2 of 2)
Task Do the following
Align a column 1. In the PageHeader section, right-click the column heading (Label item) that you want
heading to its to align with its associated data field (TextBox item).
associated data field.
The Align Columns icon becomes available.
Align Columns icon
Figure 170. Format Border dialog box with the selection of a coral, double-line border
3. To set up the border, click one of the icons in the Presets area or select the line style in the
Line Styles area, and click the appropriate sides of the square in the Preview area.
Sizing
bar
By default, the workspace on the report template page has five sections. Table 170 lists these
sections, from top to bottom.
For more information about the report template page, see “Edit existing report templates.”
Table 171. Toolbars on the report template page (Sheet 1 of 4)
Icon Description
Top toolbar
Save Active Item—Saves the report template using the same file
name.
By default, the template file name is the same name as the main
table that you selected in the Customize Report dialog box. See
“Create new report templates.”
Save As—Saves the report template using a different file name.
Show Dots—Shows the main grid lines and the small dots within
the grid.
Show Lines—Shows the main grid and the smaller lines within the
grid.
Snap Lines—When you move an item on the page, blue alignment
lines appear. When the selected item (Item 1 below) is horizontally
aligned with another item, two vertical lines bracket the aligned
items. When the selected item is vertically aligned with another
item, two horizontal lines bracket the aligned items.
Item 2
Item 1 Item 3
To select multiple items, press the SHIFT key while you select the
items.
CrossSectionLine A line that can span across multiple workspace sections on the
report template page.
Note You cannot add this item to the Detail Section workspace
section. However, you can add it to another section (for
example, the Page Header section) and have it span across the
Detail Section section.
CrossSectionBox A box that can span across multiple workspace sections on the
report template page.
Note You cannot add this item to the Detail Section workspace
section. However, you can add it to another section (for
example, the Page Header section) and have it span across the
Detail Section section.
If it is available, click the expand icon, , to open the settings for a particular property or the
collapse icon, , to close the settings.
The properties pane contains these property groups, from top to bottom:
• Appearance properties
• Behavior properties
• Data properties
• Design properties
• Layout properties
• Miscellaneous properties
Tip To open a report template for editing, see “Open a report template for editing.”
For information about opening the properties dialog box for a report item, see “Open the
property dialog box for a workspace section or a specific report item.”
Appearance properties
Table 174 describes the Appearance properties for report template items.
Table 174. Appearance properties in the Properties pane (Sheet 1 of 3)
Property Description
AnchorBottom (For the Line item only) Specifies whether the line is anchored to
the bottom of the workspace section.
Selections:
• False—Does not anchor the line to the bottom of the
workspace section.
• True—Anchors the line to the bottom of the workspace
section.
Alignment (For the Label, TextBox, ReportInfo, and Barcode items) Specifies
the horizontal alignment of the text within the container.
Selections:
• False—Does not enable a text caption to be associated with
the bar code.
• True—Enables a text caption.
CaptionPosition (For the Barcode item only) Sets the position of the caption
container relative to the bar code symbol.
Selections:
• None—Hides the caption.
• Above—Sets the position above the symbol.
• Below—Sets the position below the symbol, as in the figure
above.
CharacterSpacing (For the Label and TextBox items) Specifies the spacing between
the characters in the text, in points.
Default: 0
Default: Normal
Font (For the Label, TextBox, and CheckBox items) Specifies the name
of the font and other font characteristics such as the style, size,
effects, and script. Clicking the browse icon, , opens the Font
dialog box where you can specify the font characteristics. Clicking
the expand icon, , expands the settings.
Font > Specifies the GDI character set to use. For a list of valid values,
GdiCharSet refer to the GdiCharSet Property in the Microsoft Developer
Network (MSDN) Library.
Font > Specifies that the font is derived from a GDI vertical font.
GdiVerticalFont
ForeColor (For the Label, TextBox, and CheckBox items) Specifies the font
color.
FormatString (For the ReportInfo item only) Specifies the format of the
generated content as a page number or a date-and-time string.
LineColor Specifies the color of a line or border.
LineSpacing Specifies the spacing between multiple lines of content, in points.
LineStyle Specifies the style of a line or border.
LineWeight Specifies the thickness of a line or border, in pixels.
NarrowBarWidth (For the Barcode item only) Specifies the width of the narrow bars
in the bar code (a value of 1.0 equals 0.864 points).
Tip At a thicker width for the narrow bars, the entire bar code
might be too large for the container. In this case, enlarge the size
of the container to see the entire bar code.
NWRatio (For the Barcode item only) Specifies the ratio of the width of the
wide bars relative to the width of the narrow bars in the bar code.
The larger the ratio, the thicker the wide bars appear.
Tip At a thicker width for the wide bars, the entire bar code
might be too large for the container. In this case, enlarge the size
of the container to see the entire bar code.
Behavior properties
Table 175 describes the Behavior properties for report template items.
Table 175. Behavior properties in the Properties pane (Sheet 1 of 3)
Behavior property Description
Angle Specifies the slope of the text within the container, in degrees.
AutoReplaceFields (For the RichTextBox item only) Specifies whether the data in the
container is automatically replaced with the data from the data
source as specified by the Data Field property selection.
• False—Does not automatically replace the fields of the object
with the fields in the data source that are assigned to the
current workspace section.
• True—Automatically replaces the fields of the object with the
fields in the data source that are assigned to the current
workspace section.
AutoSize (For the Barcode item only) Specifies whether the barcode item
stretches to fill its container.
Selections:
• False—Shows the check box selected.
• True—Shows the check box cleared.
CheckSumEnabled (For the Barcode item only) Specifies whether the application
computes and includes a checksum in the bar code.
Selections:
• False—Places the Appendix design items immediately after
the DetailSection and before the page footer information.
• True—Places the Appendix design items at the bottom of the
current page just above the page footer information.
RepeatToFill (For the Detail Section workspace section only)
Data properties
Table 176 describes the Data properties.
Table 176. Data properties in the Properties pane (Sheet 1 of 3)
Data property Description
ColumnCount (For the Detail Section workspace section only) Specifies the
number of columns in the report, similar to a newspaper layout.
Default: 1
When the template contains data columns that match the layout
of the main table (data columns displayed from left to right), use
the default value of 1 (ColumnCount = 1) for the number of
columns.
Tip Increasing the number of columns per page works with
transposed data columns (transposed from rows to columns).
CountNullValues (For the Textbox item only)
Default: Inherit
Default settings:
• CoverPage—After
• DetailSection—None
• Appendix—Before
CoverPage section:
• None—No page break between the cover page and the next
section.
• Before—No effect.
• After—Adds a page break between the cover page and the
next section.
• BeforeAfter—Adds a page break between the cover page and
the next section.
DetailSection:
• None—No page break between the table rows.
• Before—Adds a page break between each table row.
• After—Adds a page break between each table row.
• BeforeAfter—Adds a page break between each table row.
Appendix section:
• None—No page break.
• Before—Adds a page break before the Appendix section.
• After—No effect.
• BeforeAfter—Adds a page break before the Appendix section.
Tag Displays information associated with an object on the page.
IMPORTANT Do not modify or delete the Tag property.
Text For a Label item, you can type text here. For the column headings,
this box displays the column heading text. For the TextBox item,
this box displays the name of the data source (table column in the
result file).
Title (For the Picture item only) Not implemented.
Design properties
Table 177 describes the Design property.
Table 177. Design properties in the Properties pane
Design property Description
(Name) Displays the internal name of an object on the report template
page, used by the application to uniquely identify each individual
object.
Layout properties
Table 178 describes the Layout properties.
Table 178. Layout properties in the Properties pane (Sheet 1 of 2)
Layout property Description
End (For the CrossSectionLine and CrossSectionBox items only)
Specifies the X and Y coordinates of the end of the line or the
bottom-right corner of the box, based on the rulers at the top and
left side of the report template page.
Height (For a workspace section only) Specifies the height of the section,
based on the ruler to the left of the template workspace.
Location Specifies the X and Y coordinates of the upper-left corner of an
object, based on the rulers at the top and to the left of the
template workspace.
Padding Specifies the values in points for the space to the left, top, right,
and bottom of the textual content within the container.
Radius (For the CrossSectionBox item only) Specifies the percentage
value for the roundness of the corners of the box. The default
value of 0 creates corners with no rounding. A value of 100 creates
top and bottom sides that look like half circles.
RoundingRadius (For the Shape item only) Specifies the percentage value for the
roundness of the corners when you select RoundRect (rectangle
with rounded corners) for the Style property.
Size Specifies the width and height of an object on the page, in inches.
SizeMode (For the Picture item only) Specifies how the report designer sizes
the image to fit in the container.
Selections:
• Clip—Clips images that are larger than the container.
• Stretch—Stretches images to fit the container.
• Zoom—Decreases the image size to fit the container.
Miscellaneous properties
Table 179 describes the Miscellaneous properties.
Table 179. Miscellaneous properties in the Properties pane (Sheet 1 of 2)
Miscellaneous property Description
Code128 (For the Barcode item only) When you select Code_128_x for the
Style property in the Appearance area, use this property to define
the code.
Code49 (For the Barcode item only) When you select Code49 for the Style
property in the Appearance area, use this property to define the
code.
DataMatrix (For the Barcode item only) When you select the DataMatrix
option for the Style property in the Appearance area, use this
property to define the code.
PDF417 (For the Barcode item only) When you select Pdf417 for the Style
property in the Appearance area, use this property to define the
code.
Open the property dialog box for a workspace section or a specific report item
The reporting feature for the application includes a property dialog box for each item in the
Section Reports pane. Using this dialog box, you can modify the formatting parameters for a
selected item on the report template page. The available properties vary depending on the
selected workspace section or item. Most of these parameters are similar to the properties
listed in the properties pane, although some have slightly different names.
2. At the bottom-right corner of the report template page, click the Property Dialog link.
The property dialog box for the selected object opens (Figure 174).
Figure 174. Property dialog box for the Label item
Note For the RichTextBox item, in addition to the Property Dialog link, you can click the
Load File link to load text from a file into the box. See “Add a rich text box to a report
template.”
The parameters in the property dialog boxes have equivalent parameters in the properties
pane.
Select the paper type, print width, page orientation, and watermark
for a report template
On the report template page, you can add a watermark, change the print width and the
orientation of the report template (portrait or landscape), and select the type of paper that you
want the printer to use.
1. Open a report template for editing. See “Open a report template for editing.”
2. Click the unfilled square in the upper-left corner (above the vertical sizing bar) of the
report template page.
Figure 175. Square in upper-left corner of the report template grid
Opens the report template properties in the Properties pane
The watermark properties appear in the Appearance group, and the orientation and paper
size properties appear in the Miscellaneous group.
Figure 176. Report template properties (in the Properties pane)
1. Open the property settings for the report template. See “Open the property settings for a
report template.”
2. Under Appearance, click the browse icon, , to the far right of the Watermark property.
The Open dialog box opens with the file type setting for all image files.
3. Find and select the image file of interest and click Open.
4. In the WatermarkAlignment list, select the appropriate alignment.
5. In the WatermarkPrintOnPages box, type the page number of the page where you want
the watermark to appear.
6. In the WatermarkSizeMode list, select whether you want to clip the image if it is larger
than the container, stretch the image to fit the container, or reduce the image size to fit
the container.
1. Open the property settings for the report template. See “Open the property settings for a
report template.”
2. Under Behavior, type a numeric value in the PrintWidth box.
Tip When you change the page layout from portrait (IsLandscape = False) to landscape
(IsLandscape = True) or landscape to portrait, the numeric value in the PrintWidth box
automatically updates, but the grid ruler does not update.
To update the grid ruler, type the print width in the PrintWidth box.
1. Open the property settings for the report template. See “Open the property settings for a
report template.”
2. Under Misc, select True for Landscape or False for Portrait in the IsLandscape list.
1. Open the property settings for the report template. See “Open the property settings for a
report template.”
2. Under Misc, select the appropriate paper size from the PaperKind list.
To open the list to the paper size of interest, type the first letter of the paper-size name.
For example, if you want to change the size from A4 to Letter, type an L in the box. If the
paper size does not appear in the box, continue typing the next letter of its name. The
application cycles through the paper sizes that begin with this letter or letters.
You can print all of the report pages from the report resolution page or the Report Print dialog
box. You can print up to five pages from the Report Preview dialog box.
1. Open a report template for editing. See “Open a report template for editing.”
2. In the toolbar on the report template page, click the Preview Report icon, .
The application opens the Report Preview dialog box and resolves the current result file
with the current template on the report template page. The Page Thumbnails pane
displays up to five resolved pages.
A red line in the right page margin indicates that the print width does not fit on the
selected page size, and the printer will print an empty page with every report page it
prints, unless you resize the template. See “Change the print width of a report template.”
Preview a report for all the visible rows in the main result table
To preview a report for all the visible rows in the selected main result table, use the Report
Preview dialog box.
1. On the report template page, resolve the report by clicking the Print Report icon, , or
the Preview Report icon, .
2. In the toolbar of the Report Print dialog box or the Report Preview dialog box, click the
Find icon, .
1. Open the report resolution page or the Report Print dialog box.
2. In the toolbar, choose Export > File Type, where the File Type is one of the following:
• Text—text file
• PDF—portable document format file
• RTF—rich text format file
• Excel—Microsoft spreadsheet
Print a report
After you review the items in your Compound Discoverer result file and reduce the number of
table rows to report to a reasonable number, open an existing report template or create a new
report template, resolve the template, and then print a report.
1. Open the result file of interest. See “Open, close, and update result files.”
2. Review and filter the data. See “Filter the data for data reduction.”
3. Do one of the following:
a. From the menu bar, choose Reporting > Create Report.
The Open Report Design Template dialog box opens.
b. Select an appropriate template and click Open.
The report resolution page opens.
–or–
• From the report template page, open the Report Print dialog box by clicking the
Print Report icon, .
4. In the toolbar, click the Print icon, .
The Print dialog box opens.
5. Select the appropriate printer and the page range that you want to print.
6. Click OK to print the report.
Table 181 describes the icons in the Page Thumbnails pane, from top left to bottom right.
Table 181. Page Thumbnails pane icons
Icon Description
Enlarge—Enlarges the size of the thumbnails.
Search Results—Displays the Search Results pane where you can search for
a particular word or phrase in the report. The found instances appear in
the list of results.
Not all the task buttons are available for every view. The Edit button is unavailable for the
Spectral Libraries view, and the Replace button is visible only for the Mass Lists, Spectral
Libraries, and Metabolika Pathways views.
Each view in the Lists & Libraries Manager page includes a table. For information about
freezing panes, hiding and showing columns, freezing rows, sorting, and filtering the
application tables, see “Common operations for manipulating data tables.”
Tip To open the Expected Compounds view, choose Lists & Libraries > Expected
Compounds from the application menu bar.
In a targeted analysis, the Generate Expected Compounds node generates a list of expected
compounds by using the parent compounds and transformations that you specify. The Find
Expected Compounds node then searches for these compounds in the raw data files that you
submit for analysis.
The initial expected compounds list contains omeprazole, the targeted compound used in the
Compound Discoverer Metabolism Tutorial.
Note To run a targeted processing workflow that includes the Generate Expected
Compounds node, the Create FISh Trace node, or both of these nodes, you must first add
the compounds of interest to the Expected Compounds list.
Figure 178. Expected Compounds view on the Lists & Libraries page
Button bar Filters row
For information about generating an Xcalibur inclusion list, see “Generate an Xcalibur
inclusion list.”
Use this row to filter the table as appropriate. See “Filter the tables on a study page or a list
or library view.”
Columns
Name Displays the user-specified compound name.
Description Displays the user-specified description.
Elemental Displays the elemental composition that the application
Composition determines from the compound’s structure.
Molecular Weight Displays the molecular weight that the application calculates from
the compound’s elemental composition.
Structure Displays the structure created by using the drawing tools or by
importing a structure file.
Note Similar Compound Editor dialog boxes open from the Mass List and Metabolika
Pathways editors. For information about using the structure drawing tools, see “Structure
drawing tools and commands.”
1. Open the Expected Compounds view on the Library & Lists page.
2. Do one of the following:
• To edit the definition of a compound, double-click the entry or select the entry and
click Edit.
The Compound Editor dialog box opens with information for the selected entry.
• To add a compound to the Expected Compounds list, click New.
The Compound Editor dialog box opens. As indicated by the red borders you must
name the compound and enter its elemental composition to save it.
Drawing area
3. To enter or edit the elemental composition of the compound, do any of the following:
• Draw or edit the structure in the drawing area. See “Structure drawing tools and
commands.”
• Paste an InChi or MOL string from the Clipboard.
• Open a structure file. See “Load a structure from a structure file.”
• Click ChemSpider to find the structure. See “Find a structure in the ChemSpider
database.”
The application automatically populates the Elemental Composition and Molecular
Weight boxes with read-only values. To change the elemental composition and the
molecular weight, you must modify the chemical structure in the drawing area.
4. In the Name box, type the name of the compound.
5. (Optional) In the Description box, type a description for the compound.
6. Click Save.
The dialog box closes and the new compound or the edited information for the existing
compound appears in the expected compounds list.
Note The application creates the inclusion list from the parent compounds (compounds
in the Expected Compounds list), transformations, and adduct ions that you select. A
parent compound is the beginning compound in a set of metabolic reactions.
Tip For instructions on how to export inclusion and exclusion lists from the result tables
of a result file, see “Export an Xcalibur inclusion or exclusion list from a compounds
table.”
1. From the application menu bar, choose Lists & Libraries > Expected Compounds.
2. If the parent compounds of interest are not in the list, add them. See “Add and edit
expected compounds with the Compound Editor.”
3. Select the parent compounds that you want to add to the Xcalibur inclusion list.
4. In the command bar of the Expected Compounds view, click Generate Inclusion List.
The Generate Xcalibur Inclusion List dialog box opens.
Figure 180. Generate Xcalibur Inclusion List dialog box
5. Browse to the folder where you want to store the TXT file that this dialog box generates,
and name the file.
6. Do one of the following:
• Click Load Settings, select the inclusion settings file that includes all the parameter
settings for generating the inclusion list, and then click Open.
• Specify all the reactions that you want to use to generate the inclusion list, and select
the adduct ions to generate.
Tip If you expect to use these parameter settings for generating other inclusion
lists, click Save Settings and save the settings to an inclusion settings file
(.inclSet).
7. Click Generate.
8. At the prompt, verify the directory and file name for the inclusion list file, and then click
OK.
The application generates a list of adduct ions by using the user-specified parent
compounds, and transformations, and ions. It exports the generated TXT file to the
user-specified folder.
The TXT file includes two columns. The first columns lists the formulas and the second
column lists the m/z values of the ions that you want the mass spectrometer to include in
a targeted DDA experiment. You can import the information in this file into the
instrument method for your LC/MS system.
Figure 181. Xcalibur inclusion list for caffeine and its metabolites
Note The Generate Xcalibur Inclusion List dialog box retains the new parameter settings
until you change them.
Adducts view
Use the Adducts view of the Lists & Libraries page to define the expected adducts in your
samples. Adducts are part of the ion definition–that is, to create a new ion definition that uses
an adduct that is not currently in the adducts list, you must first add the adduct to the adducts
list.
Tip To open the Adducts view, choose Lists & Libraries > Adducts from the application
menu bar.
The adducts list provided with the application contains 20 adducts. You can modify this list.
Figure 182. Adducts view
Adducts list
Use this row to filter the table as appropriate. See “Filter the tables on a study page or a list
or library view.”
Table columns
Name Displays the specified name of the adduct.
Adduct Mass Displays the mass that the application calculates from the specified
elemental composition.
Elemental Displays the specified elemental composition of the adduct.
Composition
Charge Displays the specified charge of the adduct.
1. From the application menu, choose Lists & Libraries > Adducts.
2. In the Adducts view, do one of the following:
• To edit the definition of an adduct, double-click the entry or select the entry and
click Edit.
The Compound Editor dialog box opens with information for the selected entry.
• To add an adduct to the list, click New.
The Adduct Editor dialog box opens. As indicated by the red borders, you must name
the adduct and enter its elemental composition to save it.
The application uses the entries in the Ion Definitions list in the following workflow nodes:
Detect Compounds and Generate Expected Compounds.
Tip To open the Ion Definitions view, choose Lists & Libraries > Ion Definitions from
the application menu bar.
Table 187 lists the common adducts and dimers for the positive ionization mode.
Table 187. Common adducts and dimers in the positive ionization mode
Adducts Adducts
Ion definition Charge Ion definition Charge
total mass total mass
M+H–H2O –17.00329 1 M+H+Na 23.9965 2
M+H–NH3 –16.01927 1 M+H+MeOH 33.03349 1
M+H 1.00728 1 M+K 38.96316 1
2M+H 1.00728 1 2M+K 38.96316 1
M+2H 2.01455 2 M+H+K 39.97044 2
M+3H 3.02183 3 M+H+ACN 42.03383 1
M+NH4 18.03383 1 2M+H+ACN 42.03383 1
2M+NH4 18.03383 1 M+2H+ACN 43.0411 2
M+H+NH4 19.0411 2 M+Na+ACN 64.01577 1
M+Na 22.98922 1 2M+Na+ACN 64.01577 1
2M+Na 22.98922 1 M+H+DMSO 79.02121 1
Table 188 lists the common adducts and dimers for the negative polarity mode.
Table 188. Common adducts and dimers in the negative ion mode
Adducts Adducts
Ion definition Charge Ion definition Charge
total mass total mass
M–H –1.00728 –1 M–H+FA 44.9982 –1
2M–H –1.00728 –1 2M–H+FA 44.9982 –1
M–2H –2.01455 –2 2M–H+HAc 59.01385 –1
M–H–H2O –19.01784 –1 M–H+HAc 59.01385 –1
M+Cl 34.9694 –1 M–H+TFA 112.98559 –1
M–2H+K 36.94861 –1
1. From the application menu bar, choose Lists & Libraries > Ion Definitions.
2. Select the ion definition and click Delete.
3. At the prompt, click Yes.
1. From the application menu bar, choose Lists & Libraries > Ion Definitions.
2. Click Import.
3. Locate the file and click Open.
A message opens with a tally of the number of imported ion definitions versus the
number of skipped ion definitions. The application only imports new entries; it skips
entries that are already in the list.
4. At the prompt, click OK.
1. From the application menu bar, choose Lists & Libraries > Ion Definitions.
2. Click Export All.
3. Select a folder, name the file, and click Save.
The application automatically adds the file name extension (.xml).
4. At the prompt, click OK.
1. From the application menu bar, choose Lists & Libraries > Ion Definitions.
2. Do one of the following:
• To add a new ion definition, click New.
The Ion Definition Editor dialog box opens. The Ion Definition box contains only
an M for the uncharged molecule. Because the ion definition must include at least
one additional component, a charge, or both, the box has a red border, and the Save
button is unavailable.
Note The Detect Compounds node uses the weight factor value for the ion
definitions. With the exception of the protonated molecule [M+H]+ in the positive
polarity mode and the deprotonated molecule [M–H]– in the negative polarity mode,
if you set the weight factor to 0, the Detect Compounds node does not look for the
specified adduct in the mass spectrum.
Table 189 describes the parameters in the Ion Definition Editor dialog box.
As you edit the ion definition by using the component list, the application
automatically updates the ion definition.
Weight Factor Specifies the weighting factor for the ion definition.
Available Use this list to create custom ion definitions.
Adducts list
Transformations view
The Transformations view contains a table of possible transformations. The Generate
Expected Compounds node uses a selection of entries from this table and the information in
the expected compounds list to generate a table of expected transformations for a known
(parent) compound.
With the addition of the PFAS Chain Shortening transformation in the Compound
Discoverer SP2 release, the default Transformations list now contains 35 transformations.
Tip To open the Transformations view, from the application menu bar, choose Lists &
Libraries > Transformations.
For details using the Transformation Editor to add or edit transformations, see “Add or edit
transformations with the Transformation Editor.”
Delete a transformation
Y To delete an entry from the transformations list
1. From the menu bar, choose Lists & Libraries > Transformations.
2. Do one of the following:
• To add a new transformation, go to step 3.
• To edit an existing transformation, go to step 4.
3. To add a transformation to the transformations list, do the following:
a. Click New.
The Transformation Editor dialog box opens. For details about the editor, see
“Transformation Editor dialog box.”
The empty Name, Arriving Group, and Leaving Group boxes have a red outline. You
must enter information in the Name box and in at least one of the group boxes.
Default: 1; range: 1 to 10
Tip To open the Neutral Loss Editor, click New in the Neutral Loss view or select an item
in the neutral loss list and click Edit.
Figure 190. Neutral Loss Editor
Tip To open the Mass Lists view, choose Lists & Libraries > Mass Lists from the
application menu bar.
Tip If you uninstall a previous version of Compound Discoverer 3.3 from your data
processing computer before you install the latest service pack, your mass list will
include the obsolete version of the Extractables and Leachables HRAM Compound
Database and the current 2023 update of this mass list. To delete the obsolete mass
list, select it and click Delete.
• FCCDB_2022
For more information, go to https://doi.org/10.5281/zenodo.3240108.
• GC Orbitrap Contaminants Library Compound Database
• GC Orbitrap Flavor and Fragrances Compound Database
• GC Orbitrap Metabolomics Library Compound Database
• LipidMaps Structure Database 2021-09-13
For more information, go to http://ncbi.nlm.nih.gov/pubmed/17098933.
• Natural Products Atlas 2021_08
• PFAS_NEG
• PFAS_NIST
For more information, go to https://data.nist.gov/od/id/mds2-2387.
Note The Example Mass List file, which contains four amino acids, is specifically
designed for the metabolomics tutorial for LC studies that you can access from the Help
menu.
Note The Export > As Plain Text shortcut menu command saves the data in the
result table to a CSV file (ChemSpider Results.csv or Compounds.csv).
– CSV files created by exporting data from the Thermo Scientific ToxID application
For information about creating, editing, importing, exporting, and replacing mass list files, see
these topics:
• Flowchart for creating and editing mass lists
• Delete or replace mass list files
• Import a mass list from a CSV file
• Import a mass list from a massList file, an XML file, or an SDF file
• Export a mass list file
• Mass Lists view parameters
Figure 192. Flowchart for creating, importing, and editing mass lists within the application
To open the Mass Lists view, from
the application menu bar, choose
Lists & Libraries > Mass Lists.
Edit an
Create a new
No existing Yes
mass list?
mass list?
Yes No
In the Mass Lists view,
select a mass list and
click Edit.
Import a mass
list?
Yes
In the Mass Lists view, click New.
In the Mass Lists view, In the Mass Lists view, In the Mass Lists view,
In the Create New Mass List click Import and select a click Import and select click Import and select
dialog box, name the mass list and CSV file. an SDF file or an XML a mass list file.
click Create. file.
Then, define the essential The mass list appears
columns in the Define CSV as a new item in the
File Format dialog box and Mass Lists view.
click OK.
Do either of the following in the Lists & Libraries > Mass Lists view:
• To delete mass list files, select the entries to delete and click Delete. Then, at the
prompt, click Yes.
• To replace a mass list file, select the entry and click Replace. Then, browse to the
appropriate folder, select a massList file, and click Open.
Tip You can create CSV files by exporting compound entries as plain text from the
following result tables: Compounds, GC EI Compounds, or GC PCI Compounds.
1. In the Lists & Libraries > Mass Lists view, click Import.
2. Locate the CSV file of interest and click Open.
The Define CSV Format dialog box opens.
Figure 193. Define DSV File Format for ‘Filename.csv’ dialog box
Note When the column header in the CSV file matches the name of an essential
column, the application recognizes it in the Essential Columns area. A red border
around a column name indicates that you must define the column.
The OK button remains unavailable until you define the Name column and at least
one column that provides the compound’s mass.
Note Double (for decimal numbers) is the value type for the Molecular Weight
column.
• To define the column in the CSV file that provides the chromatographic retention
times of the compounds, select the column name from the Retention Time list.
4. Click OK.
The Edit ‘File Name.massList (# Compounds)’ dialog box opens.
5. To edit the mass list, see “Create and edit mass list files.”
Import a mass list from a massList file, an XML file, or an SDF file
You can import mass lists from CSV, massList, SDF, or XML files.
Y To import a mass list from a massList file, an XML file, or an SDF file
1. In the Libraries & Lists > Mass Lists view, click Import.
2. Locate the massList file, XML file, or SDF file and click Open.
The Edit ‘File Name.massList (# Compounds)’ dialog box opens.
3. To edit the mass list, see “Create and edit mass list files.”
1. In the Libraries & Lists > Mass Lists view, select the file.
2. Click Export.
3. Select a folder for the file, rename the file if applicable, and click Save.
When you select a CSV file, the Define CSV File Format ‘File
Name.csv’ dialog box opens.
Export Opens the Save As dialog box for renaming and saving the selected
mass list file to another folder.
Replace Replaces the selected file with the replacement file. Use this
command when the replacement file has the same name as the
current file.
If you remove a mass list from the ServerFiles folder or edit a mass
list in the ServerFiles Folder, and then restart the application, the
mass list’s state changes to Missing or Corrupted, respectively.
Y To create a new mass list from scratch or edit the compounds in an existing or
imported mass list
1. From the application menu bar, choose Lists & Libraries > Mass Lists.
2. Do one of the following:
• To create a new mass list from scratch, click New. Then, in the Create New Mass List
dialog box, name the file and click Create.
The Edit File Name (No Compounds) dialog box opens with an empty mass list and
the new file name appears in the table in the Mass Lists view.
• To import a mass list from an external .massList file or SDF file, click Import, select
the file, and click Open.
The Edit ‘File Name (# Compounds)’ dialog box opens with editable entries for the
selected mass list or an empty mass list.
• To edit an existing mass list file, select it and click Edit. Or, double-click it.
The Edit ‘File Name (# Compounds)’ dialog box opens with editable entries for the
selected mass list or an empty mass list.
3. To modify the opened mass list, perform any of the tasks in Table 196.
Note You can add, rename, and delete nonessential columns. If you add a column,
you cannot delete it until you save your changes and reopen the mass list for editing.
Table 196. Editing a mass list that is opened in the Edit ‘File Name’ dialog box (Sheet 1 of 2)
Task Procedure
Add a compound. Click New, and follow the instructions in “Add and edit mass
list compounds with the Compound Editor.”
Delete a compound. 1. Select the compound and click Delete.
2. At the prompt, click Yes.
Import compounds Click Import, locate the XML file, and click Open.
from an XML file.
Import compounds Click Import, locate the SDF file, and click Open.
from an SDF file.
Import compounds 1. Click Import, locate the CSV file, and click Open.
from a CSV file.
2. Define the essential columns as described in “Import a
mass list from a CSV file.”
3. Click OK.
Add a nonessential 1. Click Add Column.
column.
2. In the Add Column dialog box, do the following:
a. Name the column.
b. (Optional) Select the data type:
• String: Alphanumeric and special characters
• Double: Decimal numbers
• Integer: Integers
c. Click Add.
Table 196. Editing a mass list that is opened in the Edit ‘File Name’ dialog box (Sheet 2 of 2)
Task Procedure
Rename a nonessential 1. Click Rename Column.
column.
2. In the Old Name list, select the column.
3. In the New Name box, type the new name.
4. Click Rename.
Delete a nonessential 1. Click Delete Column.
column.
2. In the Column list, select the column.
3. Click Delete.
Table 197 describes the buttons at the top of the Edit ‘File Name.massList’ (#Compounds)
dialog box.
Table 197. Edit ‘File Name.massList’ (# Compounds) dialog box parameters
Feature Description
Buttons
New Opens the Compound Editor dialog box for defining a new
compound.
Edit Opens the Compound Editor dialog box for editing the selected
compound in the mass list.
When you select a CSV file, the Define CSV File Format ‘File
Name.csv’ dialog box opens.
Add Column Opens the Add Column dialog box for naming and defining an
additional table column.
Rename Column Opens the Rename Column dialog box for renaming a
nonessential column.
Delete Column Opens the Delete Column dialog box for deleting a nonessential
column.
Add and edit mass list compounds with the Compound Editor
Use the Compound Editor dialog box to add entries to and edit entries in a mass list.
Note The application includes three similar Compound Editor dialog boxes. All three
dialog boxes include the same structure drawing tools, but their data entry fields differ.
The Compound Editor dialog box that opens for editing mass list compounds includes
the following data entry fields: Molecular Weight, Name, Formula, and RT [min].
1. From the Libraries & Lists > Mass Lists view, open the mass list for editing. See “Create
and edit mass list files.”
2. In the Edit ‘File Name (# Compounds)’ dialog box, do one of the following:
• To add a compound, click New.
• To edit a compound, double-click it (or select it and click Edit).
The Compound Editor dialog box opens (Figure 195). The dialog box is unpopulated for
a new compound. The minimum required information is the compound’s molecular
weight.
Figure 195. Compound Editor dialog box for editing mass list compounds
3. To change the structure in or add a structure to the drawing area, use the structure tools,
load a structure file, or click ChemSpider to find a structure file.
For details, see these topics:
• Structure drawing tools and commands
• Load a structure from a structure file
• Find a structure in the ChemSpider database
Note If you are editing the structure for an existing entry in the mass list, the
Molecular Weight field is unavailable, and any formula or structure that you enter
must match the displayed molecular weight.
The chemical structure appears in the drawing pane and its molecular weight and formula
appear in their respective fields.
4. (Optional) In the Name box, type or edit the name of the compound.
5. (Optional) In the RT [min] box, enter a chromatographic retention time.
The application rounds the retention time to three decimal places.
6. Click Save.
The Edit ‘File Name (# Compounds)’ dialog box appears.
7. Review the new or edited compound. If necessary, use the filter row to display only the
new entry. See “Filter the tables on a study page or a list or library view.”
8. To save the changes to the mass list that you are editing, click Save.
Note If you attempt to close the dialog box without saving the changes, an Unsaved
Changes prompt appears.
You can create your own custom mzVault libraries by using the mzVault 2.3 SP1 application
or by exporting the spectral information from a Compound Discoverer result file from an LC
study. See “Export spectral data to a new or existing mzVault library.”
Tip The Search mzVault node does not require the mzVault application, so installing the
mzVault 2.3 software is optional. However, Thermo Fisher Scientific recommends
installing the software to do any of the following:
• Edit existing spectral libraries.
• Create new spectral libraries with curated spectra.
• Convert existing legacy mzVault spectral libraries.
You can create new mzVault libraries with the Compound Discoverer 3.3 application, but
you are limited to exporting spectra from these result tables: Compounds and Expected
Compounds.
Note You cannot edit spectral libraries in the Compound Discoverer application. The
Edit button in the command bar is unavailable.
Use the Metabolika Pathways view to create new pathways, edit existing pathways, import and
export pathways, and replace pathways.
Tip To open the Metabolika Pathways view, from the application menu bar, choose Lists
& Libraries > Metabolika Pathways.
For information about creating and editing Metabolika pathways, see “Edit new and existing
Metabolika pathways.”
1. From the application menu bar, choose Lists & Libraries > Metabolika Pathways.
1. In the Metabolika Pathways view, click New.
The Create New Metabolika Pathway dialog box opens.
2. Name the file and click Create.
The Edit Metabolika Pathway ‘File Name.metabolika’ dialog box opens.
1. In the Metabolika Pathways view, select the pathway and click Edit.
The Metabolika pathway editor opens and displays the selected pathway or the beginning
section of the pathway in the left pane and the entire pathway at a lower zoom level in the
right pane. Use the tools in the upper right of the left pane to change the zoom level, and
the selection window in the right pane to display a different section of the pathway in the
left pane.
2. To modify the pathway, do any of the following:
• To edit a structure, right-click it and choose Edit Structure.
The Compound Editor opens.
• To add a structure, right-click it anywhere in the left pane and choose Add Structure.
The Compound Editor opens.
• To edit the arrows, see “Modify the arrows in a Metabolika pathway.”
3. To undo a change, use the CTRL+Z keys.
You can undo up to six changes.
1. Open the Compound Editor for editing and adding Metabolika pathway structures by
doing one of the following:
• Right-click the workspace in the Edit Metabolika Pathway ‘pathway name’ dialog box
and choose Add Structure.
Figure 198. Edit Metabolika Pathway ‘pathway name’ dialog box with shortcut menu
Figure 199. Compound Editor dialog box for editing Metabolika pathway structures
Drawing area
When the processing workflow includes the Compound Class Scoring node with one or more
user-specified compound class lists of fragments, the application determines the probability
that an unknown compound belongs to the compound classes by comparing the ions detected
in the fragmentation scans to the fragments in the compound class lists.
Tip To open the Compound Classes view, choose Lists & Libraries > Compound
Classes from the application menu bar.
Note For information about identifying compounds in the PFAS family, see “PFAS
identification.”
Use the Compound Classes view, the Edit ‘Named Compound Class’ dialog box, and the
Fragmentation Editor to create your own compound class files.
Figure 200. Compound Classes view
The process for editing the fragment structures in a compound class file is as follows:
1. From the application menu bar, choose Lists & Libraries > Compound Classes.
The Compound Classes view opens.
2. In the Compound Classes view, double-click the file that you want to edit.
The Edit Compound Class ‘File name’ dialog box opens.
3. In the Edit Compound Class dialog box, double-click the fragment that you want to edit.
The Fragment Editor opens.
4. Edit the fragment and click Save. Then, click Save to save the compound class file.
For more information about the Compound Classes view, see the following topics:
• Compound Classes view parameters
• Add new files to the Compound Classes library
• Delete, import, and export compound class files
• Edit compound class files
• Edit Compound Class dialog box parameters
If you remove or edit a compound class file from its storage folder,
and then restart the application, the mass list’s state changes to
Missing or Corrupted, respectively.
1. Open the Compound Classes view by choosing List & Libraries > Compound Classes
from the application menu bar.
2. In the Compound Classes view, click New.
The Create Compound Class dialog box opens.
3. Name the compound class, optionally provide a brief description, and click Create.
The named compound class appears as a new row in the Compound Classes view and the
Edit Compound Class ‘Name’ dialog box opens. For information about this dialog box,
see “Edit Compound Class dialog box parameters.”
Figure 202. Edit Compound Class ‘Name’ dialog box
1. In the Compound Classes view of the List & Libraries page, double-click the compound
class file that you want to edit. Or, select the file and click Edit.
The Edit Compound Class ‘File name’ dialog box opens with a list of fragments.
Figure 203. Edit Compound Class dialog box for the Omeprazole Compound Class file
2. In the Edit Compound Class ‘File name’ dialog box, do the following as applicable:
• To add a new fragment by using the Fragment Editor, click New. Then, use the
Fragment Editor to define the fragment (see Use the Fragment Editor to define
fragment ions).
• To add new fragments by importing their structures from a CSV file or an SDF file,
click Import.
• To edit a fragment, select it and click Edit. Then, use the use the Fragment Editor to
edit the fragment (see Use the Fragment Editor to define fragment ions).
• To delete a fragment, select it and click Delete.
3. Click Save to save your edits and return to the library.
For information about using the compound class editor, see these topics:
• Add new files to the Compound Classes library
• Edit compound class files
Note For information about using the drawing tools, see “Structure drawing tools and
commands.”
1. Open the Compound Classes view from the application menu by choosing Lists &
Libraries > Compound Classes.
2. In the Compound Classes view, do one of the following:
• To edit the fragments list in a existing compound class file, select a compound class
file and click Edit. Then, in the Edit Compound Class ‘name’ (# fragments) dialog
box, select a fragment and click Edit.
• To create a new compound class file, click New. In the Create Compound Class
dialog box, name the library, type a description, and click OK. Then, in the Edit
‘Named Compound Class’ (No Fragments) dialog box, click New.
The Fragment Editor dialog box opens.
Opens a structure.
Drawing
area
The application
populates these
boxes as you draw
the structure.
Figure 205. Atom Properties dialog box with an available Charge check box
c. In the Atom Properties dialog box, select the Charge check box.
d. Select the positive (+) or negative (–) option.
e. Click OK.
1. In the editor’s toolbar, click the Load Structure from Disk button, .
The Open Structure dialog box opens.
2. In the Known Structure Formats list, select the format of the structure file: MOL Format
(.mol), Compressed Structure (.mcs), or Template (.tml).
3. Locate the structure file and click Open.
The chemical structure appears in the drawing pane, and the application automatically
populates the Elemental Composition and Molecular Weight boxes.
If the structure is not visible or it is only partially visible in the pane, right-click the pane and
choose Select All. Then, while pressing the SHIFT key, drag the structure into the pane.
Note For an LC study, you can also access the ChemSpider database from the mzLogic
Analysis view.
Tip For information about the editors, see “Add and edit expected compounds with the
Compound Editor.” For information about the mzLogic Analysis view, see “mzLogic
Analysis view.”
Y To load a structure into the drawing area of the editor or mzLogic Analysis view
1. Click ChemSpider.
The ChemSpider Search dialog box opens.
2. In the Input box, enter a name, formula, molecular weight, or CSID.
3. Click Search.
4. Select a compound from the search results.
5. Click Select.
The chemical structure appears in the drawing pane and the molecular weight, name, and
formula of the compound appear in their respective fields.
Use the toolbar for these structure editors and the shortcut menu for the drawing area to draw,
manipulate, and save structures as described in the following topics:
• Toolbar for the structure editors
• Shortcut menu commands for the drawing area of the structure editors
• Use the structure icons in the toolbar
• Create template structures and adding them to a drawing
• Check the validity of a structure
• Manipulate structures
• Select atoms and bonds
• Move structures
• Save a structure to a structure file
• Edit atom properties
Note The application has three different Compound Editor dialog boxes and one
Fragment Editor dialog box that all have the same structure drawing tools. You can open
these dialog boxes from the following Lists & Libraries views: Expected Compounds,
Mass Lists, Metabolika Pathways, and Compound Classes.
Check
Redo Clean
Undo Atom Property
Templates
Copy to Clipboard
n-Membered
Ring
5-Membered Ring
6-Membered Ring
Save Structure to Disk Benzene Ring
Chain
Load Structure from Disk
Triple Bond
Selection Tool Double Bond
Single Bond
Shortcut menu commands for the drawing area of the structure editors
The structure editors include a drawing area for adding a two-dimensional structure.
1. Open a structure file, paste a structure (InChi or MOL string), or draw a structure in the
drawing area of the structure editor.
2. In the toolbar, click the Save Structure to Disk button, .
3. In the Save Structure dialog box, do the following:
a. Browse to the folder where you want to store the file.
b. Name the file.
c. In the Save As Type list, select Template (*.tml).
d. Click Save.
4. To add a template structure to the drawing area, click the Templates icon, .
The Templates dialog box opens.
5. In the Explorer view of the Templates dialog box, browse to and select the folder where
you store your structure files.
The title bar of the Templates dialog box changes from Templates to Templates – Folder
name, and the 2D structures appear above the Explorer view. The application displays all
of the structures in the folder. It does not differentiate between MOL files and Template
files.
Figure 207. Templates dialog box with a view of the stored structures
6. On the structure that you want to open, click any atom or bond.
The templates cursor, , appears in the drawing area of the Compound Editor dialog
box.
7. To place the selected structure in the drawing area, click the drawing area.
Note The check structure tool does not perform quantum mechanical or
thermodynamical calculations that address possible structural stability.
Y To check a structure
2. Click OK.
The application automatically selects the atoms and bonds that it considers incorrect. The
application considers structures that are not connected as mixtures and reports them as
errors, but it does not select the mixtures.
Manipulate structures
You can resize, rotate, mirror, clean, and verify structures in the drawing area of a structure
editor. After finishing a structure drawing, always check for errors before proceeding.
Move structures
In the structure editors, you can move all or part of a structure or all the structures.
Table 209. Moving structures
Task Procedure
Move a structure. 1. Select the atoms or bonds that you want to move.
2. Drag the selected structures to a new location.
Move all the structures in the 1. Right-click the drawing area and choose Select All.
drawing area.
2. Click any atom or bond in the drawing area, and
then drag the structures to a new location.
Click , , or , and then click the bond that you want to change.
IMPORTANT The application does not support compounds with a radical. It also does
support the R-Substituent feature.
Note For LC studies, the editor that opens depends on whether you are editing a
compound definition, a compound annotation, or a fragment definition:
• The Compound Editor dialog box opens when you add a new compound to or
edit a compound in the expected compounds list or a mass list.
• The Compound Annotation Editor dialog box opens when you edit the
annotations for a compound in the Compounds table, the Expected Compounds
table, or a Structure Proposals table.
• The Fragment Editor dialog box opens when you add a new fragment to or edit a
fragment in the Edit Compound Class dialog box.
Tip In the Compound Editor dialog box, you can save compounds with charges. You
can also save compounds with radicals, but the application ignores radicals for data
processing.
Figure 209. Atom Properties dialog box, showing the properties for a charged nitrogen atom
Tip For example, to create a compound that is labeled with one carbon-14 atom,
double-click the labeling site—the atom that you want to change. In the Atom
Properties dialog box, select the Isotope check box, and then select 14 in the Nucleon
Number list.
Note After you install the Compound Discoverer application on your computer, you can
use the application without activating the license for up to 60 days.
After you activate the software license on one computer, you can deactivate the license and
transfer it to another computer.
2. If you have not already received your activation code, do the following:
a. Check your Junk Email folder.
b. If the email is not in your Junk Email folder, log in to your account at the following
URL. In the left navigation pane, under Software & Services, click Order History.
Then, in the list of ordered products, click the order number.
https://thermo.flexnetoperations.com
c. If you cannot find your account, send an email message to Licensing at
[email protected].
Provide the following information in the body of the message:
• Software application: Compound Discoverer
• Sales order number or purchase order number:____________________
• End user name:______________________
• End user email:______________________
3. In the License Activation dialog box (Figure 211), enter the following:
• Your company name
• Your full name
• Your contact email address
• The product ID for the Compound Discoverer 3.3 application.
There are five possible product IDs. Four of the product IDs are for software
upgrades (see Table 211).
• The activation code. You can type or paste the activation code.
Table 211. Product IDs for the Compound Discoverer 3.3 software
Material Order No. Product ID Description
OPTON-31055 XCALI-65161 SW, Compound Discoverer 3.3 SP2
(single license)
OPTON-31056 XCALI-65161 SW, Compound Discoverer 3.3 SP2
(2 or more licenses)
OPTON-31060 XCALI-65162 SW, Compound Discoverer (CD) 3.3 SP2
upgrade from CD 1.0, CD 2.0, CD 2.1
OPTON-31061 XCALI-65163 SW, Compound Discoverer 3.3 SP2
upgrade from Compound Discoverer 3.0,
3.1, and 3.2
OPTON-31062 XCALI-65164 SW, Compound Discoverer 3.3 SP2 and
Mass Frontier 8.0 SR1 with Curator
OPTON-31063 XCALI-65165 SW, Compound Discoverer 3.3 SP2 and
Mass Frontier 8.0 SR1 with Curator
(2 or more licenses)
4. Depending on whether you are activating the license on an online or offline computer,
continue with the appropriate topics:
• Activate the license on an online computer
• Activate the license on an offline computer
1. If you have not already entered the licensing information, enter it in the Activation Code
view of the License Activation dialog box.
Note Activating the license on an offline computer is a three-step process that requires
access to an online computer.
1. Create an activation request file (Activation-Activation Code.req) on the offline
computer.
2. Transfer the activation request file to an online computer where you upload it to the
licensing portal to obtain a response file (activation.xml).
3. Transfer the response file (activation.xml) to the offline computer, and then process it
by clicking Process Response File in the License Activation dialog box.
1. If you have not already entered the licensing information in the Activation Code view of
the License Activation dialog box, enter it now.
2. In the License Activation dialog box, click Offline Activation.
The Save Request File dialog box opens.
3. Save the activation request file (Activation-Activation Code.req).
Figure 212. Save Request File dialog box with activation code
4. Click OK.
The Activation Code view of the License Activation dialog box reappears.
Tip When you close the Offline Activation dialog box, the application automatically
saves the URL to the Clipboard, so that you can save the URL to a file for use on an
online computer.
IMPORTANT If you accidentally close the License Activation dialog box, start over at
step 1.
7. To download the response file (activation.xml) from the licensing server, do the following
on an online computer:
a. Transfer the activation request file ((Activation-Activation Code.req) to the online
computer.
b. Go to the following URL (case sensitive):
https://thermo.flexnetoperations.com/control/thmo/offlineActivation
The Life Sciences Mass Spectrometry Software Download and Licensing Portal
opens.
c. Click Choose File or Browse, browse to and select the request file
(Activation Activation Code.req), click Open, and then click Process.
Note Whether the page includes a Choose File button or a Browse button
depends on the web browser.
This completes the offline license activation process. The License Manager indicates that the
license for the specified product (XCALI-XXXXX) is permanent.
1. On the License Manager page, select the software product that you want to deactivate,
and then click Deactivate.
The License Deactivation dialog box opens. The Product ID box is populated with the
selected product’s ID, and Activation Code box is populated with the activation code for
your license.
Figure 215. License Deactivation dialog box populated with the product ID and activation code
Note Deactivating the license on an offline computer is a three-step process that requires
access to an online computer.
1. Create a deactivation request file (Deactivation-Activation Code.req) on the offline
computer. This step only starts the license deactivation process.
2. Transfer the deactivation request file (Deactivation-Activation Code.req) to an online
computer where you upload it to the licensing portal to obtain a response file
(activation.xml).
3. Transfer the response file to the offline computer, and then process it to complete the
deactivation process.
c. Click Choose File, browse to and select the Deactivation Activation Code.req file,
click Open, and then click Process.
The server downloads the response file.
d. Save the response file (activation.xml).
3. Complete the deactivation process on the offline computer as follows:
a. Transfer the response file (activation.xml) to this computer.
b. On the License Manager page, select the license that you are in the process of
deactivating, and click Deactivate.
The License Deactivation dialog box opens.
c. Click Process Response File.
The Open Response File dialog box opens.
d. Select the response file (activation.xml) and click Open.
The license disappears from the License Manager page.
1. Download the executable files and store them in the appropriate folder on the computer
where you are running the application.
2. Open the License Manager page.
3. Click Scan for Missing Features.
4. Close and reopen the application.
5. Choose Help > About.
The About Compound Discoverer dialog box opens with the Patent and Legal Notices
page displayed.
6. Expand the Nodes list and verify that it lists the new node.
Note You must contact Pathway Solutions for a KEGG license. Kanehisa Laboratories
does not provide KEGG licenses for the Compound Discoverer application.
*If you are applying for an academic license, you must include the following statement in your
email:
I certify that I work for an academic institution, and that I will not use KEGG for any
commercial purposes (including collaborative research with a commercial entity).
1. From the menu bar of the Compound Discoverer application, choose Help > License
Manager.
2. Click Activate.
3. In the License Activation dialog box, enter the name of your company, your full name,
your email address, the product ID number (XCALI-23456), and the activation code that
you received from Pathway Solutions.
4. Follow the instructions for online or offline activation as appropriate.
After you install the license, it might appear under Compound Discoverer_KEGG. If the
License Manager displays an expiration date, the license is valid.
2. In the left pane of the Configuration page, select Parallel Options under Server Settings.
The Parallel Options view opens.
3. In the Maximum Number of Processing Workflows in Parallel Execution box, type or
select an integer from 1 to 4.
The default value is equal to half the number of CPU cores in the processing computer.
4. Click Save Current Settings.
3. Click the browse button next to New Scratch Directory and locate the new directory.
4. Click Save Current Settings.
5. Restart the application.
6. Reopen the Scratch Directory Options view and make sure that the Current Scratch
Directory box lists the new scratch directory.
3. Clear the Save Study Automatically On Close and After Analysis Submit check box.
4. Click Save Current Settings.
5. Restart the application.
For details about setting up the global color palette, see the following topics:
• Open the color schema view of the configuration page
• Select a standard color palette
• Create new custom color palettes
• Proteome Discoverer
• 60 Distinct Colors
Figure 221. Color Palette Configuration dialog box with the basic options view
Selected color
Selected position for insertion
5. Click Insert.
In the Palette Colors area, the new color appears to the left of the currently selected color.
Figure 222. Custom color palette with the color red inserted to the left of the color green
Y To select a color for a custom color palette in the gradient color chart
• To edit an existing custom color palette, select it in the Selected Palette list, and then
click Edit.
The Color Palette Configuration dialog box opens. The Palette Name box displays
the name of the custom color palette that you selected.
3. Do any of the following:
• Click a color in the hexadecimal color gradient chart.
• Enter an RGB formula or an HSB formula by typing numeric values or using the
slider.
• Select the Advanced Options check box and use the separate sliders for red, green,
blue, hue, saturation, and brightness.
Figure 223. Advanced Options view of the Color Palette Configuration dialog box
Table 212 describes the options for submitting single scans to the mzCloud database.
Table 212. Submit Single Spectrum to mzCloud Options view
Parameter Description
FT Mass Tolerance Specifies the mass tolerance for scans acquired with an FT mass
analyzer.
Default: 12 ppm
IT Mass Tolerance Specifies the mass tolerance for scans acquired with an ion trap
mass analyzer.
Default 0.4 Da
Use mzCloud Node When this check box is selected, the application uses the settings
Settings in the Search mzCloud processing workflow node if the analysis
included this node.
Default: Selected
To obtain access to all the BioCyc databases, you must do one of the following:
• If you do not have a BioCyc user account, you must create a new BioCyc account on the
Sign Up page of the BioCyc website, and then enter, test, and save your account
information in the BioCyc User Login view. When you create a new account, you
automatically have 30 days of trial access to all the BioCyc databases.
• If you already have an existing account, you can request a 30-day trial access period to all
the BioCyc databases. Or, you can purchase an individual subscription or an institutional
subscription. If you do not request a 30-day trial access period or purchase a subscription,
you will have access to only the EcoCyc and MetaCyc databases.
Note To access the BioCyc website where you set up a user account, purchase a
subscription, or request 30 days of free trial access to all the BioCyc database, you must
have Internet access.
Figure 224. BioCyc User Login page for a user without a subscription
Instructions
You can log in to the BioCyc website by clicking LOGIN at the top of most of the Web
site pages. See Figure 225.
Currently, there is only one available fragmentation database. This view is reserved for future
use.
In the left pane of the Configuration page, under Miscellaneous Settings, select
HighChem Fragmentation Options.
1. Click a column header to sort the rows between ascending order (A, B, C …) and
descending order (Z, Y, X …), based on the contents of the column.
Note The application treats formulas the same as text strings and sorts them by the
order of the characters in the formula string, not by the actual number of elements in
the formula.
2. To sort the data by a second column, hold down the CTRL key and click the second
column heading.
1. Click the expand icon to display the vertical headings of the subordinate columns.
Column
heading
Expand
icon
Subordinate
column headings
2. Select the heading of the subordinate column that you want to sort by.
The selected subordinate column heading appears in bold text.
3. Click the column heading to sort the table rows.
Click the pin icon, , next to the row number of the row that you want to freeze.
The row moves to the top of the table, its pin icon changes to the pinned position, , and
a blue bar that defines the bottom of the freeze pane appears below the fixed row.
As you fix additional rows, they move up to the freeze pane in the order selected and their
icons change to pinned, . The row just above the blue bar is the last fixed row.
When you scroll the table, the freeze pane remains at the top. Figure 227 shows a
compound library with caffeine in the freeze pane.
Figure 227. Compound library with a freeze pane
Column heading
3. To ungroup the table rows, drag the column heading out of the Group by Area bar.
Figure 229 shows the study factor (Phenotype) column heading inside the Group by Area
bar, and the table rows grouped by the study factor value (Lean, Fatty, or n/a).
4. To turn off the group by row feature, right-click the page and choose Disable Row
Grouping.
1. To move a column to the left of its current position, drag the column header to the left.
Release the mouse button when the cursor ( ) appears over the column delineator.
Note If you release the mouse button when the column-stacking cursor ( )
appears instead of when the column-reordering cursor ( ) appears, the application
stacks the two columns. See Stack two table columns into one column.
Dragging
Column 1 Heading Column 2 Heading Column 3 Heading
column 3
Column 3 Heading
to the left
2. To save the current layout for a result file, choose File > Save from the menu bar.
Note To save the current layout to a layout file that you can apply to other result files,
choose Window > Save Layout. Then, in the Save Result Layout dialog box, name
the layout and click OK.
Figure 231 shows an example where the Area and Norm. Area columns of a compounds table
are stacked.
Figure 231. Stacked columns
1. Drag the column header of the column that you want to stack below the column header
of the column that you want on top. Release the mouse button when the cursor ( )
appears over the column heading.
2. To save the current layout for a result file, choose File > Save from the menu bar.
Note Except for the Input Files table, the Checked column is, by default, the first column
in every result table—that is, Checked is the text in first column’s heading row.
Note You can hide or show columns in these tables that are available from the Lists &
Libraries menu—Expected Compounds, Transformations, Neutral Losses, Adducts, and
Ion Definitions.
1. Click the Field Chooser icon, , in the upper-left corner of the table.
Figure 233. Field chooser icon in the upper-left corner of the main Study Information table
Field Chooser icon
The Field Chooser dialog box opens with a list of all of the column headers for the
current table in alphabetical order.
Figure 234. Field Chooser dialog box for the Study Information table
2. In the Field Chooser dialog box, clear the check box for each column that you want to
hide. To show those columns again, select their corresponding check boxes.
The table updates and shows or hides your chosen columns immediately.
Note The application does not copy the compound structure in the Structure column of
the Expected Compounds library to the Clipboard.
2. To select multiple table cells, use the CTRL key or the SHIFT key.
The selected cells turn a lighter blue than the other cells in the row, as shown in the
following figure.
Note For information about filtering the tables in the result files, see “Filter the data for
data reduction.”
To set up single-condition filters for the table columns, see these topics:
• Set up single-condition filters for the table columns
• Set up a single-condition wild card filter
• Set up a single-condition filter for numeric data
For information about setting up custom filters with the Custom Filter Selection dialog box,
see “Set up a custom filter with multiple conditions.”
1. In the table’s filter row, do the following for each column that you want to filter by:
a. Click the operator symbol (Aa or =) and select an operator from the list.
b. Set up the operand by selecting or typing a value in the operand box.
After you set up a filter, the applied filter icon, , appears to the right of the operand
box, and the table displays only those rows with entries that fulfill the filter condition.
2. To remove a single filter, click the filter icon, , to the right of the operand box.
1. In the operator list in the filter row, select Like (Wildcards) or Not Like (Wildcards).
2. In the operand box, select or type text and use an asterisk “*” to replace more than one
character or use a question mark “?” to replace only one character.
Tip For example, to filter the entries in the transformations library by the presence of
nitrogen in the arriving group, do the following in the Arriving Group column:
• Select * Like (Wildcards) in the operator list.
• Type *N* in the operand box.
• To display the top n percentile of entries, select Top Percentile in the operator list
and type a numeric value in the operand box.
• To display the bottom n percentile of entries, select Bottom Percentile in the
operator list and type a numeric value in the operand box.
The selections in the operator list depend on whether the column contains text or numeric
entries. After you select an operator, the operator symbol appears in the filter row to the left of
the operand box. For more information about the operator lists, see Table 214 on page 748
and Table 215 on page 749.
For all columns, the operand list includes the following: Custom, Blanks, NonBlanks, and the
column entries. For numerical-entry columns, the operand list also includes the following:
Above Average, Below Average, Top 10, Top 10 Percentile, Bottom 10, and Bottom 10
Percentile. For more information, see Table 213.
After you set up a column filter, the applied filter icon, , appears to the right of the operand
box. Figure 235 shows a filtered Ion Definitions list that reduces the number of displayed
entries to 10 by using the total adduct mass. The filter row of the Adducts Total Mass column
displays the equals symbol ( )for the mathematical operator, the selection of Top 10 for the
operand, and the applied filter icon, .
Figure 235. Ion Definitions list that is filtered by the adduct mass
Operator
Operand
Applied filter icon
Filter
row
Table 213 describes the available operand selections and the valid typed operand entries for
both text and numeric columns.
Table 213. Operands for the table columns on a study page or Lists & Libraries view (Sheet 1 of 3)
Operand Description
All table columns
(Custom) Applies the custom filter that you set up by using the Custom
Filter Selection dialog box.
Table 213. Operands for the table columns on a study page or Lists & Libraries view (Sheet 2 of 3)
Operand Description
(NonBlanks) Compatible operators: Equals and Not Equals
Filters the table rows by using the selected entry and operator.
Typed alphanumeric Table 214 describes the compatible operators for text entries.
text or numeric value Table 215 describes the compatible operators for numeric entries.
Filters the table rows by using the typed text entry and the selected
operator.
Additional selections for numeric value columns
Above Average Compatible operators: Equals and Not Equals
Table 213. Operands for the table columns on a study page or Lists & Libraries view (Sheet 3 of 3)
Operand Description
Top 10 Percentile Compatible operators: Equals and Not Equals
Table 214 describes the operators for columns with text entries.
Table 214. Operators for text columns
Symbol Text selection Effect
Equals Displays the text entries that exactly match the selected or
typed operand.
Not equals Displays the text entries that do not exactly match the
selected or typed operand.
Less than For alphabetic text entries, displays the text entries that
begin with a letter in the alphabet that comes before the
selected or typed operand.
Less than or equal –
to
Greater than –
Greater than or –
equal to
Contains Displays the text entries that contain the text in the selected
or typed operand.
Does not contain Displays the text entries that do not contain the text in the
selected or typed operand.
Like (wildcards) Displays the text entries that contain the selected or typed
text and any additional text represented by an asterisk.
Not like Hides the text entries that contain the selected or typed text
(wildcards) and any additional text represented by an asterisk.
Match (regular Displays the text entries that contain the same text as the
expression) selected or typed operand.
Does not match Displays the text entries that do not contain the same text as
(regular the selected or typed operand.
expression)
Starts with Displays the text entries that start with the selected or typed
operand.
Does not start Displays the text entries that do not start with the selected
with or typed operand.
Ends with Displays the text entries that end with the selected or typed
operand.
Does not end Displays the text entries that do not end with the selected or
with typed operand.
Table 215 describes the operators for columns with numeric entries.
Table 215. Operators for numeric columns
Symbol Text selection Effect
Equals Displays the numerical entries that equal the selected operand.
Not equals Displays the numerical entries that are not equal to the selected
operand.
Less than Displays the numerical entries that are less than the selected
operand.
Less than or Displays the numerical entries that are less than or equal to the
equal to selected operand.
Greater than Displays the numerical entries that are greater than the selected
operand.
Greater than Displays the numerical entries that are greater than or equal to
or equal to the selected operand.
Top Displays the n highest entries in the table, where n equals the
integer typed in the operand box.
Bottom Displays the n lowest entries in the table, where n equals the
integer typed in the operand box.
Top Displays the entries in the top nth percentile, where n equals the
percentile percentage typed in the operand box.
Bottom Displays the entries in the bottom nth percentile, where n equals
percentile the percentage typed in the operand box.
2. Do the following for each condition that you want to add to a group:
a. Click Add Condition.
A new table row appears.
b. Select an operator from the Operator list and an operand from the Operand list.
As you add conditions to the group, the application updates the group filter in the
gray area below the table.
3. To add the last condition to the group, click its row number.
Figure 237. Clicking the row number in the last row to add the row to the group
The last condition appears in the group filter area. By default, the application applies the
AND operator to all of the conditions in the group (Figure 238). A vertical blue bar to
the left of the condition rows indicates an AND group.
Figure 238. Group filter with three conditions and the AND group operator
Third condition
4. To change the group operator from AND to OR or from OR to AND, click Toggle.
An orange bar to the left of the condition rows indicates an OR group.
Figure 239. Group filter with three conditions and the OR group operator
6. To remove conditions from a group, select the conditions and click Ungroup.
7. To apply the filter, click OK.
The Custom Filter Selection dialog box closes, the text (Custom) appears in the operand
box, and the application applies the custom filter to the entries in the selected filter
column.
See Table 214 on page 748 for a list of the operators for the
text entry columns. See Table 215 on page 749 for a list of the
operators for the numerical entry columns.
Operand column Use to select or type an operand for the filter condition.
See Table 213 on page 745 for a list of the operands for the
library columns.
Third column Displays comments about the filter condition. For example,
this box displays “Condition is empty” until you define the
operator and the operand for a condition.
Filter description area
This area, which is highlighted in gray, displays the group filter conditions.
For an LC study, a minimum processing workflow that includes the Generate Molecular
Networks node consists of untargeted compound detection, grouping, and annotation
assignment. To validate the elemental composition differences, the workflow must contain at
least one search node or the Predict Compositions node.
Figure 242. Example processing workflow with the Generate Molecular Networks node
The input to the Generate Molecular Networks node is a list of annotated compounds, and
the output from the node is a table of similar compounds for each detected compound.
Note If you do not select any transformations for the Generate Molecular Networks node,
the node does not consider the elemental compositions of the detected compounds for
scoring.
• If there are compounds with the same elemental composition (isomers with different
retention times), the node adds a pathway of length 0.
2. For each pair of compounds that have assigned elemental compositions, the node finds a
matching pathway that explains the elemental composition difference between the two
compounds.
3. For each pair of compounds that have fragmentation data, the node calculates the spectral
similarity by FISh Scoring as follows:
a. Matches the fragments for both compounds by directly comparing their masses.
b. For the remaining unmatched fragments, the node does the following to match the
fragments:
• Uses the mass shift of the assigned pathway to match “shifted” fragments (if
enabled). In addition, it uses all possible permutations of the individual pathway
steps (if enabled).
• Uses the mass shift between the two compounds to match “shifted” fragments (if
enabled).
c. Calculates the similarity scores.
4. Applies specified rules and thresholds to the connections (matched pairs).
5. Stores valid connections to the results file.
6. Applies specified view filters on the results table.
You can consider each stored connection as a reaction, where one compound is a substrate
that is converted into a product through a specific transformation pathway. The Direction
column indicates the direction of the reaction between the selected compound in the main
compounds table and the similar compound in the related Similar Compounds table.
A Forward connection indicates that the selected compound is a substrate, to which the
transformation has been applied to generate the similar compound. A Reverse connection
indicates that the selected compound is a product of applying the transformation to the
similar compound.
Figure 243. Caffeine and one of its demethylation products (the Tags column is hidden)
Note For an LC study, you can export compounds to the molecular networks viewer from
the Compounds table.
1. (Optional) To display only the compounds of interest, filter the compounds table.
2. (Optional) Sort the compounds table by the columns of interest.
Note The application only exports compounds that appear in the table in the order
that they appear in the table, beginning with row 1. It does not export compounds
that are hidden by applied result filters or compounds in row numbers greater than
the Compounds Limit value.
3. Right-click the compounds table and choose Molecular Networks > Send to Viewer.
The Export Molecular Networks dialog box opens with the following default settings:
• Destination: Study folder where the current result file resides
• Name: Name of the current result file
• Compounds Limit: 3000
• Open Viewer After Export: Selected
• To set the viewer to NOT automatically open in the default browser after you click
Export, clear the Open Viewer After Export check box.
5. Click Export.
If you selected the Open Viewer After Export check box and Internet Explorer is not the
default browser, the viewer opens in the Move mode ( ) in the default browser.
6. If you did not select the Open Viewer After Export check box, browse to the destination
folder. Then, right-click the index.html file and choose a browser.
Note For the Internet Explorer browser, click Allow Blocked Content.
The viewer opens in the Move mode ( ) in the browser that you selected.
The browser does not connect to the Internet. The application derives the information
that it plots in the browser from the current result file.
Note To deselect a single node, click it. To deselect multiple nodes, hold down
the CTRL+SHIFT keys and drag the mouse cursor across all the selected nodes.
When you select a node it turns pink, and its molecular weight and retention time
appear in the Selected Items pane.
Figure 246. Molecular networks viewer with information about selected nodes in the
Selected Items pane
b. Click Copy.
The viewer copies the list of selected compounds to the Clipboard and displays
instructions about how to mark the selected compounds.
Figure 247. Instructions in the right pane about marking the compounds
4. Right-click the Compounds table for LC studies and choose Molecular Networks >
Mark Selected > one of the tags.
The Compound Discoverer application marks the compounds in the result table that
match the selected items list from the Clipboard with the selected tag.
Figure 249. Marked compounds in the Compounds table
Note For an LC study, you can visualize the similarity between the compounds in the
Compounds table for an untargeted workflow.
When you send compounds from a Compounds table to the molecular networks viewer, the
following occur:
1. The Compound Discoverer application sends the user-specified number of compounds to
the viewer. The application sends only compounds that are visible in the table, it does not
send compounds that are currently filtered out (with any of the result filters). In addition,
it sends the compounds in order of their row numbers, which can vary depending on how
you sort the table.
2. The viewer opens in a local browser window and displays the data in the current result
file; it does not connect to the Internet. The viewer processes the data and displays an
initial molecular network simulation using filter and threshold settings that you specified
in the Generate Molecular Networks workflow node.
In the left pane of the viewer, you can change the settings for the filters and the thresholds, the
node style, and the link style. In addition, you can search for a named compound or
transformation and isolate the display to a specific cluster.
In the right pane of the viewer, you can view information for each node or link that you point
to in the graph area.
The viewer has three independent modes—Move, Seek, or Selection. When the viewer is in
one of the basic modes, you can turn the fourth mode—Isolation—on or off by
double-clicking a node. By default, the viewer opens in the Move mode ( ) where you can
move and rearrange the nodes.
The functions in the left pane are available in all three modes.
Move mode
The molecular networks viewer opens to the Move mode ( ) for moving and
rearranging the nodes.
Seek mode
When you turn on the Seek mode ( ), you can mark nodes as parent compounds.
Then, view connection details in the right pane. A parent compound is the starting
compound that undergoes chemical transformations.
1. Click the Seek icon, , in the toolbar at the top left of the graph pane to turn on the
Seek mode.
The right pane changes to the Seek Links pane.
2. Click a node of interest in the graph pane.
A red outline indicates the selected node. Information about the selected node appears in
the Seek Links pane.
3. Point to another node in the graph pane.
In the graph pane, a link appears between the parent node and the query node. If the two
nodes are linked, the link is black. If the nodes are not linked, the link is red.
A description of the query link appears in the Seek Links pane at the right.
• If the compounds are related through a transformation reaction, the Seek Links pane
describes the transformation with a black arrow showing the direction of the
transformation.
• If the compounds are not related through a transformation reaction, the Seek Links
pane describes the nonexistent link between the two compounds with a red X over
the black arrow.
Parent node
(2, 6-dimethyl-nonane)
The red X
indicates that
the link does
not exist.
Selection mode
When you turn on the Selection mode ( ), you can select individual nodes
(compounds) and add them to the export list in the right pane of the view.
Table 217. Selection mode tasks
Task Procedure
Turn on the Selection mode. Click the Selection icon, , in the upper-left
corner of the plot area.
Select nodes. Turn on the Selection mode. Then, click the
nodes, one by one. Or, hold down the SHIFT
key and drag the mouse cursor across the nodes
that you want to select.
Copy selected nodes to the Clipboard. In the Selected Items pane, click Copy.
Clear the export list. In the Select Items pane, click Clear.
Deselect nodes. Click each selected node one-by-one. Or, hold
down the CTRL+SHIFT keys and drag the
mouse cursor across the selected nodes.
Y To view the backbones of the clusters in the molecular networks graph area
Color-coded nodes
In the molecular networks viewer, the identification status for a compound is indicated by the
color of its node.
Table 218. Node colors for compounds in the Compounds table
Color Meaning
Light green The processing workflow identified the compound by its fragmentation
spectra during an mzCloud or mzVault library search.
Dark green The processing workflow identified the compound by its formula and a
database match (but not an mzCloud match).
Figure 253 shows the molecular networks display for the 13 compounds in the Compounds
table shown in Figure 254. Pointing to one of the gray nodes displays the information for the
unknown compound.
Figure 253. Pointing to one of the gray nodes shows the information for the unknown compound
without a formula
Figure 254 shows the Compounds table for the 13 compounds that were sent to the
molecular networks viewer.
No. Description
1 Light green nodes represent the eight compounds that are identified by name and
formula and have mzCloud matches.
2 A dark green node represents the compound that is identified by formula and a
ChemSpider database match but does not have any mzCloud matches.
3 A blue node represents the unknown compound that has a formula annotation but
no database matches.
4 Gray nodes represent one compound that is identified only by its molecular weight
and two compounds that are identified by formula and molecular weight, but the
mass annotation for each of these compounds does not match the formula
annotation (Annot. ΔMass > 5 ppm).
1 2 3 4
1. In the left pane of the molecular networks viewer, click Node Style.
2. Do one of the following:
• Select the Show Confidence option.
Each node appears as a solid circle. The color of the node reflects the confidence of
the match.
• Select the Show Pie Charts option.
Each node appears as a pie chart of the relative areas of the compound in the study
groups. The color of the node’s border reflects the confidence of the match.
• Select the Show Structures option.
Each node appears as a circle with a molecular weight, formula, or structure. The
color of the node’s border reflects the confidence of the match.
Regardless of the node style, pointing to a node displays the following information about the
compound at the top of the right pane: name, formula, RT, maximum area (across the input
files), and number of fragments in the fragmentation spectrum.
The bottom of the right pane displays a pie chart for the relative areas of the compound in the
study groups or the separate input files when the analysis does not include study groups.
Figure 255 shows a pie chart for the relative areas for caffeine in samples collected from a
human subject in the morning versus the evening.
Y To select whether the node size is proportional to the compound’s peak area or MW
1. In the left pane of the molecular networks viewer, click Node Style.
2. Select one of these options: By Area or By Mass.
1. In the left pane of the molecular networks viewer, click Link Style.
2. Select one of these options: By Score, By Coverage, or By Fragments.
Tip By default, the link color is a function of the MSn score. The length of each link has
no meaning. To investigate the clusters, try colorizing the links by the other options.
1. In the left pane of the molecular networks viewer, click Link Style.
2. Click the Show Arrows check box.
3. To display the transformation in the right pane, point to the arrow.
Figure 256. Pointing to a directional arrow for a link
Reset Pan and Zoom Reset, pan, and zoom in the graph area.
Tip To open or close a pane, point to the upper-right corner of the pane to display the
Hide text or the Show text. When text appears, click it.
Search pane
Use the Search pane of the molecular networks viewer to highlight nodes or links in the graph
area by searching for the name of a compound or the name of a transformation, respectively.
Task Procedure
Highlight nodes, in red, for In the Search pane, type the compound’s name in the
compounds with a specified text Compound box.
string in their names.
The viewer highlights all compounds with the
specified text string in the graph area.
Highlight links, in red, for the In the Search pane, type the transformation’s name in
same named transformation. the Transformation box.
The two bar colors and the background color behind the bars represent the following:
• Blue bar—represents the relative portion of the currently visible items.
• Dark gray bar—represents the relative portion of all items of a particular type.
• Light-gray background—represents the total number of items.
Figure 258. Graph Info pane of the molecular networks viewer
Table 221. Graph Info pane of the molecular networks viewer (Sheet 1 of 2)
Parameter Description
Nodes Indicates the number of nodes in the graph area.
Orphans Indicates the number of nodes without any connection (according to
the current filters and thresholds).
Identified Indicates the number of compounds with an assigned name.
Table 221. Graph Info pane of the molecular networks viewer (Sheet 2 of 2)
Parameter Description
Unknown Indicates the number of compounds without an assigned name.
Links Indicates the number of all connections.
Transform. Indicates the number of connections with assigned transformations.
MSn Indicates the number of connections with fragmentation data.
Both Indicates the number of connections with assigned transformations
and fragmentation data.
Isolation pane
If the main graph is in the Isolation mode, the Isolation pane of the molecular networks
viewer provides dynamic control over the maximum depth of the graph.
Figure 259. Isolation pane of the molecular networks viewer
For information about turning on the Isolation mode, see “Use the Isolation mode to display
specific clusters.”
Filters pane
Use the filters in the Filters pane of the molecular networks viewer to limit the amount of
visible data.
Figure 260. Filters pane of the molecular networks viewer (default settings)
Default: Enabled
Require MSn Clear to display links without fragmentation data.
Default: Enabled
Thresholds pane
To show or hide low confidence relationships between compounds in the molecular networks
viewer, adjust the settings in the Thresholds pane. By default, the thresholds are set to those
specified in the processing workflow.
Figure 261. Thresholds pane of the molecular networks viewer
Range: 0 to 100
Coverage Specifies the minimum Forward or Reverse Coverage for a
connection to be visible.
Range: 0 to 100
Matched Fragments Specifies the minimum number of matched fragments for a
Forward or Reverse search for a connection to be visible.
Minimum: 0
Clusters pane
Use the Clusters pane to limit the number of nodes per cluster. The viewer applies the Node
Links limit first and then applies the Cluster size limit. By default, the thresholds are set to
those specified in the processing workflow.
Figure 262. Clusters pane of the molecular networks viewer
Range: 1 to 50
Cluster Size Specifies the maximum number of nodes within a single
cluster. As you reduce the cluster size, the algorithm
excludes nodes connected by links with lower scores.
Reducing the cluster size to 2 reduces the number of nodes
per cluster to two nodes with one shared link.
Range: 2 to 100
You can select either Show Confidence or Show Pie Charts and either Size by Area or Size by
Mass.
Figure 263. Node Style pane of the molecular networks viewer
The molecular networks viewer displays links in the following colors: gray > orange >
dark-red. If a connection has no fragmentation data, the link appears as a gray-dashed line.
Color by Score The link color is proportional to the MSn Score.
Color by Coverage The link color is proportional to the Forward or Reverse Coverage
(max).
Color by Fragments The link color is proportional to the number of Forward or
Reverse Matches (max).
The information automatically disappears from the right pane when you move the pointer
away from the node or link.
Tip To prevent the information about the selected node or link from disappearing or
changing when you move the mouse pointer, hold the SHIFT key while you move the
pointer.
When the molecular networks view is in the Selection mode and you are not pointing to a
node or connector, the Selected Items pane appears to the right of the graphical display. For
details about creating a list of compounds to mark, see “Mark selected compounds in the
main compounds table.”
The following topics describe how to test and troubleshoot the application’s access to the
online mass spectrum databases:
• Troubleshoot access to the online databases
• Run the communication tests
• Check the URLs for the online databases in your browser
• Specify the IP address of the proxy server
• Set the correct time and time zone on the processing computer
• If any of the other communication tests also fail, check the access to the URLs for the
online databases. See “Check the URLs for the online databases in your browser.”
If you can access the URLs for the online databases through your browser, but the
communication tests still fail, the firewall or proxy setting for your company network
is blocking the application’s access to the online databases.
3. If the communication tests fail, but you can access the URLs for the online databases, do
the following as applicable:
• If a firewall is blocking the application’s access to the online databases, ask your IT
department to make sure that the company firewall is not blocking “Thermo
Compound Discoverer” or “Thermo Compound Discoverer Server” from accessing
the URLs. The application uses the following protocol: http port 80.
• If a proxy setting is blocking access, see “Specify the IP address of the proxy server.”
4. Replace the text that is highlighted in yellow in Figure 267 with your company proxy
address.
Figure 267. Default proxy address highlighted in yellow
Set the correct time and time zone on the processing computer
The mzCloud communication test includes a validation of the date and time settings on the
processing computer. If the mzCloud communication test fails, but the other communication
tests succeed, check the date and time settings for the processing computer.
1. Open the Date and Time dialog box or view by typing Date in the Windows search box.
2. Make sure that both the date and time and time zone settings are correct.
Tip If your computer is not part of a network domain that synchronizes the
computer’s clock to the network server, you can use an Internet server to synchronize
the computer’s clock.
3. If the Internet Time tab is available, click it and synchronize the computer’s clock with an
Internet server.
Technical replicates are replicate samples from the same entity under the same conditions.
Technical replicates from the same entity under the same conditions provide a measure of the
sampling error, and replicate injections from the same sample solution provide a measure of
the instrument error.
You can add only one biological replicate factor to a study. The application treats study factors
nested under a biological replicate factor as technical replicates.
In non-nested experiments, different sets of biological replicates are used for each condition.
Figure 268 shows a non-nested experiment where replicate samples are taken from two sets of
rats under two conditions. Tom, Jerry, and Hector are fed a normal diet; and Tai, Fernando,
and Pierre are fed a low-calorie diet.
Figure 268. Non-nested design with independent sets of rats for the two dietary conditions
1 2 3 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3
Three technical replicates for each rat
In nested experiments, the same set of biological replicates are used for each condition.
Figure 269 shows a nested experiment where replicate samples are taken from the same three
rats under two conditions—a normal diet and a low-calorie diet. Tom, Jerry, and Pierre are
the biological replicates.
Figure 269. Nested design with the same three rats under two dietary conditions
Tom
Rat1 Jerry2
Rat Hector
Rat 3 Tom1
Rat Jerry2
Rat Hector
Rat 3
1 2 3 1 2 3 1 2 3 1 2 3 1 2 3 1 2 3
Three technical replicates for each rat
To set up the comparison ratios for a nested design with technical replicates, add the following
study factors:
• A factor for the variable being studied with an itemized list of the variable states
• A biological replicate factor with an itemized list of the entities being studied
• (Optional) A factor for the technical replicates
Figure 270 shows the study factors for the experiment shown in Figure 269.
Figure 270. Study factors for the nested design experiment
Numerical factor
Categorical factor
Figure 271 shows the generated sample groups and ratios for the nested design. The sample
groups—Low-cal Diet and Normal Diet—are highlighted in green. The two sample groups
contain the same values for the Rat Biological Replicate factor—Tom, Jerry, and Pierre. The
technical replicates for each biological replicate are grouped together and the biological
replicates are highlighted in yellow. As shown in the Generated Ratios area, for each selected
denominator, the Differential Analysis node calculates one group ratio and individual ratios
for each biological replicate. In the result file, the Compounds table includes a Ratio column
for the group ratio and Bio. Rep. Ratio columns for the biological replicate ratios.
Index
A alignment features 63
Alignment Model parameter 228
Abort command, Job Queue 150
Alignment property, reporting 614
Activate button, License Manager 705
activation code, licensing 705 alignment, chromatographic 196, 205, 237, 470
activation request (.req) file 708 Allow Aromatic Cleavage parameter 362
ammonium adduct 272, 279
activation response file (activation.xml) 708
analyses
Activation Type parameter 234, 241
reprocessing 154, 156
Actual Size icon, reporting 608
reviewing 153
Add Column command 669 sample groups and ratios for 127, 142
Add Files command, study 173 submitting to the job queue 144
Add Items icon, reporting 607 troubleshooting 143
Add Plot command 402 Analysis pane 158
Add Structure Proposal command 377 Analysis Results page of a study 152
Add Tag command 376 analysis results, opening 334
adding Analysis Sequence Details window 153
adducts to the library 644 analysis summary 371
biological replicate factors to a study 39, 114 Analysis Validation Issues prompt 145
categorical factors to a study 112
Analyze Labeled Compounds node 270
charges to a structure 690
AnchorBottom property, reporting 614
compounds to the library 638
custom explanations to a result file 356 AND logical conjunction, filtering with 367–368, 370
fragments lists to the Compound Classes list 210 Angle property, reporting 617
input files to a new study 107 Annotate Full Spectrum Tree check box 362
input files to the Analysis pane 140 annotations
ion definitions to the library 649 copying 414
manual peaks to a specialized trace 400 editing 355
numeric factors to a study 113 fragmentation scan 408
page break after the page footer of a report 600 Appearance properties, reporting 614
transformations to the library 654 Appendix section, reporting 606
Adduct Editor 644 application
adducts menu bar 43
library 642 toolbar 48
Align Columns icon, reporting 607 applications, installing
Align Retention Times (ChromAlign) node Compound Discoverer 26
limitations 237 mzVault 28
Align Retention Times node Apply Dealkylation parameter 264
limitations 237 Apply Dearylation parameter 264
using 228 Apply Filters button, Result Filters view 371
Apply FISh Scoring check box 357 biological replicate factor 114
Apply FISh Scoring command 377 blanks
Apply Missing Value Imputation node 285 Blank sample type 98, 119, 184
Apply mzLogic node 319 Exclude Blanks parameter 293–294
Apply QC Corrections node Mark Background Compounds node 199, 206, 290
parameter descriptions 286 Normalize Areas node 293–294
Apply SERRF QC Correction node 288 blocked IP addresses 783
arranging windows, tool for 58 blue circles, KEGG database 465
Arriving Group blue coverage bars in spectrum tree pane 537
data entry box 655 bond multiplicity, changing 701
parameter 656 borders, adding to a report template item 602
result table column 513 borders, adding to design items 602
Transformations library column 653 bracket, using in elemental composition formula 161
arrows Bulk Ratio Generation area 133
layout guide tool 58 By File check box, Analysis pane 144, 158
Metabolika pathway 678
molecular networks viewer 773
Assign command, Input File Characterization page 119
C
Assign Compound Annotations node 295 Calculate Score button 485
Atom Properties dialog box CanGrow property, reporting 617
using for charged fragments 691 CaptionPosition property, reporting 614
using for neutral compounds 702 carbon-14 labeled compounds 703
Auto Layout command, Workflows page 190 categorical study factors 39
AutoReplaceFields property, reporting 617 Caution symbol 191
auto-save option for studies 718 Caution symbol, Analysis pane 141
Autosize property, reporting 617 Caution symbol, Analysis view 143
Average Peak Width parameter 198, 257, 261 Ceiling selection, mass defect 231, 320
Avg. Exchange column 517 Cell Selection Mode command 375
axis labels Cell Selection Mode shortcut menu command 741
Hierarchical Cluster Analysis view 482 Character spacing property, reporting 614
Histogram Chart 421 Charge parameter 163–164, 167, 646
Scatter Chart 432–433 charge state parameter 239, 243
charge, adding to a structure 702
B charged structures 690
BackColor property, reporting 614 Chart Type parameter 427
Bar Chart page, displaying 423 Check All Down-Regulated Points command 463
Barcode design item 611 Check All Up-Regulated Points command 463
BarHeight property, reporting 614 Check icon, structure checking tool 696
base ions 208, 272, 279 Check Point command 434
batch effects, time dependent 84 Check Structure message box 696
Behavior properties, reporting 617 CheckAlignment property, reporting 617
best fit MS scan 505 CheckBox design item, reporting 610
best MS1 scan for a compound 71 Checked property filter, Result Filters view 365
best MS2 scan for a compound 72 Checked property, reporting 617
BioCyc credentials, saving 731 CheckSumEnabled property 617
BioCyc Pathways table 545 ChemSpider Results table 533
BioCyc Pathways view 467 Chromatogram Peaks table 489
BioCyc Results table 546, 550 Chromatograms view
shortcut menu commands 403
BioCyc User Login view 727
working with 405
Biological Replicate Factor 39, 114, 175, 788
chromatographic peak apex, red line indicating 505
Clear All button, Result Filters view 371 Copy Image command 402
Clear Compound Annotation command 377 Copy Points command 402
Clear Frozen Content command 403 Copy with Headers command 375
Clipboard copying
pasting result table rows 375 Chromatograms view as a table of data points 402
pasting structures 375 Chromatograms view as an image 402
Close Sub-Report icon, reporting 607 Copy to Clipboard dialog box 163
Collapse All Column Headers command 376 elemental composition of a compound 161
collapsible left pane result tables 375
Chromatograms view 340 structures to the Clipboard 375
Mass Spectrum view 411 study factors 115, 175
Options pane, Result Charts view 416 cover page, report template 580, 605
Collect Small Segments parameter, Pie Chart 428 coverage bars in spectrum tree pane 537
Color legend, Scatter Chart view 433 CoverPage section, reporting 605
color map, selecting 719 Create Analog Trace node
color palettes, standard 720 connections 220
color schemes, selecting for a report template 573 parameter descriptions 246
color-coded named tags 345 Create FISh Trace node, parameter descriptions 248
ColumnCount property, reporting 619 Create Mass Trace node, parameter descriptions 250
ColumnDirection property, reporting 618 Create Pattern Trace node, parameter descriptions 252
columns, application table Create Report icon 568
changing the order 737 creation date, result file 372
stacking 738 CrossSectionBox design item 612
columns, report CrossSectionLine design item 612
aligning 602, 607 CSV files 61, 163, 375, 662
transposing 572 CTRL key, using 399
ColumnSpacing property, reporting 619 CTRL+SHIFT keys, deselect nodes by using 760
communication tests 783 Culture property, reporting 619
compatibility, software 25 Current Workflow Issues pane 143
Compound Annotation Editor dialog box 355 Custom Filter Selection dialog box, features 753
Compound Area Corrections view 472 Custom Isotope Ratio Pattern option 164
compound class editor 688 custom layout 354
compound class files, managing 687 custom tags
Compound Class Matches table 551 defining 345
Compound Class Scoring node 321 exporting 352
Compound Classes library 684 filtering by 351
Compound Classes library, modifying 682 importing 352
Compound Discoverer application window 43 saving to a TAGS file 352
Custom Tags Editor 345, 352
Compound Editor dialog box
for editing mass list compounds 671 Customize Report dialog box 570
shortcut menu 694 CV switching, FAIMS experiment 63, 242–243
using 638
compounds labeled with carbon-14 703 D
Compounds table 514 data points, minimum number for peak detection 257, 261
compounds, library of expected 635 Data properties, reporting 619
Configuration page 716 Data Source parameter 429
contacting us 30 Data Source, Scatter Chart 433
context menus for the result tables 374 DataField property, reporting 620
Continuous View icon, reporting 632 date and time information, reporting 570
Control sample type 98, 184 deactivating the software license 711
Copy command 434
Export > Compounds to a New mzVault Library automatic peak width detection algorithm 237
command 378 Find icon, reporting 629
Export > Single Compound to an Existing mzVault Library Find Missing Files command 181
command 378 firewalls 784
Export As Xcalibur Inclusion/Exclusion List command 378 FISh coverage score 79
Export Image As command 402 FISh scoring
Export Points As command 402 applying to targeted compounds 408
Export Spectra node applying to untargeted compounds 409
connections 220 FISh Scoring node
parameter descriptions 226 connections 220
Export to Existing Mass List dialog box 389 using 253
Export to New Mass List dialog box 388 FISh Scoring Queue view 486
Export to Text File command 375 FISh Trace Fragments table 492
exporting Fit Page icon, reporting 632
compounds to a new mzVault library 387 Fit Width icon, reporting 632
custom tags 352 Floor selection, mass defect 231, 320
data to a CSV file 381 flowchart, reporting 564
mass list for the TraceFinder application 385 fold change 457
reports 630
folder structure 99
single compound to an existing mzVault library 386
Font property, reporting 615
study information 100
Xcalibur inclusion/exclusion lists 382 Food Research folder 210
ForeColor property, reporting 615
Extract Sample Information From Sample Names
dialog box 120 Forensics folder 211
Format Border dialog box, report template 602
FormatString property, reporting 615
F Forward connection, molecular networking 757
Factor Unit box, numeric factor 113 Fractional Mass selection, mass defect 231
factory default result file layout 353 Fragment Editor 689
Factory Defaults command 418, 423, 427, 432 Fragment Mode parameter 249
FAIMS-MS experiment 63 FreeStyle data-visualization application 163
Features table 525 Freeze Content command, Chromatograms view 403
Field Chooser icon, using to show or hide columns 740 freeze pane 735
file path, input files 177 full gap
Files for Analysis area 140 Missing Value Imputation node 285
Fill Gaps node
connections 218
parameter descriptions 280
G
Filter By check boxes 341 GdiCharSet property, reporting 615
Filter By check boxes, Chromatograms view 340 GdiVerticalFont property, reporting 615
Filter By Mass Defect node 230 gear icon, workflow nodes with 155
Filter By Scan Event node 233 Generate Expected Compounds node
Filter Centroids node 236 connections 221
using 261
filter set file
saving 366 Generate Molecular Networks node 322
summary 373 Get from Clipboard button, Isotope Ratio Editor 162
Filter Summary page 373 Get from Composition Formula button, Isotope Ratio
filters summary 373 Editor 162
Find dialog box, reporting 630 global settings 716
Find Expected Compounds (Legacy) node Go to Same Item in Main Table command 378
automatic peak width detection algorithm 237 gray status rectangle 495–496
Find Expected Compounds node 255, 258 green background, table cell with 559
Sample Groups and Ratios page 127 Shape design item, reporting 611
sample groups and ratios, analysis summary 371 SHIFT key, using 733
sample types 184 Shift key, using 399, 760
Samples page of a study 182 shifted match annotations 408
Save As icon, report designer page 606 shortcut menu commands
Save Common command, Workflows page 190 Chromatogram View 401
Save Custom Tags Editor Settings dialog box 348 Compound Editor dialog box 694
Save Filter dialog box 371 for result tables with a Structure column 375
saving for result tables with a Tags column 376
all changes to open pages 44 Mass Spectrum View 414
analysis templates 158 molecular network viewer 377
auto-save option for studies 718 report designer page 609
BioCyc credentials 731 result tables 374
current page 44 tabbed documents 57
custom tags to a TAGS file 352 Show All Tables check box, Result Filters view 364, 371
data points to a text file 61 Show Details icon, Input Files page 177, 180
data points to image files 61 Show Detected Peaks command 404
filter sets 366 Show Dots icon, reporting 608
processing workflow templates 190, 224 Show Gridlines command 404
report templates 606 Show Legend command 404
result file layout 353 Show Lines icon, reporting 608
study factors 113, 115, 175 Show Standard Errors command 440
Scale Areas node 293 Show Tooltips command 403
scaling factor, chromatographic peak area 294 ShrinkToFit property, reporting 619
Scan for Missing Features button, License Manager 713 side-by-side view arrangement 59
Scan Polarity parameter 249 Single Page View icon, reporting 632
Scan Type parameter 235, 241 Size property, reporting 622
Scripting Node 331 SizeMode, reporting 622
Search ChemSpider node 301 sizing bar, report designer page 600, 604
Search Mass Lists node 302 Small Slice Collection parameter, Pie Chart 429
Search mzCloud node 304 Small Slice Threshold (%) parameter 429
Search mzVault node 309 Snap Lines icon, reporting 608
Search Neutral Losses node 327 Snap to Grid icon, reporting 608
Search Results icon, reporting 633 Snapshot Mode icon, reporting 630, 633
sections, report designer page 604 software requirements 23
Seek tool 774 sorting
Select All Visible Points command 435 analyses on the Job Queue page 149
Select Mode icon, reporting 608 distribution maps, result table 734
Select Spectra node result table rows 734
connections 220 study variables 131
parameter descriptions 237 Sorting icon, study variable 131
using 236 Specialized Traces table 497
Select/Deselect Item in All Groups icon 133 Specify FISh Scoring Settings dialog box 360
Selection mode 766 Spectrum Maximum parameter 247
Selection Mode icon, reporting 630, 633 spreadsheet
selection mode, choosing 700 exporting data to 375, 379
Selection tool 774 exporting the contents of a report 630
Selection Tool, using 700 stable isotope labeling experiments 87, 270, 442, 520
Set As Input File To shortcut menu command 184 Stable Isotope Labeling folder 212
Set Sample Type To command 119, 183 stacking table columns in a result file 738
Set Tags command 376 stacking the views 59
Standard Mass Defect parameter selection 231 changing the sort order 131
Standard sample type 98, 184 hierarchy 131
Start Page Style property, reporting 616
hiding 53 SubReport design item, editing 607
icon 49 Summaries view 371
pin icon 53 summaries, processing 371
recent file lists 53 Summed Trace parameter 248
shortcut menu 57 SupplementOptions property, reporting 616
Start property, reporting 623 symbol, Caution 191
Statistical Methods table 556 symbols
status columns exclamation mark 192
Expected Finder, Consolidated Peaks table 496 synchronizing the computer’s clock 787
Unknown Detector, Consolidated Peaks table 496
systematic error removal with random forests (SERRF) 85
structure
cleaning 699
copying to the Clipboard 391 T
creating a mirror image 699 table rows
moving 701 copying 741
resizing 698 grouping 736
rotating 698 pinning 735
saving 703 sorting 734
structure drawing tools, using 693 tables
Structure Editor module filtering 742
Atom Properties dialog box 701 freezing columns 739
bond multiplicity 701 sorting 734
bond properties 701 Tag property, reporting 621
clean structure 699 Tags column 376
structure files, opening 691 TAGS file 352
Structure Proposals table 499 tags, named and color-coded 345
structures tags, result table 345
checking validity 696 template tool, using 695
mirror 699 text file 61
resizing 698
Text property, reporting 621
rotating 698
TextJustify property, reporting 616
Structures column 392
Thermo MS Licensing, email from 705
structures, charged 690
Thermo Scientific website, user documents 22
structures, checking the validity 696
Thumbnails Pane icon, reporting 633
studies
time zone settings 787
Analysis Results page 152
TML files 695
Input Files page 180
Samples page 182 Toggle Sidebar icon, reporting 631
Study Definition page 175 tool, guide or position 58
Studies Folder parameter 99 toolbar, application 48
study factors, three type of 39 tooltips 439
Study Name and Processing Workflow page 102 Total Intensity Threshold parameter 239
study name for a result file, finding 373 Total Scan parameter 246
Study Summary page 373 Trace Type parameter 246, 250
study template files TraceFinder application, mass list for 385
creating 100 Transformation Editor 654
selecting 102 transformations
study variables adding 654
assigning 109 editing 656
X
X Axis Type parameter, Scatter Chart 432
X marks 403
Xcalibur Inclusion/Exclusion list, exporting 381
Y
Y Axis Type parameter, Scatter Chart 432
yellow background, table cell with 559
yellow-green background, table cell with 559
Z
Z Axis Type parameter, Scatter Chart 432
Zoom In icon, reporting 608
Zoom Out command 434
Zoom Out icon, reporting 608, 632
z-score transformation 481
The integration of Metabolika pathways involves mapping detected compounds against those in pathway databases. The Metabolika Results table shows whether compounds from experimental data match pathway structures fully, partially, or not at all. This integration aids in understanding biochemical interactions and validating compound identities .
Report templates can be customized by opening the report template page, which allows modification using the Customize Report dialog box. Users can add cover pages, change logos, alter date formats, add rich text boxes, and insert data graphs or mzCloud plots. It is also possible to edit subreport columns, make selections for the main table, and adjust page breaks for a tailored report layout .
The LC report templates include options like 'Compounds No Graphs' and 'Compounds with Graphs', which report structural data, names, formulas, calculated molecular weights, and group areas. Templates like 'Expected Compounds No Graphs' focus on parent compounds, formulas, retention times, and transformations. These templates facilitate comprehensive data reporting through structured output formats .
To add input files to a new study, you need to open the Input File Selection page of the New Study and Analysis Wizard. You can then click 'Add Files', browse to the appropriate folder, select the Xcalibur RAW files, and click 'Open'. The selected file names will appear in the Files box, and the Next button will be enabled once files are added .
The Study Information table provides a structured view of sample identifiers, study file IDs, file names, sample types, and study factor columns. It shows the associated sample group, replicate group, and any calculated ratios, allowing for comprehensive review and reporting of processed input files .
The Group Compounds node in the application aims to consolidate unknown compounds across input files by molecular weight and retention time. It chooses the best representative MS1 scan and fragmentation tree for further identification processes. The default parameters include a mass tolerance of 5 ppm and an RT tolerance of 0.05 minutes. This node supports aligning chromatographic peaks, setting preferred ions, and determining peak area integration .
The Find Expected Compounds node generates extracted ion chromatograms (XICs) using specified mass tolerance, and compares isotope patterns based on intensity tolerance. It evaluates intensity thresholds for isotopic peaks and uses a minimum number of isotopes to match the theoretical isotope pattern of a compound. Parameters include mass tolerance (default: 5 ppm), intensity tolerance (default: 30%), intensity threshold, and minimum number of isotopes .
Chromatographic peak areas can be determined based on the most common ion or by summing the areas of all detected adduct ions in an input file. This is managed under the Group Compounds node parameters, allowing for flexibility in analyzing peak intensities and group areas, crucial for an accurate analysis of compounds .
New input files can be characterized by setting up sample information on the Input File Characterization page of the New Study and Analysis Wizard. Alternatively, if input files are added to an existing study, the Input File Characterization dialog box is used. The characterization includes selecting delimiters for parsing file names, adding or editing study factors, and assigning study factor values to samples .
The Study Definition page allows you to set up study factors, view the study name, file location, creation date, modification date, and description. It is used to edit existing study factors, set up new study factors, and add or edit a description of the study. This setup is essential for organizing and documenting data and analysis parameters .