1 of 6

Analysis Output

When the analysis run completes, the DRAGEN TruSight Oncology 500 ctDNA Analysis Software generates an analysis output folder in a specified location.

To view analysis output, navigate to the analysis output folder and select the files that you want to view.

Analysis Output Folder Structure

Single output folder structure is as follows.

Logs_Intermediates
- AdditionalSarjMetrics
- Annotation—Contains outputs for small variant annotation.
  - Subfolders per sample ID—Contains the aligned small variants JSON.
Results
- Metrics Output TSV (all Sample IDs)
- Sample ID—The following outputs are produced for each sample:

ICA Output Folder Structure

This section describes each output folder generated during analysis and where to find metric and analytic files when the pipeline is executed. The same output folder structure and content exist in ICA and BaseSpace Sequence Hub.

High-Level Folder Structure

Run ID
- TSO500_Nextflow_logs
- _manifest.json

TSO500_Nextflow_logs Folder Structure

The TSO_500_Nextflow_Logs provides information related to the execution of the pipeline on ICA as a whole and for specific nodes (when an analysis is split across multiple nodes). It contains files used to execute parts of the workflow on different nodes as well as records of the nextflow execution on those nodes.

TSO_500_Nextflow_Logs
- _manifest.json

Results Folder Structure

Contains the aggregated MetricsOutput.tsv file at the root level. Additionally, the Results folder contains a subfolder for each sample ID.

Results
- MetricsOutput.tsv
- Sample_1

The Results subfolder contains the following files:

Results
- MetricsOutput.tsv
- <Sample_id>

Logs_intermediates Folder Structure

Contains folders for each submodule in the DRAGEN TSO 500 ctDNA on ICA pipeline. The folders contain a copy of all the relevant files required to create the metric output files and report files, as well as the combined log files at the root level and subfolders for each sample.

Logs_intermediates
- AdditionalSarjMetrics
- Annotation

All logs in Logs_Intermediates are generated from the running analysis software. Inputs to the running Docker container (for example, the run folder, sample sheet, and FASTQ folder) are mapped from native locations on the server to the following locations in the container:

Input

Running Docker Container Location

The paths in the log messages refer to paths within the running docker container, not paths on the server.

Errors Folder Structure

Contains Errors.tsv. This file contains the summary of all the errors encountered during pipeline execution.

Errors
- Errors.tsv

NovaSeq 6000Dx Analysis Application Output Folder Structure

The following files and folders are created during analysis by NovaSeq 6000Dx Analysis Application:

analysisResults.json
CopyComplete.txt
edgeos.nextflow.config
inputs/

When the analysis run completes, the analysis application generates an analysis output in a specified location. To view analysis output, follow the steps below:

On the “Completed” runs tab, select the run
Review the run details page, and this will give the information to access the output folder
External Location: is the input for the run

Combined Variant Output

File name: {SampleID}_CombinedVariantOutput.tsv

The combined variant file contains the variants and biomarkers in a single file. The output contains the following variant types and biomarkers:

Small variants (including EGFR complex variants)
Copy number variants
Tumor Mutational Burden (TMB)
MSI
DNA Fusions

The combined variant output file also contains Analysis Details and Sequencing Run Details sections. The details of each are listed in the following table:

Analysis Details

Sequencing Run Details

Variant Filtering Rules

Combined variant output produces small variants with blank fields in the following situations:

The variant has been matched to a canonical RefSeq transcript on an overlapping gene not targeted by TruSight Oncology 500 ctDNA.
The variant is located in a region designated iSNP, indel, or Flanking in the TST500_Manifest.bed file located in the Resources folder.
Small Variants - All variants with the FILTER field marked as PASS and which have a canonical RefSeq transcript are present in the combined variant output.

Metrics Output

File Name: MetricsOutput.tsv

The metrics output file is a final combined metrics report that provides sample status, key analysis metrics, and metadata in a tab-separated values (TSV) file. Sample metrics within the report indicate guideline‑suggested lower specification limits (LSL) and upper specification limits (USL) for each sample in the run.

One metrics output file is generated for the entire run. An additional file is generated for each sample.

All metrics and guidelines are applicable to all versions of DRAGEN TSO 500 ctDNA analysis software (v2.1 and above).

Run Metrics

Run metrics from the analysis module indicate the quality of the sequencing run.

Review the following metrics to assess run data quality:

Metric

Description

Recommended Threshold

The values in the Run Metrics section are listed as NA in the following situations:

The analysis was started from FASTQ files.
The analysis was started from BCL files and the InterOp files are missing or corrupt.
[NovaSeqX Plus only] There is no PCT_PF_READS value in NovaSeqX Plus runs, so the PCT_PF_READS value will always be NA.

Sample QC Metrics

Review the following metrics to assess sample data quality:

Metric

Description

Recommended Threshold

Variant Class

*The recommended threshold of 0.059 for GENE_SCALED_MAD only applies to real cell‑free DNA.

For troubleshooting information, refer to

DNA Expanded Metrics

DNA expanded metrics are provided for information only. They can be informative for troubleshooting but are provided without explicit specification limits and are not directly used for sample quality control. For additional guidance, contact Illumina Technical Support.

Metric

Description

Troubleshooting

TOTAL_PF_READS (count)

Total number of non-supplementary, non-secondary, and passing QC reads after alignment to the whole genome sequence.

Primarily driven by data output of sequencer, quality of library and balancing of library in library pool. If TOTAL_PF_READS is in line with other samples, but coverage metrics are more may suggest non-specific enrichment.

Low values for all samples indicate a poor quality run with possible low cluster numbers or low numbers of Q30 and PF%.

A low value for an individual sample indicates poor pooling of this library into the final pool.

MEAN_FAMILY_SIZE (count)

A UMI Family is a group of reads that all have the same UMI barcode. The family size is the number of reads in family. MEAN_FAMILY_SIZE is the mean of the entire population of reads assembled into UMI families.

The mean UMI family size decreases with increased unique read numbers, and more input DNA leads to more unique reads. Conversely over sequencing of a fixed population of unique DNA molecules leads to increased family size.

As a guide, for a good run with optimal cluster density, passing specs, even sample pooling, and good quality DNA we usually observe values <10.

UMI family size = 1 is not ideal as it is harder to correct for errors.

UMI family size of 2 to 5 enables efficient error correction without wasting sequencing capacity on high percentages of duplicate reads.

Coverage Reports

The gene and exon coverage report files are tab-separated value (TSV) files with coverage values matching respectively the exons and genes specified in the manifest file.

Block List

The following table lists the genes that have associated block listed sites. For the exact location of the block listed site, contact Illumina Technical Support.

Gene

Block List Sites

Gene

Block List Sites

Gene

Block List Sites

ABL1

FGFR2

144

Analysis Output

When the analysis run completes, the DRAGEN TruSight Oncology 500 ctDNA Analysis Software generates an analysis output folder in a specified location.

To view analysis output, navigate to the analysis output folder and select the files that you want to view.

Analysis Output Folder Structure

Single output folder structure is as follows.

Logs_Intermediates
- AdditionalSarjMetrics
- Annotation—Contains outputs for small variant annotation.
  - Subfolders per sample ID—Contains the aligned small variants JSON.
Results
- Metrics Output TSV (all Sample IDs)
- Sample ID—The following outputs are produced for each sample:

ICA Output Folder Structure

High-Level Folder Structure

Run ID
- TSO500_Nextflow_logs
- _manifest.json

TSO500_Nextflow_logs Folder Structure

TSO_500_Nextflow_Logs
- _manifest.json

Results Folder Structure

Contains the aggregated MetricsOutput.tsv file at the root level. Additionally, the Results folder contains a subfolder for each sample ID.

Results
- MetricsOutput.tsv
- Sample_1

The Results subfolder contains the following files:

Results
- MetricsOutput.tsv
- <Sample_id>

Logs_intermediates Folder Structure

Logs_intermediates
- AdditionalSarjMetrics
- Annotation

Input

Running Docker Container Location

The paths in the log messages refer to paths within the running docker container, not paths on the server.

Errors Folder Structure

Contains Errors.tsv. This file contains the summary of all the errors encountered during pipeline execution.

Errors
- Errors.tsv

NovaSeq 6000Dx Analysis Application Output Folder Structure

The following files and folders are created during analysis by NovaSeq 6000Dx Analysis Application:

analysisResults.json
CopyComplete.txt
edgeos.nextflow.config
inputs/

When the analysis run completes, the analysis application generates an analysis output in a specified location. To view analysis output, follow the steps below:

On the “Completed” runs tab, select the run
Review the run details page, and this will give the information to access the output folder
External Location: is the input for the run

Analysis Output

hashtagAnalysis Output Folder Structure

hashtagICA Output Folder Structure

hashtagHigh-Level Folder Structure

hashtagTSO500_Nextflow_logs Folder Structure

hashtagResults Folder Structure

hashtagLogs_intermediates Folder Structure

hashtagErrors Folder Structure

hashtagNovaSeq 6000Dx Analysis Application Output Folder Structure

Combined Variant Output

hashtagVariant Filtering Rules

Metrics Output

hashtagRun Metrics

hashtagSample QC Metrics

DNA Expanded Metrics

Coverage Reports

Block List

Combined Variant Output

hashtagVariant Filtering Rules

Analysis Output

hashtagAnalysis Output Folder Structure

hashtagICA Output Folder Structure

hashtagHigh-Level Folder Structure

hashtagTSO500_Nextflow_logs Folder Structure

hashtagResults Folder Structure

hashtagLogs_intermediates Folder Structure

hashtagErrors Folder Structure

hashtagNovaSeq 6000Dx Analysis Application Output Folder Structure

Coverage Reports

Metrics Output

hashtagRun Metrics

hashtagSample QC Metrics

DNA Expanded Metrics

Block List

Analysis Output Folder Structure

ICA Output Folder Structure

High-Level Folder Structure

TSO500_Nextflow_logs Folder Structure

Results Folder Structure

Logs_intermediates Folder Structure

Errors Folder Structure

NovaSeq 6000Dx Analysis Application Output Folder Structure

Variant Filtering Rules

Run Metrics

Sample QC Metrics

Variant Filtering Rules

Analysis Output Folder Structure

ICA Output Folder Structure

High-Level Folder Structure

TSO500_Nextflow_logs Folder Structure

Results Folder Structure

Logs_intermediates Folder Structure

Errors Folder Structure

NovaSeq 6000Dx Analysis Application Output Folder Structure

Run Metrics

Sample QC Metrics