Analysis Output

When the analysis run completes, the DRAGEN TruSight Oncology 500 Analysis Software generates an analysis output folder in a specified location.

To view analysis output, navigate to the analysis output folder and select the files that you want to view.

Metrics Output

The MetricsOutput.tsv file contains the following quality control metrics for all samples:

  • DNA library QC metrics for:

    • Small variant calling

    • TMB

    • MSI

    • CNV

    • [HRD] GIS

  • RNA library QC metrics

  • Run QC metrics, analysis status, and contamination

This TSV file also includes expanded DNA library QC metrics per sample, based on total reads, collapsed reads, chimeric reads, and on-target reads. Analysis using RNA samples also produces RNA library QC metrics and expanded RNA library QC metrics per sample based on total reads and coverage.

The MetricsOutput.tsv file is a final combined metrics report with sample status, key analysis metrics, and metadata. Sample metrics within the report include suggested lower specification limits (LSL) and upper specification limits (USL) for each sample in the run.

For troubleshooting information, refer to [Troubleshooting](ADD LINK TO TROUBLESHOOTING)

Single Node Analysis Output Folder Structure

Single output folder structure is as follows.

  • Logs_Intermediates

    • AdditionalSarjMetrics— Contains per pair ID calculations to support the PCT_TARGET_250X metric.

    • Annotation—Contains outputs for small variant annotation.

      • Subfolders per sample ID—Contains the aligned small variants JSON.

  • CombinedVariantOutput

    • Subfolders per pair ID—Contains the combined variant output TSV files.

    • A combined output log file.

  • Contamination

    • Subfolders per DNA sample ID—Contains the contamination metrics JSON file and output logs.

  • DnaDragenCaller

    • Subfolders per sample ID—Contains the aligned BAM and index files, small variant VCF and gVCF, copy number variant VCF, MSI JSON, and QC outputs in CSV format.

  • DnaDragenExonCNVCaller

    • Subfolders per DNA sample ID—Contains the exon-level CNV JSON,the supporting calculation, and the QC files.

  • DnaFastqValidation—Contains the FASTQ validation output log for DNA samples.

  • FastqDownsample

    • Subfolders per RNA sample ID—Contains FASTQ files and output logs.

    • FastqDownsample output

  • Gis—Contains GIS-related files for HRD samples.

    • Subfolders per HRD sample ID—Contains the GIS JSON, the supporting calculation, and the QC files.

  • LrAnnotation

    • Subfolders per DNA sample ID—Contains the annotated exon-level CNV JSON.

  • LrCalculator

    • Subfolders per DNA sample ID—Contains the exon-level CNV VCF.

  • MetricsOutput

    • Subfolders per pair ID—Contains the metrics output TSV files.

    • A combined output log file.

  • ResourceVerification—Contains the resource file checksum verification logs.

  • RnaAnnotation

    • Subfolders per RNA sample ID—Contains the annotated splice variant JSON.

  • RnaDragenCaller

    • Subfolders per sample ID—Contains the aligned BAM, fusion candidates CSV, and QC outputs in CSV format.

  • RnaFastqValidation—Contains the FASTQ validation output log for RNA samples.

  • RnaFusion

    • Subfolders per RNA sample ID—Contains the All Fusions CSV and Fusion Processor logs.

  • RnaQcMetrics

    • Subfolders per RNA sample ID—Contains the RNA QC metrics JSON.

  • RnaSpliceVariantCalling

    • Subfolders per RNA sample ID—Contains the splice variants VCF.

  • Run QC—Contains the Run QC metrics JSON, Intermediate Run QC metrics JSON, and log file.

  • SampleAnalysisResults

    • Subfolders per pair ID—Contains the Sample Analysis Results JSON and detailed log file.

    • SampleSheetValidation—Contains the Intermediate sample sheet and validation log.

  • Tmb

    • Subfolders per DNA sample ID—Contains the TMB metrics CSV, TMB trace TSV, and related files and logs. passing_sample_steps.json —Contains the steps passed for each sample ID. pipeline_trace.txt—Contains a summary and troubleshooting file that lists each Nextflow task executed and the status (for example, COMPLETED or FAILED). run.log—Contains a complete trace-level log file describing the Nextflow pipeline execution. run_report.html—Contains high-level run statistics (performance, usage, etc.) run_timeline.html —Contains timeline-related information about the analysis run.

  • Results

    • Metrics Output TSV (all pair IDs)

  • Pair ID—The following outputs are produced for each sample:

    • Combined Variant Output TSV

    • Metrics Output TSV

    • TMB Trace TSV

    • Small Variant Genome VCF

    • Small Variant Genome Annotated JSON

    • Copy Number Variant VCF

    • GIS JSON

    • MSI JSON

    • Exon-level CNV VCF

    • Exon-level CNV Annotated JSON

    • All Fusion CSV

    • Splice Variant VCF

    • Splice Variant Annotated JSON

Multiple Node Analysis Output Folder Structure

Multiple output folder structure is as follows.

  • Demultiplex_Output

    • Logs_Intermediates—Contains FASTQ files per sample.

  • NodeX_Output—The following outputs are produced for each node used:

    • Logs_Intermediates

    • Results—Contains results only for the samples run on the node.

  • Gathered_Output

    • Results—Contains results for all samples from all nodes.

Last updated