Analysis Output

When the analysis run completes, the software generates an analysis output in a specified location with the folder name /staging/DRAGEN_Solid_WGS_Tumor_Normal_Pipeline_{version}_Analysis_{datetimestamp}. In ICA, analysis output is listed in the Output section of the analysis, where the folder name is a combination of user reference, pipeline name, and a UUID.

Within the analysis folder, each analysis step generates a subfolder within the Logs_Intermediates folder.

Output Folders

๐Ÿ“‚ Results - Contains the final result files from the pipeline.

๐Ÿ“„ MetricsOutput.tsv - Contains summary metrics for all samples.

๐Ÿ“‚ Case1

๐Ÿ“„ Case1_MetricsOutput.tsv - Contains summary metrics for tumor and normal samples for Case1.

๐Ÿ“‚ TumorSample1

๐Ÿ“„ TumorSample1.hard-filtered.vcf.gz - Contains somatic small variant calls.

๐Ÿ“„ TumorSample1.cnv.vcf.gz - Contains somatic copy number variant calls.

๐Ÿ“„ TumorSample1.sv.vcf.gz - Contains somatic structural variant calls.

๐Ÿ“„ TumorSample1_SNV_Tumor_Annotated.json.gz - Contains somatic small variant annotations.

๐Ÿ“„ TumorSample1_CNV_Tumor_Annotated.json.gz - Contains somatic copy number variant annotations.

๐Ÿ“„ TumorSample1_SV_Tumor_Annotated.json.gz - Contains somatic structural variant annotations.

๐Ÿ“„ TumorSample1.tmb.metrics.csv - Contains the TMB result and metrics.

๐Ÿ“„ TumorSample1.microsat_output.json - Contains the MSI result and metrics.

๐Ÿ“„ TumorSample1.hrdscore.csv - Contains the HRD result and metrics.

๐Ÿ“„ TumorSample1.tn.bw - Contains tangent normalized somatic coverage in BigWig format.

๐Ÿ“„ TumorSample1.tumor.baf.bedgraph.gz - Contains somatic b-allele frequency in BedGraph format.

๐Ÿ“„ TumorSample1.bam - Contains aligned somatic reads in BAM format.

๐Ÿ“„ TumorSample1.bam.bai - Contains index of aligned somatic reads in BAI format.

๐Ÿ“‚ NormalSample1

๐Ÿ“„ NormalSample1.hard-filtered.vcf.gz - Contains germline small variant calls.

๐Ÿ“„ NormalSample1.cnv.vcf.gz - Contains germline copy number variant calls.

๐Ÿ“„ NormalSample1.sv.vcf.gz - Contains germline structural variant calls.

๐Ÿ“„ NormalSample1.repeats.vcf.gz - Contains germline short tandem repeat calls.

๐Ÿ“„ NormalSample1.vntr.vcf.gz - Contains germline variable number tandem repeat calls.

๐Ÿ“„ NormalSample1.targeted.vcf.gz - Contains germline targeted (star allele) calls.

๐Ÿ“„ NormalSample1.targeted.json - Contains germline targeted (star allele) data in JSON format.

๐Ÿ“„ NormalSample1_SNV_Normal_Annotated.json.gz - Contains germline small variant annotations.

๐Ÿ“„ NormalSample1_CNV_Normal_Annotated.json.gz - Contains germline copy number variant annotations.

๐Ÿ“„ NormalSample1SV_Normal_Annotated.json.gz - Contains germline structural variant annotations.

๐Ÿ“„ NormalSample1.hla.tsv - Contains germline HLA typing calls.

๐Ÿ“„ NormalSample1.bam - Contains aligned germline reads in BAM format.

๐Ÿ“„ NormalSample1.bam.bai - Contains index of aligned germline reads in BAI format.

๐Ÿ“‚ Logs_Intermediates - Contains all intermediate files for each step of the pipeline (BAMs moved to the Results folder).

๐Ÿ“‚ ResourceVerification

๐Ÿ“‚ SampleSheetValidation

๐Ÿ“‚ NormalFastqValidation

๐Ÿ“‚ TumorFastqValidation

๐Ÿ“‚ DragenCaller

๐Ÿ“‚ TumorNormalVariantCaller

๐Ÿ“‚ Tmb

๐Ÿ“‚ Annotation

๐Ÿ“‚ SampleAnalysisResults

๐Ÿ“‚ AdditionalSarjMetrics

๐Ÿ“‚ MetricsOutput

๐Ÿ“‚ Work - (DRAGEN server only) Contains information and files related to Nextflow execution.

File Overview

This section describes the summary output files generated during analysis.

Metrics Output

The metrics output file is a final combined metrics report that provides sample status, key analysis metrics, and metadata in a tab-separated values (TSV) file. Sample metrics within the report indicate guideline-suggested lower specification limits (LSL) and upper specification limits (USL) for each sample in the run. One metrics output file is generated for the entire run. An additional file is generated for each case.

Normal DNA Input QC Metrics

Metric (UOM)

LSL Guideline

USL Guideline

TOTAL_INPUT_READS (Count)

NA

NA

PCT_MAPPED_READS (%)

90.00

NA

PCT_PROPERLY_PAIRED_READS (%)

90.00

NA

PCT_Q30_BASES (%)

80.00

NA

PCT_SOFT_CLIPPED_BASES_R1 (%)

NA

10.0

PCT_SOFT_CLIPPED_BASES_R2 (%)

NA

10.0

PCT_SUPPLEMENTARY_(CHIMERIC)_ALIGNMENTS (%)

NA

15.0

ESTIMATED_READ_LENGTH (bp)

NA

NA

MEAN_INSERT_LENGTH (bp)

NA

NA

MEDIAN_INSERT_LENGTH (bp)

NA

NA

INPUT_BASES_OVER_REFERENCE_GENOME_SIZE (Count)

NA

NA

ESTIMATED_SAMPLE_CONTAMINATION (%)

NA

2.00

Normal DNA Dedup/UMI QC Metrics

Metric (UOM)

LSL Guideline

USL Guideline

PCT_DUPLICATE_MARKED_READS (%)

NA

20.00

PCT_READS_WITH_VALID_OR_CORRECTABLE_UMIS (%)

NA

NA

PCT_READS_IN_DISCARDED_FAMILIES (%)

NA

NA

PCT_READS_FILTERED_OUT (%)

NA

NA

PCT_READS_WITH_UNCORRECTABLE_UMIS (%)

NA

NA

TOTAL_NUMBER_OF_FAMILIES (Count)

NA

NA

FAMILIES_DISCARDED (Count)

NA

NA

DUPLEX_FAMILIES (Count)

NA

NA

MEAN_FAMILY_DEPTH (Count)

NA

NA

Normal DNA Coverage QC Metrics

Metric (UOM)

LSL Guideline

USL Guideline

AVERAGE_GENOME_COVERAGE (Count)

20.00

NA

PCT_UNIFORMITY_OF_COVERAGE_OVER_20PCT_OF_MEAN (%)

50.00

NA

PCT_GENOME_20X (%)

80.00

NA

Tumor DNA Input QC Metrics

Metric (UOM)

LSL Guideline

USL Guideline

TOTAL_INPUT_READS (Count)

NA

NA

PCT_MAPPED_READS (%)

90.00

NA

PCT_PROPERLY_PAIRED_READS (%)

90.00

NA

PCT_Q30_BASES (%)

80.00

NA

PCT_SOFT_CLIPPED_BASES_R1 (%)

NA

10.0

PCT_SOFT_CLIPPED_BASES_R2 (%)

NA

10.0

PCT_SUPPLEMENTARY_(CHIMERIC)_ALIGNMENTS (%)

NA

15.0

ESTIMATED_READ_LENGTH (bp)

NA

NA

MEAN_INSERT_LENGTH (bp)

NA

NA

MEDIAN_INSERT_LENGTH (bp)

NA

NA

INPUT_BASES_OVER_REFERENCE_GENOME_SIZE (Count)

NA

NA

ESTIMATED_SAMPLE_CONTAMINATION (%)

NA

2.00

Tumor DNA Dedup/UMI QC Metrics

Metric (UOM)

LSL Guideline

USL Guideline

PCT_DUPLICATE_MARKED_READS (%)

NA

20.00

PCT_READS_WITH_VALID_OR_CORRECTABLE_UMIS (%)

NA

NA

PCT_READS_IN_DISCARDED_FAMILIES (%)

NA

NA

PCT_READS_FILTERED_OUT (%)

NA

NA

PCT_READS_WITH_UNCORRECTABLE_UMIS (%)

NA

NA

TOTAL_NUMBER_OF_FAMILIES (Count)

NA

NA

FAMILIES_DISCARDED (Count)

NA

NA

DUPLEX_FAMILIES (Count)

NA

NA

MEAN_FAMILY_DEPTH (Count)

NA

NA

Tumor DNA Coverage QC Metrics

Metric (UOM)

LSL Guideline

USL Guideline

AVERAGE_GENOME_COVERAGE (Count)

20.00

NA

PCT_UNIFORMITY_OF_COVERAGE_OVER_20PCT_OF_MEAN (%)

50.00

NA

PCT_GENOME_20X (%)

80.00

NA

Tumor DNA T/N Sample Match QC Metrics

Metric (UOM)

LSL Guideline

USL Guideline

OUTLIER_BAF_FRACTION (NA)

NA

0.90

Tumor DNA Purity QC Metrics

Metric (UOM)

LSL Guideline

USL Guideline

ESTIMATED_PURITY (%)

20.00

NA

Last updated

Was this helpful?