DNA Output
Refer to DNA Analysis Methods for more information.
Small Variant Genome VCF
File name: {SAMPLE_ID}_hard-filtered.gvcf.gz
The small variant genome VCF file includes the variant call status for all targeted intervals, left-padded by 25 bp.
The Epidermal Growth Factor Receptor (EGFR) complex variant VCF includes phased EGFR variants. The FILTER column in the genome VCF determines the variant status. Refer to the following table for more information.
.
PASS
WT.
., A, C, G, etc1
low_depth²
Reference or filtered variant candidate with depth < 1000.
A, C, G, etc1
PASS
PASS variants.
A, C, G, etc1
weak_evidence
Filtered variant candidate with the following conditions:
Filtered variant candidate with low SQ score (< 2).
A, C, G, etc1
excluded_regions³
Position with high background noise. Not available for variant detection.
A, C, G, etc1
systematic_noise
Filtered variant candidate with low AQ score (< 20 for hotspots, < 60 for nonhotspots).
A, C, G, etc1
mapping_quality
Filtered variant candidate with low median mapping quality (< 30).
A, C, G, etc1
read_position
Filtered variant candidate showed bias clustered at fragment ends.
A, C, G, etc1
multiallelic
Filtered if there are two or more ALT alleles at this location.
A, C, G, etc1
low_frac_info_reads
Filtered if the fraction of informative reads is low (< 0.5).
¹ Etc refers to other variant types not mentioned in the table.
² Reference positions and nonpassing variants with coverage below 1000X directly translate into low_depth. For variant calls, low_depth is not applied when a position has a PASS filter.
³ This is a static list of regions compiled by Illumina. Email Illumina Technical Support for more information.
Small Variant Annotated JSON
File name: {SAMPLE_ID}_DNAVariants_Annotated.json.gz
The small variants annotated file provides variant annotation information for all non-reference positions in the VCF, which includes non-pass variants. The variant consequence definition is available on the Sequence Ontology website.
All pass variant calls are annotated using the Illumina Annotation Engine (IAE), also known as Nirvana, with the following information (using the RefSeq transcript):
HGNC Gene
Transcript
Exon
Consequence
c.HGVS
p.HGVS
COSMIC
TMB Trace
File name: {Sample_ID}.tmb.metrics.csv
The TMB metrics file contains the tumor mutational burden metrics for each DNA sample. The file format uses the following CSV column convention, similar to other metric CSV files.
Filtered Variant Count
Remaining variants after variant and germline filters.
Eligible Region (MB)
The specified custom regions, in megabases, that meet the minimum coverage threshold.
TMB
Filtered variants normalized by the eligible regions.
Sample Analysis Results JSON
File name: {SampleID}_SampleAnalysisResults.json
The sample analysis results file (SARJ) is an aggregated results file created for each sample. The SARJ file is used for the generation of downstream outputs. The file contains passing variants and passing variant annotations.
Copy Number Variants VCF
File name: {Sample_ID}_cnv.vcf
The copy number variants (CNV) file contains calls for DNA libraries of the CNV genes targeted by TruSight Oncology 500 ctDNA v2 and v1 assays. The CNV call indicates fold change results for each gene classified as reference, deletion, or amplification.
The value in the QUAL column of the copy number variants VCF is a Phred transformation of the p-value represented by the following equation:\
The p-value is derived from the t-test between the fold change (FC) of the gene against the rest of the genome. Higher Q-scores indicate higher confidence in the CNV call.
In the VCF notation, <DUP> indicates the detected FC is greater than a predefined amplification cutoff. <DEL> indicates the FC is less than a predefined deletion cutoff for that gene. This cutoff can vary from gene to gene.
Each copy number variant is reported as the fold change on normalized read depth in a testing sample relative to the normalized read depth in diploid genomes. Given tumor purity, the ploidy of a gene in the sample can be inferred from the reported fold change.\
Given tumor purity X%, for a reported fold change Y, the copy number n can be calculated by using the following equation:
For example, in a testing sample of tumor purity at 30%, MET with a fold change of 2.2x indicates that 10 copies of MET DNA are observed.
Copy Number Variants Metrics CSV
Copy number variant metrics are reported on a per sample level.
Sex Genotyper—The predicted sex of the sample.
Number of alignment records—The number of alignment records in the sample.
Bases in reference genome—The number of bases in the reference genome.
Average alignment coverage over genome—The average alignment coverage across the reference genome.
Number of filtered records (total)—The number of total filtered records.
Number of filtered records (duplicates)—The number of duplicated filtered records.
Number of filtered records (MAPQ)—The number of MAPQ filtered records.
Number of filtered records (unmapped)—The number of unmapped filtered records.
Number of target intervals—The number of target intervals in the sample.
Number of segments—The number of segments in the sample. Applicable only to CNV SLM mode.
Number of amplifications—The number of amplifications in the sample. Applicable only to CNV SLM mode.
Number of deletions—The number of deletions in the sample. Applicable only to CNV SLM mode.
Number of passing amplifications—The number of passing amplifications in the sample. Applicable only to CNV SLM mode.
Number of passing deletions—The number of passing deletions in the sample. Applicable only to CNV SLM mode.
Small Variant VCF
File name: {SampleID}_hard-filtered.vcf
The small variant file contains both phased variants and all other small variants. The header sections from both the phased variant (complex) VCF and the small variant VCF are included in this merged VCF. Variants that are found for both phased variants and small variants are only displayed as phased variants.
Fusions CSV
File name: {Sample_ID}_Fusions.csv
The fusions file contains all candidate fusions identified by the analysis pipeline.
The fusion columns are described in the following table. If you use Microsoft Excel to view this file, genes that are convertible to dates (for example, MARCH1) automatically convert to dd‑mm format (1-Mar).
Sample
Input sample ID.
Name
Fusion name as reported by the DRAGEN fusion caller.
Chr1
The chromosome of the first breakend.
Pos1
The position of the first breakend.
Chr2
The chromosome of the second breakend.
Pos2
The position of the second breakend.
Direction
The direction of how the breakends are joined.
Alt_Depth
The number of read-pairs supporting the fusion call.
Total_Depth
Max number of read-pairs aligned to a fusion breakend.
BP1_Depth
Number of read-pairs aligned to the first breakend.
BP2_Depth
Number of read-pairs aligned to the second breakend.
VAF
Variant allele frequency.
Gene1
Genes that overlap the first breakend.
Gene2
Genes that overlap the second breakend.
Contig
The fusion contig.
Filter
Indicates whether the fusion has passed all of the fusion filters.
Is_Cosmic_GenePair
Indicates whether the gene pair has been reported by COSMIC (True/False).
Fusion Directionality Known
Indicates whether the fusion direction is known, and indicated by the order of the genes (True/False).
The following table lists the meaning of the values in the direction column. The values are in the format used by Samtools.
L1R2
t[p[
The left of breakend1 is joined with the right of breakend2.
L1rL2
t]p]
The left of breakend1 is joined with the reverse complement of the left of breakend2.
L2R1
]p]t
The left of breakend2 is joined with the right of breakend1.
rR2R1
[p[t
The reverse complement of the right of breakend2 is joined with the right of breakend1.
Last updated