DNA Output
Refer to DNA Analysis Methods for more information.
Small Variant gVCF
File name: {SAMPLE_ID}_hard-filtered.gvcf.gz
The small variant genome variant call file contains information on all candidate small variants evaluated, including complex variants up to 15 bp from phased variant calling across the entire TSO 500 panel.
The variant status is determined by the FILTER column in the genome VCF as follows.
Filter | Note |
---|---|
PASS | PASS variants. |
base_quality | Site filtered because median base quality of alt reads at this locus does not meet threshold. |
filtered_reads | Site filtered because the fraction of reads is too large. |
fragment_length | Site filtered because absolute difference between the median fragment length of alt reads and median fragment length of ref reads at this locus exceeds threshold. |
low_depth | Site filtered because the read depth is too low. |
low_frac_info_reads | Site filtered because the fraction of informative reads is below threshold. |
long_indel | Site filtered because the indel length is too long. |
mapping_quality | Site filtered because median mapping quality of alt reads at this locus does not meet threshold. |
multiallelic | Site filtered because more than two alt alleles pass tumor LOD. |
no_reliable_supporting_read | Site filtered because no reliable supporting somatic read exists. |
read_position | Site filtered because median of distances between start/end of read and this locus is below threshold. |
str_contraction | Site filtered due to suspected PCR error where the alt allele is one repeat unit less than the reference. |
too_few_supporting_reads | Site filtered because there are too few supporting reads in the tumor sample. |
weak_evidence | Somatic variant score (SQ) does not meet threshold. |
systematic_noise | Site filtered based on evidence of systematic noise in normal sample. |
excluded_regions | Site overlaps with VC excluded regions bed. |
Small Variant Annotated JSON
File name: {SAMPLE_ID}_DNAVariants_Annotated.json.gz
The small variants annotated file provides variant annotation information for all nonreference positions from the genome VCF including pass and nonpass variants.
TMB Trace
The TMB trace file provides comprehensive information on how the TMB value is calculated for a given sample. All passing small variants from the small variant filtering step are included in this file. To calculate the numerator of the TmbPerMb value in the TMB JSON, set the TSV file filter to use the IncludedInTMBNumerator with a value of True.
The TMB trace file is not intended to be used for variant inspections. The filtering statuses are exclusively set for TMB calculation purposes. Setting a filter does not translate into the classification of a variant as somatic or germline.
Column | Description |
---|---|
Chromosome | Chromosome |
Position | Position of variant |
RefCall | Reference base |
AltCall | Alternate base |
VAF | Variant allele frequency |
Depth | Coverage of position |
CytoBand | Cytoband of variant |
GeneName | Name of gene if applicable. A semicolon delimited list is used for multiple genes. |
VariantType | Type of the variant: SNV, insertion, deletion, MNV |
CosmicIDs | Cosmic IDs, if multiple concatenated by “;” |
MaxCosmicCount | Maximum Cosmic study count |
AlleleCountsGnomadExome | Variant allele count in gnomAD exome database |
AlleleCountsGnomadGenome | Variant allele count in gnomAD genome database |
AlleleCounts1000Genomes | Variant allele count in 1000 genomes database |
MaxDatabaseAlleleCounts | Maximum variant allele count over the three databases |
GermlineFilterDatabase | TRUE if variant was filtered by the database filter |
GermlineFilterProxi | TRUE if variant was filtered by the proxi filter |
CodingVariant | TRUE if variant is in the coding region |
Nonsynonymous | TRUE if variant has any transcript annotations with nonsynonymous consequences |
IncludedinTMBNumberator | TRUE if variant is used in the TMB calculation |
Copy Number VCF
The copy number VCF file contains CNV calls for DNA libraries of the amplification genes targeted by DRAGEN TruSight Oncology 500 Analysis Software. The CNV call indicates fold change results for each gene classified as reference, deletion, or amplification.
The value in the QUAL column of the VCF is a Phred transformation of the p-value where Q=-10xlog10(p-value). The p-value is derived from the t-test between the fold change of the gene against the rest of the genome. Higher Q-scores indicate higher confidence in the CNV call.
In the VCF notation, <DUP> indicates the detected fold change (FC) is greater than a predefined amplification cutoff. <DEL> indicates the detected FC is less than a predefined deletion cutoff for that gene. This cutoff can vary from gene to gene.
In analysis versions prior to v2.5, <DEL> calls in the VCF are marked as LowValidation. The LowValidation filter indicates that the calls have been validated only with in silico data sets and are provided as information only.
Each copy number variant is reported as a fold change on normalized read depth in a testing sample relative to the normalized read depth in diploid genomes. Given tumor purity, you can infer the ploidy of a gene in the sample from the reported fold change.
Given tumor purity X%, for a reported fold change Y, you can calculate the copy number n using the following equation:
For example, a tumor purity at 30% and a MET with fold change of 2.2x indicates that 10 copies of MET DNA are observed.
Last updated