1 of 12

Variants Tab

The variant grid can be found under the Variants tab within a case. Each tab represents a filtered list of the variants, and each row contains data for one variant. The data include biological information associated with the variant in various annotation sources. The variant grid provides a set of tools to sort, filter, flag, and review variants.

By default, the page displays a list of variants that meet the criteria of the filter configured in the test definition. Adding new filters creates additional tabs, with each having a list of variants meeting the criteria of the corresponding filters. For more information about applying filters, refer to .

The following table lists the contents of each column in the variant grid.

Variant Grid Columns

Modify Columns

The variant grid provides a configurable view of variants in a case. You can create new filtered views and customize the column configuration in the grid.

The configured view applies only to the selected case.

Modifying how data appear in the variant grid only affects how information is arranged in the table. Modifying views does not change the underlying data.

The test definition specifies one or more default tabs that provide filtered variant views. Each view resides on a different tab with a unique set of filters. The tab lists the name of the filter and the number of variants passing the filter. Refer to to learn how to create and modify variant filters.

You can create an unlimited number of tabs for as many additional views as necessary.

Show, Hide, or Reorder Columns

Show, hide, and reorder columns in the grid for a custom view of data in the tab. Changing the column order changes the view for all users.

Drag a column to move it.
Columns with additional data can be expanded by selecting (+More).
Select the menu next to the column name and change any of the following options: – Show or hide columns. – Pin a column. – Resize columns.\

The Interpret, Flag, and IGV columns in This Case section are always visible and cannot be hidden or moved.

Sort Columns

Each tab can support a different custom sort that includes up to 10 columns. Custom sort configuration is per-tab and persists across sessions.

An arrow in the column heading indicates that the column is sorted. If more than one column is sorted, a number indicates the sort level.

To sort a column, select the column heading.
To add a level of sort, hold the shift key and select a different column heading. Select the heading again to reverse the sort for the column.
To clear the sort, select any column without holding the shift key.

Save Column Configuration

Column configurations can be saved to appear in the same order for every case. Column configurations are saved together with a filter as part of the tab in the variant grid.

Show, hide, and organize columns as desired. If filtering logic is focused on a certain variant category, columns could be configured to view data relevant to that variant category.
Select Edit Variant Filters.
Select Save As.
Use the filter and column configuration for other cases by loading the filter or by adding the filter to default filters in Test Definitions.

Apply Variant Filters

Variants filters provide options for applying any combination of inclusion and exclusion criteria to the variants in a case. Filter criteria can vary depending on the selected variant categories. If filters are applied to more than one variant category in the same filter group, only filters relevant for all variant categories are available. For more information, refer to .

Each filter combination resides on a different tab in the variant grid. Default filter views are defined in the test definition. You can create filter tabs in the grid for as many additional views as necessary. Filters applied in the variant grid are specific to the selected case.

For more information about filter options, refer to and

Filters in the Test Definition

When you configure a new test, you can add one or more specific filters to the test definition. The filters become default filters and are applied to every case in your workgroup. The default filters are locked and shown in the first tabs of the variant grid. For more information, refer to .

The default filter tabs, indicated by a lock, cannot be altered or deleted for the cases already processed. To change or delete default filters, you must update the filters that are used in the test definition and reprocess the case and upcoming cases through the updated test definition.

Understanding the Demo Filter

Included with Connected Insights, a demonstration of the filtering is provided as a template for you to define your own filter views. In the primary filter group, the filter is set up to return all variants categories (ie small variants, copy number variants, structural variants, RNA fusion variants, and RNA splice variants) and requiring that these are called with a PASS by each of the variant callers. In the subsequent filter groups, the filter is set up to apply the following logic for each of the variant categories:

Small variants with coding consequences and population frequency ≤ 0.05 in gnomAD for AFR, AMR, EAS, NFE, SAS
Any copy number variants
Structural variants resulting in a unidirectional gene fusion with at least three supporting reads
RNA fusion variants resulting in a unidirectional gene fusion
RNA splice variants resulting in an exon loss

Create New Filters in the Variant Grid

Configure and modify case-specific filter views in new or unlocked variant grid tabs. The default filter tabs, indicated by a lock, cannot be altered or deleted.

Create a tab using one of the following methods:
- To create a filter, select New Filter.
- To copy an existing filter, select the tab drop-down arrow and then select Duplicate Filter.
- To load a new filter, select Load Filter.
  - Select or search for a filter to load from the list of compatible filters created and saved by all users in your workgroup. Filters with variant flags are only compatible to the cases using the same flag list. Select Apply.
[Optional] Double-click the tab label and enter a new name.
Select Edit Variant Filters.
Select Apply.

Lock a Filter

To lock a filter view, select the tab drop-down arrow, and then select Lock Filter. Locked filter views are indicated by a blue lock and cannot be deleted.

Edit Filter Name

To edit a filter name, select the tab drop-down arrow, and then select Edit Filter Name.
After editing the name, select Save.

Save Filter and Column Configurations

To save a filter view, select Edit Variant Filters, then select Save As. The filter is saved and can be configured in the test definitions to be a default filter and be used across cases. Column configurations and filter dependencies are saved with the filter.

Remove a Filter

To remove a filter view, delete the tab. Default tabs are indicated by a lock and cannot be deleted.

Export Filtered Variants

Export a list of variants and variant data to a tab-delimited file.

The maximum number of exported variants in a list is 7500. If the list exceeds the maximum, only the first 7500 results are included in the exported file.

Configure the filters and flags to show only the variants to export.
Select the tab drop-down arrow, and then select Export Grid as TSV.

Filter by Variant Category

Connected Insights provides users with the flexibility to apply a selection of filter criteria to each variant category supported in the software. The selection of variant categories impacts the set of filtering criteria that can be selected for a given filter group.

The following table summarizes filters available for each variant category:

Variant Categories

Variant Details Filters

This page summarizes filters related to variant details. Filter availability can vary depending on the selected variant categories. If filters are applied to more than one variant category in the same condition group, only filters relevant for all variant categories are available. For more information, refer to Filter by Variant Category.

Origin

Filters variants by Suspected Somatic or Predicted Germline origin.

You can select these options when creating or editing a variant filter by updating the Origin criterion. For example, if you do not want predicted germline variants, then add or update the Origin criterion to only include Suspected Somatic. For more information, refer to Variant Filters.

You can also add or edit a test definition to include either somatic or predicted germline variants through selecting the applicable variant filters in the Variant Filter(s) field. For more information, refer to Test Definition Setup.

For tumor-only analyses, when enabled in DRAGEN, variant origin is determined for small variants based on population frequency databases.

For tumor-normal analysis, when enabled in DRAGEN, variant origin is determined for small variants based on the presence or absence of the variant in the normal sample.

LOH Overlap

Filters small variants by overlap with an LOH event when LOH data is provided.

Genes

Filters variants by genes. There are two ways to create gene lists in the filter.

Using a list of gene names. To create a gene list, type or paste gene names in the Additional Genes field.
Using gene-disease associations from several sources. For more information, refer to Disease Association Filters.

Variant Type

Filters small variants, structural variants, and copy number variants by their types.

❗ The variant type is only selectable in a filter group with a single selected variant category as the variant types are tied to specific variant categories.

Consequences

Filters data by specific consequences.

❗ The consequence filter is only selectable in a condition group with a single selected variant category as the consequences are tied to specific variant categories.

Small Variant Consequences

When annotating transcripts with terms, Connected Insights uses the most specific term supported by the variant annotator.

Consequence filters return only the specified term and do not automatically include child terms. Specify the exact terms to include in the filter results.

Functional Consequences

Consequence

Description

Gain of Function Variant

The variant results in gain of function.

Loss of Function Variant

The variant results in loss of function.

These consequences are annotated when a variant has a biological assertion from any source with these consequences (for example, JAX-CKB or MyKnowledge Base).

Coding Sequences

Start and Stop Alterations

Filters data by the presence and location of start and stop alterations.

Start and Stop Alterations

Consequence

Description

Start Loss

The loss of a start codon in the coding sequence.

Stop Gained

The gain of a stop codon in the coding sequence.

Stop Loss

The loss of a stop codon in the coding sequence.

Incomplete Terminal Codon

A change to at least one base of the final codon of an incomplete annotated transcript.

Feature Elongation

The variant causes the extension of the genomic feature.

Feature Truncation

The variant causes the reduction of a genomic feature.

Splice Site

Filters data by the affected splice site.

Type

Description

Splice Acceptor Variant

The variant affects the canonical splice acceptor site (last two bases of the 3' end of the intron).

Splice Donor Variant

The variant affects the canonical splice donor site (first two bases of the 5' of the intron).

Splice Region Variant

An indel or substitution in a non coding splice region of the gene.

Indels

Type

Description

Frameshift Variant

An insertion or deletion in which the number of base pairs is not divisible by 3, causing a frame disruption.

Inframe Deletion

A deletion that does not disrupt the reading frame.

Inframe Insertion

An insertion that does not disrupt the reading frame.

Other

Type

Description

Missense Variant

A single base pair substitution that results in the translation of a different amino acid at the position.

Protein Altering Variant

The variant has a protein-altering coding consequence.

Coding Sequence Variant

The variant changes the coding sequence.

Silent Consequences

Filters data by the variant relationship to a gene.

Type

Description

Intergenic Variant

The variant position is not covered by any gene transcript.

Upstream Gene Variant

The variant position is within 5 kb upstream of the defined transcript start coordinate.

Downstream Gene Variant

The variant position is within 5 kb downstream of the defined transcript end coordinate.

Intron Variant

The variant occurs within an intron region.

3-prime UTR Variant

The variant is in the 3' untranslated region of a gene.

5-prime UTR Variant

The variant is in the 5' untranslated region of a gene.

Noncoding Transcript Exon Variant

The variant changes the noncoding exon sequence in a noncoding transcript.

Noncoding Transcript Variant

The variant occurs in a noncoding RNA gene.

Synonymous Variant

The variant does not affect the primary amino acid sequence of the translated protein.

Start Retained Variant

At least one base in the start codon is changed, but the start codon remains.

Stop Retained Variant

At least one base in the terminator code is changed, but the terminator remains.

Mature miRNA Variant

The variant occurs within a mature miRNA sequence.

NMD Transcript Variant

The variant is in a transcript and is the target of nonsense-mediated decay (NMD).

Regulatory Region Ablation

A deletion of a region that contains a regulatory region.

Regulatory Region Amplification

An amplification of a region that contains a regulatory region.

Regulatory Region Variation

The variant occurs in a regulatory region.

Structural Variant Consequences

When annotating transcripts with terms, Connected Insights uses the most specific terms supported by the variant annotator. Consequence filters return only the specified term and do not automatically include child terms. Specify the exact terms to include in the filter results.

Functional Consequences

Consequence

Description

Gain of Function Variant

The variant results in gain of function.

Loss of Function Variant

The variant results in loss of function.

These consequences are annotated when a variant has a biological assertion from any source with these consequences (for example, JAX-CKB or MyKnowledge Base).

Transcript Consequences

Filters data by the transcript consequence.

Consequence

Description

Transcript Variant

The variant changes the structure of the transcript.

Intron Variant

The variant is completely within the intron region of the gene.

Exon Variant

The variant is completely within the exon region of the gene.

Transcript Ablation

A deletion of the region that contains a transcript feature.

Transcript Amplification

An amplification of a region that contains a transcript.

Feature Elongation

The variant causes the extension of a genomic feature.

Feature Truncation

The variant causes the reduction of a genomic feature.

5-Prime Duplicated Transcript

A partially duplicated transcript in which the 5' end of the transcript is duplicated.

3-Prime Duplicated Transcript

A partially duplicated transcript in which the 3' end of the transcript is duplicated.

Gene Fusion Consequence

Filters data by the gene fusion consequence.

Consequence

Description

Unidirectional Gene Fusion

A fusion of two genes on the same strand.

Bidirectional Gene Fusion

A fusion of two genes on the opposite strand.

Gene Fusion

A fusion of two genes with ambiguous or unknown strand.

Copy Number Variant Consequences

When annotating transcripts with terms, Connected Insights uses the most specific term supported by the variant annotator. Consequence filters return only the specified term and do not automatically include child terms. Specify the exact terms to include in the filter results.

Consequence

Description

Gain of Function Variant

The variant results in gain of function.

Loss of Function Variant

The variant results in loss of function.

These consequences are annotated when a variant has a biological assertion from any source with these consequences (for example, JAX-CKB or MyKnowledge Base).

Transcript Consequences

Filters data by the transcript consequence.

Consequence

Description

Transcript Variant

The variant changes the structure of the transcript.

Intron Variant

The variant is completely within the intron region of the gene.

Exon Variant

The variant is completely within the exon region of the gene.

Transcript Ablation

A deletion of a region that contains a transcript feature.

Transcript Amplification

An amplification of a region that contains a transcript.

Transcript Truncation

A truncation of a region that contains a transcript.

Feature Elongation

The variant causes the extension of a genomic feature.

Feature Truncation

The variant causes the reduction of a genomic feature.

5-Prime Duplicated Transcript

A partially duplicated transcript in which the 5' end of the transcript is duplicated.

3-Prime Duplicated Transcript

A partially duplicated transcript in which the 3' end of the transcript is duplicated.

Loss of Heterozygosity

The variant results in loss of heterozygosity of the transcript.

Copy Number Variant Consequence Filters

Filters data by the copy number consequence.

Type

Description

Copy Number Increase

The copy number is increased relative to the reference sequence.

Copy Number Decrease

The copy number is decreased relative to the reference sequence.

Copy Number Change

The copy number is increased or decreased.

Intron

The variant is completely within the intron region of the gene.

Exon

The variant is completely within the exon region of the gene.

RNA Splice Variant Consequences

Consequence

Description

Gain of Function Variant

The variant results in gain of function.

Loss of Function Variant

The variant results in loss of function.

The functional consequences are annotated when a variant has a biological assertion from any source with these consequences (for example, JAX-CKB or My Knowledge Base).

RNA Splice Variant

Consequence

Description

Exon Loss Consequence

A loss of one or more exons in a gene.

RNA Fusion Variant Consequences

Consequence

Description

Gain of Function Variant

The variant results in gain of function.

Loss of Function Variant

The variant results in loss of function.

The functional consequences are annotated when a variant has a biological assertion from any source with these consequences (for example, CKB or My Knowledge Base).

RNA Splice Variant

Consequence

Description

Unidirectional Gene Fusion

A fusion of two genes on the same strand.

Transcript Variant

The variant changes the structure transcript.

Position (Chromosome)

Filters by specified chromosomes. If no chromosome is selected, the chromosome filter is not applied.

Position (Genomic Regions)

Filters by specified regions. The input format is chr#: start-stop, within multiple regions separated by spaces or new lines.

Change (Copy Number)

These values indicate a reference, deletion, or amplification of copy number variants.

❗ The change (copy number) filter is only selectable in a condition group with only the copy number variant category.

Change (Fold Change)

With copy number variants, the fold change value is derived from the normalized read depth of the gene in a sample. This depth is relative to the normalized ready depth of diploid regions in the same sample.

❗ The change (fold change) filter is only selectable in a condition group with only the copy number variant category.

Length

Filters data by variant length with resolution up to one bp.

Custom Annotations

Filters variants by Custom Annotations.

Variant Quality Filters

This page summarizes filters related to variant quality. Filter availability can vary depending on the selected variant categories. If filters are applied to more than one variant category in the same condition group, only filters relevant for all variant categories are available. For more information, refer to .

Filters

Filters data by the value provided for the variant in the FILTER column of the VCF file. Refer to the variant caller documentation to confirm possible values and recommended thresholds.

Quality

Filters data by the value provided for the variant in the QUAL column of the VCF file. Refer to the variant caller documentation to confirm possible values and recommended thresholds.

Sample Metrics

Filters data by read metrics, for example, VAF, allele depth, paired and read counts, split read count, and others.

Low Complexity Region

Excludes small variants in low complexity regions (LCRs).

For more information, refer to .

Variant Details Filters

Origin

Filters variants by Suspected Somatic or Predicted Germline origin.

For tumor-only analyses, when enabled in DRAGEN, variant origin is determined for small variants based on population frequency databases.

For tumor-normal analysis, when enabled in DRAGEN, variant origin is determined for small variants based on the presence or absence of the variant in the normal sample.

LOH Overlap

Filters small variants by overlap with an LOH event when LOH data is provided.

Genes

Filters variants by genes. There are two ways to create gene lists in the filter.

Using a list of gene names. To create a gene list, type or paste gene names in the Additional Genes field.
Using gene-disease associations from several sources. For more information, refer to Disease Association Filters.

Variant Type

Filters small variants, structural variants, and copy number variants by their types.

❗ The variant type is only selectable in a filter group with a single selected variant category as the variant types are tied to specific variant categories.

Consequences

Filters data by specific consequences.

❗ The consequence filter is only selectable in a condition group with a single selected variant category as the consequences are tied to specific variant categories.

Small Variant Consequences

When annotating transcripts with terms, Connected Insights uses the most specific term supported by the variant annotator.

Consequence filters return only the specified term and do not automatically include child terms. Specify the exact terms to include in the filter results.

Functional Consequences

Consequence

Description

Gain of Function Variant

The variant results in gain of function.

Loss of Function Variant

The variant results in loss of function.

These consequences are annotated when a variant has a biological assertion from any source with these consequences (for example, JAX-CKB or MyKnowledge Base).

Coding Sequences

Start and Stop Alterations

Filters data by the presence and location of start and stop alterations.

Start and Stop Alterations

Consequence

Description

Start Loss

The loss of a start codon in the coding sequence.

Stop Gained

The gain of a stop codon in the coding sequence.

Stop Loss

The loss of a stop codon in the coding sequence.

Incomplete Terminal Codon

A change to at least one base of the final codon of an incomplete annotated transcript.

Feature Elongation

The variant causes the extension of the genomic feature.

Feature Truncation

The variant causes the reduction of a genomic feature.

Splice Site

Filters data by the affected splice site.

Type

Description

Splice Acceptor Variant

The variant affects the canonical splice acceptor site (last two bases of the 3' end of the intron).

Splice Donor Variant

The variant affects the canonical splice donor site (first two bases of the 5' of the intron).

Splice Region Variant

An indel or substitution in a non coding splice region of the gene.

Indels

Type

Description

Frameshift Variant

An insertion or deletion in which the number of base pairs is not divisible by 3, causing a frame disruption.

Inframe Deletion

A deletion that does not disrupt the reading frame.

Inframe Insertion

An insertion that does not disrupt the reading frame.

Other

Type

Description

Missense Variant

A single base pair substitution that results in the translation of a different amino acid at the position.

Protein Altering Variant

The variant has a protein-altering coding consequence.

Coding Sequence Variant

The variant changes the coding sequence.

Silent Consequences

Filters data by the variant relationship to a gene.

Type

Description

Intergenic Variant

The variant position is not covered by any gene transcript.

Upstream Gene Variant

The variant position is within 5 kb upstream of the defined transcript start coordinate.

Downstream Gene Variant

The variant position is within 5 kb downstream of the defined transcript end coordinate.

Intron Variant

The variant occurs within an intron region.

3-prime UTR Variant

The variant is in the 3' untranslated region of a gene.

5-prime UTR Variant

The variant is in the 5' untranslated region of a gene.

Noncoding Transcript Exon Variant

The variant changes the noncoding exon sequence in a noncoding transcript.

Noncoding Transcript Variant

The variant occurs in a noncoding RNA gene.

Synonymous Variant

The variant does not affect the primary amino acid sequence of the translated protein.

Start Retained Variant

At least one base in the start codon is changed, but the start codon remains.

Stop Retained Variant

At least one base in the terminator code is changed, but the terminator remains.

Mature miRNA Variant

The variant occurs within a mature miRNA sequence.

NMD Transcript Variant

The variant is in a transcript and is the target of nonsense-mediated decay (NMD).

Regulatory Region Ablation

A deletion of a region that contains a regulatory region.

Regulatory Region Amplification

An amplification of a region that contains a regulatory region.

Regulatory Region Variation

The variant occurs in a regulatory region.

Structural Variant Consequences

Functional Consequences

Consequence

Description

Gain of Function Variant

The variant results in gain of function.

Loss of Function Variant

The variant results in loss of function.

These consequences are annotated when a variant has a biological assertion from any source with these consequences (for example, JAX-CKB or MyKnowledge Base).

Transcript Consequences

Filters data by the transcript consequence.

Consequence

Description

Transcript Variant

The variant changes the structure of the transcript.

Intron Variant

The variant is completely within the intron region of the gene.

Exon Variant

The variant is completely within the exon region of the gene.

Transcript Ablation

A deletion of the region that contains a transcript feature.

Transcript Amplification

An amplification of a region that contains a transcript.

Feature Elongation

The variant causes the extension of a genomic feature.

Feature Truncation

The variant causes the reduction of a genomic feature.

5-Prime Duplicated Transcript

A partially duplicated transcript in which the 5' end of the transcript is duplicated.

3-Prime Duplicated Transcript

A partially duplicated transcript in which the 3' end of the transcript is duplicated.

Gene Fusion Consequence

Filters data by the gene fusion consequence.

Consequence

Description

Unidirectional Gene Fusion

A fusion of two genes on the same strand.

Bidirectional Gene Fusion

A fusion of two genes on the opposite strand.

Gene Fusion

A fusion of two genes with ambiguous or unknown strand.

Copy Number Variant Consequences

Consequence

Description

Gain of Function Variant

The variant results in gain of function.

Loss of Function Variant

The variant results in loss of function.

These consequences are annotated when a variant has a biological assertion from any source with these consequences (for example, JAX-CKB or MyKnowledge Base).

Transcript Consequences

Filters data by the transcript consequence.

Consequence

Description

Transcript Variant

The variant changes the structure of the transcript.

Intron Variant

The variant is completely within the intron region of the gene.

Exon Variant

The variant is completely within the exon region of the gene.

Transcript Ablation

A deletion of a region that contains a transcript feature.

Transcript Amplification

An amplification of a region that contains a transcript.

Transcript Truncation

A truncation of a region that contains a transcript.

Feature Elongation

The variant causes the extension of a genomic feature.

Feature Truncation

The variant causes the reduction of a genomic feature.

5-Prime Duplicated Transcript

A partially duplicated transcript in which the 5' end of the transcript is duplicated.

3-Prime Duplicated Transcript

A partially duplicated transcript in which the 3' end of the transcript is duplicated.

Loss of Heterozygosity

The variant results in loss of heterozygosity of the transcript.

Copy Number Variant Consequence Filters

Filters data by the copy number consequence.

Type

Description

Copy Number Increase

The copy number is increased relative to the reference sequence.

Copy Number Decrease

The copy number is decreased relative to the reference sequence.

Copy Number Change

The copy number is increased or decreased.

Intron

The variant is completely within the intron region of the gene.

Exon

The variant is completely within the exon region of the gene.

RNA Splice Variant Consequences

Consequence

Description

Gain of Function Variant

The variant results in gain of function.

Loss of Function Variant

The variant results in loss of function.

The functional consequences are annotated when a variant has a biological assertion from any source with these consequences (for example, JAX-CKB or My Knowledge Base).

RNA Splice Variant

Consequence

Description

Exon Loss Consequence

A loss of one or more exons in a gene.

RNA Fusion Variant Consequences

Consequence

Description

Gain of Function Variant

The variant results in gain of function.

Loss of Function Variant

The variant results in loss of function.

The functional consequences are annotated when a variant has a biological assertion from any source with these consequences (for example, CKB or My Knowledge Base).

RNA Splice Variant

Consequence

Description

Unidirectional Gene Fusion

A fusion of two genes on the same strand.

Transcript Variant

The variant changes the structure transcript.

Position (Chromosome)

Filters by specified chromosomes. If no chromosome is selected, the chromosome filter is not applied.

Position (Genomic Regions)

Filters by specified regions. The input format is chr#: start-stop, within multiple regions separated by spaces or new lines.

Change (Copy Number)

These values indicate a reference, deletion, or amplification of copy number variants.

❗ The change (copy number) filter is only selectable in a condition group with only the copy number variant category.

Change (Fold Change)

❗ The change (fold change) filter is only selectable in a condition group with only the copy number variant category.

Length

Filters data by variant length with resolution up to one bp.

Custom Annotations

Filters variants by Custom Annotations.

Disease Association Filters

This page summarizes filters related to variant – disease and gene – disease associations. Filter availability can vary depending on the selected variant categories. If filters are applied to more than one variant category in the same condition group, only filters relevant for all variant categories are available. For more information, refer to Filter by Variant Category.

Genes

In addition to filtering by the gene list, variants can be filtered by genes based on their associations with diseases and phenotypes.

How to Create Disease/Phenotype-Based List

Select the genes filter.
- In the displayed dialog box window, select the Include genes from the diseases checkbox.
- Start typing a phenotype or disease to display a list of potential matches to add to the list.
- Select a checkbox next to Resource to include it in the list.
- Select a high, medium, or low confidence score.
- Select an overlap distance between 0.00 and 1.00.
- The disease and related diseases display in the Related Diseases area with the distance and gene count. Deselect any unnecessary related diseases.
- Add any other genes to the gene list.
- Add additional genes in the Additional genes area.
Select Apply to save changes to the gene list.

Resources of Gene - Disease Associations

The following tables detail the ontology sources that Connected Insights uses to determine relationships between genes and diseases.

Gene Disease Relationship Resources

Resource

Description

OMIM

Online Mendelian Inheritance in Man

HPO

Human Phenotype Ontology

Phenopedia

Human Genome Epidemiology (HuGE)

GEL PanelApp

Genomics England PanelApp

ILMN

• Clinvar – NCBI ClinVar •MedGen – NCBI portal to information about conditions and phenotypes related to Medical Genetics. •GTR – NCBI Genetic Testing Registry •GeneRIF – NCBI Gene Reference into Function

Disease Relationship Resources

Resource

Description

ICD-9

International Classification of Diseases, Ninth Revision

ICD-10

International Classification of Diseases, Tenth Revision

MeSH

Medical Subject Headings

UML

Unified Medical Language system. A repository of ontology resources.

SNOMEDCT

Systematized Nomenclature of Medicine Clinical Terms

Overlap Distance

Phenotype to gene search finds similar phenotypes and diseases across various ontology sources, independent of the underlying vocabulary in each source. If an equivalent concept does not exist across the sources, Connected Insights calculates the distance between nodes in the ontological hierarchies and assigns a score from 0 to 1, where:

Values closer to 0 indicate that the concepts are more equivalent. A value of 0 indicates that the concepts are the same.
Values closer to 1 indicate that the concepts are more dissimilar. A value of 1 indicates that the concepts can only be connected at the root node and are therefore excluded from query results.

The determination of distance accounts for the fact that sibling concepts on leaf nodes (eg, hypertrophic cardiomyopathy, and dilated cardiomyopathy) are more closely related to each other than siblings close to the root (eg, abnormal vascular morphology, and abnormal heart morphology).

Confidence

Confidence scores for gene - disease associations are calculated using the following rules:

Expert-curated data from OMIM, HPO, Phenopedia, and ClinVar are assigned a high confidence score.
High, moderate, or low confidence scores are converted from GeL PanelApp strong, medium, and low scores, respectively.
GTR confidence scores are based on information content metrics, which measure the specificity of a genetic test for a particular phenotype and a gene.
GeneRIF associations, which are derived using data mining, and assigned medium confidence.

COSMIC Cancer Gene Census

Filters variants based on gene role in cancer annotated by COSMIC Cancer Gene Census (CGC).

Cancer Gene Census

Role in Cancer

Description

TSG

Known tumor suppressor gene (TSG).

Oncogene

Known oncogene.

Fusion

Known fusion gene.

OMIM

Filters variants based on genes with known gene-disease associations in the OMIM database.

Present in OMIM — An OMIM entry exists for the gene.
Has associated OMIM phenotypes (including ?) — A relationship exists between the phenotype and a matching gene at the transcript level. Provisional relationships, indicated by "?" in OMIM, are included.
Has associated OMIM phenotypes (excluding ?) — A relationship exists between the phenotype and a matching gene at the transcript level. Provisional relationships, indicated by "?" in OMIM, are excluded.

Selecting associated phenotypes enables options to refine the filter by mode of inheritance.

Mode of Inheritance Descriptions

Mode of Inheritance

Description

Autosomal Dominant

Autosomal Recessive

(X-linked)

XLD

(X-linked Dominant)

XLR

(X-linked Recessive)

(Y-linked)

Mitochondrial

Multifactorial

Digenic Dominant

Digenic Recessive

SMu

Somatic Mutation

SMo

Somatic Mosaicism

Isolated Cases

COSMIC

Filters by presence in the COSMIC database. For more information, refer to Acronyms and Terms.

❕ The COSMIC filter is only selectable for small variants.

Cancer Hotspots

Filters by number of samples in cancer hotspots.

ClinVar

Filters on interpretation categories and the review status provided in the ClinVar database.

Interpretation Category Descriptions

Interpretation Category

Definition in Connected Insights

Pathogenic

The variant has at least one aggregate variant record (VCV entry) or aggregate variant –disease record (RCV) with classification category Pathogenic in the ClinVar database.

Likely Pathogenic, UncertainSignificance, Likely Benign, Benign

The variant has at least one aggregate variant record (VCV entry) or aggregate variant –disease record (RCV) with classification category Pathogenic in the ClinVar database.

None

The variant has no records in ClinVar or has at least one aggregate variant record (VCV entry)or aggregate variant – disease record (RCV) with interpretation categories Drug Response,Protective, and others (any other categories excluding Pathogenic, Likely Pathogenic,Uncertain Significance, Likely Benign, and Benign).

To filter by ClinVar review status, use the definitions provided from the ClinVar status review guidelines on the National Center for Biotechnology Information website.

Interpretation Category Descriptions

Number of Stars

Definition in Connected Insights

Review Status Descriptions

Four

The highest review status across all VCV and RCV records for the variant is four stars.

Practice guideline. For more information,refer to the ClinVar status review guidelines on the National Center for Biotechnology Information website.

Three

The highest review status across all VCVand RCV records for the variant is three stars.

Reviewed by export panel. For more information, refer to the ClinVar status review guidelines on the National Center for Biotechnology Information website.

Two

The highest review status across all VCVand RCV records for the variant is two stars.

Criteria provided, multiple submitters, no conflicts. Two or more submitters with assertion criteria and evidence (or a public contact) provided the same interpretation.

One

The highest review status across all VCV and RCV records for the variant is one star.

Criteria provided, conflicting interpretations. Multiple submitters provided assertion criteria and evidence(or a public contact) but there are conflicting interpretations. The independent values are enumerated for clinical significance.

None

The highest review status across all VCV and RCV records for the variant is no stars.

No assertion criteria provided. For more information, refer to the ClinVar status review guidelines on the National Center for Biotechnology Information website.