Disease Association Filters
This page summarizes filters related to variant – disease and gene – disease associations. Filter availability can vary depending on the selected variant categories. If filters are applied to more than one variant category in the same condition group, only filters relevant for all variant categories are available. For more information, refer to Filter by Variant Category.
Genes
In addition to filtering by the gene list, variants can be filtered by genes based on their associations with diseases and phenotypes.
How to Create Disease/Phenotype-Based List
Select the genes filter.
In the displayed dialog box window, select the Include genes from the diseases checkbox.
Start typing a phenotype or disease to display a list of potential matches to add to the list.
Select a checkbox next to Resource to include it in the list.
Select a high, medium, or low confidence score.
Select an overlap distance between 0.00 and 1.00.
The disease and related diseases display in the Related Diseases area with the distance and gene count. Deselect any unnecessary related diseases.
Add any other genes to the gene list.
Add additional genes in the Additional genes area.
Select Apply to save changes to the gene list.
Resources of Gene - Disease Associations
The following tables detail the ontology sources that Connected Insights uses to determine relationships between genes and diseases.
Gene Disease Relationship Resources
Resource
Description
OMIM
Online Mendelian Inheritance in Man
HPO
Human Phenotype Ontology
Phenopedia
Human Genome Epidemiology (HuGE)
GEL PanelApp
Genomics England PanelApp
ILMN
• Clinvar – NCBI ClinVar •MedGen – NCBI portal to information about conditions and phenotypes related to Medical Genetics. •GTR – NCBI Genetic Testing Registry •GeneRIF – NCBI Gene Reference into Function
Disease Relationship Resources
Resource
Description
ICD-9
International Classification of Diseases, Ninth Revision
ICD-10
International Classification of Diseases, Tenth Revision
MeSH
Medical Subject Headings
UML
Unified Medical Language system. A repository of ontology resources.
SNOMEDCT
Systematized Nomenclature of Medicine Clinical Terms
Overlap Distance
Phenotype to gene search finds similar phenotypes and diseases across various ontology sources, independent of the underlying vocabulary in each source. If an equivalent concept does not exist across the sources, Connected Insights calculates the distance between nodes in the ontological hierarchies and assigns a score from 0 to 1, where:
Values closer to 0 indicate that the concepts are more equivalent. A value of 0 indicates that the concepts are the same.
Values closer to 1 indicate that the concepts are more dissimilar. A value of 1 indicates that the concepts can only be connected at the root node and are therefore excluded from query results.
The determination of distance accounts for the fact that sibling concepts on leaf nodes (eg, hypertrophic cardiomyopathy, and dilated cardiomyopathy) are more closely related to each other than siblings close to the root (eg, abnormal vascular morphology, and abnormal heart morphology).
Confidence
Confidence scores for gene - disease associations are calculated using the following rules:
Expert-curated data from OMIM, HPO, Phenopedia, and ClinVar are assigned a high confidence score.
High, moderate, or low confidence scores are converted from GeL PanelApp strong, medium, and low scores, respectively.
GTR confidence scores are based on information content metrics, which measure the specificity of a genetic test for a particular phenotype and a gene.
GeneRIF associations, which are derived using data mining, and assigned medium confidence.
COSMIC Cancer Gene Census
Filters variants based on gene role in cancer annotated by COSMIC Cancer Gene Census (CGC).
Cancer Gene Census
Role in Cancer
Description
TSG
Known tumor suppressor gene (TSG).
Oncogene
Known oncogene.
Fusion
Known fusion gene.
OMIM
Filters variants based on genes with known gene-disease associations in the OMIM database.
Present in OMIM — An OMIM entry exists for the gene.
Has associated OMIM phenotypes (including ?) — A relationship exists between the phenotype and a matching gene at the transcript level. Provisional relationships, indicated by "?" in OMIM, are included.
Has associated OMIM phenotypes (excluding ?) — A relationship exists between the phenotype and a matching gene at the transcript level. Provisional relationships, indicated by "?" in OMIM, are excluded.
Selecting associated phenotypes enables options to refine the filter by mode of inheritance.
Mode of Inheritance Descriptions
Mode of Inheritance
Description
AD
Autosomal Dominant
AR
Autosomal Recessive
XL
(X-linked)
XLD
(X-linked Dominant)
XLR
(X-linked Recessive)
YL
(Y-linked)
MI
Mitochondrial
Mu
Multifactorial
DD
Digenic Dominant
DR
Digenic Recessive
SMu
Somatic Mutation
SMo
Somatic Mosaicism
IC
Isolated Cases
COSMIC
Filters by presence in the COSMIC database. For more information, refer to Acronyms and Terms.
❕ The COSMIC filter is only selectable for small variants.
Cancer Hotspots
Filters by number of samples in cancer hotspots.
ClinVar
Filters on interpretation categories and the review status provided in the ClinVar database.
Interpretation Category Descriptions
Interpretation Category
Definition in Connected Insights
Pathogenic
The variant has at least one aggregate variant record (VCV entry) or aggregate variant –disease record (RCV) with classification category Pathogenic in the ClinVar database.
Likely Pathogenic, UncertainSignificance, Likely Benign, Benign
The variant has at least one aggregate variant record (VCV entry) or aggregate variant –disease record (RCV) with classification category Pathogenic in the ClinVar database.
None
The variant has no records in ClinVar or has at least one aggregate variant record (VCV entry)or aggregate variant – disease record (RCV) with interpretation categories Drug Response,Protective, and others (any other categories excluding Pathogenic, Likely Pathogenic,Uncertain Significance, Likely Benign, and Benign).
To filter by ClinVar review status, use the definitions provided from the ClinVar status review guidelines on the National Center for Biotechnology Information website.
Interpretation Category Descriptions
Number of Stars
Definition in Connected Insights
Review Status Descriptions
Four
The highest review status across all VCV and RCV records for the variant is four stars.
Practice guideline. For more information,refer to the ClinVar status review guidelines on the National Center for Biotechnology Information website.
Three
The highest review status across all VCVand RCV records for the variant is three stars.
Reviewed by export panel. For more information, refer to the ClinVar status review guidelines on the National Center for Biotechnology Information website.
Two
The highest review status across all VCVand RCV records for the variant is two stars.
Criteria provided, multiple submitters, no conflicts. Two or more submitters with assertion criteria and evidence (or a public contact) provided the same interpretation.
One
The highest review status across all VCV and RCV records for the variant is one star.
Criteria provided, conflicting interpretations. Multiple submitters provided assertion criteria and evidence(or a public contact) but there are conflicting interpretations. The independent values are enumerated for clinical significance.
None
The highest review status across all VCV and RCV records for the variant is no stars.
No assertion criteria provided. For more information, refer to the ClinVar status review guidelines on the National Center for Biotechnology Information website.
Last updated