This walk-through is intended to represent a typical workflow when building and studying a cohort of oncology cases.
Click Create Cohort
button.
Select the following studies to add to your cohort:
TCGA – BRCA – Breast Invasive Carcinoma
TCGA – Ovarian Serous Cystadenocarcinoma
Add a Cohort Name
= TCGA Breast and Ovarian_1472
Click on Apply
.
Expand Show query details
to see the study makeup of your cohort.
Charts
will be open by default. If not, click Show charts
Use the gear icon in the top-right to change viewable chart settings.
Tip:
Disease Type
,Histological Diagnosis
,Technology
,Overall Survival
have interesting data about this cohorts
The Subject
tab with all Subjects list is displayed below Charts with a link to each Subject by ID and other high-level information, like Data Types measured and reported. By clicking a subject ID, you will be brought to the data collected at the Subject level.
Search for subject TCGA-E2-A14Y
and view the data about this Subject.
Click the TCGA-E2-A14Y
Subject ID link to view clinical data for this Subject that was imported via the metadata.tsv file on ingest.
Note: the Subject is a 35 year old Female with vital status and other phenotypes that feed up into the
Subject
attribute selection criteria when creating or editing cohorts.
Click X
to close the Subject details.
Click Hide charts
to increase interactive landscape.
Click the Marker Frequency
tab, then click the Somatic Mutation
tab.
Review the gene list and mutation frequencies.
Note that PIK3CA has a high rate of mutation in the Cohort (ranked 2nd with 33% mutation frequency in 326 of the 987 Subjects that have Somatic Mutation data in this cohort).
Do Subjects with PIK3CA mutations have changes in PIK3CA RNA Expression?
Click the Gene Expression
tab, search for PIK3CA
PIK3CA RNA is down-regulated in 27% of the subjects relative to normal samples.
Switch from normal
to disease
Reference where the Subject’s denominator is the median of all disease samples in your cohort.
The count of matching vs. total subjects that have PIK3CA up-regulated RNA which may indicate a distinctive sub-phenotype.
Click directly on PIK3CA
gene link in the Gene Expression
table.
You are brought to the Gene
tab under the Gene Summary
sub-tab that lists information and links to public resources about PIK3CA.
Click the Variants
tab and Show legend and filters
if it does not open by default.
Below the interactive legend you see a set of analysis tracks: Needle Plot, Primate AI, Pathogenic variants, and Exons.
The Needle Plot allows toggling the plot by gnomAD frequency
and Sample Count
. Select Sample Count
in the Plot by
legend above the plot.
There are 87 mutations distributed across the 1068 amino acid sequence, listed below the analysis tracks. These can be exported via the icon into a table.
We know that missense variants can severely disrupt translated protein activity. Deselect all Variant Types
except for Missense
from the Show Variant Type
legend above the needle plot.
Many mutations are in the functional domains of the protein as seen by the colored boxes and labels on the x-axis of the Needle Plot.
Hover over the variant with the highest sample count in the yellow PI3Ka
protein domain.
The pop-up shows variant details for the 64 Subjects observed with it: 63 in the Breast Cancer study and 1 in the Ovarian Cancer Study.
Use the Exon zoom bar from each end of the Amino Acid sequence to zoom in to the PI3Ka
domain to better separate observations.
There are three different missense mutations at this locus changing the wildtype Glutamine at different frequencies to Lysine (64), Glycine (6), or Alanine (2).
The Pathogenic Variant
Track shows 7 ClinVar entries for mutations stacked at this locus affecting amino acid 545. Pop up details with pathogenicity calls, phenotypes, submitter and a link to the ClinVar entry is seen by hovering over the purple triangles.
Note the Primate AI
track and high Primate AI score.
Primate AI
track displays Scores for potential missense variants, based on polymorphisms observed in primate species. Points above the dashed line for the 75th percentile may be considered likely pathogenic as cross-species sequence is highly conserved; you often see high conservancy at the functional domains. Points below the 25th percentile may be considered "likely benign".
Click the Expression
tab and notice that normal Breast and normal Ovarian tissue have relatively high PIK3CA RNA Expression in GTex RNAseq tissue data but ubiquitously expressed.