Oncology Walk-through
This walk-through is intended to represent a typical workflow when building and studying a cohort of oncology cases.
Create a Cancer Cohort and View Subject Details
Click
Create Cohort
button.Select the following studies to add to your cohort:
TCGA – BRCA – Breast Invasive Carcinoma
TCGA – Ovarian Serous Cystadenocarcinoma
Add a
Cohort Name
= TCGA Breast and Ovarian_1472Click on
Apply
.Expand
Show query details
to see the study makeup of your cohort.Charts
will be open by default. If not, clickShow charts
Use the gear icon in the top-right to change viewable chart settings.
Tip:
Disease Type
,Histological Diagnosis
,Technology
,Overall Survival
have interesting data about this cohortsThe
Subject
tab with all Subjects list is displayed below Charts with a link to each Subject by ID and other high-level information, like Data Types measured and reported. By clicking a subject ID, you will be brought to the data collected at the Subject level.Search for subject
TCGA-E2-A14Y
and view the data about this Subject.Click the
TCGA-E2-A14Y
Subject ID link to view clinical data for this Subject that was imported via the metadata.tsv file on ingest.Note: the Subject is a 35 year old Female with vital status and other phenotypes that feed up into the
Subject
attribute selection criteria when creating or editing cohorts.Click
X
to close the Subject details.Click
Hide charts
to increase interactive landscape.
Data Analysis, Multi-Omic Biomarker Discovery, and Interpretation
Click the
Marker Frequency
tab, then click theSomatic Mutation
tab.Review the gene list and mutation frequencies.
Note that PIK3CA has a high rate of mutation in the Cohort (ranked 2nd with 33% mutation frequency in 326 of the 987 Subjects that have Somatic Mutation data in this cohort).
Do Subjects with PIK3CA mutations have changes in PIK3CA RNA Expression?
Click the
Gene Expression
tab, search forPIK3CA
PIK3CA RNA is down-regulated in 27% of the subjects relative to normal samples.
Switch from
normal
todisease
Reference where the Subject’s denominator is the median of all disease samples in your cohort.The count of matching vs. total subjects that have PIK3CA up-regulated RNA which may indicate a distinctive sub-phenotype.
Click directly on
PIK3CA
gene link in theGene Expression
table.You are brought to the
Gene
tab under theGene Summary
sub-tab that lists information and links to public resources about PIK3CA.Click the
Variants
tab andShow legend and filters
if it does not open by default.Below the interactive legend you see a set of analysis tracks: Needle Plot, Primate AI, Pathogenic variants, and Exons.
The Needle Plot allows toggling the plot by
gnomAD frequency
andSample Count
. SelectSample Count
in thePlot by
legend above the plot.There are 87 mutations distributed across the 1068 amino acid sequence, listed below the analysis tracks. These can be exported via the icon into a table.
We know that missense variants can severely disrupt translated protein activity. Deselect all
Variant Types
except forMissense
from theShow Variant Type
legend above the needle plot.Many mutations are in the functional domains of the protein as seen by the colored boxes and labels on the x-axis of the Needle Plot.
Hover over the variant with the highest sample count in the yellow
PI3Ka
protein domain.The pop-up shows variant details for the 64 Subjects observed with it: 63 in the Breast Cancer study and 1 in the Ovarian Cancer Study.
Use the Exon zoom bar from each end of the Amino Acid sequence to zoom in to the
PI3Ka
domain to better separate observations.There are three different missense mutations at this locus changing the wildtype Glutamine at different frequencies to Lysine (64), Glycine (6), or Alanine (2).
The
Pathogenic Variant
Track shows 7 ClinVar entries for mutations stacked at this locus affecting amino acid 545. Pop up details with pathogenicity calls, phenotypes, submitter and a link to the ClinVar entry is seen by hovering over the purple triangles.Note the
Primate AI
track and high Primate AI score.Primate AI
track displays Scores for potential missense variants, based on polymorphisms observed in primate species. Points above the dashed line for the 75th percentile may be considered likely pathogenic as cross-species sequence is highly conserved; you often see high conservancy at the functional domains. Points below the 25th percentile may be considered "likely benign".
Click the
Expression
tab and notice that normal Breast and normal Ovarian tissue have relatively high PIK3CA RNA Expression in GTex RNAseq tissue data but ubiquitously expressed.
Last updated