1 of 1

Cohorts Data in ICA Base

ICA Cohorts data can be viewed in an ICA Project Base instance as a shared database. A shared database in ICA Base operates as a database view. To use this feature, enable Base for your project prior to starting any ICA Cohorts ingestions. See Base for more information on enabling this feature in your ICA Project.

ICA Cohorts Base Tables

After ingesting data into your project, select Phenotypic and Molecular data are available to view in Base. See Cohorts Import for instruction on importing data sets into Cohorts.

Post ingestion, data will be represented in Base.
Select BASE from the ICA left-navigation and click Query.
Under the New Query window, a list of tables is displayed. Expand the Shared Database for Project \<your project name\> .
Cohorts tables will be displayed.
To preview the table and fields click each view listed.
Clicking any of these views then selecting PREVIEW on the right-hand side will show you a preview of the data in the tables.

If your ingestion includes Somatic variants, there will be two molecular tables: ANNOTATED_SOMATIC_MUTATIONS and ANNOTATED_VARIANTS. All ingestions will include a PHENOTYPE table.

The PHENOTYPE table includes a harmonized set that is collected across all data ingestions and is not representative of all data ingested for the Subject or Sample. Sample information is also displayed in this table, if applicable. Sample information drives the annotation process if molecular data is included in the ingestion. That data is stored in the PHENOTYPE table.

Phenotype Data

Field Name

Type

Description

Sample Information

Field Name

Type

Description

Sample Attribute

This table is an entity-attribute value table of supplied sample data matching Cohorts accepted attributes.

Field Name

Type

Description

Study Information

Field Name

Type

Description

Subject

Field

Type

Description

Subject Attribute

This table is an entity-attribute value table of supplied subject data matching Cohorts accepted attributes.

Field

Type

Description

Disease

Field

Type

Description

Drug Exposure

Field

Type

Description

Measurement

Field

Type

Description

Procedure

Field

Type

Description

Annotated Variants

This table will be available for all projects with ingested molecular data

Annotated Somatic Mutations

This table will only be available for data sets with ingested Somatic molecular data.

Annotated Copy Number Variants

This table will only be available for data sets with ingested CNV molecular data.

Annotated Structural Variants

This table will only be available for data sets with ingested SV molecular data. Note that ICA Cohorts stores copy number variants in a separate table.

Raw RNAseq data tables for genes and transcripts

These tables will only be available for data sets with ingested RNAseq molecular data.

Table for gene quantification results:

The corresponding transcript table uses TRANSCRIPT_ID instead of GENE_ID and GENE_HGNC.

Differential expression tables for genes and transcripts

These tables will only be available for data sets with ingested RNAseq molecular data.

Table for differential gene expression results:

The corresponding transcript table uses TRANSCRIPT_ID instead of GENE_ID and GENE_HGNC.

Cohorts Data in ICA Base

ICA Cohorts Base Tables

After ingesting data into your project, select Phenotypic and Molecular data are available to view in Base. See Cohorts Import for instruction on importing data sets into Cohorts.

Post ingestion, data will be represented in Base.
Select BASE from the ICA left-navigation and click Query.
Under the New Query window, a list of tables is displayed. Expand the Shared Database for Project \<your project name\> .
Cohorts tables will be displayed.
To preview the table and fields click each view listed.
Clicking any of these views then selecting PREVIEW on the right-hand side will show you a preview of the data in the tables.

If your ingestion includes Somatic variants, there will be two molecular tables: ANNOTATED_SOMATIC_MUTATIONS and ANNOTATED_VARIANTS. All ingestions will include a PHENOTYPE table.

Phenotype Data

Field Name

Type

Description

Sample Information

Field Name

Type

Description

Sample Attribute

This table is an entity-attribute value table of supplied sample data matching Cohorts accepted attributes.

Field Name

Type

Description

Study Information

Field Name

Type

Description

Subject

Field

Type

Description

Subject Attribute

This table is an entity-attribute value table of supplied subject data matching Cohorts accepted attributes.

Field

Type

Description

Disease

Field

Type

Description

Drug Exposure

Field

Type

Description

Measurement

Field

Type

Description

Procedure

Field

Type

Description

Annotated Variants

This table will be available for all projects with ingested molecular data

Annotated Somatic Mutations

This table will only be available for data sets with ingested Somatic molecular data.

Annotated Copy Number Variants

This table will only be available for data sets with ingested CNV molecular data.

Annotated Structural Variants

This table will only be available for data sets with ingested SV molecular data. Note that ICA Cohorts stores copy number variants in a separate table.

Raw RNAseq data tables for genes and transcripts

These tables will only be available for data sets with ingested RNAseq molecular data.

Table for gene quantification results:

The corresponding transcript table uses TRANSCRIPT_ID instead of GENE_ID and GENE_HGNC.

Differential expression tables for genes and transcripts

These tables will only be available for data sets with ingested RNAseq molecular data.

Table for differential gene expression results:

The corresponding transcript table uses TRANSCRIPT_ID instead of GENE_ID and GENE_HGNC.

Cohorts Data in ICA Base

hashtagICA Cohorts Base Tables

hashtagPhenotype Data

hashtagSample Information

hashtagSample Attribute

hashtagStudy Information

hashtagSubject

hashtagSubject Attribute

hashtagDisease

hashtagDrug Exposure

hashtagMeasurement

hashtagProcedure

hashtagAnnotated Variants

hashtagAnnotated Somatic Mutations

hashtagAnnotated Copy Number Variants

hashtagAnnotated Structural Variants

hashtagRaw RNAseq data tables for genes and transcripts

hashtagDifferential expression tables for genes and transcripts

Cohorts Data in ICA Base

hashtagICA Cohorts Base Tables

hashtagPhenotype Data

hashtagSample Information

hashtagSample Attribute

hashtagStudy Information

hashtagSubject

hashtagSubject Attribute

hashtagDisease

hashtagDrug Exposure

hashtagMeasurement

hashtagProcedure

hashtagAnnotated Variants

hashtagAnnotated Somatic Mutations

hashtagAnnotated Copy Number Variants

hashtagAnnotated Structural Variants

hashtagRaw RNAseq data tables for genes and transcripts

hashtagDifferential expression tables for genes and transcripts

ICA Cohorts Base Tables

Phenotype Data

Sample Information

Sample Attribute

Study Information

Subject

Subject Attribute

Disease

Drug Exposure

Measurement

Procedure

Annotated Variants

Annotated Somatic Mutations

Annotated Copy Number Variants

Annotated Structural Variants

Raw RNAseq data tables for genes and transcripts

Differential expression tables for genes and transcripts

ICA Cohorts Base Tables

Phenotype Data

Sample Information

Sample Attribute

Study Information

Subject

Subject Attribute

Disease

Drug Exposure

Measurement

Procedure

Annotated Variants

Annotated Somatic Mutations

Annotated Copy Number Variants

Annotated Structural Variants

Raw RNAseq data tables for genes and transcripts

Differential expression tables for genes and transcripts