# Correlation Engine for Gene Discovery

Connected Multiomics (ICM) enables you to explore and analyze your own multiomic datasets. [Correlation Engine](https://www.illumina.com/products/by-type/informatics-products/connected-analytics/modules/correlation-engine.html) (CE) complements this by aggregating signals across experiments, tissues, and conditions to uncover robust co-expression patterns. Together, they allow for context-specific network analysis, helping identify gene modules, regulatory relationships, and potential biomarkers.

Once you’ve identified differentially expressed genes in Connected Multiomics, we recommend importing them into Correlation Engine for deeper biological interpretation.

Here's how:

### **Step 1: Export Genes or Proteins of Interest from ICM**

1. In the Task Graph of your Analysis in Connected Multiomics, navigate to and click the data node containing the significantly differentially expressed genes (Figure 1A)
2. Click the "Download data" button at the bottom of task menu (Figure 1B).
3. Uncheck the “Include counts” option before clicking "Download." This ensures compatibility with CE, which does not require raw counts.
4. Open the downloaded text file on your local computer. To simplify import into CE, retain only the following columns:

   1. Gene ID
   2. P-value
   3. Fold change

   *(See Figure 1C for example formatting.)*

<figure><img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-05686d4f92f82e3138bd9a8f492546f5f2a00458%2FFig1%20(1).png?alt=media" alt=""><figcaption></figcaption></figure>

Figure 1. Download the file of genes of inter*est from ICM.*

### ***Step 2: Upload to Correlation Engine***

Now you are ready to upload your genes or proteins of interest to CE.

To switch to CE is easy

1. From the ICM Studies page, click the Product dashboard icon <img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-a8849db2c1353f24d691eed97618bf67baca1838%2FScreenshot%202025-09-26%20at%202.40.06%E2%80%AFPM.png?alt=media" alt="" data-size="line"> on the top right to open all the applications in your current workgroup
2. Select Correlation Engine from your list of applications.
3. On the CE homepage, click “Import Your Data”. (See Figure 2A)
4. In the import wizard:

   1. follow the prompts and provide the requested metadata
   2. Click “Next Step” to proceed (Figure 2B).

   *Note: it is possible to upload multiple files simultaneously, but column headers must match exactly.*
5. On the next page, map your columns using the dropdowns:
   1. Gene ID
   2. P-value
   3. Fold change
6. Review your submission and click “Finish”.

<figure><img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-5a2097edb7c2e99f6ed68776ae901e7439f3ac26%2FFig2%20(1).png?alt=media" alt=""><figcaption></figcaption></figure>

Figure 2. Interface of data uploading in CE.

### Step 3: Explore Results in CE

You will receive an email notification once CE completes the import and analysis. A new study will be created in your private project. You can start exploring the ‘Pathway Enrichment’ and more (Figure 3, top panel) after clicking on the “BIOSET DATA” in the study.

<figure><img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-2a3be0b25a3557fe9690199719e5720833561292%2FFig3.png?alt=media" alt=""><figcaption></figcaption></figure>

Figure 3. Explore CE with your own analysis results from ICM.

### Example Use Case

This tutorial uses data from HT-29 colon cancer cells treated by 5-aza-deoxy-cytidine (5 AZA) with different dosages (10um,5 um, 0um). Without any other background information provided, CE correctly infers the most correlated public study *"Colorectal cancer HT29 cell line treated with 5uM and 10uM 5-aza-deoxy-cytidine for 5d” (See Figure 4)*

<figure><img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-3794ff89c4ce8415a648c3d4410419ba6835fd38%2FFig%204.png?alt=media" alt=""><figcaption></figcaption></figure>

Figure 4. The most correlated study by CE is linked to the query data used.

Querying in Knockdown Atlas also shows that CE identifies DNMT1 as the top perturbed gene, consistent with 5 AZA’s role as a DNA methyltransferase inhibitor. (See Figure 5)

This demonstrates the precision and power of combining Connected Multiomics with Correlation Engine

<figure><img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-1615fac4149305ddc715f458adfe8ccb7a6c2fe9%2FFig%205.png?alt=media" alt=""><figcaption></figcaption></figure>

Figure 5. The top 5 pertured genes infered by CE with the example query data.

### Summary

By exploring the same datatset in both Connected Multiomics and Correlation engine, you can:

* Move from differential analysis to biological interpretation
* Leverage curated public datasets for context
* Discover novel gene functions and biomarkers
