# Methylation Array

This guide provides instructions for analyzing Illumina Infinium Methylation array data. The data set is [Infinium Methylation Screening Array Demo Data Set](https://support.illumina.com/downloads/infinium-methylation-demo-data-set.html). There are 60 human samples with two different groups and two dose levels in each group.

The input data for methylation array is two .idat files per sample:

* `<Sample name>_Grn.idat`
* `<Sample name>_Red.idat`

### Create analysis

After[ login to Connected Multiomics](https://help.multiomics.illumina.com/icm/introduction/#log-in-to-connected-multiomics), [create study and add the IDAT files to the study](https://help.connected.illumina.com/icm/studies/create-study) to create 60 samples, add [sample meta data](https://help.connected.illumina.com/icm/studies/view-studies/sample-metadata).

<div align="left"><figure><img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-bd1bd45e64bc5e308883768ac77d388adbaf59b6%2Fstudy.png?alt=media" alt="" width="563"><figcaption></figcaption></figure></div>

Click New Analysis button (<img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-315d1d86f14027f22d3e77bb3e8caced9b1789a7%2Fimage%20(486).png?alt=media" alt="" data-size="line">), specify an analysis name, choose ***Custom: Illumina Infinium Methylation*** from *Analysis Type* drop-down list, and use **All Illumina Infinium Methylation samples**, click **Run Analysis.**

<div align="left"><figure><img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-d78b5a7b4413ff640a9bb9f43c7300355f80c07c%2Fanalysis%20(3).png?alt=media" alt="" width="563"><figcaption></figcaption></figure></div>

A Microarray methylation data node is generated in the Analysis. Click on the data node and choose [**Generate beta value task**](https://help.connected.illumina.com/icm/analyses/analysis-functionality/task-menu/pre-analysis-tools/generate-beta-value)**.**

<div align="left"><figure><img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-996d7d6b4f918c49b7812f39f43e9e32bc4816e8%2Fbeta%20(3).png?alt=media" alt="" width="563"><figcaption></figcaption></figure></div>

### Generate beta value

Choose **Infinium Methylation Screening Array** as the *Chip name* from the drop-down list. Users have the options to exclude probes on XY chromosomes depends on their study goal. To be more conservative, users can also exclude probes if their detect p-values are higher than a cutoff value in certain number of samples. Filter probes will remove noise and speed up the downstream analysis.

<div align="left"><figure><img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-0af420ca3c4d2818604e8698f7dbb5559acb5bbe%2Fbeta%20(2).png?alt=media" alt="" width="563"><figcaption></figcaption></figure></div>

After configure the dialog, click **Finish** to run the task.

### Generate PCA

Click on Methylation beta data node, choose [**PCA**](https://help.connected.illumina.com/icm/analyses/analysis-functionality/task-menu/exploratory-analysis/pca) in the *Exploratory analysis* section on the menu.

<div align="left"><figure><img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-5ba19a888da1172a79ecf6c94a5c86073687728d%2Fpca.png?alt=media" alt="" width="563"><figcaption></figcaption></figure></div>

Keep the default settings and click **Finish**.

Double click on the PCA data node to open the report in Data viewer.

<div align="left"><figure><img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-2ded50f1d9fb93cb56e5850814dfe7fe35ac25c7%2Fscatterplot.png?alt=media" alt="" width="563"><figcaption></figcaption></figure></div>

### Detect differential methylation

Single click the *Methylation beta* data node and perform the [**Detect differential methylation**](https://help.connected.illumina.com/icm/analyses/analysis-functionality/task-menu/statistics/detect-differential-methylation) task under *Statistics* in the task menu.

This task converts the beta values to m-values and performs [ANOVA](https://help.connected.illumina.com/icm/analyses/analysis-functionality/task-menu/statistics/differential-analysis/anova-limma-trend-limma-voom) differential expression analysis on both beta value and m-value matrixes.

Follow along with the task to make one-way or two-way ANOVA comparisons. The configured ANOVA model is performed on both beta value and m-value matrices. Click **Finish**.

This outputs the *Detect differential methylation* task report list. Open the **Task report** from the task menu or double-click the data node.

<div align="left"><figure><img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-5d555c2722a6e7fdf0cf762a2164855f031aa0ce%2Fdge.png?alt=media" alt=""><figcaption></figcaption></figure></div>

The outputs of this task include significance as *P-value* and *FDR step up* which is from the M-values. The *LSMeans* of the groups and the *Difference* are of the Beta-values.

When the difference is greater than or equal to 0.2, it site is labeled as **Hypermethylated** in *Methylation* field; if the difference is less than or equal to -0.2, it site is labeled as **Hypomethylated** in *Methylation* field. If the difference is between -0.2 to 0.2, it is labeled "**?**" in *Methylation* field.<br>

Click the Optional columns button to add more column data including annotation from the Illumina manifest file. For more information on these optional columns, see *Infinium Methylation Screening Array Manifest Column Headings* on [Illumina support site](https://support.illumina.com/downloads/infinium-methylation-screening-manifest-files.html).

Use the left filter panel to filter the results then click **Generate filtered node**. Detailed information on how to use the filter panel can be found [here](https://help.connected.illumina.com/icm/analyses/analysis-functionality/task-menu/statistics/differential-analysis/gsa).

### Visualize filtered results with Hierarchical clustering / heatmap

If we use FDR <=0.05 and difference is greater that |0.7| on Cell Line vs Coriell as an example filter criteria to generate a filtered result.

<figure><img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-2044724a9e0fc2234d839f29e28b3b3efda06406%2FfilterDGE.png?alt=media" alt=""><figcaption></figcaption></figure>

Select the generated *Filtered feature list* data node and use the *Exploratory analysis* task menu dropdown to perform the [**Hierarchical clustering / heatmap**](https://help.connected.illumina.com/icm/analyses/analysis-functionality/task-menu/exploratory-analysis/hierarchical-clustering) task.

<div align="left"><figure><img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-3ac085924c7d9ee8d96fd087cd16901cc74efebd%2FhcD.png?alt=media" alt="" width="563"><figcaption></figcaption></figure></div>

Leave the settings in the dialog as default and click **Finish**.

<div align="left"><figure><img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-5808fc61913361854a379589925b4b87816fa18d%2FhcD1.png?alt=media" alt="" width="563"><figcaption></figcaption></figure></div>

Double click on the Hierarchical clustering/heatmap to open the report.

### Biological interpretation

Select the *Filtered feature list* data node within the Analyses tab and choose the [**Gene set enrichment**](https://help.connected.illumina.com/icm/analyses/analysis-functionality/task-menu/biological-interpretation/gene-set-enrichment) task under *Biological interpretation* in the task menu. Choose the KEGG database, with other settings as default, click **Finish**. Double click on the output *Pathway enrichment* data node to open the report.

<div align="left"><figure><img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-6defe5fcc443b08e15da6220766af93d657dad12%2Fpw.png?alt=media" alt="" width="563"><figcaption></figcaption></figure></div>

When the table is filtered down to less than 100 row, the report can be viewed in Data viewer. The filter can be performed on the report table, or click on the *Pathway enrichment* data node, perform **Filter gene sets** in *Filtering* section. Click on <img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-a2c9d858a75743b7f791e43aba82b12970529c98%2Fimage%20(474).png?alt=media" alt="" data-size="line"> when it is available to open the plots.

<div align="left"><figure><img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-404998bcc8b1c2248e710fc50ab181ff9c8e07c8%2FpwPlot.png?alt=media" alt="" width="563"><figcaption></figcaption></figure></div>

This completes the example analyses pipeline.

<div align="left"><figure><img src="https://580316046-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FWMxqQAMFOJtu98OBk9KN%2Fuploads%2Fgit-blob-c882e3bc3c39516fc85f63853e58c587f02f4a0e%2Fpipeline%20(1).png?alt=media" alt="" width="563"><figcaption></figcaption></figure></div>
