LogoLogo
Illumina KnowledgeIllumina SupportSign In
Partek
  • Home
Partek
  • Overview
  • Partek Flow
    • Frequently Asked Questions
      • General
      • Visualization
      • Statistics
      • Biological Interpretation
      • How to cite Partek software
    • Quick Start Guide
    • Installation Guide
      • Minimum System Requirements
      • Single Cell Toolkit System Requirements
      • Single Node Installation
      • Single Node Amazon Web Services Deployment
      • Multi-Node Cluster Installation
      • Creating Restricted User Folders within the Partek Flow server
      • Updating Partek Flow
      • Uninstalling Partek Flow
      • Dependencies
      • Docker and Docker-compose
      • Java KeyStore and Certificates
      • Kubernetes
    • Live Training Event Recordings
      • Bulk RNA-Seq Analysis Training
      • Basic scRNA-Seq Analysis & Visualization Training
      • Advanced scRNA-Seq Data Analysis Training
      • Bulk RNA-Seq and ATAC-Seq Integration Training
      • Spatial Transcriptomics Data Analysis Training
      • scRNA and scATAC Data Integration Training
    • Tutorials
      • Creating and Analyzing a Project
        • Creating a New Project
        • The Metadata Tab
        • The Analyses Tab
        • The Log Tab
        • The Project Settings Tab
        • The Attachments Tab
        • Project Management
        • Importing a GEO / ENA project
      • Bulk RNA-Seq
        • Importing the tutorial data set
        • Adding sample attributes
        • Running pre-alignment QA/QC
        • Trimming bases and filtering reads
        • Aligning to a reference genome
        • Running post-alignment QA/QC
        • Quantifying to an annotation model
        • Filtering features
        • Normalizing counts
        • Exploring the data set with PCA
        • Performing differential expression analysis with DESeq2
        • Viewing DESeq2 results and creating a gene list
        • Viewing a dot plot for a gene
        • Visualizing gene expression in Chromosome view
        • Generating a hierarchical clustering heatmap
        • Performing biological interpretation
        • Saving and running a pipeline
      • Analyzing Single Cell RNA-Seq Data
      • Analyzing CITE-Seq Data
        • Importing Feature Barcoding Data
        • Data Processing
        • Dimensionality Reduction and Clustering
        • Classifying Cells
        • Differentially Expressed Proteins and Genes
      • 10x Genomics Visium Spatial Data Analysis
        • Start with pre-processed Space Ranger output files
        • Start with 10x Genomics Visium fastq files
        • Spatial data analysis steps
        • View tissue images
      • 10x Genomics Xenium Data Analysis
        • Import 10x Genomics Xenium Analyzer output
        • Process Xenium data
        • Perform Exploratory analysis
        • Make comparisons using Compute biomarkers and Biological interpretation
      • Single Cell RNA-Seq Analysis (Multiple Samples)
        • Getting started with the tutorial data set
        • Classify cells from multiple samples using t-SNE
        • Compare expression between cell types with multiple samples
      • Analyzing Single Cell ATAC-Seq data
      • Analyzing Illumina Infinium Methylation array data
      • NanoString CosMx Tutorial
        • Importing CosMx data
        • QA/QC, data processing, and dimension reduction
        • Cell typing
        • Classify subpopulations & differential expression analysis
    • User Manual
      • Interface
      • Importing Data
        • SFTP File Transfer Instructions
        • Import single cell data
        • Importing 10x Genomics Matrix Files
        • Importing and Demultiplexing Illumina BCL Files
        • Partek Flow Uploader for Ion Torrent
        • Importing 10x Genomics .bcl Files
        • Import a GEO / ENA project
      • Task Menu
        • Task actions
        • Data summary report
        • QA/QC
          • Pre-alignment QA/QC
          • ERCC Assessment
          • Post-alignment QA/QC
          • Coverage Report
          • Validate Variants
          • Feature distribution
          • Single-cell QA/QC
          • Cell barcode QA/QC
        • Pre-alignment tools
          • Trim bases
          • Trim adapters
          • Filter reads
          • Trim tags
        • Post-alignment tools
          • Filter alignments
          • Convert alignments to unaligned reads
          • Combine alignments
          • Deduplicate UMIs
          • Downscale alignments
        • Annotation/Metadata
          • Annotate cells
          • Annotation report
          • Publish cell attributes to project
          • Attribute report
          • Annotate Visium image
        • Pre-analysis tools
          • Generate group cell counts
          • Pool cells
          • Split matrix
          • Hashtag demultiplexing
          • Merge matrices
          • Descriptive statistics
          • Spot clean
        • Aligners
        • Quantification
          • Quantify to annotation model (Partek E/M)
          • Quantify to transcriptome (Cufflinks)
          • Quantify to reference (Partek E/M)
          • Quantify regions
          • HTSeq
          • Count feature barcodes
          • Salmon
        • Filtering
          • Filter features
          • Filter groups (samples or cells)
          • Filter barcodes
          • Split by attribute
          • Downsample Cells
        • Normalization and scaling
          • Impute low expression
          • Impute missing values
          • Normalization
          • Normalize to baseline
          • Normalize to housekeeping genes
          • Scran deconvolution
          • SCTransform
          • TF-IDF normalization
        • Batch removal
          • General linear model
          • Harmony
          • Seurat3 integration
        • Differential Analysis
          • GSA
          • ANOVA/LIMMA-trend/LIMMA-voom
          • Kruskal-Wallis
          • Detect alt-splicing (ANOVA)
          • DESeq2(R) vs DESeq2
          • Hurdle model
          • Compute biomarkers
          • Transcript Expression Analysis - Cuffdiff
          • Troubleshooting
        • Survival Analysis with Cox regression and Kaplan-Meier analysis - Partek Flow
        • Exploratory Analysis
          • Graph-based Clustering
          • K-means Clustering
          • Compare Clusters
          • PCA
          • t-SNE
          • UMAP
          • Hierarchical Clustering
          • AUCell
          • Find multimodal neighbors
          • SVD
          • CellPhoneDB
        • Trajectory Analysis
          • Trajectory Analysis (Monocle 2)
          • Trajectory Analysis (Monocle 3)
        • Variant Callers
          • SAMtools
          • FreeBayes
          • LoFreq
        • Variant Analysis
          • Fusion Gene Detection
          • Annotate Variants
          • Annotate Variants (SnpEff)
          • Annotate Variants (VEP)
          • Filter Variants
          • Summarize Cohort Mutations
          • Combine Variants
        • Copy Number Analysis (CNVkit)
        • Peak Callers (MACS2)
        • Peak analysis
          • Annotate Peaks
          • Filter peaks
          • Promoter sum matrix
        • Motif Detection
        • Metagenomics
          • Kraken
          • Alpha & beta diversity
          • Choose taxonomic level
        • 10x Genomics
          • Cell Ranger - Gene Expression
          • Cell Ranger - ATAC
          • Space Ranger
          • STARsolo
        • V(D)J Analysis
        • Biological Interpretation
          • Gene Set Enrichment
          • GSEA
        • Correlation
          • Correlation analysis
          • Sample Correlation
          • Similarity matrix
        • Export
        • Classification
        • Feature linkage analysis
      • Data Viewer
      • Visualizations
        • Chromosome View
          • Launching the Chromosome View
          • Navigating Through the View
          • Selecting Data Tracks for Visualization
          • Visualizing the Results Using Data Tracks
          • Annotating the Results
          • Customizing the View
        • Dot Plot
        • Volcano Plot
        • List Generator (Venn Diagram)
        • Sankey Plot
        • Transcription Start Site (TSS) Plot
        • Sources of variation plot
        • Interaction Plots
        • Correlation Plot
        • Pie Chart
        • Histograms
        • Heatmaps
        • PCA, UMAP and tSNE scatter plots
        • Stacked Violin Plot
      • Pipelines
        • Making a Pipeline
        • Running a Pipeline
        • Downloading and Sharing a Pipeline
        • Previewing a Pipeline
        • Deleting a Pipeline
        • Importing a Pipeline
      • Large File Viewer
      • Settings
        • Personal
          • My Profile
          • My Preferences
          • Forgot Password
        • System
          • System Information
          • System Preferences
          • LDAP Configuration
        • Components
          • Filter Management
          • Library File Management
            • Library File Management Settings
            • Library File Management Page
            • Selecting an Assembly
            • Library Files
            • Update Library Index
            • Creating an Assembly on the Library File Management Page
            • Adding Library Files on the Library File Management Page
            • Adding a Reference Sequence
            • Adding a Cytoband
            • Adding Reference Aligner Indexes
            • Adding a Gene Set
            • Adding a Variant Annotation Database
            • Adding a SnpEff Variant Database
            • Adding a Variant Effect Predictor (VEP) Database
            • Adding an Annotation Model
            • Adding Aligner Indexes Based on an Annotation Model
            • Adding Library Files from Within a Project
            • Microarray Library Files
            • Adding Prep kit
            • Removing Library Files
          • Option Set Management
          • Task Management
          • Pipeline managment
          • Lists
        • Access
          • User Management
          • Group Management
          • Licensing
          • Directory Permissions
          • Access Control Log
          • Failed Logins
          • Orphaned files
        • Usage
          • System Queue
          • System Resources
          • Usage Report
      • Server Management
        • Backing Up the Database
        • System Administrator Guide (Linux)
        • Diagnosing Issues
        • Moving Data
        • Partek Flow Worker Allocator
      • Enterprise Features and Toolkits
        • REST API
          • REST API Command List
      • Microarray Toolkit
        • Importing Custom Microarrays
      • Glossary
    • Webinars
    • Blog Posts
      • How to select the best single cell quality control thresholds
      • Cellular Differentiation Using Trajectory Analysis & Single Cell RNA-Seq Data
      • Spatial transcriptomics—what’s the big deal and why you should do it
      • Detecting differential gene expression in single cell RNA-Seq analysis
      • Batch remover for single cell data
      • How to perform single cell RNA sequencing: exploratory analysis
      • Single Cell Multiomics Analysis: Strategies for Integration
      • Pathway Analysis: ANOVA vs. Enrichment Analysis
      • Studying Immunotherapy with Multiomics: Simultaneous Measurement of Gene and Protein
      • How to Integrate ChIP-Seq and RNA-Seq Data
      • Enjoy Responsibly!
      • To Boldly Go…
      • Get to Know Your Cell
      • Aliens Among Us: How I Analyzed Non-Model Organism Data in Partek Flow
    • White Papers
      • Understanding Reads in RNA-Seq Analysis
      • RNA-Seq Quantification
      • Gene-specific Analysis
      • Gene Set ANOVA
      • Partek Flow Security
      • Single Cell Scaling
      • UMI Deduplication in Partek Flow
      • Mapping error statistics
    • Release Notes
      • Release Notes Archive - Partek Flow 10
  • Partek Genomics Suite
    • Installation Guide
      • Minimum System Requirements
      • Computer Host ID Retrieval
      • Node Locked Installation
        • Windows Installation
        • Macintosh Installation
      • Floating/Locked Floating Installation
        • Linux Installation
          • FlexNet Installation on Linux
        • Installing FlexNet on Windows
        • License Server FAQ's
        • Client Computer Connection to License Server
      • Uninstalling Partek Genomics Suite
      • Updating to Version 7.0
      • License Types
      • Installation FAQs
    • User Manual
      • Lists
        • Importing a text file list
        • Adding annotations to a gene list
        • Tasks available for a gene list
        • Starting with a list of genomic regions
        • Starting with a list of SNPs
        • Importing a BED file
        • Additional options for lists
      • Annotation
      • Hierarchical Clustering Analysis
      • Gene Ontology ANOVA
        • Implementation Details
        • Configuring the GO ANOVA Dialog
        • Performing GO ANOVA
        • GO ANOVA Output
        • GO ANOVA Visualisations
        • Recommended Filters
      • Visualizations
        • Dot Plot
        • Profile Plot
        • XY Plot / Bar Chart
        • Volcano Plot
        • Scatter Plot and MA Plot
        • Sort Rows by Prototype
        • Manhattan Plot
        • Violin Plot
      • Visualizing NGS Data
      • Chromosome View
      • Methylation Workflows
      • Trio/Duo Analysis
      • Association Analysis
      • LOH detection with an allele ratio spreadsheet
      • Import data from Agilent feature extraction software
      • Illumina GenomeStudio Plugin
        • Import gene expression data
        • Import Genotype Data
        • Export CNV data to Illumina GenomeStudio using Partek report plug-in
        • Import data from Illumina GenomeStudio using Partek plug-in
        • Export methylation data to Illumina GenomeStudio using Partek report plug-in
    • Tutorials
      • Gene Expression Analysis
        • Importing Affymetrix CEL files
        • Adding sample information
        • Exploring gene expression data
        • Identifying differentially expressed genes using ANOVA
        • Creating gene lists from ANOVA results
        • Performing hierarchical clustering
        • Adding gene annotations
      • Gene Expression Analysis with Batch Effects
        • Importing the data set
        • Adding an annotation link
        • Exploring the data set with PCA
        • Detect differentially expressed genes with ANOVA
        • Removing batch effects
        • Creating a gene list using the Venn Diagram
        • Hierarchical clustering using a gene list
        • GO enrichment using a gene list
      • Differential Methylation Analysis
        • Import and normalize methylation data
        • Annotate samples
        • Perform data quality analysis and quality control
        • Detect differentially methylated loci
        • Create a marker list
        • Filter loci with the interactive filter
        • Obtain methylation signatures
        • Visualize methylation at each locus
        • Perform gene set and pathway analysis
        • Detect differentially methylated CpG islands
        • Optional: Add UCSC CpG island annotations
        • Optional: Use MethylationEPIC for CNV analysis
        • Optional: Import a Partek Project from Genome Studio
      • Partek Pathway
        • Performing pathway enrichment
        • Analyzing pathway enrichment in Partek Genomics Suite
        • Analyzing pathway enrichment in Partek Pathway
      • Gene Ontology Enrichment
        • Open a zipped project
        • Perform GO enrichment analysis
      • RNA-Seq Analysis
        • Importing aligned reads
        • Adding sample attributes
        • RNA-Seq mRNA quantification
        • Detecting differential expression in RNA-Seq data
        • Creating a gene list with advanced options
        • Visualizing mapped reads with Chromosome View
        • Visualizing differential isoform expression
        • Gene Ontology (GO) Enrichment
        • Analyzing the unexplained regions spreadsheet
      • ChIP-Seq Analysis
        • Importing ChIP-Seq data
        • Quality control for ChIP-Seq samples
        • Detecting peaks and enriched regions in ChIP-Seq data
        • Creating a list of enriched regions
        • Identifying novel and known motifs
        • Finding nearest genomic features
        • Visualizing reads and enriched regions
      • Survival Analysis
        • Kaplan-Meier Survival Analysis
        • Cox Regression Analysis
      • Model Selection Tool
      • Copy Number Analysis
        • Importing Copy Number Data
        • Exploring the data with PCA
        • Creating Copy Number from Allele Intensities
        • Detecting regions with copy number variation
        • Creating a list of regions
        • Finding genes with copy number variation
        • Optional: Additional options for annotating regions
        • Optional: GC wave correction for Affymetrix CEL files
        • Optional: Integrating copy number with LOH and AsCN
      • Loss of Heterozygosity
      • Allele Specific Copy Number
      • Gene Expression - Aging Study
      • miRNA Expression and Integration with Gene Expression
        • Analyze differentially expressed miRNAs
        • Integrate miRNA and Gene Expression data
      • Promoter Tiling Array
      • Human Exon Array
        • Importing Human Exon Array
        • Gene-level Analysis of Exon Array
        • Alt-Splicing Analysis of Exon Array
      • NCBI GEO Importer
    • Webinars
    • White Papers
      • Allele Intensity Import
      • Allele-Specific Copy Number
      • Calculating Genotype Likelihoods
      • ChIP-Seq Peak Detection
      • Detect Regions of Significance
      • Genomic Segmentation
      • Loss of Heterozygosity Analysis
      • Motif Discovery Methods
      • Partek Genomics Suite Security
      • Reads in RNA-Seq
      • RNA-Seq Methods
      • Unpaired Copy Number Estimation
    • Release Notes
    • Version Updates
    • TeamViewer Instructions
  • Getting Help
    • TeamViewer Instructions
Powered by GitBook
On this page
  • Supported Platforms
  • Data Import
  • Custom annotations
  • Conversion to Aligned Reads
  • Advanced options
  • Visualizing Microarray Probes
  • Downstream Tasks
  • References
  • Additional Assistance

Was this helpful?

Export as PDF
  1. Partek Flow
  2. User Manual

Microarray Toolkit

PreviousREST API Command ListNextImporting Custom Microarrays

Last updated 7 months ago

Was this helpful?

With the Partek Flow Microarray Toolkit, you can import and analyze microarray data with the same ease as any sequencing analysis pipeline. This document covers the following parts.

Supported Platforms

Partek Flow supports popular gene expression microarray platforms including Affymetrix GeneChips (.CEL) and Illumina BeadChips (.idat). You can also import text output from Illumina Genome Studio in the form of tab-delimited text files with probe IDs and AVGSignal values. For the latter, each sample should correspond to one text file.

For custom-made Affymetrix .CEL and Illumina .idat or AVGSignal data generated from Genome Studio files, refer to the documentation on for additional instructions regarding the probe files.

Data Import

To import microarray files into a project, go to the Data tab of the project and select the Import Data button. If the data is not already in an existing Partek Flow project, select the Automatically create samples from files button. You will be presented with three different ways to upload your data files:

  • Directly from your Partek Flow server

  • From My Computer

  • From a URL

The most efficient way of importing array data would be to import them directly from your Partek Flow server. When you select this option, navigate to the folder containing your data files. Valid file types will be selectable for upload. Click on the files you would like to import into the project and select the Create sample button (Figure 1).

During the upload process, Partek Flow will determine the specific platform of your dataset. If they are commonly used chips, such as Affymetrix HTA arrays, the gene and transcript annotations associated with the platform will automatically be downloaded in the Library File Manager.

Once the samples have been uploaded, the Data tab will list each sample along with the Platform associated with each sample. If Affymetrix .CEL files were uploaded, it will also include the date that the array was scanned (Figure 2). This information may be helpful in assessing possible batch effects.

You can now add additional sample attributes to your microarray project. For more information on how to set up your project, please refer to the documentation on Creating a project.

Custom annotations

Conversion to Aligned Reads

Once the microarray data has been uploaded, the Microarray intensity data node will appear in the Analyses tab. To convert this data to aligned reads, select the Microarray intensity data node and select the aligner you would like to use for conversion (Figure 3).

Selecting an aligner will bring up the Microarray Conversion options page (Figure 4). Under the Select probe sequence file section, make sure that the correct Chip name is specified. For non-custom arrays, these should be set automatically to the platform detected upon importing the array.

If for some reason you wish to override that selection, then click the Change selection link and select the chip name in the drop-down menu or select New chip… if you would like to upload a different chip annotation (Figure 5).

In the Select coverage depth section, you can set the nominal read coverage depth for your array. This is by default set at 20 million reads. However, you can adjust this number for better accuracy or if you are evaluating low-expressing genes. If you are comparing the microarray data with a corresponding NGS dataset, you can use this option to scale your data to a more comparable order of magnitude. Take note that setting this to a higher number will result in longer computation times overall.

Advanced options

Clicking the Configure link under the Advanced options section to view the advanced options that can be modified. These are generally transformations applied to microarray data (Figure 6). Note that both interrogating and control (if present) probes are used during fitting and adjustment.

The Specify annotation sequence format drop-down menu allows you to specify the format of the probe sequences associated with your microarrays. Please consult with the manufacturer of your microarrays to determine the exact specification of their probe annotation files. The options available for sequence annotation include:

  • None: Assumes that the sequences in the microarray annotation are the probe sequences printed on the array

  • Complement: Assumes that the sequence is for the region being interrogated by the array

  • Reverse complement: Assumes that the sequence is the reverse complement of the interrogated sequence. This is particularly useful if you have primer sequences as annotations

The Consolidation of identical probe sequences option is available to take into account probes that have identical sequences within the same array. This is particularly common in microRNA arrays, where identical mature miRNA sequence can exist for multiple organisms.

By default, this is set to take the mean of the intensities the probes and utilizes that to calculate the read count for the probe sequence. Clicking on the drop-down menu allows you to either turn off this feature (none) or use the median instead (Figure 6).

Clicking the Sequence correction transform checkbox corrects intensity values for sequence-specific effects. It is turned off by default and we only advise that you select this option if there are wide differences in the GC content of your probes.

Both the GCCN sequence correction transform and Scale intensity transforms were developed by Affymetrix [1]. GCCN scales the intensities with respect to the difference in probe affinity associated with Guanine and Cytosine (GC) content. On the other hand, the Scale intensity transform was developed improve inter-platform comparisons between microarray and RNA-seq data. Specifically, the algorithm stretches the intensity distribution to a common range with a power law mapping that decompresses fold change ratios . This ultimately simplifies fold change comparisons between different technologies.

The RMA background transform check box applies robust multi-array average (RMA) normalization to your microarray samples [2]. This is a well-accepted method of background correction for microarray intensities.

The Quantile normalization check box performs quantile normalization on all microarray intensities. This is important in making sure that signals from any two arrays are comparable.

Any combination of these advanced options can be saved as its own option set. Once the option set has been created, it will be available in the Option set drop-down menu for use in future projects.

After you have finished configuring all microarray conversion options. Click the Next button to configure the parameters for the aligner you have chosen to use. Please refer to the documentation on Alignment for more information about configuring aligners.

Visualizing Microarray Probes

To visualize microarray probes in Partek Flow's Chromosome viewer, click the Select tracks button at the top left corner of the viewer. Select the Probe intensities check box to display microarray probes (Figure 8).

This will display microarray probes aligned to the reference genome (Figure 9). By default, the probes are colored by their intensities with darker shades signifying higher probe intensities.

Clicking a specific probe will display additional information about that probe including mapping position and sequence length. If the probe intensities are consolidated for specific categorical attributes, the average probe intensity is also displayed. For more details on how to use the Chromosome viewer, please consult its documentation.

Downstream Tasks

The Convert to aligned reads task generates an Aligned reads data node. This then allows you to perform most tasks that can be performed on that data node type in Partek Flow. These include Quantify to transcriptome, Normalize counts and Differential gene expression (GSA) analysis. Please consult the documentation on a specific task for specific instructions on their use.

References

Additional Assistance

Partek Flow also supports custom-made Affymetrix and Illumina arrays. Once the data files have been uploaded, you will also need to supply a custom annotation file which contains information on the probe sequences associated with the array. Please refer to the document on for additional information on how to create and upload these annotations.

Affymetrix Whitepaper on . Accessed February 17, 2016

Irrizarry RA, Hobbs B, Collin F et al. . Biostat. 2003; 4:249-264.

If you need additional assistance, please visit to submit a help ticket or find phone numbers for regional support.

Importing Custom Microarrays
Microarray normalization using Signal Space Transformation with probe Guanine Cytosine Count Correction
Exploration, normalization, and summaries of high density oligonucleotide array probe level data
our support page
Importing Custom Microarrays
Supported Platforms
Data Import
Conversion to Aligned Reads
Visualizing Microarray Probes
Downstream Tasks
References
Additional Assistance
Figure 1. Importing microarray files into Partek Flow
Figure 2. Imported microarrays will automatically include Platform and, in some cases, Scan date in the Attributes page
Figure 3. Converting Microarray intensity to Aligned reads
Figure 4. The Microarray Conversion options page
Figure 5. Selecting a different probe sequence file
Figure 6. Advanced options for Microarray conversion
Figure 7. Changing the way identical probes are consolidated
Figure 8. Displaying microarray probes in Chromosome viewer
Figure 9. Viewing probe intensities