Supported Data Types

The following Illumina and third-party data types are supported in Connected Multiomics:

Spatial Transcriptomics

Illumina Spatial Solution

Vendor

Illumina

Assay Names(s)

Illumina Spatial technology

Secondary Analysis Pipeline

DRAGEN Spatial Transcriptome

File Types

.h5ad

.ome.tiff

Samples Per File

One sample per set of files

Datasets

Included in demo data

Additional information

At least one .ome.tiff and multiple .h5ad files are required.

10x Genomics Visium Space Ranger Output

Vendor

10x Genomics

Assay Names(s)

Visium assays including:

HD WT Panel Gene Expression

HD 3’ Gene Expression

Spatial Gene Expression

Secondary Analysis Pipeline

File Types

.h5

or

barcodes.tsv.gz

features.tsv.gz

matrix.mtx.gz

or

barcodes.csv.gz

features.csv.gz

matrix.mtx.gz

with _spatial.tar.gz

Optional: .tif

Samples Per File

One sample per set of files

Additional information

Either count matrix data as 1 filtered .h5 file per sample or sparse matrix files for each sample as 3 files (two .csv with one .mtx or two .tsv with one .mtx for each sample). The spatial output files should be in compressed format (.zip). The high resolution image (.tif)can be uploaded and is optional. The spatial result file name must begin with the sample name. Only 1 sample can be ingested at a time.

10x Genomics Xenium

Vendor

10x Genomics

Assay Names(s)

Xenium assays including:

In Situ Gene Expression

Secondary Analysis Pipeline

File Types

cell_feature_matrix.h5

cells.csv.gz

cell_boundaries.csv.gz

nucleus_boundaries.csv.gz

transcripts.csv.gz / transcripts.parquet.csv.gz

morphology_focus.ome.tif

Samples Per File

One sample per set of files

Additional information

Includes the unzipped Xenium Output Bundle with the preferred input image file (.tiff) for each sample. The .h5 file name must begin with the sample name. Only 1 sample can be ingested at a time.

Nanostring CosMx

Vendor

Nanostring

Secondary Analysis Pipeline

N/A

File Types

exprMat_file.csv

metadata_file.csv

polygons.csv

fov_positions_file.csv

optional tx_file.csv

Images contained in CellComposite or CellOverlay folder:

.tiff

.jpeg

.jpg

Samples Per File

One sample per set of files

Additional information

NanoString CosMx data should include 5 files (exprMat_file.csv, metadata_file.csv, polygons.csv, fov_positions_file.csv, optional tx_file.csv) and the images contained in the CellComposite/CellOverlay folder per sample. The exprMat file name must begin with the sample name. Only 1 sample can be ingested at a time.

Single-cell RNA-Seq

Illumina DRAGEN Single Cell

Vendor

Illumina

Secondary Analysis Pipeline

File Types

.barcodes.tsv.gz

.features.tsv.gz

.matrix.mtx.gz

Samples Per File

One sample per set of files

Datasets

Included in demo data

Additional information

Any additional descriptors prior to main extensions are supported (eg .scRNA.filtered.matrix.mtx). Each file name must begin with the sample name. Multiple samples can be ingested at the same time. _matrix/_features/_barcodes and .matrix/.features/.barcodes is accepted.

scRNA feature-barcode-matrix

Vendor

10x Genomics

Parse Biosciences

Assay Names(s)

Chromium assays including:

Universal 3' Gene Expression

Parse assays including:

Evercode WT

File Types

.tsv.gz or .csv.gz

.mtx.gz

Samples Per File

One sample per set of files

Additional information

Sparse matrix output. Each sample has 3 files: two .csv with one .mtx or two .tsv with one .mtx. Each file name must begin with the sample name. Multiple samples can be ingested at the same time. _matrix/_features/_barcodes and .matrix/.features/.barcodes is accepted.

10x Genomics Cell Ranger counts h5

Vendor

10x Genomics

Assay Names(s)

Chromium assays including:

Universal 3' Gene Expression

Secondary Analysis Pipeline

File Types

.h5

Samples Per File

One sample per file

Additional information

This compressed binary format is preferred for 10x Genomics Cell Ranger output. One filtered .h5 file per sample. Multiple samples can be ingested at the same time.

h5ad

Vendor

Various

Assay Names(s)

Various

Secondary Analysis Pipeline

File Types

.h5ad

Samples Per File

One sample per file

Datasets

Not available

Additional information

AnnData object in the h5ad file format

Seurat (RNA)

Vendor

Various

Assay Names(s)

Various

Secondary Analysis Pipeline

File Types

.qs or .rds

Samples Per File

One sample per file

Datasets

Gene Expression Omnibus - Search GSE186892

Additional information

R object for data processed by Seurat (RNA)

Single-cell ATAC-Seq

10x Genomics Cell Ranger

Vendor

10x Genomics

Secondary Analysis Pipeline

File Types

.h5

.csv

fragments.tsv.gz

fragments.tsv.gz.tbi

peaks.bed

Samples Per File

One sample per set of files

Additional information

Each file must begin with the sample name. Multiple files can be ingested at the same time

Seurat (ATAC)

Vendor

Various

Assay Names(s)

Various

Secondary Analysis Pipeline

File Types

.qs or .rds

Samples Per File

One sample per file

Datasets

Not available

Additional information

R object for data processed by Seurat (ATAC)

Bulk RNA-Seq

Illumina DRAGEN RNA

Vendor

Illumina

Assay Names(s)

Illumina Stranded Total RNA Prep Ligation with Ribo-Zero Plus

Illumina Stranded mRNA Prep Ligation

TruSeq Stranded Total RNA Library Prep Gold

TruSeq Stranded Total RNA Library Prep Globin

TruSeq Stranded mRNA Library Prep

Secondary Analysis Pipeline

File Types

.sf

.sf.gz

Samples Per File

One sample per file

Datasets

Included in demo data

Additional information

Gene Counts in sf Format

Vendor

Various

Assay Names(s)

Various

Secondary Analysis Pipeline

File Types

quant.genes.sf

Samples Per File

One sample per file

Datasets

Gene Expression Omnibus - Search GSM7103647

Additional information

Bulk ChIP/ATAC Seq

Region Count Matrix

Vendor

Various

Assay Names(s)

Various

Secondary Analysis Pipeline

Various including:

MACS

File Types

.txt

Samples Per File

One file per sample

Datasets

Not available

Additional information

Region name contains genomic location with the format as chromosome:start-stop.

Bulk DNA-Seq

VCF

Vendor

Various

Assay Names(s)

Various

Secondary Analysis Pipeline

Various including:

DRAGEN

File Types

.vcf

.vcf.gz

.bgz

Samples Per File

One sample per file

Datasets

Included in demo data

Additional information

Bulk Proteomics

Illumina Protein Prep

Vendor

Illumina

Assay Names(s)

Illumina Protein Prep 9.5K Plasma

Illumina Protein Prep 9.5K Serum

Secondary Analysis Pipeline

File Types

.adat

Samples Per File

Multiple samples per file

Datasets

Included in demo data

Additional information

Somalogic ADAT

Vendor

SomaLogic

Assay Names(s)

SomaScan 11K and 7K assays

Secondary Analysis Pipeline

N/A

File Types

.adat

Samples Per File

Multiple samples per file

Additional information

Bulk Methylation

Illumina 5-base Solution

Vendor

Illumina

Assay Names(s)

Illumina 5-base DNA Prep

Illumina 5-base DNA Prep with Enrichment

Secondary Analysis Pipeline

File Types

.CX_report.txt.gz

.methyl_metrics.csv

.mapping_metrics.csv .wgs_coverage_metrics.csv

.M-bias.txt

Samples Per File

One sample per set of files

Datasets

Included in demo data

Additional information

Bulk miRNA

Illumina miRNA Prep

Vendor

Illumina

Assay Names(s)

Secondary Analysis Pipeline

File Types

.txt

Samples Per File

Multiple samples per file

Datasets

Included in demo data

Additional information

Microarray Methylation

Illumina Infinium Methylation

Vendor

Illumina

Secondary Analysis Pipeline

N/A

File Types

.idat

Samples Per File

One sample per set of files

Datasets

Included in demo data

Additional information

Requires 2 .idat files per sample. Red.idat and Grn.idat

Library File

scType

Vendor

Various

Assay Names(s)

Various

Secondary Analysis Pipeline

Various

File Types

.tsv

.csv

Samples Per File

Multiple samples per file

Datasets

N/A

Additional information

See Sample Metadata for more detail on format.

Last updated

Was this helpful?