LogoLogo
Illumina KnowledgeIllumina SupportSign In
Partek
  • Home
Partek
  • Overview
  • Partek Flow
    • Frequently Asked Questions
      • General
      • Visualization
      • Statistics
      • Biological Interpretation
      • How to cite Partek software
    • Quick Start Guide
    • Installation Guide
      • Minimum System Requirements
      • Single Cell Toolkit System Requirements
      • Single Node Installation
      • Single Node Amazon Web Services Deployment
      • Multi-Node Cluster Installation
      • Creating Restricted User Folders within the Partek Flow server
      • Updating Partek Flow
      • Uninstalling Partek Flow
      • Dependencies
      • Docker and Docker-compose
      • Java KeyStore and Certificates
      • Kubernetes
    • Live Training Event Recordings
      • Bulk RNA-Seq Analysis Training
      • Basic scRNA-Seq Analysis & Visualization Training
      • Advanced scRNA-Seq Data Analysis Training
      • Bulk RNA-Seq and ATAC-Seq Integration Training
      • Spatial Transcriptomics Data Analysis Training
      • scRNA and scATAC Data Integration Training
    • Tutorials
      • Creating and Analyzing a Project
        • Creating a New Project
        • The Metadata Tab
        • The Analyses Tab
        • The Log Tab
        • The Project Settings Tab
        • The Attachments Tab
        • Project Management
        • Importing a GEO / ENA project
      • Bulk RNA-Seq
        • Importing the tutorial data set
        • Adding sample attributes
        • Running pre-alignment QA/QC
        • Trimming bases and filtering reads
        • Aligning to a reference genome
        • Running post-alignment QA/QC
        • Quantifying to an annotation model
        • Filtering features
        • Normalizing counts
        • Exploring the data set with PCA
        • Performing differential expression analysis with DESeq2
        • Viewing DESeq2 results and creating a gene list
        • Viewing a dot plot for a gene
        • Visualizing gene expression in Chromosome view
        • Generating a hierarchical clustering heatmap
        • Performing biological interpretation
        • Saving and running a pipeline
      • Analyzing Single Cell RNA-Seq Data
      • Analyzing CITE-Seq Data
        • Importing Feature Barcoding Data
        • Data Processing
        • Dimensionality Reduction and Clustering
        • Classifying Cells
        • Differentially Expressed Proteins and Genes
      • 10x Genomics Visium Spatial Data Analysis
        • Start with pre-processed Space Ranger output files
        • Start with 10x Genomics Visium fastq files
        • Spatial data analysis steps
        • View tissue images
      • 10x Genomics Xenium Data Analysis
        • Import 10x Genomics Xenium Analyzer output
        • Process Xenium data
        • Perform Exploratory analysis
        • Make comparisons using Compute biomarkers and Biological interpretation
      • Single Cell RNA-Seq Analysis (Multiple Samples)
        • Getting started with the tutorial data set
        • Classify cells from multiple samples using t-SNE
        • Compare expression between cell types with multiple samples
      • Analyzing Single Cell ATAC-Seq data
      • Analyzing Illumina Infinium Methylation array data
      • NanoString CosMx Tutorial
        • Importing CosMx data
        • QA/QC, data processing, and dimension reduction
        • Cell typing
        • Classify subpopulations & differential expression analysis
    • User Manual
      • Interface
      • Importing Data
        • SFTP File Transfer Instructions
        • Import single cell data
        • Importing 10x Genomics Matrix Files
        • Importing and Demultiplexing Illumina BCL Files
        • Partek Flow Uploader for Ion Torrent
        • Importing 10x Genomics .bcl Files
        • Import a GEO / ENA project
      • Task Menu
        • Task actions
        • Data summary report
        • QA/QC
          • Pre-alignment QA/QC
          • ERCC Assessment
          • Post-alignment QA/QC
          • Coverage Report
          • Validate Variants
          • Feature distribution
          • Single-cell QA/QC
          • Cell barcode QA/QC
        • Pre-alignment tools
          • Trim bases
          • Trim adapters
          • Filter reads
          • Trim tags
        • Post-alignment tools
          • Filter alignments
          • Convert alignments to unaligned reads
          • Combine alignments
          • Deduplicate UMIs
          • Downscale alignments
        • Annotation/Metadata
          • Annotate cells
          • Annotation report
          • Publish cell attributes to project
          • Attribute report
          • Annotate Visium image
        • Pre-analysis tools
          • Generate group cell counts
          • Pool cells
          • Split matrix
          • Hashtag demultiplexing
          • Merge matrices
          • Descriptive statistics
          • Spot clean
        • Aligners
        • Quantification
          • Quantify to annotation model (Partek E/M)
          • Quantify to transcriptome (Cufflinks)
          • Quantify to reference (Partek E/M)
          • Quantify regions
          • HTSeq
          • Count feature barcodes
          • Salmon
        • Filtering
          • Filter features
          • Filter groups (samples or cells)
          • Filter barcodes
          • Split by attribute
          • Downsample Cells
        • Normalization and scaling
          • Impute low expression
          • Impute missing values
          • Normalization
          • Normalize to baseline
          • Normalize to housekeeping genes
          • Scran deconvolution
          • SCTransform
          • TF-IDF normalization
        • Batch removal
          • General linear model
          • Harmony
          • Seurat3 integration
        • Differential Analysis
          • GSA
          • ANOVA/LIMMA-trend/LIMMA-voom
          • Kruskal-Wallis
          • Detect alt-splicing (ANOVA)
          • DESeq2(R) vs DESeq2
          • Hurdle model
          • Compute biomarkers
          • Transcript Expression Analysis - Cuffdiff
          • Troubleshooting
        • Survival Analysis with Cox regression and Kaplan-Meier analysis - Partek Flow
        • Exploratory Analysis
          • Graph-based Clustering
          • K-means Clustering
          • Compare Clusters
          • PCA
          • t-SNE
          • UMAP
          • Hierarchical Clustering
          • AUCell
          • Find multimodal neighbors
          • SVD
          • CellPhoneDB
        • Trajectory Analysis
          • Trajectory Analysis (Monocle 2)
          • Trajectory Analysis (Monocle 3)
        • Variant Callers
          • SAMtools
          • FreeBayes
          • LoFreq
        • Variant Analysis
          • Fusion Gene Detection
          • Annotate Variants
          • Annotate Variants (SnpEff)
          • Annotate Variants (VEP)
          • Filter Variants
          • Summarize Cohort Mutations
          • Combine Variants
        • Copy Number Analysis (CNVkit)
        • Peak Callers (MACS2)
        • Peak analysis
          • Annotate Peaks
          • Filter peaks
          • Promoter sum matrix
        • Motif Detection
        • Metagenomics
          • Kraken
          • Alpha & beta diversity
          • Choose taxonomic level
        • 10x Genomics
          • Cell Ranger - Gene Expression
          • Cell Ranger - ATAC
          • Space Ranger
          • STARsolo
        • V(D)J Analysis
        • Biological Interpretation
          • Gene Set Enrichment
          • GSEA
        • Correlation
          • Correlation analysis
          • Sample Correlation
          • Similarity matrix
        • Export
        • Classification
        • Feature linkage analysis
      • Data Viewer
      • Visualizations
        • Chromosome View
          • Launching the Chromosome View
          • Navigating Through the View
          • Selecting Data Tracks for Visualization
          • Visualizing the Results Using Data Tracks
          • Annotating the Results
          • Customizing the View
        • Dot Plot
        • Volcano Plot
        • List Generator (Venn Diagram)
        • Sankey Plot
        • Transcription Start Site (TSS) Plot
        • Sources of variation plot
        • Interaction Plots
        • Correlation Plot
        • Pie Chart
        • Histograms
        • Heatmaps
        • PCA, UMAP and tSNE scatter plots
        • Stacked Violin Plot
      • Pipelines
        • Making a Pipeline
        • Running a Pipeline
        • Downloading and Sharing a Pipeline
        • Previewing a Pipeline
        • Deleting a Pipeline
        • Importing a Pipeline
      • Large File Viewer
      • Settings
        • Personal
          • My Profile
          • My Preferences
          • Forgot Password
        • System
          • System Information
          • System Preferences
          • LDAP Configuration
        • Components
          • Filter Management
          • Library File Management
            • Library File Management Settings
            • Library File Management Page
            • Selecting an Assembly
            • Library Files
            • Update Library Index
            • Creating an Assembly on the Library File Management Page
            • Adding Library Files on the Library File Management Page
            • Adding a Reference Sequence
            • Adding a Cytoband
            • Adding Reference Aligner Indexes
            • Adding a Gene Set
            • Adding a Variant Annotation Database
            • Adding a SnpEff Variant Database
            • Adding a Variant Effect Predictor (VEP) Database
            • Adding an Annotation Model
            • Adding Aligner Indexes Based on an Annotation Model
            • Adding Library Files from Within a Project
            • Microarray Library Files
            • Adding Prep kit
            • Removing Library Files
          • Option Set Management
          • Task Management
          • Pipeline managment
          • Lists
        • Access
          • User Management
          • Group Management
          • Licensing
          • Directory Permissions
          • Access Control Log
          • Failed Logins
          • Orphaned files
        • Usage
          • System Queue
          • System Resources
          • Usage Report
      • Server Management
        • Backing Up the Database
        • System Administrator Guide (Linux)
        • Diagnosing Issues
        • Moving Data
        • Partek Flow Worker Allocator
      • Enterprise Features and Toolkits
        • REST API
          • REST API Command List
      • Microarray Toolkit
        • Importing Custom Microarrays
      • Glossary
    • Webinars
    • Blog Posts
      • How to select the best single cell quality control thresholds
      • Cellular Differentiation Using Trajectory Analysis & Single Cell RNA-Seq Data
      • Spatial transcriptomics—what’s the big deal and why you should do it
      • Detecting differential gene expression in single cell RNA-Seq analysis
      • Batch remover for single cell data
      • How to perform single cell RNA sequencing: exploratory analysis
      • Single Cell Multiomics Analysis: Strategies for Integration
      • Pathway Analysis: ANOVA vs. Enrichment Analysis
      • Studying Immunotherapy with Multiomics: Simultaneous Measurement of Gene and Protein
      • How to Integrate ChIP-Seq and RNA-Seq Data
      • Enjoy Responsibly!
      • To Boldly Go…
      • Get to Know Your Cell
      • Aliens Among Us: How I Analyzed Non-Model Organism Data in Partek Flow
    • White Papers
      • Understanding Reads in RNA-Seq Analysis
      • RNA-Seq Quantification
      • Gene-specific Analysis
      • Gene Set ANOVA
      • Partek Flow Security
      • Single Cell Scaling
      • UMI Deduplication in Partek Flow
      • Mapping error statistics
    • Release Notes
      • Release Notes Archive - Partek Flow 10
  • Partek Genomics Suite
    • Installation Guide
      • Minimum System Requirements
      • Computer Host ID Retrieval
      • Node Locked Installation
        • Windows Installation
        • Macintosh Installation
      • Floating/Locked Floating Installation
        • Linux Installation
          • FlexNet Installation on Linux
        • Installing FlexNet on Windows
        • License Server FAQ's
        • Client Computer Connection to License Server
      • Uninstalling Partek Genomics Suite
      • Updating to Version 7.0
      • License Types
      • Installation FAQs
    • User Manual
      • Lists
        • Importing a text file list
        • Adding annotations to a gene list
        • Tasks available for a gene list
        • Starting with a list of genomic regions
        • Starting with a list of SNPs
        • Importing a BED file
        • Additional options for lists
      • Annotation
      • Hierarchical Clustering Analysis
      • Gene Ontology ANOVA
        • Implementation Details
        • Configuring the GO ANOVA Dialog
        • Performing GO ANOVA
        • GO ANOVA Output
        • GO ANOVA Visualisations
        • Recommended Filters
      • Visualizations
        • Dot Plot
        • Profile Plot
        • XY Plot / Bar Chart
        • Volcano Plot
        • Scatter Plot and MA Plot
        • Sort Rows by Prototype
        • Manhattan Plot
        • Violin Plot
      • Visualizing NGS Data
      • Chromosome View
      • Methylation Workflows
      • Trio/Duo Analysis
      • Association Analysis
      • LOH detection with an allele ratio spreadsheet
      • Import data from Agilent feature extraction software
      • Illumina GenomeStudio Plugin
        • Import gene expression data
        • Import Genotype Data
        • Export CNV data to Illumina GenomeStudio using Partek report plug-in
        • Import data from Illumina GenomeStudio using Partek plug-in
        • Export methylation data to Illumina GenomeStudio using Partek report plug-in
    • Tutorials
      • Gene Expression Analysis
        • Importing Affymetrix CEL files
        • Adding sample information
        • Exploring gene expression data
        • Identifying differentially expressed genes using ANOVA
        • Creating gene lists from ANOVA results
        • Performing hierarchical clustering
        • Adding gene annotations
      • Gene Expression Analysis with Batch Effects
        • Importing the data set
        • Adding an annotation link
        • Exploring the data set with PCA
        • Detect differentially expressed genes with ANOVA
        • Removing batch effects
        • Creating a gene list using the Venn Diagram
        • Hierarchical clustering using a gene list
        • GO enrichment using a gene list
      • Differential Methylation Analysis
        • Import and normalize methylation data
        • Annotate samples
        • Perform data quality analysis and quality control
        • Detect differentially methylated loci
        • Create a marker list
        • Filter loci with the interactive filter
        • Obtain methylation signatures
        • Visualize methylation at each locus
        • Perform gene set and pathway analysis
        • Detect differentially methylated CpG islands
        • Optional: Add UCSC CpG island annotations
        • Optional: Use MethylationEPIC for CNV analysis
        • Optional: Import a Partek Project from Genome Studio
      • Partek Pathway
        • Performing pathway enrichment
        • Analyzing pathway enrichment in Partek Genomics Suite
        • Analyzing pathway enrichment in Partek Pathway
      • Gene Ontology Enrichment
        • Open a zipped project
        • Perform GO enrichment analysis
      • RNA-Seq Analysis
        • Importing aligned reads
        • Adding sample attributes
        • RNA-Seq mRNA quantification
        • Detecting differential expression in RNA-Seq data
        • Creating a gene list with advanced options
        • Visualizing mapped reads with Chromosome View
        • Visualizing differential isoform expression
        • Gene Ontology (GO) Enrichment
        • Analyzing the unexplained regions spreadsheet
      • ChIP-Seq Analysis
        • Importing ChIP-Seq data
        • Quality control for ChIP-Seq samples
        • Detecting peaks and enriched regions in ChIP-Seq data
        • Creating a list of enriched regions
        • Identifying novel and known motifs
        • Finding nearest genomic features
        • Visualizing reads and enriched regions
      • Survival Analysis
        • Kaplan-Meier Survival Analysis
        • Cox Regression Analysis
      • Model Selection Tool
      • Copy Number Analysis
        • Importing Copy Number Data
        • Exploring the data with PCA
        • Creating Copy Number from Allele Intensities
        • Detecting regions with copy number variation
        • Creating a list of regions
        • Finding genes with copy number variation
        • Optional: Additional options for annotating regions
        • Optional: GC wave correction for Affymetrix CEL files
        • Optional: Integrating copy number with LOH and AsCN
      • Loss of Heterozygosity
      • Allele Specific Copy Number
      • Gene Expression - Aging Study
      • miRNA Expression and Integration with Gene Expression
        • Analyze differentially expressed miRNAs
        • Integrate miRNA and Gene Expression data
      • Promoter Tiling Array
      • Human Exon Array
        • Importing Human Exon Array
        • Gene-level Analysis of Exon Array
        • Alt-Splicing Analysis of Exon Array
      • NCBI GEO Importer
    • Webinars
    • White Papers
      • Allele Intensity Import
      • Allele-Specific Copy Number
      • Calculating Genotype Likelihoods
      • ChIP-Seq Peak Detection
      • Detect Regions of Significance
      • Genomic Segmentation
      • Loss of Heterozygosity Analysis
      • Motif Discovery Methods
      • Partek Genomics Suite Security
      • Reads in RNA-Seq
      • RNA-Seq Methods
      • Unpaired Copy Number Estimation
    • Release Notes
    • Version Updates
    • TeamViewer Instructions
  • Getting Help
    • TeamViewer Instructions
Powered by GitBook
On this page
  • Exploratory Analysis Results
  • T cells
  • B cells
  • Additional Assistance

Was this helpful?

Export as PDF
  1. Partek Flow
  2. Tutorials
  3. Analyzing CITE-Seq Data

Classifying Cells

PreviousDimensionality Reduction and ClusteringNextDifferentially Expressed Proteins and Genes

Last updated 7 months ago

Was this helpful?

We will now examine the results of our exploratory analysis and use a combination of techniques to classify different subsets of T and B cells in the MALT sample.

Exploratory Analysis Results

  • Double click the merged UMAP data node

  • Under Configure on the left, click Style, select the Graph-based cluster node, and color by the Graph-based attribute (Figure 1)

The 3D UMAP plot opens in a new data viewer session (Figure 2). Each point is a different cell and they are clustered based on how similar their expression profiles are across proteins and genes. Because a graph-based clustering task was performed upstream, a biomarker table is also displayed under the plot. This table lists the proteins and genes that are most highly expressed in each graph-based cluster. The graph-based clustering found 11 clusters, so there are 11 columns in the biomarker table.

  • Click and drag the 2D scatter plot icon from New plot onto the canvas (Figure 2)

  • Drop the 2D scatter plot to the right of the UMAP plot

  • Click Merged counts to use as data for the 2D scatter plot (Figure 3)

A 2D scatter plot has been added to the right of the UMAP plot. The points in the 2D scatter plot are the same cells as in the UMAP, but they are positioned along the x- and y-axes according to their expression level for two protein markers: CD3_TotalSeqB and CD4_TotalSeqB, respectively (Figure 4).

  • In Select & Filter, click Criteria to change the selection mode

  • Click the blue circle next to the Add rule drop-down menu (Figure 5)

  • Click Merged counts to change the data source

  • Choose CD3_TotalSeqB from the drop-down list (Figure 6)

  • Click and drag the slider on the CD3D_TotalSeqB selection rule to include the CD3 positive cells (Figure 7)

As you move the slider up and down, the corresponding points on both plots will dynamically update. The cells with a high expression for the CD3 protein marker (a marker for T cells) are highlighted and the deselected points are dimmed (Figure 8).

  • Click Merged counts in Get data on the left under Setup

  • Click and drag CD8a_TotalSeqB onto the 2D scatter plot (Figure 9)

  • Drop CD8_TotalSeqB onto the x-axis configuration option

The CD3 positive cells are still selected, but now you can see how they separate into CD4 and CD8 positive populations (Figure 10).

The simplest way to classifying cell types is to look for the expression of key marker genes or proteins. This approach is more effective with CITE-Seq data than with gene expression data alone as the protein expression data has a better dynamic range and is less sparse. Additionally, many cell types have expected cell surface marker profiles established using other technologies such as flow cytometry or CyTOF. Let's compare the resolution power of the CD4 and CD8A gene expression markers compared to their protein counterparts.

  • Click the duplicate plot icon above the 2D scatter plot (Figure 11)

  • Click Merged counts in the Get Data icon under Setup

  • Search for the CD4 gene

  • Click and drag CD4 onto the duplicated 2D scatter plot

  • Drop the CD4 gene onto the y-axis option

  • Search for the CD8A gene

  • Click and drag CD8A onto the duplicated 2D scatter plot

  • Drop the CD8A gene onto the x-axis option

The second 2D scatter plot has the CD8A and CD4 mRNA markers on the x- and y-axis, respectively (Figure 12). The protein expression data has a better dynamic range than the gene expression data, making it easier to identify sub-populations.

  • Manually select the cells with high expression of the CD4_TotalSeqB protein marker (Figure 13)

More than 2000 cells show positive expression for the CD4 cell surface protein.

Let's perform the same test on the gene expression data.

  • Click on a blank spot on the plot to clear the selection

  • Manually select the cells with high expression of the CD4 gene marker (Figure 14)

This time, only 500 cells show positive expression for the CD4 marker gene. This means that the protein data is less sparse (i.e. there fewer zero counts), which further helps to reliably detect sub-populations.

T cells

Based on the exploratory analysis above, most of the CD3 positive cells are in the group of cells in the right side of the UMAP plot. This is likely to be a group of T cells. We will now examine this group in more detail to identify T cell sub-populations.

  • Draw a lasso around the group of putative T cells (Figure 15)

  • Click and drag the plot to rotate it around

This group of putative T cells predominantly consists of cells assigned to graph-based clusters 3, 4, and 6, indicated by the colors. Examining the biomarker table for these clusters can help us infer different types of T cell.

  • Click and drag the bar between the UMAP plot and the biomarker table to resize the biomarker table to see more of it (Figure 17)

Cluster 6 has several interesting biomarkers. The top biomarker is CXCL13, a gene expressed by follicular B helper T cells (Tfh cells). Another biomarker is the PD-1 protein, which is expressed in Tfh cells. This protein promotes self-tolerance and is a target for immunotherapy drugs. The TIGIT protein is also expressed in cluster 6 and is another immunotherapy drug target that promotes self-tolerance.

Cluster 4 expresses several marker genes associated with cytotoxicity (e.g. NKG7 and GZMA) and both CD3 and CD8 proteins. Thus, these are likely to be cytotoxic cells.

We can visually confirm these expression patterns and assess the specificity of these markers by coloring the cells on the UMAP plot based on their expression of these markers.

  • Click the duplicate plot icon above the UMAP plot

We will color the cells on the duplicate by their expression of marker genes, while keeping the original plot colored by graph-based cluster assignment.

  • Click and drag the CXCL13 gene from the biomarker table onto the duplicate UMAP plot

  • Drop the CXCL13 gene onto the Green (feature) option (Figure 18)

  • Click and drag the NKG7 gene from the biomarker table onto the duplicate UMAP plot

  • Drop the NKG7 gene onto the Red (feature) option

The cells with higher CXCL13 and NKG7 expression are now colored green and red, respectively. By looking at the two UMAP plots side by side, you can see these two marker genes are localized in graph-based clusters 6 and 4, respectively (Figure 19).

  • Click the blue circle next to the Add criteria drop-down list

  • Search for Graph to search for a data source

  • Select Graph-based clustering (derived from the Merged counts > PCA data nodes)

  • Click the Add criteria drop-down list and choose Graph-based to add a selection rule (Figure 20)

  • In the Graph-based filtering rule, click All to deselect all cells

  • Click cluster 6 to select all cells in cluster 6

  • Using the Classify tool, click Classify selection

  • Label the cells as Tfh cells (Figure 21)

  • Click Save

  • Click cluster 4 to select all cells in cluster 4

  • In the Classify icon, click Classify selection

  • Label the cells as Cytotoxic cells

  • Click Save

We can classify the remaining cells as helper T cells, as they predominantly express the CD4 protein marker.

  • Click on the invert selection icon in either of the UMAP plots (Figure 22)

  • In Classify, click Classify selection

  • Label the cells as Helper T cells

  • Click Save

Let's look at our progress so far, before we classify subsets of B-cells.

  • Click the Clear filters link in Select & Filter

  • Select the duplicate UMAP plot (with the cell colored by marker genes)

  • Under Configure on the left, open Style and color the cells by New classifications (Figure 23)

B cells

In addition to T-cells, we would expect to see B lymphocytes, at least some of which are malignant, in a MALT tumor sample. We can color the plot by expression of a B cell marker to locate these cells on the UMAP plot.

  • In the Get data icon on the left, click Merged counts

  • Scroll down or use the search bar to find the CD19_TotalSeqB protein marker

  • Click and drag the CD19_TotalSeqB marker over to the UMAP plot on the right

  • Drop the CD19_TotalSeqB marker over the Color configuration option on the plot

The cells in the UMAP plot are now colored from grey to blue according to their expression level for the CD19 protein marker (Figure 24). The CD19 positive cells correspond to several graph-based clusters. We can filter to these cells to examine them more closely,

  • Lasso around the CD19 positive cells (Figure 25)

The plots will rescale to include the selected points. The CD19 positive cells include cells from graph-based clusters 1, 2 and 7 (Figure 26).

  • Find the CD3_TotalSeqB protein marker in the biomarker table

  • Click and drag the CD3_TotalSeqB onto the UMAP plot on the right

  • Drop the CD3_TotalSeqB protein marker onto the Color configuration option on the plot (Figure 27)

While these cells express T cell markers, they also group closely with other putative B cells and express B cell markers (CD19). Therefore, these cells are likely to be doublets.

  • Select either of the UMAP plots

  • Click on the Select & Filter

  • Find the CD3_TotalSeqB protein marker in the biomarker table

  • Click and drag CD3_TotalSeqB onto the Add criteria drop-down list in Select & Filter (Figure 28)

  • Set the minimum threshold to 3 in the CD3_TotalSeqB selection (Figure 29)

  • Click the Classify icon then click Classify selection

  • Label the cells as Doublets

  • Click Save

The biomarkers for clusters 1 and 2 also show an interesting pattern. Cluster 1 lists IGHD as its top biomarker, while cluster 2 lists IGHA1 as the fourth most significant. Both IGHD (Immunoglobulin Heavy Constant Delta) and IGHA1 (Immunoglobulin Heavy Constant Alpha 1) encode classes of the immunoglobulin heavy chain constant region. IGHD is part of IgD, which is expressed by mature B cells, and IGHA1 is part of IgA1, which is expressed by activated B cells. We can color the plot by both of these genes to visualize their expression.

  • Click, drag and drop IGHD from the biomarker table onto the Green (feature) configuration option on the UMAP plot on the right

  • Click, drag and drop IGHA1 from the biomarker table onto the Red (feature) configuration option on the UMAP plot on the right (Figure 30)

We can use the lasso tool to select and classify these populations.

  • Lasso around the IGHD positive cells (Figure 31)

  • In the Classify icon on the left, click Classify selection

  • Label the cells as Mature B cells

  • Click Save

  • Lasso around the IGHA1 positive cells (Figure 32)

  • In the Classify icon on the left, click Classify selection

  • Label the cells as Activated B cells

  • Click Save

We can now visualize our classifications.

  • Click the Clear filters link in the Select & Filter icon on the left

  • Select the duplicate UMAP plot (with the cell colored by marker genes)

  • Under Configure on the left, click the Style icon and color the cells by New classifications (Figure 33)

  • Click Apply classifications in the Classify icon

  • Name the attribute Cell type

  • Click Run

  • Click OK to close the message about a classification task being enqueued

Additional Assistance

On the first 2D scatter plot (with protein markers), click in the top right corner

Click in the top right of the plot to switch back to pointer mode

On the second 2D scatter plot (with mRNA markers), click in the top right corner

Click in the top right corner of both 2D scatter plots, to remove them from the canvas

Click in the top right corner of the 3D UMAP plot

Click in the Select & Filter tool to include the selected points

Click in the top right of the plot to switch back to pointer mode

Add the Biomarkers table using the Table option in the New plot menu, you can drag and reposition the table using the button in the top left corner of the plot .

If you need to create more space on the canvas, hide the panel words on the left using the arrow .

In Select & Filter, click to remove the CD3_TotalSeqB filtering rule

Click in Select & Filter to exclude the cluster 6/Tfh cells

Click in Select & Filter to exclude the cluster 4/Cytotoxic cells

Click in the top right corner of the UMAP plot

Click in Select & Filter to include the selected points

Click in Select & Filter to exclude the selected points

Click in the top right corner of the UMAP plot

Optionally, you may wish to save this data viewer session if you need to go back and reclassify cells later. To save the session, click the icon on the left and name the session.

If you need additional assistance, please visit to submit a help ticket or find phone numbers for regional support.

our support page
Exploratory Analysis Results
T cells
B cells
Figure 1. Color the cells in the UMAP plot by their graph-based cluster assignment
Figure 2. Add a 2D scatter plot and place it to the right of the UMAP plot
Figure 3. Choose Merged counts data to draw the 2D scatter plot
Figure 4. The canvas now has a 2D scatter plot next to the UMAP
Figure 5. Click the blue circle to change the data source for the rule selector
Figure 6. Choose the CD3_TotalSeqB protein marker as a selection rule
Figure 7. Use the slider to select cells with positive expression for the CD3 protein marker
Figure 8. CD3+ cells are selected on both plots
Figure 9. Change the feature plotted on the x-axis to CD8_TotalSeqB
Figure 10. 2D scatter plot with CD4_TotalSeqB and CD8_TotalSeqB features on the axes
Figure 11. Click the duplicate plot icon to make a copy of the 2D scatter plot
Figure 12. The second 2D scatter plot (bottom) has the CD8 and CD4 genes plotted against each other
Figure 13. Draw a lasso to manually select CD4+ cells, based on protein expression
Figure 14. Draw a lasso to manually select CD4+ (mRNA) cells
Figure 15. Select the group of putative T cells
Figure 16. Group of putative T-cells
Figure 17. Resize plots to see more of the biomarker table
Figure 18. Click and drag the gene from the biomarker table onto the plot
Figure 19. The cells in the UMAP plot on the right are colored by their expression of CXCL13 (green) and NKG7 (red) marker genes. These cells belong to graph-based clusters 6 and 4, respectively, shown in the plot on the left
Figure 20. Change the data source to Graph-based clustering and choose Graph-based from the drop-down list
Figure 21. Select all cluster 6 cells and classify them as Tfh cells
Figure 22. Invert the selection to select all remaining cells
Figure 23. Color by New classifications (T cell subsets)
Figure 24. Cells in UMAP plot colored by their expression of CD19 protein
Figure 25. Lasso around CD19 positive cells
Figure 26. Filtered CD19 positive cells
Figure 27. Some cells within the CD19 positive clusters show signs of expressing T-cells markers
Figure 28. Click and drag the CD3 protein marker directly onto the Add criteria drop-down list to create a selection criteria
Figure 29. Select the remaining CD3 positive doublet cells
Figure 30. The B cells colored by IGHD (green) and IGHA1 (red) gene expression
Figure 31. Lasso around the IGHD positive cells
Figure 32. Select IGHA1 positive cells
Figure 33. UMAP with cells colored by cell types