Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
Loading...
To plan a successful sequencing run, a with details on run configuration (e.g., sequencer type, flowcell, and sample type) is required. Follow the instrument-specific steps below to create a sample sheet compatible with the Illumina Prep Kit and DRAGEN Protein Quantification.
Log in to and select your workgroup.
In the Run Planning tool, configure the settings described in the following table. Some settings are instrument specific. When you select the DRAGEN Protein Quantification application, the library prep kit and index adapter kit populate automatically, along with additional instrument-specific settings.
This table describes the possible configuration settings and values.
Instrument Platform
NovaSeq 6000/6000Dx
Secondary Analysis
[Cloud analysis] BaseSpace/Illumina Connected Analytics
[Local analysis] Local
Application
DRAGEN Protein Quantification (select the latest version)
Library Prep Kit
Illumina Protein Prep 9k (auto-populated)
Index Adapter Kit
Illumina DNA-RNA UD Indexes Set A B C D Tagmentation (auto-populated)
[NovaSeq 6000]
These settings are configured automatically and are not editable. - 2 indexes - Single Read - 15, 10, 10, 0
Lane Splitting is not supported for NovaSeq 6000 runs. Select "Repeat set of samples across all lanes."
Upload Illumina Protein Prep Automation System output file (*.csv) to BSSH Run Planner.
Select Import samples, select the CSV file type, and upload the Illumina Protein Prep Automation System output file. The interface highlights invalid values immediately after the file is rendered.
[Optional] To include a second plate in the run, repeat the import process and select Add to existing samples when prompted. The new samples are appended to the samples that were uploaded previously.
WARNING - Sample IDs and index sequences must be unique within a sequencing run. If combining libraries from multiple Illumina Protein Prep runs, avoid combining plates that contain the same sample IDs or index sequences.
[Optional] Multi-project analysis: Users may add Project
values either in the Illumina Protein Prep output file (prior to uploading) or add values in the BSSH Run Planner. See for more details.
WARNING - All samples from the same plate must have the same project (or no project) value.
[Optional] Enter an appropriate output file prefix. This value is used as a part of the prefix for the secondary analysis output file names. The first character must be alphanumeric. For the remaining characters, alphanumeric, hyphens, underscores, and spaces are permitted.
Proceed to the Run Review page and save the planned run.
[NovaSeq 6000, cloud analysis] Download the sample sheet and save it to a network location accessible to the sequencing instrument.
[NovaSeq 6000 Local analysis] Select Export to download the sample sheet. Save the file to a network location accessible to the sequencing instrument.
In the Run Planning tool, configure the settings described in the following table. Some settings are instrument specific. When you select the DRAGEN Protein Quantification application, the library prep kit and index adapter kit populate automatically, along with additional instrument-specific settings.
This table describes the possible configuration settings and values.
Instrument Platform
NovaSeq X Series
Secondary Analysis
[Cloud analysis] BaseSpace/Illumina Connected Analytics
[Local analysis] Local
[Local analysis]
FASTQ file compression format
DRAGEN
This setting is required by default. The setting does not impact DRAGEN Protein Quantification as no FASTQ files are output.
[Local analysis]
Generate FastQC metrics
Yes This setting is optional. The setting does not impact DRAGEN Protein Quantification as no FASTQ files are output.
Read Lengths
- Read 1: 15 - Index 1: 10 - Index 2: 10 - Read 2: 0
Application
DRAGEN Protein Quantification (select the latest version)
Library Prep Kit
Illumina Protein Prep 9k (auto-populated)
Index Adapter Kit
Illumina DNA-RNA UD Indexes Set A B C D Tagmentation (auto-populated)
Override Cycles
These settings are configured automatically and should not be edited. - Read 1: Y15 - Index 1: I10 - Index 2: I10
If lane splitting is not utilized, do not edit the Lane
column in the Illumina Protein Prep Automation System output file.
Upload Illumina Protein Prep Automation System output file (*.csv) to BSSH Run Planner.
Select Import samples, select the CSV file type, and upload the Illumina Protein Prep Automation System output file. The interface highlights invalid values immediately after the file is rendered.
[Optional] To include a second plate in the run, repeat the import process and select "Add to existing samples" when prompted. The new samples are appended to the samples that were uploaded previously.
Barcode mismatch 1 and 2—No action is required. The default value is set to 1. Do not change this value.
If lane splitting will not be utilized, indicate that each sample is present in all lanes. For the first sample, click on the Lanes box and select the first checkbox. This will populate the cell with "1,2,3,4,5,6,7,8". Then, select the Lanes header and click "Fill down". This will add these lane values for all samples.
WARNING - Sample IDs and index sequences must be unique within a sequencing run. If combining libraries from multiple Illumina Protein Prep runs, avoid combining plates that contain the same sample IDs or index sequences. When combining libraries with non-unique indexes, ensure they are loaded into different flow cell lanes, and that lane splitting is enabled during sample sheet creation.
[Optional] Enter an appropriate output file prefix. This value is used as a part of the prefix for the secondary analysis output file names. The first character must be alphanumeric. For the remaining characters, alphanumeric, hyphens, underscores, and spaces are permitted.
Proceed to the Run Review page and save the planned run.
[NovaSeq X Series, cloud analysis] No action is required. The sample sheet is automatically uploaded to the instrument.
[Local analysis] Select Export to download the sample sheet. Save the file to a network location accessible to the sequencing instrument.
Additional Notes:
DRAGEN Protein Quantification does not support Multiple Analysis on NovaSeq X.
For additional information on run planning, refer to the documentation.
Log in to and select your workgroup.
If lane splitting will be utilized with this sequencing run, it's recommended to add values to Lane
column to Illumina Protein Prep Automation System output files locally. See for more details.
[Optional] Multi-project analysis: Users may add Project
values either in the Illumina Protein Prep output file (prior to uploading) or add values in the BaseSpace Run Planner. See for more details.
WARNING - All samples from the same plate must have the same project (or no project) value.
For additional information on run planning, refer to the documentation.
The Illumina Protein Prep Automation System Output File (IPPAS output file) is a .csv file that's produced after automated library prep is completed. It contains the following fields:
Sample ID
Identifier per sample. Specified in Sample Manifest prior to automation.
Well position
Well position of the sample. Specified in Sample Manifest prior to automation.
Project
Optionally contains information about what project output a file should contain. See by Project for more details.
PlateBarcode
Barcode associated with plate. Specified in Sample Manifest prior to automation.
BatchID
User-provided batch identifier
InputType
Sample input type, either plasma, serum, plamsa_calibrator, serum_calibrator, plasma_QC, serum_QC, or blank. Specified in Sample Manifest prior to automation.
MatrixTubeBarcode
Barcode associated with matrix tube. Specified in Sample Manifest prior to automation.
ControlID
Lot number for associated calibrator, QC, or blank sample. Specified in Sample Manifest prior to automation.
ProbePlate
Proteomics Probe Plate barcode scanned during Proteomics Assay.
SOMAmerBeadPlate
SOMAmer-Bead Plate Dil 1 barcode.
Lanes
Optionally contains information about lane splitting. See by Project for more details.
NGS readout of more than 9K unique protein targets for a single plasma or serum sample.
From sample to processed results in under 2.5 days with just 4 hours of hands-on time.
Includes both local and cloud solutions for planning a run and for processing data with DRAGEN Protein Quantification.
The Illumina E2E Protein Prep solution integrates automation steps to perform sample preparation, protein capture, sequencing, and bioinformatics analysis. Once sequencing is finished, the DRAGEN Protein Quantification application automatically initiates in BSSH or ICA. The diagram below illustrates the E2E workflow.
For documentation on the assay and automation components of Illumina Protein Prep, please refer to the Illumina Protein Prep Product Documentation (document # 200045446).
Normalization corrects for the sources of confounding variation, such as overall protein concentration differences, minor deviations in volume transfer during the assay, or efficiency of library preparation steps. It's performed sequentially and produces an individual ADAT file with counts for each normalization step. See the metrics appendix page for a summary of all metrics.
Hybridization Normalization: This step corrects for biases that can occur during the hybridization and sequencing preparation stages of the assay. During hybridization, controls are spiked into each sample; during normalization, the counts for these controls are compared against an internal reference based on all the samples on the plate. A scale factor is calculated for each sample, and if the scale factor is outside of the specification range (0.4–2.5), the sample will receive a FLAG.
Internal Reference Median Normalization: This step corrects for differences in the total protein abundance measurement of a sample. It is performed for each dilution group and runs separately for blank and calibrator control samples. A scale factor is calculated for each dilution group, by comparing the observed protein measurements to a reference of expected values for each protein.
This reference is based on median protein counts across all samples of the same sample type on the same plate. If any of the scale factors for a sample are outside of the specification range (0.4–2.5), the sample will receive a FLAG.
Plate Scaling: This step corrects for possible changes in measured total protein counts between plates, when calibrator samples are present. The median of each SOMAmer measurement across the five calibrators is compared to an external calibration reference to calculate a plate scale factor.
The first plate scaling step compares the calibrator medians to a reference derived from the sequencing instrument (NovaSeq 6000 or NovaSeq X Series) and adjusts the entire plate accordingly.
The second step compares the scaled calibrator medians to a reference derived from the NovaSeq X Series (10B flow cell). There is no metric for plate scaling.
The references used in this step can be found in the SOMAmer metadata, under Ref.Bridging.<CalibratorId>.<Instrument>.<Flowcell>.<MasterMixLot#>. The reference used by cross-instrument plate scaling is Ref.Bridging.<PlateBarcode>.NovaSeqX.10B.AA.
Calibration: This step corrects for batch effects that impact individual SOMAmers.
The first calibration step (Platform Specific Calibration) compares the median of each SOMAmer measurement across the five Calibrator sample replicates to an external Calibration reference. This reference is derived from runs using the same instrument type, flow cell type, calibrator lot, and sample input type used in the run being analyzed. It then calculates a SOMAmer-specific scale factor and a Calibrator metric (PlatformSpecificCalibrationTailPercent). PlatformSpecificCalibrationTailPercent corresponds to the percentage of SOMAmers with scale factors outside the specification range (0.6–1.4). If 15% of SOMAmers fall outside of this specification, the plate receives a WARNING for the PlatformSpecificTailPercent_PassFlag metric.
The second calibration step (Cross Platform Calibration) compares the updated calibrator medians to a reference derived from the NovaSeq X Series (10B flow cell), using the same calibrator lot and sample input type as the run being analyzed. The scale factor used to align the median of the calibrators to the reference value is applied to all samples on the same plate.
The references used in this step can be found in the SOMAmer metadata, under Ref.Bridging.<CalibratorId>.<Instrument>.<Flowcell>.<MasterMixLot#>. The reference used by cross-instrument calibration is Ref.Bridging.<CalibratorId>.NovaSeqX.10B.AA.
External Reference Median Normalization: This step corrects for differences in the total protein signal for samples on each dilution plate. A scale factor is calculated by comparing the observed protein measurements to a reference of expected values for each protein. It's performed on plasma/serum samples and QC samples.
If any of the scale factors for a sample are outside of the specification range (0.4–2.5), the sample will receive a FLAG.
The reference used in this step can be found in the SOMAmer metadata, under Ref.MedNormExt.Plasma.QC or Ref.MedNormExt.Serum.QC (dependent on the input type).
End to End (E2E) (IPP) workflow combines Illumina chemistry, SOMAmer technology, and DRAGEN data analysis for a comprehensive, automated NGS-based proteomics solution. This E2E solution provides the following:
Integrated analysis via (BSSH), (ICA), and Illumina Connected Multiomics (powered by ).
Before setting up and running the Illumina Protein Prep End-to-End (E2E) solution, ensure that the necessary software, tools, and configurations are in place. These prerequisites can vary depending on the environment used to run the secondary analysis (e.g., via cloud or locally). Follow the steps below to configure instrument and software appropriately.
Control Software (v1.8.0 or later)
Illumina Protein Prep custom recipe XML file installed on the sequencing instrument. Illumina provides the file.
Illumina Protein Prep NovaSeq 6000 v1.0.xml
Illumina Protein Prep Automation System output files, depending on the sequencing set up.
NovaSeq 6000 (S4 flow cell)
Recommended: Two IPPAS output files / 192 reactions
Control Software (v1.3.0 or later)
Illumina Protein Prep custom recipe XML file installed on the sequencing instrument. Illumina provides the file.
Illumina Protein Prep NovaSeq X 10B v2.0.xml
Illumina Protein Prep NovaSeq X 25B 100 cycle v1.0.xml
Illumina Protein Prep NovaSeq X 25B 200 cycle v1.0.xml (upon request only)
Illumina Protein Prep NovaSeq X 25B 300 cycle v1.0.xml (upon request only)
Illumina Protein Prep Automation System output files, depending on the sequencing set up.
NovaSeq X (10B flow cell)
Recommended: Two IPPAS output files / 192 reactions
NovaSeq X (25B flow cell)
Recommended: Four IPPAS output files / 384 reactions
BaseSpace Sequence Hub (BSSH) or Illumina Connected Analytics (ICA) account.
All BSSH and ICA subscriptions come with access to the DRAGEN Protein Quantification application.
[ICA subscribers] To view results in ICA, select ICA Run Storage in BSSH Workgroup Settings.
For more information about which subscription is best for your use case, contact your FAS or TAM.
Note: When performing analysis with more than one plate, each plate must have their own unique plate barcode value.
DRAGEN Phase 4 server with the following space recommendations:
NovaSeq6000, S4 flow cell: ≥ 10GB
NovaSeqX, 10B flow cell: ≥ 20 GB
NovaSeqX, 25B flow cell: ≥ 35 GB
External storage drive mounted to a DRAGEN server. This drive must be mounted through a network share and support CIFS/SMB protocols. Read and write permissions are required to use this network share. For more information, please refer to the Illumina Protein Prep Product Documentation (document # 200045446).
Local Proteomics Sample Sheet Generator Excel Workbook.
One or more IPPAS output files (one per plate).
Access to a DRAGEN Server configured to the Illumina Protein Quantification Local Secondary Analysis (see prerequisites page for more information).
Download and open the Local Proteomics Sample Sheet Generator excel file.
Navigate to the Start tab and follow the instructions described in the upper left portion of the sheet:
Fill in the RunName
and output_file_prefix.
Change the InstrumentPlatform
and PrepKitName
as needed.
Copy and paste values from IPPAS output file(s) not including the headers into the specified fields.
Add the Flow Cell lanes for each sample as a comma separated list with no spaces as needed.
Ensure there are no errors displayed in the Row Check or Overall Check (a green cell is expected)
Navigate to the "SaveAsCSV" and save it as CSV (filename must not contain special characters).
This is a sample sheet compatible to local DRAGEN Protein Quantification.
(Recommended) Load this sample sheet onto the sequencer prior to sequencing.
If the sample sheet is not included prior to sequencing, the user must manually reference the sample sheet when running DRAGEN Protein Quantification later.
For information on registering a BaseSpace Sequence Hub or Illumina Connected Analytics account, refer to the or connect with an Illumina Account Manager.
The Run Planning web interface, accessible via , requires internet connection. The Proteomics Sample Sheet Generator tool (linked below) allows for offline creation of sample sheets compatible with the local DRAGEN Protein Quantification application.
Analyze the data locally using a DRAGEN Server with DRAGEN Protein Quantification Pipeline (see for more information).
The purpose of lane splitting is to enable the reuse of sample indexes (barcodes) on the same flow cell. To accomplish this, the samples indexed with the same barcodes must be physically separated by placing them on different lanes of the flow cell.
Currently, lane splitting functionality in Illumina Protein Prep is only supported on the NovaSeqX platform. To use lane splitting, use the sample sheet to indicate which lane(s) each sample is found in. Recommendation: Edit the "Lanes" column of the Illumina Protein Prep Automation System output file(s):
Find the Lanes
column in the Illumina Protein Prep Automation System output file.
Add lane numbers in which the sample is located as a comma-separated value. Example: '1,2,3,4' for a sample located in lanes 1, 2, 3, and 4.
The goal of multi-analysis by project is to improve flexibility by enabling multiple analysis outputs from a single sequencing run without needing to requeue the sample sheet. A single project must include at least one plate (with its controls) and can include multiple plates used in a sequencing run. Each project produces one set of output files (including normalized ADATs and a DRAGEN Report).
To use multi-analysis by project, include the project name in the Project
column of the Illumina Protein Prep Automation System output files. If no project information is provided, DRAGEN Protein Quantification will assume all plates belong to the same project and will assign the project name based on the user provided 'run name'. The multi-analysis by project feature is available for both the NovaSeq 6000 and NovaSeq X platforms.
Find the Project
column in the Illumina Protein Prep Automation System output file.
Edit Project
column in an Illumina Protein Prep Automation System output file by assigning a project name to each sample. Project name string should contain only alphanumeric characters and underscores.
Upload modified Illumina Protein Prep Automation System output file to the Run Planner.
Note: If you want all samples from a sequencing run to be analyzed together, DO NOT include any values in the "Project" column of the sample sheet.
Note: If project splitting is enabled, all samples on one plate must be included in the same project. Samples from one plate CANNOT be split across projects.
Here is an example of what the first lines of two individual IPPAS output files may look like after project and lanes values were added. The plate with Barcode "A" adds lanes "1,2,3,4" and project value "ProjectA". The plate with barcode "B" adds lanes "5,6,7,8" and project value "ProjectB". Note that, this configuration indicates that pro
Upload the modified Illumina Protein Prep Automation System output file to the .
The purpose of a flag is to highlight that a sample required a high degree of correction during the normalization process. This means a sample had sufficiently high or low signal, causing the normalization scale factors to be out of specification. The general recommendation is to exercise caution when using that sample in downstream analysis; normalization may not have been able to properly correct for the large changes in signal.
A flag in a non-blank sample indicates low SOMAmer read depth. DRAGEN Protein Quantification has a minimum read depth to ensure measurement precision is achieved for each sample. Flagged samples may have a decrease in measurement precision.
A flag in a blank sample may indicate increased background or contamination.
A flag in a blank sample may indicate plasma or serum contamination. In internal studies, uncontaminated blanks generally have low RefCorr values (~0.4) and blanks with greater than 2% plasma or serum contamination were observed to have a RefCorr values of at least 0.6.
A flag in a sample indicates that the hybridization controls in that sample had elevated or decreased signal compared to the plate reference. Potential reasons for a high or low hybridization control signal include:
Issues with the sample itself. These include poor sample quality, protein loss during hybridization (e.g., due to elevated temperature) or other issues that arise during library prep. In this case, other failure modes will likely also be flagged, such as low Raw Counts, or high / low MedNormExt scale factors.
High plate-wide plasma / serum variability. Because Hyb Norm is performed using an internal reference, the composition of the plate can impact the scale factors of other samples. Therefore, a flag in one sample may reflect the quality of the other samples on the plate rather than issues with the sample itself (for example, many hemolyzed samples or blanks). In this case, other error modes for that sample are not expected to be flagged.
A flag in a calibrator indicates that the flagged calibrator replicate had a large discrepancy in signal when compared to the median of all calibrators within that plate. A single flagged calibrator will have limited impact on the plate due to use of median calibrator values in analysis. Multiple flagged calibrators could impact plate performance; reach out to Illumina Support for additional information.
A flag in a sample indicates that it had a large discrepancy in signal when compared to the external plasma or serum reference. Low signal may be caused by sample degradation or sample dilution; high signal may be caused by hemolysis. A difference in signal could also be caused by the diseased-state of the sample.
QC Percent in Tails and Calibration Percent in Tails are used to determine plate quality by examining individual SOMAmers in each well. A QC Percent in Tails failure will likely require a repeat library preparation or re-sequencing for this plate; contact Illumina Support for additional information.
Review the following table for more details.
Calibration Percent in Tails
QC Percent in Tails
Interpretation
WARNING
The median of the calibrator replicates for that plate were sufficiently different from the expected.
FAIL The median of the QC replicates for that plate were sufficiently different from the expected QC reference.
Likely a failed run
Some samples may individually pass but should not be used for downstream analysis due to the plate failure
One or more normalization steps after Calibration were unable to rescue the run
Could be caused by a plate-wide issue, including sequencing run failure, automation failure, or reagent issue
WARNING
The median of the calibrator replicates for that plate were sufficiently different from the expected.
PASS The median of the QC samples were sufficiently similar to expected values from QC reference.
Successful Run
One or more normalization steps after the Calibration step have successfully removed the differences observed in calibrators
PASS
The median of the calibrator samples were sufficiently similar to expected values from calibration reference.
FAIL The median of the QC replicates for that plate were sufficiently different from the expected QC reference.
Technically a failed run, but could be a false positive
Relatively rare combination
May indicate problem with only QC samples and not entire plate (e.g., edge effect)
There are two additional metrics on run quality that evaluates blank samples in a plate (see below). The general recommendation is that, if warnings are observed in one of these metrics, the plate should be used with caution in downstream analysis since there may have been sample contamination or elevated background. If issues with run quality are consistently seen, there may be issues with the assay set up. Contact Illumina Support for additional information or support.
A warning on a plate may indicate plate-wide increased background or contamination.
A warning on a plate may indicate plate-wide plasma or serum contamination.
There are a number of quality control checks that are applied on a plate and sample level. See the metrics appendix page for a summary of all metrics.
Minimum SOMAmer Read Counts: Non-blank samples with less than 10 million reads will receive a FLAG for SOMAmerReads_PassFlag in the ADAT. These reads are counted in the raw counts step. Only human protein SOMAmers are part of this count, not controls. There is no specification for blank samples.
Maximum SOMAmer Read Counts: Blank samples with more than 40 million normalized reads will receive a FLAG for SOMAmerNormRead_PassFlag in the ADAT. These reads are counted in the plate scale normalization step. Only human protein SOMAmers are part of this count, not controls. There is no specification for non-blank samples. A plate where 70% of the blanks have a FLAG for this step will receive a WARNING.
Reference Correlation: This step produces a Spearman correlation coefficient describing how similar a sample is to a an external Plasma or Serum reference (see below). Blank samples with a correlation to the reference greater than 0.6 will receive a FLAG for RefCorr_PassFlag in the ADAT. A plate where 70% of the blanks have a FLAG for this step will receive a WARNING.
The reference used in this step can be found in the SOMAmer metadata, under Ref.MedNormExt.Plasma.QC or Ref.MedNormExt.Serum.QC (dependent on the input type).
QC Check: This step compares the median of each SOMAmer measurement, across the three QC sample replicates, to an external QC reference. It then calculates a SOMAmer-specific scale factor and a QC metric (QCCheckTailPercent). QCCheckTailPercent corresponds to the percentage of SOMAmers with scale factors outside the specification range (0.8–1.2). If more than 15% of the scale factors are outside of the specification range, the plate receives a FAIL.
The references used in this step can be found in the SOMAmer metadata, under Ref.QCCheck.Plasma or Ref.QCCheck.Serum (dependent on the input type).
The DRAGEN Protein Quantification application is designed to perform counting and normalization for proteomics data from the Illumina Protein Prep pipeline. It converts data from the binary base call (BCL) files, generated by Illumina NovaSeq 6000 or NovaSeq X Series systems, into the normalized proteomic counts. Upon completion of sequencing, the application is automatically initiated for analysis on BaseSpace Sequence Hub (BSSH) or Illumina Connected Analytics (ICA).
The sections below exemplify how to configure the instrument(s) for autolaunching the secondary analysis, manually initiate an analysis, and requeue an analysis.
On your instrument, log into your workgroup and select the following run setup options:
Workflow: NovaSeq Standard
Read length: 15, 10, 10, 0
[NovaSeq 6000] Upload the v2 sample sheet generated by the BSSH Run Planner tool to the sequencing instrument.
[NovaSeq X Series] No action is required. The v2 sample sheet automatically displays on the instrument as a planned run.
Select the appropriate Illumina Protein Prep custom recipe for your sequencing instrument and flowcell. During the run, data is uploaded automatically to BSSH. Primary analysis and secondary analysis are completed automatically through autolaunch.
[Optional] Use the BaseSpace Sequence Hub to monitor the run from start to completion.
If manual mode sequencing is performed, or autolaunch is unsuccessful, there is an option to manually upload a completed run folder to BSSH and kick off the autolaunch analysis. To use this method, the samplesheet must be in the uploaded run folder, and be named "SampleSheet.csv".
This is a non comprehensive list of fields.
Header
FileFormatVersion
Must be "2"
Header
InstrumentPlatform
Must be "NovaSeqXSeries" or "NovaSeq"
Header
RunName
User-provided value
Header
RunDescription
User-provided value (optional)
Reads
Read1Cycle
15
Reads
Index1Cycle
10
Reads
Index2Cycle
10
Sequencing_Settings
LibraryPrepKits
Must be "IlluminaProteinPrep9k"
BCLConvert_Data
Sample_ID
Alphanumeric name up to 100 characters. Letters, numbers, dashes only (or any combination of letters, numbers, and dashes)
BCLConvert_Data
Index
i7 index sequence, including A, C, T or G letters, 10 nucleotides long
BCLConvert_Data
Index2
i5 index sequence, including A, C, T or G letters, 10 nucleotides long
BCLConvert_Data
Lane
Lane value (shall be a number between 1 and 8 inclusive) (optional if instrument is NovaSeq 6k, required if instrument is NovaSeq X)
Cloud_Proteomics_Settings (Cloud Analysis) or Proteomics_Settings (Local Analysis)
SoftwareVersion
Three-digit version of SW used in secondary analysis. For example, "2.0.0"
Cloud_Proteomics_Settings (Cloud Analysis) or Proteomics_Settings (Local Analysis)
StartsFromFastq
Must be "false"
Cloud_Proteomics_Settings (Cloud Analysis) or Proteomics_Settings (Local Analysis)
output_file_prefix
User-provided prefix
Cloud_Proteomics_Data (Cloud Analysis) or Proteomics_Data (Local Analysis)
Sample_ID
Alphanumeric name up to 100 characters. Letters, numbers, dashes only (or any combination of letters, numbers, and dashes)
Cloud_Proteomics_Data (Cloud Analysis) or Proteomics_Data (Local Analysis)
PlateBarcode
For each plate, associated plate barcode
Cloud_Proteomics_Data (Cloud Analysis) or Proteomics_Data (Local Analysis)
MatrixTubeBarcode
For each plate, associated matrix tube barcode
Cloud_Proteomics_Data (Cloud Analysis) or Proteomics_Data (Local Analysis)
BatchID
For each plate, user-provided batchID
Cloud_Proteomics_Data (Cloud Analysis) or Proteomics_Data (Local Analysis)
InputType
For each sample, associated input type. Examples: Plasma_Calibrator, Plasma_QC, Plasma, Serum_Calibrator, Serum_ QC, Serum, Blank
Cloud_Proteomics_Data (Cloud Analysis) or Proteomics_Data (Local Analysis)
ControlID
ID of the calibrator, QC, and blank lot (applied to controls only).
Cloud_Proteomics_Data (Cloud Analysis) or Proteomics_Data (Local Analysis)
ProbePlate
For each plate, probe plate lot number, from library prep
Cloud_Proteomics_Data (Cloud Analysis) or Proteomics_Data (Local Analysis)
SOMAmerBeadPlate
For each plate, SOMAmer Bead Plate lot number, from library prep
Cloud_Proteomics_Data (Cloud Analysis) or Proteomics_Data (Local Analysis)
WellPosition
For each sample, well position (A1-H12)
Cloud_Settings
GeneratedVersion
Software version of the BSSH Run Planner tool that generated the sample sheet
Cloud_Settings
Cloud_Proteomics_Pipeline (Cloud Only)
ICA Path to the proteomics pipeline.
For example: urn:ilmn:ica:pipeline:6475f989-f9fc-4874-9ff6-10ac23d11681#DRAGEN_Protein_Quantification_2-0-0
Cloud_Data
Sample_ID
Alphanumeric name up to 100 characters. Letters, numbers, dashes only (or any combination of letters, numbers, and dashes)
Cloud_Data
ProjectName
(optional) user-provided plate-specific project
Cloud_Data
LibraryName
For each sample, must be <Sample_ID>_<index>_<index2>
Cloud_Data
LibraryPrepKitName
Must be "IlluminaProteinPrep9k"
Cloud_Data
IndexAdapterKitName
Must be "IlluminaDNARNAUDISetABCDTagmentation_Proteomics"
Examples of local and cloud sample sheets for NovaSeq 6000 and NovaSeq X are attached to this page.
DRAGEN Protein Quantification can be run locally after the installation of the local solution on a DRAGEN phase 4 server by an FAS.
To initiate a run:
The --analysisFolder
parameter is optional. If no path is provided, output files will be put in a subdirectory of the folder where the command is being run.
The -s
parameter is optional if the sample sheet file is included in the run folder.
For details on the parameters used with the script, execute the following command.
To view primary metrics:
Go to the relevant BaseSpace Sequence Hub (BSSH) workgroup.
Navigate to the "Runs" tab and select the run.
The "Summary" tab gives an overview of the sequencing run quality (e.g., average %Q30, %PF Yield).
Navigate to "Metrics" for detailed per lane information on all sequencing metrics.
To view the analysis associated with a specific sequencing run:
Navigate to the "Summary" tab of the relevant run.
Click on the link below "Latest Analysis" (displaying the results from the latest analysis processed to the run data). For re-queued/re-analyzed runs, the previously completed analyses can be found under the "Prior Analysis" section
Click on "Reports" and find the quality metrics associated with the secondary analysis on the run data.
Note: Those who have access to an ICA account and want to view results on ICA may either click on "View Files in ICA" in the top right corner of your BSSH Analysis page or directly access the analysis in ICA. The secondary analysis results in ICA will be in a BSSH-managed project with the same name as the BSSH workgroup where the analysis was performed.
For additional information on autolaunch, refer to the page.
Follow the instructions from the for to BSSH.
Follow the instructions described in .
A sample sheet is required to kick off secondary analysis. It can be made either using the BSSH Run Planner Tool (recommended), Excel Sample Sheet Generator, or manually. The following table describes the sample sheet fields and its values depending on the environment used to execute the DRAGEN Protein Quantification application. The pipeline can be executed via cloud using either or , or executed locally using a phase 4 .
For additional information, refer to the ICS .
For further information on tracking and viewing run and analysis results in BaseSpace Sequence Hub, refer to the BSSH documentation.
DRAGEN Reports is an HTML report that provides a quick overview of the quality of an E2E analysis. The report consists of three sections, which are displayed as tabs in the report. The following sections describe the tabs.
This section is subdivided into the following subsections:
Reagent Lot Summary: This table describes the reagents used per plate in the analysis.
Plate QC Summary: This table provides the plate level metrics, including Calibration % in Tails, QC % in Tails, Reference Correlation, and Blank Background metrics.
Calibration Scale Factors: This histogram illustrates the distribution of calibration scale factors for a given plate.
QC Scale Factors: This histogram illustrates the distribution of QC scale factors for a given plate.
This section of the report contains information on which samples passed or flagged specifications as well as SOMAmer count yield per sample. It is subdivided into the following subsections:
Sample QC Summary: The table describes the percentage of samples (organized by sample type) that passed Quality Control.
QC Summary: The heatmap shows the QC status of samples based on their position in the plate wells.
Flagged Samples: The table identifies samples that failed one or more sequencing or normalization specifications.
SOMAmer Read Counts: This graph represents the SOMAmer count for each sample included in the analysis.
This report section details the QC metrics and normalization steps.
DRAGEN Protein Quantification produces the following key output files in BaseSpace Sequence Hub:
DRAGEN_Protein_Quantification_<SW Version>
<project> (if no projects are provided, there shall be one folder titled with the RunName)
adat
<output file prefix>_FinalNormStep_MedNormExt.adat: This file contains the final normalized counts for the samples in this project.
other_normalization_steps: This folder contains all intermediate normalized ADATs with their respective counts.
quality metrics
<output file prefix>_run_qc_stats.csv: This file provides a summary of per sample run qc statistics.
DRAGEN Report
DRAGEN Report: Folder containing all projects DRAGEN report file/s
Extra Files
ICA Logs: Folder containing information on ICA logs
The example below contains an analysis configured with two Projects, "1_x" and "0_x". Analysis and outputs are processed for each project individually (check for details).
Sample level metrics are found in the sample/row-level metadata in the ADAT.
Minimum SOMAmer Read Count
SOMAmerReads; SOMAmerReads_PassFlag
Non-Blank Samples
SOMAmerReads > 10 million
PASS/FLAG
Maximum SOMAmer Normalized Read Count
SOMAmerNormReads; SOMAmerNormReads_PassFlag
Blank Samples
SOMAmerNormReads < 40 million reads
PASS/FLAG
Hybridization Normalization
HybNorm_1_ScaleFactor; HybNorm_PassFlag
Non-Blank Samples
0.4 ≤ HybNorm_1_ScaleFactor ≤ 2.5
PASS/FLAG
Internal Reference Median Normalization
MedNormInt_5e-05_ScaleFactor; MedNormInt_0.005_ScaleFactor; MedNormInt_0.2_ScaleFactor; MedNormInt_PassFlag
Individual Blank and Calibrator samples, split by dilution group
0.4 ≤ MedNormInt_5e-05_ScaleFactor ≤ 2.5 0.4 ≤ MedNormInt_0.005_ScaleFactor ≤ 2.5 0.4 ≤ MedNormInt_0.2_ScaleFactor ≤ 2.5
PASS/FLAG
External Reference Median Normalization
MedNormExt_5e-05_ScaleFactor; MedNormExt_0.005_ScaleFactor; MedNormExt_0.2_ScaleFactor; MedNormExt_PassFlag
Individual non-control and QC samples, split by dilution group
0.4 ≤ MedNormExt_5e-05_ScaleFactor ≤ 2.5 0.4 ≤ MedNormExt_0.005_ScaleFactor ≤ 2.5 0.4 ≤ MedNormExt_0.2_ScaleFactor ≤ 2.5
PASS/FLAG
Correlation with Reference
RefCorr; RefCorr_PassFlag
Blank Samples
RefCorr < 0.6
PASS/FLAG
TM Controls
EmpiricalHybTemp
All samples
N/A
N/A
Row Check
RowCheck_PassFlag
All samples
The row check shall be PASS is all Pass Flags in this row are PASS; otherwise it shall be FLAG
PASS/FLAG
SOMAmer level metrics are found in the SOMAmer/column-level metadata in the ADAT.
These PASS/FAIL designations are only used for plate level quality control. They should not be used to assess SOMAmer quality.
Platform Specific Calibration
PlatformSpecificCalibrate_<PlateBarcode>_ScaleFactor; PlatformSpecificCalibrate_<PlateBarcode>_PassFlag
0.6 < PlatformSpecificCalibrate_<PlateBarcode>_ScaleFactor < 1.4
PASS/FAIL
Cross Platform Calibration
CrossPlatformCalibrate_<PlateBarcode>ScaleFactor;
CrossPlatformCalibrate<PlateBarcode>_PassFlag
0.6 < PCrossPlatformCalibrate_<PlateBarcode>_ScaleFactor < 1.4
PASS/FAIL
QC Check
QCCheck_<PlateBarcode>ScaleFactor; QCCheck_<PlateBarcode>_PassFlag
0.8 < QCCheck_<PlateBarcode>ScaleFactor < 1.2
PASS/FAIL
Plate level metrics are found in the header metadata in the ADAT.
QC Percent in Tails
QCCheckTailPercent; QCCheckTailPercent_PassFlag
QCCheckTailPercent < 15%
PASS/FAIL
Calibration Percent in Tails
PlatformSpecificCalibrateTailPercent; PlatformSpecificCalibrateTailPercent_PassFlag
PlatformSpecificCalibrateTailPercent < 15%
PASS/WARNING
SOMAmer Normalized Reads
PlateSOMAmerNormReads_PassFlag
< 70% of blank samples flagged at SOMAmerNormReads
PASS/WARNING
Reference Correlation
PlateRefCorr_PassFlag
< 70% of blank samples flagged at RefCorr
PASS/WARNING
Platform Specific Plate Scale
PlatformSpecificPlateScale_ScaleFactor
N/A
N/A
Cross Platform Plate Scale
CrossPlatformPlateScale_ScaleFactor
N/A
N/A
Cross Platform Calibration
CrossPlatformCalibrationTailPercent
N/A
N/A
ADATs can be analyzed in R or Python using parsers created by Somalogic.
DRAGEN Protein Quantification v2.0.0 is compatible with SomaData v1.0.0 (Python - formerly called Canopy) and SomaDataIO v6.1.0 (R).
Somadata creates an ADAT object, which is an extension of a Pandas DataFrame.
rows correspond to samples
columns correspond to SOMAmers
values are normalized counts
Below are examples on parsing an ADAT in Python and R.
Illumina Connected Multiomics provides interactive visualizations and powerful statistics.
For product information, please reference the ICM product documentation
This is a walkthrough of an analysis that could be done on ICM with an example proteomic data set, produced by DRAGEN Protein Quantification. It covers the following features:
Adding samples to a study
Filtering samples
Data Transformation
t-SNE
PCA
Hierarchical clustering and creating heatmaps
Differential expression
Gene set enrichment analysis
Create two sample groups - one for female samples, and one for male samples. Filter by 'Sex', and select 'F' only. At the top of the page, click on ‘+ Create sample group’ to create a sample group.
Type in a group name and click on the ‘Create’ button. Then, change the filter from 'F' to 'M', and create another sample group.
Navigate to the ‘Sample Group’ tab and confirm the sample groups.
Click on ‘+ New Analysis’ to create a new analysis.
In the pop-up window, provide a name for the analysis, select ‘Advanced Analysis’ as the Analysis Type, choose the sample group to be included in the analysis ('All ADAT Samples' will be selected by default), and click on the ‘Run Analysis’ button.
Note: make sure there are no duplicated Sample IDs in the analysis groups.
A pop-up message will show up if the analysis creation is successful.
Refresh the page to get the latest status of the analysis.
When the Status is ‘Complete’, click on the analysis tile to enter the analysis module.
Click on the ‘Sex in F, M’ node generated from the previous step; select ‘Normalization and scaling’ > ‘Normalization’ in the right-hand-side Quantification toolbar.
In the normalization view, drag and drop ‘Add’ and ‘Log’ from the ‘Available methods’ column to the 'Selected methods' column. Select 10.0 in the dropdown menu for ‘Log’, and click on the ‘Finish’ button.
Click on the ‘Normalized counts’ node generated from the data transformation step, and select ‘Exploratory analysis’ > ‘PCA’ in the right-hand-side toolbar.
In the PCA view, adjust the corresponding PCA parameters and click on the ‘Finish’ button.
Compute biomarkers contributing to sex variable:
Click on the ‘PCA’ node generated from the previous step, then select ‘Statistics’ > ‘Compute biomarkers’.
In the compute biomarkers view, select corresponding parameters and click on the ‘Finish’ button. Double click on 'Biomarkers' to explore further.
Click on the ‘PCA’ node generated from the previous step, and select ‘Exploratory analysis’ > ‘t-SNE’ in the right-hand-side toolbar.
In the t-SNE view, select corresponding parameters and click on the ‘Finish’ button. Select the t-SNE node for additional information.
Click on the ‘Normalized counts’ node generated from the data transformation step. In the right-hand-side toolbar, select ‘Statistics’ > ‘Differential analysis’.
In the differential analysis view, select a method (using Limma-trend in this example) and click on the ‘Next’ button.
Select the factors to be included in the model and click on the ‘Add factors’ button. Confirm the selected factors show up in the ‘Selected factor(s)’ section, and proceed to the ‘Next’ button.
Note: For paired or longitudinal analysis, check ‘Random’ column for certain variables to make them a random effect in the mixed effect model.
If a categorical factor is selected (for example, 'Sex'), the next step will be to 'Define comparisons'. Add ‘F’ to the Numerator box and add ‘M’ to the Denominator box. Click ‘Add comparison’ button to create this comparison.
The specified comparisons will appear in the ‘Comparisons’ section. Click on the ‘Finish’ button.
Filtering significant differential analysis results
Click on the ‘F vs M’ node generated from the differential analysis step, and in the right-hand-side of toolbar, select ‘Filtering’ > ‘Differential analysis filter’.
Select ‘Metadata’ as the filter type, and specify the filter criteria. In this example, include features with false discovery rate (FDR) less than 0.05 for the Female vs Male comparison. Double click the filtered feature list node to view the Limma-trend report.
Click on the ‘Filtered feature list’ node generated from the filtering differential analysis results step, and select ‘Exploratory analysis’ > ‘Hierarchical clustering / heatmap’ from the right-hand-side toolbar.
In the Hierarchical clustering / heatmap view, select ‘Assign order’ by Sex in the Sample order section, and click on the ‘Finish’ button.
Double click on the ‘Hierarchical clustering / heatmap’ node generated from the previous step to view the heatmap with hierarchical clustering.
In the left-hand-side toolbar, select ‘Annotations’ and add 'Sex' as a row annotation.
View the heatmap - two major clusters can be observed: a cluster of protein signals with higher abundance in females compared to males, and a cluster with higher abundance in males compared to females.
Select ‘Save as’ in the left-hand-side toolbar to save the heatmap to the data viewer.
Click on the ‘Filtered feature list’ node generated from the filtering differential analysis results step, and select ‘Biological interpretation’ > ‘Gene set enrichment’ in the right-hand-side toolbar.
Select ‘KEGG data base’ in the Database section and select a corresponding library version. If none are available, click "New library" and the respective species. Then, click on the ‘Finish’ button.
Python:
R:
Filter Samples and Create Sample GroupsFilter out samples that were flagged during QC during secondary analysis by filtering with the ‘Row Pass’ filter. Optionally filter out samples with other metadata by selecting the corresponding inclusion/exclusion criteria in the right-hand-side ‘Filters’ tab.​​
​​
​​
​​
​​
For additional information on PCA, review the following documentation:
For additional information on t-SNE, view the following documentation:
For additional information on hiearcharical clustering, view the following documentation:
Double click on the ‘Pathway enrichment’ node generated from the previous step to check the analysis results. Options to download the result table and gene sets are available.
Click on the hyperlinks in the ‘Gene set’ column to view the related genes in a certain pathway.
To view a subset of pathways in the data viewer, filter the pathways further (for example, by filtering for pathways with a P-values of less than 5e-8) and repeating the above gene set enrichment steps on the filtered list. There will be an option to use the data viewer.
March 2025
Initial Release
How can I share data with a collaborator in BSSH?
I sequenced in manual mode/need to kick of autolaunch after sequencing is complete. How can I do this?
Use the BSSH CLI to upload the run folder to BSSH. Make sure the samplesheet is named "SampleSheet.csv" and you are using an up-to-date version of the tool - at least v1.6.1.
For additional information, see the following documentation:
How do I download/export the samplesheet?
Go to the BSSH Run Planner:
Go to BSSH, select the intended workgroup on the top right, and then select the Runs tab
For a new run:
Select the "New Run" button on the right and plan a new run with the desired pipeline.
After ingesting the IPPAS output files in the BSSH Run Planner, the user will reach the following Run Review page:
Select the "Export" option on the bottom right to download the samplesheet created
Select the "Save as Draft" or "Save as Planned" options to save this run as a draft or planned run respectively\
For an "Active" or "Planned" run:
Select the intended tab mentioned under the Runs header.
Select the required run from the list of runs:
Next, click File > Download > Sample Sheet:
How can I requeue a run on BSSH with a new samplesheet/new version of the pipeline?
Go to the BSSH Run Planner tool and plan a new run with the desired pipeline.
Save the exported samplesheet (see the above step for additional information).
Go to the original run, and click Status > Requeue > Planned Run.
Select "Use a new Sample Sheet".
Upload the new sample sheet.
On the next page, click "requeue".
Can multi-analysis by project split samples on the same plate?
No. All samples on the same plate must have the same project.
What is DRC_Level and why is this used?
DRC stands for Dynamic Range Compression. It is used to even out the SOMAmer concentrations from a 5-log dynamic range to 2-log. Without DRC, the most abundant SOMAmers would occupy the majority of sequencing readout capacity. Each sample would require extremely high sequencing depth to cover the unabundant SOMAmers and accurately quantify them. In using DRC, probes are grouped based on their observed abundances under no DRC condition, and each group is compressed by a different set ratio. The compression level of each SOMAmer is included in the DRC_Level
row of SOMAmer metadata.
What is the LoD, and what does it indicate?
LoD stands for Limit of Detection. Each SOMAmer has an LoD for both Plasma and Serum. The LoD is a noise floor for that SOMAmer's fully normalized (MedNormExt) counts. In other words, normalized counts above the SOMAmer's LoD reflect the relative abundance of that SOMAmer's target protein, whereas normalized counts below its LoD may reflect assay uncertainty.
How should LoD be used to discern signal?
The normalized count of a given protein is considered signal if it is above the LoD for that protein for it's given sample type.
How do Somalogic and Illumina metrics compare?
Metrics and Normalization steps with different names between Somalogic's tool and Illumina's DRAGEN Protein Quantification. See the table below for the approximate mapping.
Raw RFU
Raw
Hyb Normalization
HybNorm
medNormInt
MedNormInt
plateScale
PlatformSpecificPlateScale
Calibration
PlatformSpecificCalibrate
-
CrossPlatformPlateScale,
Applied to normalize data across instruments
-
CrossPlatformCalibrate
Applied to normalize data across instruments
-
MedNormExt
anmlQC
-
qcCheck
QCCheck
anmlSMP
-
Filtered
-
Where are the FASTQs?
DRAGEN Protein Quantification does not produce FASTQs. By utilizing DRAGEN Counting, the software uses BCL Convert to demultiplex and count proteins per sample at the same time. This reduces analysis time but also removes FASTQ as a pipeline output.
What if the PF or Q30 values are lower than expected?
First, check secondary metrics. If secondary metrics are passing, it's acceptable to proceed with further analysis. If secondary metrics have warnings or failures, consider re-pooling or repeating the dilute and denature and sequencing of the saved pool. If the data still looks poor, reach out to tech support for further guidance.
What organisms does the SOMAmer metadata cover in the 9.5k assay?
Human
10326
Mouse
230
Gila monster
3
Hornet
3
Jellyfish
3
African clawed frog
3
Thermus thermophilus
3
European elder
2
Common eastern firefly
2
E. coli
1
HIV-1
1
HIV-2
1
Bacillus stearothermophilus
1
Ensifer meliloti
1
Red alga
1
The primary output file of DRAGEN Protein Quantification is the ADAT. One ADAT is produced for each normalization step performed. The final ADAT produced (<output_file_prefix>_Step7_FinalNormStep_MedNormExt.adat) has the normalized counts at the end of the full normalization process.
There are four key components to the ADAT: the header, SOMAmer metadata, sample metadata, and counts.
Version
Version of the software used during analysis
Title
User reference for the anlaysis
SOMAmerReferenceSource
SOMAmer metadata version used during analysis
AssayType
Assay types used
AssayVersion
Version of the Illumina Protein Prep assay
AssayRobot
Liquid handling robot
RunId
Unique identifier for the run (created by sequencing instrument)
Instrument Type
Instrument used for sequencing
Flowcell
Flowcell used for sequencing
YieldDemux
Total number of reads that are demultiplexed
YieldQ30Demux
Total number of reads that are demultiplexed with a passing QC score
Q30WeightedMean
Q30 primary sequencing metric, comuted as the Q30 weighted mean across lanes
CreatedDate
Date the run was performed
StudyOrganism
Sample organism (human)
StudyMatrix
Matrix used in study (serum or plasma)
CalibratorId
Lot number(s) of the calibrator samples
ProcessSteps
The process steps that occurred to produced the counts in the current ADAT
PlateRefCorr_PassFlag
PASS/FLAG for each plate. If greater than 70% of Blank Samples have RefCorr_PassFlag = FLAG, PlateRefCorr_PassFlag shall be WARNING. Otherwise, it shall have the value "PASS".
PlateSOMAmerNormReads_PassFlag
PASS/FLAG for each plate. If greater than 70% of Blank Samples have SOMAmerNormReads_PassFlag = FLAG, PlateSOMAmerNormReads_PassFlag shall be WARNING. Otherwise, it shall have the value "PASS".
PlatformSpecificPlateScale_ScaleFactor
Scale factors for the PlateScale normalization step, when compared to a platform specific reference. The reference is based on data from the sequencing instrumented used for the run.
PlatformSpecificCalibrateTailPercent
Percent of Platform Specific Calibration scale factors in tails (outside of the acceptable range of 0.6-1.4) for each plate when compared to a reference. The platform specific reference is based on data from the sequencing instrument used for the run.
PlatformSpecificCalibrateTailPercent_PassFlag
PASS/WARNING for each plate. If PlatformSpecificCalibrateTailPercent for a plate is greater than .15, this value shall be "WARNING". If it's less than or equal to .15, it shall be "PASS".
CrossPlatformPlateScale_ScaleFactor
Scale factors for the PlateScale normalization step, when compared to a universal reference made with NovaSeq X data.
CrossPlatformCalibrateTailPercent
Percent of Cross Platform Calibration scale factors in tails (outside of the acceptable range of 0.6-1.4) for each plate when compared to a reference. The cross platform reference is based on data from NovaSeq X runs.
QCCheckTailPercent
Percent of QC scale factors in tails for each plate.
QCCheckTailPercent_PassFlag
PASS/FAIL for each plate. If QCCheckTailPercent for a plate is greater than .15, this value shall be "FAIL". If it's less than or equal to .15, it shall be "PASS".
MedianPercentCV
Median Percent CV metric for each plate, providing:
(sample Standard Deviation of calibrator samples/Mean of calibrator samples)*100
(sample Standard Deviation of QC samples/Mean of QC samples)*100
MedianS2B
Median Signal to Background metric for each plate, providing:
Median Calibrator/Median Blank
Median QC/Median Blank
GeneratedBy
Version of SomaData parser used to write ADAT (ex. SomaData_1.0.0)
SampleID
Sample identifier
PlateId
Unique plate identifier
MatrixTubeBarcode
Matrix tube barcode scanned during library prep
BatchID
User-provided batch identifier
InputType
Sample type specified in manifest file.
Examples: Plasma_Calibrator, Plasma_QC, Plasma, Serum_Calibrator, Serum_ QC, Serum, Blank
MatrixTube
Matrix used.
Examples: Plasma, Serum
SampleType
Type of sample processed.
Examples: Plasma, Serum, Blank, QC, Calibrator
ControlID
ID of the calibrator, QC, or blank lot (applied to controls only).
ProbePlate
Probe plate lot number, from library prep
SOMAmerBeadPlate
SOMAmer bead plate lot number. Last two digits indicate the master mix lot number.
WellPosition
Location of the sample on the 96 well plate (A1-H12)
Project
(Optional) User-provided project identifier
SOMAmerReads
Number of raw counts human SOMAmer reads (excluding control reads)
SOMAmerReads_PassFlag
This flag indicates whether a non-blank sample meets specifications for number of SOMAmer reads. For non-blank samples:
SOMAmerReads ≥ 10 million = PASS
SOMAmerReads < 10 million = FLAG
Not applied to blank samples.
SOMAmerNormReads
Number of normalized counts human SOMAmer reads at plate scale normalization step (excluding control reads)
SOMAmerNormReads_PassFlag
This flag indicates whether a blank sample meets the specifications for number of normalized SOMAmer reads. For blank samples:
SOMAmerNormReads ≤ 40 million = PASS
SOMAmerNormReads > 40 million = FLAG
Not applied to non-blank samples.
RefCorr
Spearman correlation value for comparing samples to the plasma or serum reference.
RefCorr_PassFlag
This flag indicates if a blank sample meets the specification for comparison to a serum or plasma reference. For blank samples:
RefCorr < 0.6 = PASS
RefCorr > 0.6 = FLAG
Not applied to non-blank samples.
EmpericalHybTemp
This indicates the actual hybridization temperature of a sample, obtained empirically from a set of 78 temperature controls.
HybNorm_1_ScaleFactor
The hybridization control scale factor
HybNorm_PassFlag
This flag indicates whether the HybNorm_1_ScaleFactor is within the specified acceptance criteria range of 0.4–2.5.
MedNormInt_5e-05_ScaleFactor
The MedNormInt scale factor for the 0.005% dilution group
MedNormInt_0.005_ScaleFactor
The MedNormInt scale factor for the 0.5% dilution group
MedNormInt_0.2_ScaleFactor
The MedNormInt scale factor for the 20% dilution group
MedNormInt_PassFlag
This flag indicates whether all dilution group scale factors are within the specified acceptance criteria range of 0.4–2.5 for the MedNormInt step.
MedNormExt_5e-05_ScaleFactor
The MedNormExt scale factor for the 0.005% dilution group
MedNormExt_0.005_ScaleFactor
The MedNormExt scale factor for the 0.5% dilution group
MedNormExt_0.2_ScaleFactor
The MedNormExt scale factor for the 20% dilution group
MedNormExt_PassFlag
This flag indicates whether all dilution group scale factors are within the specified acceptance criteria range of 0.4–2.5 for the MedNormExt step.
RowCheck_PassFlag
This flag indicates whether all row scale factors are within the specified acceptance criteria range.
SeqId
Unique sequence identifier for the SOMAmer
SeqIdVersion
SOMAmer sequence version
SomaId
Somalogic-provided identifier
Target
Protein target identifier
TargetFullName
Protein target full name
Type
Target type (protein)
UniProt
UniProt identifier
EntrezGeneID
Entrez Gene identifier
EntrezGeneSymbol
Entrez Gene symbol
HybControl
True is the SOMAmer is a hyb control, else False
LoD.Plasma
Limit of detection for plasma SOMAmers
LoD.Serum
Limit of detection for serum SOMAmers
Organism
Organism (e.g. Human, Mouse) of the SOMAmer
PlatfromSpecificCalibrate_<PlateId>_PassFlag
Pass Flag for Platform Specific Calibration
PlatformSpecificCalibrate_<PlateId>_ScaleFactor
Scale Factor for Platform Specific Calibration
CrossPlatformCalibrate_<PlateId>_ScaleFactor
Scale Factor for Cross Platform Calibration
QCCheck_<PlateId>_PassFlag
Pass Flag for QC Check
QCCheck_<PlateId>_ScaleFactor
Scale Factor for QC Check
Dilution
Dilution group classification for the SOMAmer
DRC_Level
Compression of SOMAmer range that occurred during the assay.
References
References used in the analysis.
Units
Units of matrix content
See the following documentation:
Additional information on the BSSH Run Planner: