๐ŸŒ€to set up and run an analysis

  1. Launch the DRAGEN Microbial Enrichment Plus BaseSpace application. The application is found in the "DRAGEN" dropdown and the "Infectious Disease and Microbiology" dropdown in BaseSpace.

  2. The DRAGEN Microbial Enrichment Plus only supports Biosample inputs.

  3. After choosing a name for the Analysis, choose either โ€œBiosampleโ€ or โ€œProjectโ€ as input type. Selecting โ€œProjectโ€ will result in all biosamples in the selected project being analyzed.

  4. Select the enrichment panel library from the dropdown. Only one panel can be selected per analysis. There is no need to open the "Custom panel specification" tab if selecting one of the pre-set panels. Note that the user must first select the Enrichment Panel for the appropriate downstream analysis options to populate.

  5. Under "Enrichment Panel Microorganism Reporting List" the default option is "All microorganisms". Optional: the microorganism reporting list can be subset from the full set of organisms targeted by the selected panel if desired. If the user is analyzing with the RPIP or UPIP panel, they can select "Pre-defined specification" or "User-defined specification". For VSP, VSP V2 or RVOP/RVEK, there is no "Pre-defined specification" option and only "User-defined specification" is available. If selecting "User-defined specification", the following steps must be taken. We recommend pre-loading your "User-defined specification" file before starting the analysis.

    • Download the template with all the microorganisms as a starting point.

    • Remove any rows of organisms that are not wanted in the analysis/report

    • DO NOT remove any columns or alter their names

    • DO NOT delete the header row or alter the names of the header row

    • Optional but recommended: Rename the file to indicate it is altered. Note that in BaseSpace the file name is text before a period.

    • Upload the file to a BaseSpace Project

    • Once the data file is uploaded, select the "Dataset File(s)" under the "User-defined specification" tab **Once uploaded to a project, the user can reuse the same "User-defined specification" sheet across different analyses.

  6. Analysis Options:

  • Perform read QC (Quality Control)

    • If checked, reads are pre-processed using quality metrics before analysis.

    • If unchecked, read quality metrics are calculated, but reads are not trimmed or filtered before analysis.

  • Report bacterial AMR markers only

    • If checked, only bacterial AMR markers but no microorganisms are reported

    • This option is disabled if RVOP, VSP, VSP V2 or Custom Panel is selected

    • This option is disabled if the 'Report bacterial AMR markers only when an associated microorganism is reported' option is enabled

  • Report bacterial AMR markers only when an associated microorganism is reported

    • If checked, detected bacterial AMR markers are reported if the bacterial AMR marker passes a minimum reporting threshold and one or more associated microorganisms are also detected and reported

    • If unchecked, detected bacterial AMR markers are reported if the bacterial AMR marker passes a minimum reporting threshold

    • This option is disabled if RVOP, VSP, VSP V2 or Custom Panel is selected

  • Report microorganisms and/or AMR markers that are below threshold

    • If checked, microorganisms and/or AMR markers below reporting thresholds are included in reports

    • If unchecked, only microorganisms and/or AMR markers above reporting thresholds are included in reports

    • This option is disabled if Custom Panel is selected

    • This option is disabled if the 'Report bacterial AMR markers only when an associated microorganism is reported' option is enabled

  1. The 'Read classification sensitivity' default value is 5. This is a pre-filtering step only valid for VSP, VSP v2 or RVOP/RVEK. Decreasing the default may considerably slow the run time.

  2. Pangolin is by default Enabled, expect if analyzing the UPIP panel. Note that Pangolin only runs if SARS-CoV-2 is detected.

  3. Nextclade is by default Disabled. This can be Enabled. Note that Nextclade only runs if the viruses listed below are detected:

  • Human immunodeficiency virus 1 (HIV-1)

  • Human respiratory syncytial virus A (HRSV-A)

  • Human respiratory syncytial virus B (HRSV-B)

  • Influenza A virus (H1N1pdm09)

  • Influenza A virus (H3N2)

  • Influenza A virus (H5N1)

  • Influenza A virus (H5N6)

  • Influenza A virus (H5N8)

  • Influenza B virus (B/Victoria/2/87-like)

  • Influenza B virus (B/Yamagata/16/88-like)

  • Monkeypox virus (MPV)

  • Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)

  1. Select an Internal Control (IC). The default is none. Note that RVOP/RVEK and VSP do not have internal controls in their panel. VSP V2 can be analyzed with the following ICs - Enterobacteria Phage T7, Escherichia virus T4, Escherichia Virus MS2, or Armored RNA Quant Internal Process Control. The IC must be added to the workflow before library preparation - otherwise select "None" for IC. The Internal Control concentration needs to be normalized and in the correct format.

  2. Select the Project where the Analysis Output should be saved.

Running analysis on Custom enrichment Panels or Custom reference databases

Users may define their own reference database through the Custom Panel analysis option. This option can be used to analyze samples enriched with standard Illumina Infectious Disease and Microbiology panels (RPIP, UPIP, VSP, VSP V2, RVOP/RVEK) if the user wishes to use a specific set of references rather than the built-in databases for each of these panels. This option also enables users to analyze samples enriched with other targeted enrichment kits through the DRAGEN Microbial Enrichment Plus app.

  1. Launch the DRAGEN Microbial Enrichment Plus BaseSpace application. The application is found in the "DRAGEN" dropdown and the "Infectious Disease and Microbiology" dropdown in BaseSpace.

  2. The DRAGEN Microbial Enrichment Plus only supports Biosample inputs.

  3. After choosing a name for the Analysis, choose either โ€œBiosampleโ€ or โ€œProjectโ€ as input type. Selecting โ€œProjectโ€ will result in all biosamples in the selected project being analyzed.

  4. Select the "Custom Panel" option in the Enrichment Panel dropdown.

  5. A custom reference FASTA must be uploaded to BaseSpace before launching the analysis. See https://help.basespace.illumina.com/manage-data/import-data for more information about importing files into BaseSpace. Information on formatting the FASTA file is provided here.

  6. A BED file is optional. If provided, the BED file must also be uploaded to BaseSpace before launching the analysis. Information on formatting the bed file is provided here..

  7. The only Analysis Options for a custom panel is the ability to turn on or off read QC.

  • Perform read QC (Quality Control)

    • If checked, reads are pre-processed using quality metrics before analysis.

    • If unchecked, read quality metrics are calculated, but reads are not trimmed or filtered before analysis.

  1. Pangolin will run on custom reference sequences with at least 3% coverage that meet these naming conventions:

  • If only a FASTA file is provided, Pangolin will run on sequences that have a header containing either SARS-CoV-2 or NC_045512

  • If both a FASTA and BED file are provided, Pangolin will run on sequences where the first column (chrom) contains NC_045512 or the fourth column (genomeName) contains SARS-CoV-2

  1. Nextclade DOES NOT work for custom panel analysis - if this is "Enabled" it will not run.

  2. For Custom Analysis, the only option for "Internal Control" is "NONE".

  3. Select the Project where the Analysis Output should be saved.

Last updated