End-to-End User Flow: DRAGEN Analysis
This tutorial demonstrates major functions of the ICA platform, beginning with setting up a project with instrument run data to prepare to use pipelines, and concluding with viewing pipeline outputs in preparation for eventually ingesting outputs into available modules.
In the following example, we start from an existing ICA project with demultiplexed instrument run data (fastq), use the DRAGEN Germline pipeline, and view output data.
Set-up
This tutorial assumes you already have an existing project in ICA. To create a new project, please see instructions in the Projects page.
Additionally, you will need the DRAGEN Demo Bundle linked to your existing ICA project. The DRAGEN Demo Bundle is an an entitled bundle provided by Illumina with all standard ICA subscriptions and includes DRAGEN pipelines, references, and demo data.
Linking an Entitled Bundle
For general steps on creating and linking bundles to your project, see the Bundles page. This tutorial explores the DRAGEN Germline Published Pipeline, so we will need to link the DRAGEN Demo Bundle to our existing project.
Steps:
Go to your project's
DetailspageClick the
EditbuttonClick the
+button, under LINKED BUNDLESClick on the
DRAGEN Demo Bundle, then click theLink BundlesbuttonThere may be multiple versions of DRAGEN Demo Bundles. This tutorial details steps for
DRAGEN Demo Bundle 3.9.5.Steps for other versions after 3.9.5 should be similar.
Click the
Savebutton
DRAGEN Demo Bundle assets should be available now in your projects Data and Pipelines pages.
Pipelines
After setting up the project in ICA and linking a bundle, we can run various pipelines.
DRAGEN Germline Published Pipeline
This example demonstrates how to run the DRAGEN Germline Published Pipeline (version 3.9.5) in your ICA project using the demo data from the linked DRAGEN Demo Bundle.
The required pipeline input assets for this tutorial include:
Under
DatapageIllumina DRAGEN Germline Demo Data folder
Illumina DRAGEN Enrichment Demo Data folder
Illumina References folder
Under
PipelinespageDRAGEN Germline
Launching the DRAGEN Germline Published Pipeline with Demo Data
From the Pipelines page, select DRAGEN Germline 3.9.5, and then click Start Analysis. Initial set-up details require a User Reference (pipeline run name meaningful to the user) and an Entitlement Bundle from the drop-down menu under Pricing.
Running the DRAGEN Germline pipeline uses the following inputs which are to be added in the Input Files section:
FASTQ files
Select the FASTQ files in the
Illumina DRAGEN Enrichment Demo Datafolder and select Add.
Reference:
Select a reference genome from the
Illumina Referencesfolder (do not select a methyl-converted reference genome for this tutorial)Ie: hg38_altaware_nohla-cnv-anchored.v8.tar (suggested, if enabling CNV analysis)
The DRAGEN Germline Settings to be selected are:
Enable germline small variant calling: Set to
trueEnable SV (structural variant) calling: Set to
trueIf true, Enable map align output must also be set to
true
Enable repeat genotyping: Set to
trueEnable map align: Set to
trueWhen using FASTQ files as input, as in this example, set this to
trueas default.When using BAM files as input, set to
trueto realign reads in input BAMs; set tofalseto keep alignments in input BAM files.
Enable CNV calling: Set to
trueEnabling Copy Number Variant calling requires one of the following:
Enable CNV self normalization is set to
trueA panel of normals (PON) is provided in the Input Files
Output format: Set to
CRAMOther available options for alignments output are BAM and SAM format.
Enable CNV self-normalization: Set to
trueRequired if Enable CNV calling is set to
trueand no panel of normals (PON) is provided in the Input Files.
Enable duplicate marking: Set to
trueEmit Ref Confidence: Set to
GVCFto enable banded gVCF generation for this exampleTo enable base pair resolution in the gVCF, set to
BP_RESOLUTION
Additional DRAGEN args: Leave Empty
Users can provide additional DRAGEN arguments here (see the DRAGEN user guide for examples), but we will leave this blank for this example run.
Sample sex: Leave blank
Users may specific the sex of the sample here if known, but the user will omit this setting for this example run.
Enable HLA: Set to
trueto enable HLA typingEnable map align output: Set to
trueThe format for alignment output was selected previously in the "Output format setting" above
Resources
Use the default resources settings:
Storage size: Set to
smallFPGA Medium Tier: Set to
StandardFPGA Medium Resources: Set to
FPGA Medium
Once all parameters have been set, click Start analysis
Monitoring analysis run status
You can monitor the status of analysis pipeline runs from the Flow > Analysis page in your project. See Analysis Lifecycle for more details.
Click the refresh button (
) in upper right corner of the ICA environment page to update the status.
Click on the run to view more information about it. The various tabs under a given run provide additonal context regarding the status of the completed run.
If you encounter a failed run, you can find more information in the Projects > your_project > Flow > Analyses > your_analysis > Details tab and on the execution report tab.
Analysis run logs can be found on the Steps tab. Use the sliders next to Stderr and Stdout for more details. Check the box next to "Show technical steps" to view additional log files.
Viewing DRAGEN analysis output
DRAGEN analysis output folders are found on the project's Data page, along with all other data loaded to the project (such as assets from a linked entitled bundle). Output analysis will be grouped into folders, so users can click through the folder structure to explore outputs.
Related Resources
DRAGEN Support Site: https://support.illumina.com/sequencing/sequencing_software/dragen-bio-it-platform.html
ICA Pricing: https://help.ica.illumina.com/reference/r-pricing
Last updated
Was this helpful?
