Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Table of Contents

Installation

  • requires fakeroot feature of Singularity to be working

    • must build the image on a compute node

Getting the source

  • Goto https://developer.nvidia.com/clara-parabricks and choose appropriate Licensing method:

    • I chose “standard evaluation license” for testing

    • Fill in the requested details and the link to tarball containing the installation script and license will be emailed by NVIDIA

  • It appears that Parabricks is a container image, which when installed using the provided installer.py in the tarball, creates a Singularity sandbox as a installation directory and builds a statically built executable(s) pbrun.

Intall command

Code Block
salloc --gres=gpu:1 --time=01:00:00 

...

Code Block
srun --pty ./installer.py --container singularity --ngc --install-location /path/to/installation

Testing

I am using the testing recipe from https://ngc.nvidia.com/catalog/containers/hpc:parabricks

Getting data

Code Block
$ wget https://s3.amazonaws.com/parabricks.sample/parabricks_sample.tar.gz
$ tar xf parabricks_sample.tar.gz
$ ls parabricks_sample
Data Ref

Typical Jobscript

Code Block
#!/bin/bash

#SBATCH --job-name=parabricks-test
#SBATCH --time=02:00:00
#SBATCH --gres=gpu:1
#SBATCH --constraint=gpu_ai
#SBATCH --ntasks=1
#SBATCH --cpus-per-task=4
#SBATCH --mem=20G


module load parabricks/3.0.0.2

export DATA_DIR=/ibex/scratch/shaima0d/scratch/parabricks-testing

pbrun fq2bam \
       --ref ${DATA_DIR}/parabricks_sample/Ref/Homo_sapiens_assembly38.fasta \
       --in-fq ${DATA_DIR}/parabricks_sample/Data/sample_1.fq.gz parabricks_sample/Data/sample_2.fq.gz \
       --out-bam output.bam

Output

Code Block
Loading module for Singularity
Singularity 3.5 modules now loaded
Loading module for parabricks-3.0.0.2
parabricks-3.0.0.2 modules now loaded

[Parabricks Options Mesg]: Checking argument compatibility
[Parabricks Options Mesg]: Automatically generating ID prefix
[Parabricks Options Mesg]: Read group created for /ibex/scratch/shaima0d/scratch/parabricks-
testing/parabricks_sample/Data/sample_1.fq.gz and /ibex/scratch/shaima0d/scratch/parabricks-
testing/parabricks_sample/Data/sample_2.fq.gz
[Parabricks Options Mesg]: @RG\tID:HK3TJBCX2.1\tLB:lib1\tPL:bar\tSM:sample\tPU:HK3TJBCX2.1
------------------------------------------------------------------------------
||                 Parabricks accelerated Genomics Pipeline                 ||
||                             Version v3.0.0.2                             ||
||                       GPU-BWA mem, Sorting Phase-I                       ||
||                  Contact: Parabricks-Support@nvidia.com                  ||
------------------------------------------------------------------------------
[M::bwa_idx_load_from_disk] read 0 ALT contigs

GPU-BWA mem
ProgressMeter	Reads		Base Pairs Aligned
[16:21:48]	5043564		580000000
[16:22:36]	10087128	1160000000
[16:23:25]	15130692	1740000000
[16:24:13]	20174256	2320000000
[16:25:01]	25217820	2900000000
[16:25:49]	30261384	3480000000
[16:26:38]	35304948	4060000000
[16:27:26]	40348512	4640000000
[16:28:16]	45392076	5220000000
[16:29:05]	50435640	5800000000

GPU-BWA Mem time: 525.787852 seconds
GPU-BWA Mem is finished.

GPU Sorting, Marking Dups, BQSR
ProgressMeter	SAM Entries Completed

Total GPU-BWA Mem + Sorting + MarkingDups + BQSR Generation + BAM writing
Processing time: 570.052968 seconds

[main] CMD: PARABRICKS mem -Z ./pbOpts.txt /ibex/scratch/shaima0d/scratch/parabricks-testing/parabricks_sample/Ref/Homo_sapiens_assembly38.fasta /ibex/scratch/shaima0d/scratch/parabricks-testing/parabricks_sample/Data/sample_1.fq.gz /ibex/scratch/shaima0d/scratch/parabricks-testing/parabricks_sample/Data/sample_2.fq.gz @RG\tID:HK3TJBCX2.1\tLB:lib1\tPL:bar\tSM:sample\tPU:HK3TJBCX2.1
[main] Real time: 576.339 sec; CPU: 2121.508 sec
------------------------------------------------------------------------------
||        Program:                      GPU-BWA mem, Sorting Phase-I        ||
||        Version:                                          v3.0.0.2        ||
||        Start Time:                       Mon Jul 27 16:20:45 2020        ||
||        End Time:                         Mon Jul 27 16:30:21 2020        ||
||        Total Time:                           9 minutes 36 seconds        ||
------------------------------------------------------------------------------
------------------------------------------------------------------------------
||                 Parabricks accelerated Genomics Pipeline                 ||
||                             Version v3.0.0.2                             ||
||                             Sorting Phase-II                             ||
||                  Contact: Parabricks-Support@nvidia.com                  ||
------------------------------------------------------------------------------
progressMeter - Percentage
[16:30:24]	0.0	 0.00 GB
[16:30:34]	47.6	 0.27 GB
[16:30:44]	94.0	 0.11 GB
Sorting and Marking: 30.002 seconds
------------------------------------------------------------------------------
||        Program:                                  Sorting Phase-II        ||
||        Version:                                          v3.0.0.2        ||
||        Start Time:                       Mon Jul 27 16:30:24 2020        ||
||        End Time:                         Mon Jul 27 16:30:54 2020        ||
||        Total Time:                                     30 seconds        ||
------------------------------------------------------------------------------
------------------------------------------------------------------------------
||                 Parabricks accelerated Genomics Pipeline                 ||
||                             Version v3.0.0.2                             ||
||                         Marking Duplicates, BQSR                         ||
||                  Contact: Parabricks-Support@nvidia.com                  ||
------------------------------------------------------------------------------
progressMeter -	Percentage
[16:31:05]	9.0	 5.95 GB
[16:31:15]	22.3	 13.18 GB
[16:31:25]	38.2	 11.80 GB
[16:31:35]	52.4	 9.11 GB
[16:31:45]	64.0	 6.49 GB
[16:31:55]	76.7	 3.97 GB
[16:32:05]	87.1	 1.24 GB
[16:32:15]	100.0	 0.00 GB
BQSR and writing final BAM:  80.932 seconds
------------------------------------------------------------------------------
||        Program:                          Marking Duplicates, BQSR        ||
||        Version:                                          v3.0.0.2        ||
||        Start Time:                       Mon Jul 27 16:30:55 2020        ||
||        End Time:                         Mon Jul 27 16:32:16 2020        ||
||        Total Time:                            1 minute 21 seconds        ||
------------------------------------------------------------------------------

Benchmarks

Child pages (Children Display)
alltrue

...