Is the parallelization working properly?

ramirobarrantes · August 8, 2024, 7:06pm

I have a simple pipeline which just concatenates and does multiQC:

workflow CHAMPLAIN {

    take:
    ch_samplesheet // channel: samplesheet read in from --input

    main:

    ch_versions = Channel.empty()
    ch_multiqc_files = Channel.empty()

    ch_samplesheet
    .map { sample_ID,fastqList ->
        def files = fastqList[0].split(',')
        [sample_ID, tuple(files)]
    }
    .set{ch_samples}

    CAT_FASTQ (ch_samples)

    FASTQC (CAT_FASTQ.out.reads)

    ch_multiqc_files = ch_multiqc_files.mix(FASTQC.out.zip.collect{it[1]})
    ch_versions = ch_versions.mix(FASTQC.out.versions.first())
   etc etc

But I don’t think it’s parallelizing correctly, attached is the pipeline_info file (it’s still running so I am attaching the .txt file, some thing were cached), but the point is that the execution seems to be sequential
execution_trace_2024-08-08_10-01-07.txt (9.9 KB)

It’s two steps, CONCATENATION and then FASTQC, It seems that it’s running everything sequentially.

The slurm parameters I have as:

#!/bin/bash                                                                                         
#SBATCH --partition=short                                                                           
#SBATCH --nodes=1                                                                                   
#SBATCH --cpus-per-task=32                                                                          
#SBATCH --mem=64G                                                                                   
#SBATCH --time=3:00:00

any suggestions?

Alexander_Nater · August 9, 2024, 10:53am

Well, what are your resource allocations for these processes? If they use anywhere close to 32GB of memory or 16 CPUs (the runner process will also use part of your total allocation), only one task will be able to run at the same time. Why don’t you use the slurm executor?

ramirobarrantes · August 9, 2024, 2:27pm

Thank you!! And this was the issue, very basic indeed. I had no idea that the slurm executor existed so will use that.

Topic		Replies	Views
Unable to run parallel threads Ask for help nextflow , aws	4	181	April 23, 2024
Using queue size to parallelise local executor Ask for help	2	53	December 2, 2024
How to run the parallel command in script block of a nextflow process? Ask for help	3	35	May 22, 2025
SIngle nextflow job vs the executor Ask for help	2	150	August 9, 2024
Local executor processor utilisation Ask for help	2	194	February 26, 2024

Is the parallelization working properly?

Related topics