Fixing race condition when channel items altered by later subworkflow are seen by earlier one

bskubi · July 24, 2024, 12:40am

Here is a toy example of a pipeline architecture with a somewhat counterintuitive behavior:

workflow A {
take:
ch

main:
ch | filter{it.containsKey("b")} | view

emit:
ch
}

workflow B {
take:
ch

main:
ch | map{it.b = true} | set{ch}
}

workflow {
ch = channel.fromList([[:], [b:false]])
ch | A | B
}

Output

[b:true]
[b:true]

In this example, an alteration to a channel item made in a subsequent subworkflow is seen by the previous subworkflow. This counterintuitive behavior makes some sense – B fires immediately because A doesn’t alter the channel items, and the changes it makes are seen by the filter and view operators in A due to the timing of when they fire.

The context in which this occurs in practice is where the same QC-launching subworkflow is called multiple times on the same channel. This channel contains hashmap channel items, where the hashmap may have several different keys associated with QCable stats files. Later subworkflows may add keys and a filename as the setup to calling a process that will produce the stats file itself. But the earlier QC subworkflow (analogous to A, above) can see that the key and filename was added by the later subworkflow (analogous to B), so it tries to pass that not-yet-created stats file to its QC process call, which creates an error.

I have workarounds, but it seems like there ought to be a more elegant solution. Using the toy example above, what would be the right way to achieve the intended behavior, where A should display [b:false] by filtering out the empty hashmap and calling view before B is run?

bentsherman · July 24, 2024, 6:18pm

You should never modify values in an operator closure like that. Instead, you can use the .clone() method or + operator to create a new map with modifications:

['b': true] + ['b': false] == ['b': false]

I will probably add a compiler warning for this pattern in the future as it is a common pitfall for users.

robsyme · July 24, 2024, 9:09pm

I gave a bytesize talk here outlined this problem and suggested mitigation strategies.

bskubi · July 26, 2024, 4:35pm

Thanks to both of you, that makes sense.

system · August 2, 2024, 4:35pm

This topic was automatically closed 7 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Reuse channel result in subworkflows Ask for help	7	183	May 3, 2024
How to access attributes from subMap and branch to print Ask for help	4	137	February 29, 2024
Help needed with channels Ask for help nextflow	1	40	April 8, 2025
Understanding Subworkflow logic Ask for help	3	100	July 2, 2024
How to wait until a process is complete? Ask for help	22	667	April 23, 2024

Fixing race condition when channel items altered by later subworkflow are seen by earlier one

Related topics