We have a Process with a large amount of variation in the input file sizes, so we have retry with increasing memory:
process {
withLabel:IndexDedup {
queue='regular'
cpus = 5
memory={ 10.GB * task.attempt }
time='8h'
errorStrategy ={ 'retry' }
maxRetries= 5
}
}
When I resume, the result from the input that worked with 10 GB is successfully retrieved from cache, but the result for a large input file that had 2 failures before success, with memory = { 10.GB * 3 } = 30.GB, is not retrieved from cache.
Increasing the initial memory so that the process works first time fixed the issue. However I don’t particularly want to allocate 30 GB for all inputs, and I want to resume from the success.
I see on Caching and resuming — Nextflow documentation that Task attempt
is part of the cache hash. Do I need a way of setting it to be the final, successful task attempt?
Thanks,
Jocelyn