S3 work folder

Miguel_Grau · February 4, 2025, 5:50am

Hi, when trying to use a s3 folder as a work dir when launching a pipeline from seqera, I get this error:

Pipeline work directory must start with slash character - offending value: s3://scratch/work

I tried defining workDir from the nextflow.config but it is still using the one defined during the lunch form (where I had to put a non-s3 one)

robsyme · February 6, 2025, 7:53pm

Hi Miguel

I’m not sure I totally understand your situation. Some quick questions:

These are runs launched from Seqera Platform, correct?
What is the work directory defined in the Compute Environment (CE) in which you are executing the run?
What was the value you trying to set the work directory to?
It sounds like you’d like to override that work directory for a particular pipeline or a particular run. Is that correct?

In general, if you want to override the CE for a particular run, it would be sensible to use the “Work directory” text field as shown below. Is that an option for your situation?

Miguel_Grau · February 7, 2025, 5:21am

Hi robsyme,

Yes.
It is a general work folder in the netapp
The value is something similar to your screenshot: s3://scratch/work/
Yes
That’s exactly my situation. When I try overriding using a s3 folder, I get the error: Pipeline work directory must start with slash character - offending value: s3://scratch/work and I can’t submit it. I also tried not overriding the work folder in the “Run parameters” tab but in the Advanced settings, in the nextflow.config, but it doesn’t work either.

robsyme · February 8, 2025, 2:45pm

Thanks for the info, Miguel.

Are you launching the run using a compute environment configured with an HPC executor (SLURM/PBS/SGE, etc)?

Using an S3 path as the work directory only makes sense when using the AWS Batch executor. It would be more efficient to use a networked filesystems on your HPC rather than reading from and writing to a (potentially) remote object store like S3.

Miguel_Grau · February 11, 2025, 5:47am

Hi robsyme,

Yes, we are using SLURM.
We are using minio configured in a local S3. Since it is local, the performance could be enough, right?

Miguel_Grau · February 12, 2025, 11:52am

Just to clarify, I’m using Slurm with the Tower Agent

Miguel_Grau · July 9, 2025, 4:50pm

Hi @robsyme. Any update on this?

robsyme · July 9, 2025, 5:07pm

Apologies, Miguel.

When using a non-awsbatch executor, the work directory must be a path on a shared filesystem rather than an S3 bucket. It is difficult for Platform to verify that both the head and compute nodes will have the required permissions to read from and write to that remote location, and is likely to cause errors - even if the performance is sufficient.

Is there a shared filesystem that you can use on this cluster?

robsyme · July 9, 2025, 5:11pm

I have also created a feature request on your behalf at: Demote message about workdir prefix from error to warning when using s3 buckets as workdir on SLURM | Voters | Seqera Feedback Forum

You can upvote and/or follow this feature for updates.

Miguel_Grau · July 9, 2025, 5:36pm

Thanks, Rob! Already upvoted
Yes, we have a shared filesystem for the work folder (which we are using as the default one) but our input data is on a local s3 minio. It is a huge amount of data in some runs, so we were interested in using fusion to avoid the staging step and speed up the pipeline execution in general.

Poshi · July 14, 2025, 9:13am

Never used neither S3 nor the Tower environment, but did you try the obvious? Did you tried converting the relative path you are trying to use to an absolute one (one that starts with a slash)?

Miguel_Grau · July 15, 2025, 5:53am

I am not using relative paths, @Poshi. These are s3 storage paths.

Poshi · July 15, 2025, 9:43am

Taking the first message you posted I saw:

Pipeline work directory must start with slash character - offending value: s3://scratch/work

That is a relative path. Or at least, it is a relative path according to the standard. Maybe S3 works differently and don’t follow the standards, but I don’t think so.

Miguel_Grau · July 15, 2025, 9:59am

Check how a s3 path looks, please. e.g. File paths in Amazon S3 - Media2Cloud on AWS

Poshi · July 15, 2025, 10:12am

Oh! Ok, I assumed that “scratch” was part of the path, not the bucket name. Then everything seems fine in the naming. Forgot my comment

Topic		Replies	Views
"Field ‘workDir’ is not writable" error when resuming a pipeline Ask for help	1	45	February 17, 2025
Non-aws S3 bucket won't be picked up by nextflow Ask for help nextflow , aws , nf-core	2	37	October 21, 2024
Pipeline not working in AWS Batch because of a fusion problem Ask for help fusion , aws , platform	6	62	April 21, 2025
Failed to create publish directory when submitting on seqera plaform Ask for help nextflow , platform	2	57	January 15, 2025
Automatically Set Report to be Within the Working Directory Ask for help platform	1	328	October 23, 2023

S3 work folder

Related topics