Apologies if these are beginner questions for Nextflow. I am trying to configure Nextflow (Cutandrun in particular) to run on my local cluster, but unfortunately Java is not available as a module, so I cannot use Nextflow directly on it. They do offer Singularity, so I’m trying to start out with Docker and figure out how to get Nextflow to work entirely from containers. So far, I have been able to fetch both the nextflow/nextflow and the nfcore/tools containers from dockerhub and run them successfully, but I am unable to successfully run a pipeline. For example I start the Cutanrun pipeline using this command (on a PC, WSL2 Ubuntu):
docker run -itv “/home/jcreamer”:“/home/jcreamer/” nextflow/nextflow:latest nextflow run nf-core/cutandrun -r 3.2.2 -profile docker --input /home/jcreamer/test.csv --genome GRCh38 --outdir /home/jcreamer/test_1
Nextflow initiates, and starts the pipeline, but it fails after this error:
ERROR ~ Error executing process > ‘NFCORE_CUTANDRUN:CUTANDRUN:INPUT_CHECK:SAMPLESHEET_CHECK (test.csv)’
Caused by:
Process NFCORE_CUTANDRUN:CUTANDRUN:INPUT_CHECK:SAMPLESHEET_CHECK (test.csv) terminated with an error exit status (125)
Command executed:
check_samplesheet.py test.csv samplesheet.valid.csv true
cat <<-END_VERSIONS > versions.yml
“NFCORE_CUTANDRUN:CUTANDRUN:INPUT_CHECK:SAMPLESHEET_CHECK”:
python: $(python --version | grep -E -o “([0-9]{1,}.)+[0-9]{1,}”)
END_VERSIONS
Command exit status:
125
Command output:
(empty)
Command error:
docker: Cannot connect to the Docker daemon at unix:///var/run/docker.sock. Is the docker daemon running?.
See ‘docker run --help’.
Work dir:
/work/84/1730c9b57b7a2ba7bff69fb42b75e3
Tip: you can replicate the issue by changing to the process work dir and entering the command bash .command.run
– Check ‘.nextflow.log’ file for detail
It seems like the initial container cannot call docker further, how do I direct the container to use the existing docker installation on the machine? Does Singularity fix these issues? There’s no clear cut instructions for how to accomplish a pure containerized Nexftlow from what I’ve found. Also, using the “conda” profile does not fix it, instead I get an issue that says “conda” cannot be found. Is there a container source that has ALL the dependencies required for nextflow/nf-core that runs out of the gate? How would I go about constructing this if nextflow running inside the container requires further container calls to complete a pipeline?