I have 5000-50,000 cells DNA, using nextflow I aligned them to genome then I use picard to get some stats. I have the metadata file go with it.
I have been plotting the summary stat distribution in R (violin) or python. It would be great if I can use multiQC to do it and generate a report.
if I just use multiqc directly, it will have 5,000 rows and not grouped by samples, so it is not very helpful.
Is there metadata groupby integration from multiqc?
Thanks!
There’s no obvious way to do this currently, though one way could be to run the Picard multiple times using path filters. Then at least you’d get separate plots for each group of samples:
Note that you can also use MultiQC within scripts, which may be a better approach:
Phil