Is there a way to change the sorting behaviour in multiqc

mpampuch · August 22, 2025, 12:12am

Recently I ran nf-core/rnaseq on my RNAseq data. I was wondering if there was a way to change the behaviour of how the multiqc sorts the data.

Here I have samples in a time series. The samples are sorted top-to-bottom and left-to-right alpha-numerically, but I would like to sort them sensibly by time (ie. 0h, 2h, 4h, etc. instead of 0h, 10h, 12h, etc.). Is there a way to achieve this behaviour or pass in a custom predicate function for desired sorting behaviour for sample names?

Also is this a problem at the multiqc level, or at the DEseq2 level? Which program would need to be modified to make this work?

ewels · August 22, 2025, 6:41am

Lazy answer: Honestly, the easiest fix is probably to rename your samples to have a leading 0 so that they sort alphabetically (eg. 02h etc). This is especially true given that you’re running within the nf-core/rnaseq pipeline, which makes modification more awkward / undesirable.

Note that there is a button above the plot to switch between Sorted by sample and Clustered. Assuming some biological similarity, I’d kind of hope that the clustered view might make more sense in your case.

That said, let’s try to look for some sorting logic..

I can’t see any sorting of samples within the pipeline, it seems to just cat all the files together, so likely will be random / POSIX within that file (you can check that source file to confirm):

github.com/nf-core/rnaseq

modules/local/deseq2_qc/main.nf

b971ba18d


      
          if [ -f "R_sessionInfo.log" ]; then
              # Handle PCA files
              sed "s/deseq2_pca/${label_lower}_deseq2_pca/g" <$pca_header_multiqc > pca_header.tmp
              sed -i -e "s/DESeq2 PCA/${label_upper} DESeq2 PCA/g" pca_header.tmp
              cat pca_header.tmp *.pca.vals.txt > ${label_lower}.pca.vals_mqc.tsv
              rm pca_header.tmp
          
              # Handle clustering files
              sed "s/deseq2_clustering/${label_lower}_deseq2_clustering/g" <$clustering_header_multiqc > clustering_header.tmp
              sed -i -e "s/DESeq2 sample/${label_upper} DESeq2 sample/g" clustering_header.tmp
              cat clustering_header.tmp *.sample.dists.txt > ${label_lower}.sample.dists_mqc.tsv
              rm clustering_header.tmp
          fi
          
          cat <<-END_VERSIONS > versions.yml
          "${task.process}":
              r-base: \$(echo \$(R --version 2>&1) | sed 's/^.*R version //; s/ .*\$//')
              bioconductor-deseq2: \$(Rscript -e "library(DESeq2); cat(as.character(packageVersion('DESeq2')))")
          END_VERSIONS
          """

I also don’t think that we sort within the custom content module for heatmaps, I guess that the only sort we do is here in the heatmap code:

github.com/MultiQC/MultiQC

multiqc/plots/heatmap.py

997f52e7e


      
          # Get unique row and column categories in the order of indices
          row_cats_df = df.select(["row_idx", "row_cat"]).unique().sort("row_idx")
          col_cats_df = df.select(["col_idx", "col_cat"]).unique().sort("col_idx")

So yeah, I don’t think that there is any way to customise this currently, sorry. Feel free to put in a GitHub issue requesting it as a new feature.

vlad.savelyev · August 22, 2025, 8:02am

We do natural sample name sorting (i.e. 1, 2, 10, 20 instead of 1, 10, 2, 20) for other plot types, so it would be very straightforward to do that for heatmap. Will do that update!

Topic		Replies	Views
Sample names order in tables and plots Ask for help multiqc	7	285	July 30, 2024
Custom content table how to disable default sorting Ask for help multiqc	4	128	June 19, 2024
Multiqc not sorting descendingly with custom_plot_config Ask for help multiqc	5	93	May 26, 2025
Using filename as samplename for custom content Ask for help multiqc	4	75	August 29, 2025
X axis ordered by date? Ask for help multiqc	18	307	April 7, 2025

Is there a way to change the sorting behaviour in multiqc

Related topics