X axis ordered by date?

rolivella · March 26, 2024, 4:01pm

Hello! I’m testing MultiQC for proteomics data and I’m wondering if there’s a way to order the X axis by date (for instance, dd/MM/YY hh:mm) from older to newer. So what we want is to see how the QC parameters changes across time (a classical time series).
Thanks!
Roger

ewels · March 27, 2024, 3:23pm

Hi @rolivella!

Could you give a little more context please? X axis of what? A screengrab or something would help.

Thanks!

Phil

rolivella · March 27, 2024, 3:38pm

Hi @ewels !

Yes, I’m would like to ask if we can do plots like this:

The X axis is time, for instance “2024-03-23 11:40:36” for a particular data point. The Y axis is a measure if the area of some QC peptides (EYE, YIC, etc.). We have similar QC plots for other parameters but always with the same time X axis.

Is it more clear now?

Thanks!

Roger

ewels · March 27, 2024, 3:56pm

Understood, thanks I don’t think that there are any existing plots that have a datetime axis like this, so I think it’s unlikely that it’s supported at present. Best bet is to put in a GitHub issue requesting it. The underlying plotting library definitely supports it, so hopefully it shouldn’t be a big task.

Short term, two workarounds spring to mind - if all the points are shared and the data points are evenly distributed, you could treat the x axis as categorical data and it should sort of work. Alternatively, you could convert the dates / times into a unix timestamp so that they behave as a simple integer and can be treated as a numerical axis. But then the labels will be nonsensical. So neither approach is perfect.

Tagging @vlad.savelyev for visbility.

Phil

rolivella · March 28, 2024, 8:27am

OK, thank you very much! It’s not urgent so I can put it in a GitHub issue requesting it.

Roger

vlad.savelyev · March 28, 2024, 11:25am

Values like “2024-03-23 11:40:36” should work well as X-axis values, and get the right sorting. Plotly will even recognize it as a timestamp and render nicely:

lineplot_data = {
    "sample1": {
        str(datetime.datetime.today()): 1,
        str(datetime.datetime.today() + datetime.timedelta(days=1)): 2,
        str(datetime.datetime.today() + datetime.timedelta(days=2)): 3,
        str(datetime.datetime.today() + datetime.timedelta(days=3)): 5,
        str(datetime.datetime.today() + datetime.timedelta(days=4)): 2,
        str(datetime.datetime.today() + datetime.timedelta(days=5)): 3,
        str(datetime.datetime.today() + datetime.timedelta(days=6)): 4,
        str(datetime.datetime.today() + datetime.timedelta(days=7)): 5,
    },
}

self.add_section(
    name="Line Plot",
    anchor="test-line-plot",
    description="Line plot test.",
    plot=linegraph.plot(lineplot_data, pconfig={"id": "test_line_plot"}),
)

vlad.savelyev · March 28, 2024, 11:33am

And the X axis orderred automatically as well by date.

lineplot_data = {
    "sample1": {
        str(datetime.datetime(2024, 3, 10)): 2,
        str(datetime.datetime(2024, 3, 7)): 10,
        str(datetime.datetime(2024, 3, 5, 1, 30)): 1,
        str(datetime.datetime(2024, 3, 6, 18, 10)): 4,
        str(datetime.datetime(2024, 3, 8, 12, 2)): 2,
    },
}

So not sure if there is an issue here. Let me know if that’s not working for you or if I misunderstood the question, @rolivella.

rolivella · March 28, 2024, 11:44am

Thanks @vlad.savelyev ! Looks promising. As I’m working with non-standard multiqc data (proteomics data) I’m using a customized JSON like this test:

{
  "id": "custom_data_lineplot_2",
  "section_name": "Mass accuracy",
  "description": "This is the mass accuracy.",
  "plot_type": "linegraph",
  "pconfig": {
    "id": "custom_data_linegraph2",
    "title": "Mass accuracy",
    "ylab": "Mass accuracy (ppm)",
    "xDecimals": false,
    "ymax": "2",
    "ymin": "-2"

  },
  "data": {
    "LVN": { "24/03/2024": -0.20262053803095767,"25/03/2024": -0.50262053803095767},
    "HLV": { "24/03/2024": -0.30262053803095767,"25/03/2024": -0.10262053803095767}
  }
  }

How should I apply those Plotly timestamps to this JSON? Thanks!

vlad.savelyev · March 28, 2024, 11:51am

Your JSON works for me:

multiqc test_mqc.json

What are you getting? How are you running MultiQC?

rolivella · March 28, 2024, 11:59am

Yes yes, is working, I meant now I’m using this string “24/03/2024”. Should I put it in another format in order Plotly to get it as timestamp? Now is a “string”, I’m wondering if I have put it as a “timestamp”, do you know what I mean?

ewels · March 28, 2024, 12:23pm

Going on @vlad.savelyev’s suggestion above:

Python 3.12.1 | packaged by conda-forge | (main, Dec 23 2023, 08:01:35) [Clang 16.0.6 ] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> import datetime
>>> str(datetime.datetime.today())
'2024-03-28 13:22:17.508050'

So 24/03/2024 → 2024-03-24 00:00:00.000000

I suspect that Plotly will be relatively forgiving and that you can cut some / all of the timestamp off the end. But maybe start with this and see if it works first.

Phil

vlad.savelyev · March 28, 2024, 2:55pm

Right on, in order for Plotly to recognize a string as a date, it needs to be in ISO format:

  "data": {
    "LVN": { "2024-03-24": -0.20262053803095767,"2024-03-25": -0.50262053803095767},
    "HLV": { "2024-03-24": -0.30262053803095767,"2024-03-25": -0.10262053803095767}
  }

Your original “24/03/2024” format sort of works as Plotly interprets it as a plain string category, but since it would sort it alphanumerically - date first - month second - this is not ideal. So if you change the format to “2024-03-24”, it should work as expected.

rolivella · March 28, 2024, 4:16pm

Yes, the format did the trick. By putting dates in this format “2024-03-24 11:00” the X axis worked as expected.

Thank you both for the help!

ewels · March 28, 2024, 4:24pm

Great! Thanks for letting us know!

rolivella · March 28, 2025, 3:05pm

Hi again

I’m working on the nf-core/ribomsqc pipeline and need to implement the feature we discussed last year. Since the pipeline will process one file at a time as the instrument generates data, I was wondering if this setup would work:

File 1:

{
  "id": "custom_data_lineplot_2",
  "section_name": "Mass accuracy",
  "description": "This is the mass accuracy.",
  "plot_type": "linegraph",
  "pconfig": {
    "id": "custom_data_linegraph2",
    "title": "Mass accuracy",
    "ylab": "Mass accuracy (ppm)",
    "ymax": "2",
    "ymin": "-2"
  },
  "data": {
    "LVN": { "2024-03-24": -0.20262053803095767 },
    "HLV": { "2024-03-24": -0.30262053803095767 }
  }
}

File 2:

{
  "id": "custom_data_lineplot_2",
  "section_name": "Mass accuracy",
  "description": "This is the mass accuracy.",
  "plot_type": "linegraph",
  "pconfig": {
    "id": "custom_data_linegraph2",
    "title": "Mass accuracy",
    "ylab": "Mass accuracy (ppm)",
    "ymax": "2",
    "ymin": "-2"
  },
  "data": {
    "LVN": { "2024-03-25": -0.50262053803095767 },
    "HLV": { "2024-03-25": -0.10262053803095767 }
  }
}

Can MultiQC merge these and show both time points on the x-axis?

I tried placing both files in a folder and running MultiQC, but only the one with the earlier date (2024-03-24) shows up in the report.

Thanks!

ewels · March 28, 2025, 7:28pm

Hi @rolivella,

I’m not sure that this is possible, at least with Custom Content. You can progressively add more samples with more files, but merging data points for the same samples across multiple files isn’t possible, I think.

What might work is writing a script that imports MultiQC, this gives a lot more flexibility in what is possible. With this method you could build the data for the plot however you want, including parsing whatever series of data structures you want, before passing back to MultiQC to generate a plot and report.

Phil

rolivella · March 29, 2025, 8:53am

Thanks, @ewels, for replying so quickly!

Got it — I’ll take a look at what you mentioned about importing the MultiQC library within a script. That said, for this initial version of the pipeline, I’d really like to wrap things up as soon as possible, since we have a major mass spectrometry conference in the US during the first week of June. I’d like to have the first version of the pipeline functional and published in the official repo by then, if possible.

So for now, I’d prefer to focus on presenting the results with MultiQC as-is, without developing additional scripts. I was thinking about reordering the samples in a different way, as shown in this (admittedly rough!) plot. But I think the idea comes across. P1 and P2 are two different parameters, and S1, S2, S3 are the samples being processed, each generating an output file that MultiQC picks up.

I believe this structure fits better within MultiQC’s framework, and I wouldn’t need to develop another script, right?

Thanks again!

ewels · March 31, 2025, 1:25pm

A little tough to see but yes, if different plots for each data input then I think it should be fine

ewels · April 7, 2025, 6:00am

This topic was automatically closed after 6 days. New replies are no longer allowed.

Topic		Replies	Views
Sample names order in tables and plots Ask for help multiqc	7	203	July 30, 2024
Multiqc not sorting descendingly with custom_plot_config Ask for help multiqc	5	29	May 26, 2025
Custom content Ask for help multiqc	6	165	June 24, 2024
Custom content table how to disable default sorting Ask for help multiqc	4	74	June 19, 2024
Adding legend to somalier PCA plots Ask for help multiqc	3	150	April 22, 2024

X axis ordered by date?

Related topics