Performant, interactive Plots.jl backend for jupyter notebook

standarddeviant · May 30, 2018, 3:30pm

I’ve been using Plots.jl with plotlyjs() for interactive plots and automated ‘report’ generation. I appreciate the work that’s gone in to Plots.jl and PlotlyJS.jl - I use them every day!

Some of the individual plots have performance issues in the generated html files, so I play tricks like resampling my data (thanks to DSP.jl) and plot at lower sample rates, etc. But even at lower rates, with multiple plots, the generated pages and the interactive javascript can become rather slow.

For the curious, one of my data signals is typically 8M points, and for plotting I will usually resample to around 200k points. With spectrograms and multiple time series on multiple plots, that’s enough to slow things down on my relatively powerful desktop machine.

I know of GLVisualize.jl and know that it’s a work-in-progress backend for Plots.jl. Will GLVisualize.jl use WebGL, or something else, to render interactive plots in a more efficient way than relying on javascript to do all the heavy lifting?

mkborregaard · May 30, 2018, 6:24pm

You should check out Makie - Home
It’s the future of GLVisualize, and I believe @sdanisch (the primary developer) intends for this to end up replacing Plots, providing the same functionality (backends, recipes, terse syntax) in pure julia.
It’s particularly focused on interactivity but still in a relatively early stage of development (first release coming up soon).

MA_Laforge · June 5, 2018, 10:20pm

@standarddeviant: I had the exact same problem, and that’s why I developped InspectDR:

InspectDR: https://github.com/ma-laforge/InspectDR.jl
Gallery: FileRepo/README.md at master · ma-laforge/FileRepo · GitHub

200k points is only considered a “moderate” dataset for InspectDR. With 200k point, you still get very good interactivity with almost no lag when you drag plot data around in order to pan the plot & alter its limits.
- You might want to bypass the Plots.jl module if you really need speed for automated ‘report’ generation, though.
Plot times are still good even when datasets are even larger (ex 2Gb datasets).
- Again: You should bypass the Plots.jl (use InspectDR directly) if you want to minimize plot times.

Conditions

Performance is even good on a 7yr old computer running a VM session.
InspectDR runs at optimum speeds when the x-axis data is sorted in increasing order.

Caveat

Does not support 3D plots.
Performance is better if Plots.jl is bypassed (InspectDR used directly).
Not directly interactive in Jupyter - but static plot images are supported (.png works best) & you can still launch the Gtk GUI to get the interactive mode.

I myself am looking forward to GLVisualize/Makie, but last time I checked, it is not yet able to do what I needed… But I am looking forward to the first release as well.

mkborregaard · June 6, 2018, 9:37am

Yes, should have pointed out inspectdr as well.

standarddeviant · June 18, 2018, 1:53am

Thanks @mkborregaard - I will keep a watch on Makie.jl.

Thanks @MA_Laforge for writing InspectDR! I’ve used InspectDR as a Plots.jl backend, but will look at direct usage.

Is InspectDR doing something similar to to the Python package “datashader”?

MA_Laforge · June 19, 2018, 2:12am

@standarddeviant: Thanks for pointing me to datashader - I was unaware of this package.

To a certain extent: yes, InspectDR does something similar to datashader is doing:

It performs a reduction of the data for each zoom level, parsing the entire dataset each time.
- It only “downsamples” the data for that particular zoom level… so the intent is to never drop data points (As long as we have a line plot - not a scatter plot).
- Data reduction is simply used to minimize the commands sent to the (slower) drawing subsystem.

Differences:

Data reduction in InspectDR is much less flexible than datashader - especially with scatter plots.
- InspectDR does not do fancy/customizable normalization of pixel intensity, etc… because it is optimized for line plots, not scatter plots.
It sounds like the datashader algorithm might even be faster than the one in InspectDR (time per # of points), but I have not verified this.

NOTE

There are probably other differences, but I don’t understand datashader well enough… and I have not had a chance to study the algorithm they use.
I might eventually want to make use of datashader from within InspectDR… but I probably should see if there are licensing issues with this before I start (I am not quite certain what its license entails).

James_Bednar · August 29, 2018, 9:46pm

Datashader’s license is just BSD 3-clause, which is highly permissive and unlikely to prevent any reasonable use you would have of it. Go for it!

affans · August 30, 2018, 3:29am

VegaLite.jl is another plotting library, but is not compatible with Plots.jl. I am not sure if this will work for you. It provides a “grammer of graphics”.

Topic		Replies	Views
ANN: New interactive Plots.jl backend for large datasets Visualization	7	2573	March 2, 2021
GR vs GLVisualize comparison? General Usage	19	3422	July 18, 2017
Plotting big data sets? Visualization	3	1211	November 22, 2021
Strange things with plots, InspectDR General Usage question , plots , inspectdr	4	1102	July 21, 2017
How to plot very large numbers of points and save the figures with small latencies for raw audio images? Visualization plotting , inspectdr	17	2051	March 21, 2020

Performant, interactive Plots.jl backend for jupyter notebook

Conditions

Caveat

Differences:

NOTE

Related topics