[pre-ANN] DataViewer.jl: explore data files with the power of Makie

ffevotte · October 25, 2023, 6:11pm

Hello,

I’ve recently been working on DataViewer.jl, a GUI that helps explore and visualize the structure of data contained in datafiles (such as HDF5, JLD2 or JSON files).

This tool focuses on understanding how data files are structured, and therefore provides by default a tree-like view of the data. When a “leaf data” is of a supported type^[1], basic Makie-based visualizations are provided, to further understand what the data file contains.

DataViewer.jl can be used as a normal Julia package, whose main API is the DataViewer.view function:

julia> using DataViewer
julia> using JLD2
julia> DataViewer.view("my_file.jld2")

It can also be “installed” in the system, i.e. the DataViewer.install function creates a launcher script^[2] that can be called from the command line. (This is what is demoed in the screencast above)

I arguably lacked inspiration for the package name: although it is free in the General registry, a cursory look on github revealed at least 3 other Julia projects by that name. So please feel free to bikeshed on the name (or anything else!)

Implementationwise, this is based on JSServe.jl and WGLMakie, everything being rendered in an Electron.jl window. All my gratitude goes to @sdanisch and @jules who helped me a lot!

Currently: a 1D 2D or 3D array of numbers, or a dictionary of numbers ↩︎
By default, this also compiles a system image so that everything feels more snappy ↩︎

Jake · October 25, 2023, 6:40pm

That demo looks really nice. Without having yet tried it, does it also do:

Mark the curser position?
Better yet, mark the closest point to the curser position?
Allow me to transform the abscissa? i.e. k*(n-1) where k is a constant and n is the sample number.
Read arrow files? And maybe even Matlab files? (The new Matlab format is a version of HDF5)
Save (or copy) the output graph?

Having something that competes with Matlab’s interactive graphing/data exploration facilities is really exciting.

rafael.guerra · October 25, 2023, 8:23pm

Thanks and congratulations on a great job.

One question and one suggestion:

How to increase the size of the graphic area (it doesn’t increase when we enlarge the Electron window)?
Perhaps: DatafileViewer.jl ?

jar1 · October 25, 2023, 8:35pm

Would it be possible to put the gui in a vscode tab?

fdekerme · October 25, 2023, 9:25pm

Nice work !
3d slice visualization could be very useful for medical images in DICOM format.

gvdr · October 26, 2023, 5:03am

Yeah, the 3D slice would be a very cool addition also to GeoStats viz functions. Is that implemented somewhere like in Makie? Gonna search for it now.

ffevotte · October 26, 2023, 6:26am

These aren’t implemented yet, but should be doable and in the spirit of the package

Currently, DataViewer is more meant as a tool to quickly understand/check what’s in a data file. I.e answer questions like “did I correctly put all results in my output file?” or “does this user-provided input file contain the data I need, with the right structure?”.
It’s not really meant as a plotting tool, and I feel like adding UI to transform axes would go too far in that direction (and where do we stop? do we also want some UI to label axes? or add a title?)
Saving the output graph could be a nice, generic feature, though.

AFAIU, Arrow files store flat tabular/columnar data à la DataFrames? For now, DataViewer has rather been designed to work with tree-like data structures (like Dicts of Dicts of Arrays of Dicts), but I guess we could also support DataFrame-like data structures by looking at them either as a Dictionary of columns, or an Array of rows. Once this is done, reading arrow files would be very natural.

ffevotte · October 26, 2023, 6:32am

Thanks!

Good question, I’d like to know that as well! AFAIU, this is a limitation of WGLMakie (or maybe it’s simply that I didn’t manage to resize the WGLMakie figure?)

github.com/MakieOrg/Makie.jl

WGLMakie Resize Feature

opened 11:49PM - 16 Sep 21 UTC

closed 04:29PM - 23 Aug 24 UTC

bradcarman

enhancement WGLMakie

I was wondering if it would be possible to auto-resize a WGLMakie figure? For e…xample, when the figure is displayed in a VSCode Plot Pane window, would it be possible to: 1. Fill the plot pane window based on the window initial size 2. Auto-resize the figure based on a change to the plot pane size? Basically, I'm looking for similar functionality as the PlotlyJS.jl package, which fills and resizes based on the Blink window or the VSCode Plot Pane size.

Yes, good idea! I like that it conveys the idea of working with datafiles rather than raw data. Maybe the command-line tool could be called dfv then?

ffevotte · October 26, 2023, 6:48am

The short answer is that it’s a perfectly sensible thing to do, but I didn’t manage to do it.

I originally thought it would be as simple as asking the JSServe app to display in VSCode, but this only partly works: all features implemented with Makie Observables work fine in this context, but navigation in the data fields doesn’t work. It might be because this is coded like a plain old web app: in the Electron window clicking on a link asks the JSServe server for the document associated to a new URL; in a VSCode pane, clicking on a link doesn’t seem to do anything…

If someone knowledgeable wants to chime in about this, I’d really appreciate it!

I’m not at all familiar with the DICOM format, but a cursory look at DICOM.jl seems to indicate that all keywords are there: tree-like data structure, Dict-like access…
It might be really easy to at least try adding support to DICOM in DataViewer.

Not sure I understand: are you asking whether 3D slices are implemented in Makie? If so, there’s for example this snippet in the WGLMakie documentation, which shows how to do it:

https://docs.makie.org/dev/explanations/backends/wglmakie/#record_a_statemap

fdekerme · October 26, 2023, 7:56pm

A DICOM file consists of two parts: a header and data. The header contains information on the patient, the equipment used, image parameters and other metadata. To visualize metadata, you can see (self-promotion here ) my package DICOMTree.

Data can take many forms, but the most common (a DICOM file from a CT scanner) is a 2D matrix of grayscale pixels, corresponding to a slice of the 3D image. To make up the complete image, there are several separate DICOM files, each containing a slice. If the image has 256 slices, there will be 256 DICOM files. Notice that this may vary for some DICOM files, in radiotherapy for instance. For example, the RT Dose (patient irradiation dose map) contains the 3D matrix directly in a single file. The RT Structure, on the other hand, contains points (coordinates in physical space) that describe the contours of certain organs/regions of interest. But the most common is the CT scanner.

Hope it was helpful

LaurentPlagne · October 27, 2023, 9:00am

@davidanthoff and @pfitzseb may have an opinion on this.

pfitzseb · October 27, 2023, 9:31am

That’s expected behaviour and unlikely to change.

sdanisch · October 27, 2023, 2:15pm

Really cool package!

I think it should be possible to make links, that jus updates the APP…
But I will need to think about how to actually structure that.

You might be happy to see SpecApi by SimonDanisch · Pull Request #3281 · MakieOrg/Makie.jl · GitHub, which fixed quite a few memory leaks in WGLMakie, together with JSServe#sd/fixes!

jar1 · October 27, 2023, 5:30pm

Would you mind elaborating on this?

Seeing the video above I imagined VSCode Julia could someday support a user-customizable interactive gui system which would be very powerful, something like https://gtoolkit.com/. What stands in the way of this?

ffevotte · October 27, 2023, 8:47pm

Glad to hear that! Please let me know if I can help.

Topic		Replies	Views
DataViewer.jl supported data file formats Visualization	15	751	October 31, 2023
[ANN] Announcing ElectronDisplay.jl Package Announcements announcement	2	1780	February 14, 2019
[ANN] Julia Data Science Book and Books.jl Package Announcements announcement , book	15	2847	August 15, 2022
[ANN] MakieLayout.jl: A layout manager for Makie.jl Package Announcements	25	4041	December 22, 2019
ANN: JuliaDB.jl Community	40	9692	November 13, 2018

[pre-ANN] DataViewer.jl: explore data files with the power of Makie

Related topics