Is there a lazy version of graph?

tk3369 · December 15, 2021, 7:25am

I came across an interesting problem recently that can be solved with the shortest path algorithms from Graphs.jl. The only problem is that it takes a “long time” (julia scale ) to build the graph because there are 250K vertices and ~1mm edges.

In my situation, the existence of these edges can be calculated on the fly. I wonder if it’s possible to build a lazy graph without having to materialize all the edges upfront. Perhaps I just pass a function that will return true if there’s an edge between the two vertices. My vertices are just numbered sequentially 1:n.

Does anyone know if Graphs.jl can support that or if there’s another package with such functionality? Thanks

etienne_dg · December 15, 2021, 8:46am

As far as I’m aware, there is still no package for building lazy graphs (and I think it would be an awesome feature). You can totally make a new graph type extending AbstractGraph, and use the functions of Graphs.jl.
You just need to implement all the API functions, you can check the documentation to create an alternate graph type. vertices are labelled by a subtype of Integer. Despite what claims the documentation, you should have the neighbors functions return vertices in ascending order and the vertices must form a continuous range starting from one (each vertex must be lower than nv(g)), otherwise some functions will not work as expected. This is especially true for astar implementation, where vertices are directly used for indexing.

You can implement has_edge, outneighbors and inneighbors lazily.

mschauer · December 15, 2021, 8:51am

What is wrong in the docs? Doesn’t it say “ascending order”?

etienne_dg · December 15, 2021, 8:55am

Yes, but it does not speak of continous range starting from one, and ascending order is only for AbstractSimpleGraph, so in theory, AbstractGraph doesn’t require it, but I’m not sure how well this is respected in the codebase…

mschauer · December 15, 2021, 9:01am

Ok, let’s update that

etienne_dg · December 15, 2021, 9:06am

Also, if your performance problem is only in building the graph, how do you construct it? If you add every single edge one by one, this is a very inefficient way to build it, because SimpleGraphs maintain the adjacency lists in ascending order, so it does recompute a lot of things for each edge addition. Edge addition is not constant time for SimpleGraph because of it’s internal structure. You can improve this by batching edge additions, or directly building the fadjlist (and badjlist for directed graphs) yourself.

tk3369 · December 16, 2021, 7:41am

Quick update:
It turns out that my performance problem came from populating the sparse array rather than constructing the graph. I use a sparse matrix to keep track of the edge distances, and the setindex! calls to the array took the majority of time. That’s actually a little surprising though… I will check further when I have time later.

Thanks for the ideas above and sorry about the noise

Tom

P.S. I create all edges in an array and make a single call to create the graph.

mschauer · December 16, 2021, 3:55pm

Creating an empty sparse matrix and filling it with setindex! isn’t particular fast because the CSC format is optimised for efficient arithmetic operations and not for changes to the sparsity structure.

Topic		Replies	Views
[WIP] SimpleLazyGraphs.jl - add vertices and edges as needed Package Announcements question , package , graphs	6	660	July 18, 2022
Graph construction performance Graphs lightgraphs	2	723	May 16, 2019
Inefficiency of add_edge! in SimpleWeightedGraphs New to Julia	7	1249	June 22, 2020
Looping over edges in LightGraphs New to Julia	8	2004	June 7, 2020
Creating a Weighted Graph Graphs lightgraphs , graphs	33	5944	June 16, 2020

Is there a lazy version of graph?

Related topics