How is the performance of GraphNeuralNetworks.jl compared to PytorchGeometric?

DoktorMike · April 12, 2023, 3:27pm

I’ve been playing around with the excellent GraphNeuralNetworks.jl and am thinking about applying it to the biochemistry domain. Specifically within drug discovery. I was wondering if anyone has tried to benchmark speed on CPU and/or GPU for training the same network as compared to Pytorch Geometric?

Also if there are any gotchas that I should be on the lookout for? The tasks I’m thinking about are both node level predictions but also full graph predictions.

Sorry for the vague question.

//Mike

CarloLucibello · April 12, 2023, 4:55pm

It would be nice to compare performance. PyG has plenty of examples, we should try to translate some of them and compare times. Do you have some specific dataset in mind?
Maybe this example of graph regression on the QM9 datasets could be a good start

github.com

pyg-team/pytorch_geometric/blob/master/examples/qm9_nn_conv.py

import os.path as osp

import torch
import torch.nn.functional as F
from torch.nn import GRU, Linear, ReLU, Sequential

import torch_geometric.transforms as T
from torch_geometric.datasets import QM9
from torch_geometric.loader import DataLoader
from torch_geometric.nn import NNConv, Set2Set
from torch_geometric.utils import remove_self_loops

target = 0
dim = 64


class MyTransform(object):
    def __call__(self, data):
        # Specify target.
        data.y = data.y[:, target]

This file has been truncated. show original

?
Set2Set has to be implemented yet but it should be easy.

DoktorMike · April 12, 2023, 5:26pm

Yes the QM9 is super relevant. I’ll try to port it.

CarloLucibello · April 12, 2023, 8:50pm

We can omit the Complete transformation.
I opened a PR for Set2Set

The other ingredient is QM9. Hopefully it can be loaded from

If that doesn’t work, you can try with

julia> using HuggingFaceDatasets

julia> d = load_dataset("lisn519010/QM9", split="full").with_format("julia")
Dataset({
    features: ['x', 'edge_index', 'edge_attr', 'y', 'pos', 'z', 'name', 'idx'],
    num_rows: 130831
})

DoktorMike · April 13, 2023, 11:30am

I found that QM9 is already available through MLDatasets.jl

using MLDatasets
data = TUDataset("QM9")

But it doesn’t have the same number of rows as I would expect. I get 129,433 instead of 133,886. Not sure why. In the file qm9.csv which I think is the original data we have 133,886 rows.

But on TUDatasets website the qm9.zip indeed shows 129,433 graphs.

I’ll double check which one Pytorch Geometric is using in their example.

Topic		Replies	Views
[ANN] GraphNeuralNetworks.jl Package Announcements flux , machine-learning , graphs	0	863	November 8, 2021
[ANN] GraphNets.jl - Simple, blazing fast, message-passing graph neural network Package Announcements machine-learning	11	1727	September 19, 2023
TorchDrug equivalence in Julia? Machine Learning	3	489	April 2, 2022
Minibatches of graphs in GeometricFlux.jl Machine Learning flux , geometricflux	2	578	October 22, 2021
Graph Neural Networks in Flux. Machine Learning	5	3004	December 14, 2021

How is the performance of GraphNeuralNetworks.jl compared to PytorchGeometric?

Related topics