Using pretrained pytorch models in Flux

tbreloff · May 8, 2020, 12:54pm

Hi all. I’m excited to be able to dip my toes back into the Julia pond. I’ve missed the community these last few years.

My question is about how to best leverage existing pytorch resources, such as the extensive pretrained NLP models in 🤗 Transformers, while doing research using Flux and other Julia packages.

I understand I can always call it directly through PyCall as a backup, but I’m curious about the best Julia-only approach to exploiting the massive amount of resources going into pytorch packages.

Thanks in advance, and great work with the language and package ecosystem!

avik · May 8, 2020, 2:46pm

Not sure there is a very simple or clear answer yet, but I’ll point to Torch.jl and Peter’s ongoing GSoC project

tbreloff · May 8, 2020, 3:16pm

Thanks! The GSoC link looks very relevant . Do you have more details on the planned scope and timing of that project?

Oscar_Smith · May 8, 2020, 3:32pm

The other way to do that is to export the pytorch net to onnx (torch.onnx — PyTorch 1.12 documentation) and read it in with flux (GitHub - FluxML/ONNX.jl: Read ONNX graphs in Julia)

chengchingwen · May 9, 2020, 4:40am

Hi, @tbreloff
The current planned method of the GSoC project will be reading the serialized state_dict and rebuild the model according to the weight names. PyTorch have several way to save a model, one was by extract the state_dict of the model and save it in a pickle like format. I can release some draft code for loading the state_dict next week if you need it.

JosePereiraUA · February 8, 2021, 11:08am

I there, do we have an update on this idea/project. How should one go about doing this, right now?

ToucheSir · February 8, 2021, 5:11pm

https://github.com/chengchingwen/Transformers.jl reads pre-trained weights from HuggingFace and more using GitHub - chengchingwen/Pickle.jl: An experimental package for loading and saving object in Python Pickle format.. You should be able to build off of the latter for other models (e.g. torch.hub).

Topic		Replies	Views
How to export a Flux model to Python? Machine Learning flux	2	435	July 19, 2023
Load a pre-trained Pytorch model in Julia Machine Learning flux	5	668	August 16, 2023
Struggling to load a pre-trained PyTorch model into Julia Machine Learning flux	2	666	September 20, 2023
How to import a pre-trained neural network into Flux.jl ? Machine Learning	7	3412	April 24, 2020
GSoC proposal: Addition of optimisers and model loading capabilities to Flux.jl Community	0	592	April 2, 2017

Using pretrained pytorch models in Flux

Related topics