Assistance with Transformers.jl

Stantin_Siebritz · October 16, 2020, 9:43am

Hi

I would like to create an implementation of the Transformer NLP architecture, more specifically I am trying to implement a BERT Summarizer. I have been using the Transformer.jl (v 0.1.7) package and have followed the documentation tutorial, however I am unable to get the example to run for a following reasons:

The line in the tutorial
enable_gpu(true) # make todevice work on gpu

returns “CUDA not functional”, even though when it is (as running CUDA.functional() returns true). I have made sure that libcuda.so is visible.

The code
vocab = Vocabulary(labels, unksym)

which references Transformers.Basic.Vocabulary which does not exist. I have scoured the Transformer.jl source code and cannot find the Vocabulary function.

I am assuming that the documentation that is available might not be up to date with the current version of the package, hence the need for assistance. Any feedback or advice would be greatly appreciated.

Thanks
Stantin

Topic		Replies	Views
Running a pre-trained BERT on twitter data using Flux.jl Transformer.jl Machine Learning flux , nlp , transformers	17	2144	September 16, 2021
[ANN] Transformers.jl Package Announcements announcement	6	1945	February 18, 2020
BERT models from huggingface - Transformers.jl Machine Learning package	1	1194	July 15, 2021
Basic example for using Transformers.jl for sequential autoencoding Machine Learning examples , flux , transformers , sentence-transformer	0	62	December 10, 2024
TransformerBlocks.jl - Simple, blazing fast, transformer components Package Announcements machine-learning	5	1236	March 19, 2023

Assistance with Transformers.jl

Related topics