Basic example for using Transformers.jl for sequential autoencoding

spolk · December 10, 2024, 3:06pm

I am interested in using Transformers.jl for sequence-to-sequence autoencoding and was hoping to get help solving a minimum working example building and training an appropriate transformer-based sequence-to-sequence autoencoder using Transformer.jl on a synthetic dataset of sequences of varying length.

For example, consider the following dataset, which has consists of sequences of random samples from a Gaussian distribution:

# Set parameters for synthetic sequence data generation
elem_dim = 5; # The number of dimensions in a sequence's element
mean_seq_length = 10; # Desired average length of sequences 
std_seq_length = 3; # Standard deviation in length of sequences 
num_seqs = 100; # Number of sequences to generate

seqs = [
    [randn(elem_dim,1) for j = 1:(std_seq_length*randn() + mean_seq_length)] 
    for i = 1:num_seqs]
]

Given data of this format, I am interested in the answers to the following questions:

How does the variable seqs need to be formatted in order to be input into a transformer using Transformers.jl?
What is the smallest, most basic transformer-based architecture for sequence-to-sequence autoencoding for data of this type?

Thank you very much for your time and help.I apologize that this question is quite basic, but I haven’t been able to find a suitable answer online.

Topic		Replies	Views
Using Transformers.jl for time series classification? Web Stack	10	1501	December 23, 2020
I am building a simple to use autoencoder model.. anyone interested? Machine Learning machine-learning , mlj , betaml	4	466	December 29, 2023
Converting transformer to predict time series code from Transformers = "~0.1.15" to Transformers = "~0.2.8" New to Julia question	0	31	September 22, 2024
Using Transformers.jl for "is next sentence" New to Julia	2	555	March 24, 2021
Autoencoder for telecommunication (Constellation shaping) Machine Learning	4	870	December 11, 2019

Basic example for using Transformers.jl for sequential autoencoding

Related topics