Flux seq2seq

I’ve made some progress and put all my code, with some explanations, in a notebook. The model does seem to learn something… more often than not, the subject of the sentence is correct, but the remaining words are gibberish.

Also I notice a big difference in performance with different hyperparameters but I’m not sure how I could choose the optimal ones.

I’d really appreciate someone providing me with some feedback.

Thanks,
Jules

1 Like