Hi, I noted the used of “record” for BPTT for LSTM/Recurrence based Models. Is this in used? I don’t see it being documented anywhere, i also could not find it in the code.
How do we do BPTT?
Hi, I noted the used of “record” for BPTT for LSTM/Recurrence based Models. Is this in used? I don’t see it being documented anywhere, i also could not find it in the code.
How do we do BPTT?
Here’s a good article on Neural Differential Equations that might give some pointers. Neural Differential Equations