Neural ODE with unkown time steps

Hey! , I am trying to model 3D points in space using a Neural ODE, those points form a trajectory since they originated from the same particle and are ‘snapshots’ of it throughout time, I want to use a Neural ODE to model the dynamics but my data has only the X, Y ,Z , Px (for momentum on the X-axis) , Py and Pz , and I don’t have any data related to time, I thought of using the distance in 3D space as a replacement for time but the trajectories are curved (and since In the end , the model will be used to classify points to different trajectories so time isn’t that important in the inference phase),
what can I do in this case?
thanks for you time and help!

I’m not sure I quite understand your problem. Illustrate it with an example?

Oh hi Chris! , thanks for your effort towards the community, I learnet diffeq.jl from your tutorial :slight_smile: ,so here is the thing, I have data on particle collisions, the data is the position of the particles throughout the detectors, each position of the same particle is a “hit”, so as they move in the detector from the collision point to the outside, the positions are documented (x y z ) but the timestamps of when each hit happened are not documented, so i have points in space only, that they form a trajectory of the movement of that particle, my question is, how can i model this using a neural ODE if i don’t have the exact time stamps (time step) for the loss function to work since it compares each point in space from the real data to its corresponding from the neural net (at that timestamp).
Note that i have momentum values too along with the coordinates(xyz).
P.S.i usually dont share screenshots but we are discussing theory not code.

Do you know if these values are meant to be sequential, i.e. is id 2 in time after id 1 and you just don’t know how long it took? Do you have an idea for the maximum time it could’ve taken?

Yes, they are sequential and i know exactly what hits belong to what trajectory , each trajectory has a max of 21hits , average ~10 , and min 3/4 hits, i do have all the info about what hit to what track ,hence ,to which particle it belongs , while i do not have the time span or the maximum time

You can do a form of data shooting:

For each subiteration, sample a random data point. Solve it over (0,tmax), where tmax is the maximum time to the next point. Now calculate the closest point along this trajectory to the next data point. Take that as the time point for the observation and calculate the loss against that data point. Repeat that between a bunch of data point. This is parallelizable and should have robust properties with respect to local minima.

@ChrisRackauckas HI!, I am doing what you told me, but i am having some difficulty parallelizing it, i tried using Threads.@threads (with atomics too)and @floop inside of my loss function, but they dont work with the AD backends.

Code:


function loss_func(_ , ps , st , tracks)
    """local total_loss::Float32 = 0 

    for track_id in 1:Batch_size
        track = tracks[track_id , : ]
        number_of_hits = count(!ismissing, track)
            
        if number_of_hits <= num_of_groups * 2 
            continue 
        end

        local per_group = number_of_hits ÷ num_of_groups 
        ss_lock = Threads.SpinLock()
"""
        #THE IMPORTANT PART 
        @floop for ind_group in 1:num_of_groups

            local distance = 0f0
            local min_dist = 1e10
            local plus::Int16 = 0

            pred , st= predict_neuralode( track[((ind_group-1)*per_group)+1] , ps) 
            if ind_group == num_of_groups 
                plus = number_of_hits % num_of_groups
            end
            for step in 1:numsteps
                for  data_point in ((ind_group-1)*per_group)+1:ind_group*per_group+plus

                    distance = norm(pred[:, step]' .- track[data_point])    
                    if min_dist > distance 
                        min_dist = distance  
                    end 

                end 
            end     
            lock(ss_lock) do     
                total_loss += min_dist^2 
            end 

            if ind_group < num_of_groups 
                total_loss += continuity_term * sum(abs,pred[: , end]' .- track[(ind_group*per_group)+1])
            end 

        end 

    end 

    return total_loss, st, zero(Float32)
end 

That should do just fine. If not, you can use the ensembles interface.

Hey again! : ) @ChrisRackauckas
now after i trained the model, how do i use it correctly in the inference phase

like since i still don’t have timesteps values saved, and cant use (0,tmax) since not all of them are equally distanced.
i tried to do constant time stepping and let the neural net learn extra dynamics,but the loss values is in the ~e8, i realized it was a bad idea.

P.S i try to think on a solution on my own before i ask you each time, thank you for your time, really appreciated :slight_smile:

You’d recreate the ODE using the trained neural network but now with different time spans. If the neural network is only dependent on phase space then time shifts do not effect the result.