Gaussian Processes with missing data

michielver · January 27, 2021, 12:36pm

Good to know! I will check out if ODE’s make sense for my application (and if I can make sense of them :P). I also encountered Linear Dynamical Systems in the book by Bishop (Pattern Recognition and Machine Learning), which actually almost seems to be a synonym for continuous HMM’s anyway.

Here is my attempt to sort of ‘formalize’ my problem using mathematical notation:

Let D be the number of dogs, let T_d be the number of timepoints for a dog d, let P be the number of features/attributes (i.e. test scores) at every timepoint.

Then for a dog d we have \{z_{d,t}\}_{t=1}^{T_d} with only z_{d,1} being observed, where 0 \leq z \leq 4 and z could potentially be discretized as well if desired (because in the original data z are multiples of 0.25 because it is calculated based on the answers to a questionnaire), but I believe it being continuous might work to our advantage anyway.
Additionally, for a dog d we have \{X_{d,t}\}_{t=1}^{T_d} where X is P-dimensional, because in actuality there are multiple test scores at each timepoint (whereas in my initial post I assumed there to be only 1 for simplicity’s sake). X is an integer with 0 \leq X \leq 100.

I thought it might also be helpful to include a diagram:

Lastly, I did an attempt at forming a bit of an equation, inspired by Arthur’s reply in another topic.

x_{d,t,p} = \beta_{0,p} + z_{d,t} \cdot \beta_{p} + \epsilon_{d,t,p} for d=1,\dots,D, and t=1,\dots,T_d, and p=1,\dots,P.

Where:

x_{d,t,p} is test score p for dog d at time point t
\beta_{0,p} is the intercept for test score p
z_{d,t} is the latent behavioral score for dog d at time t
\beta_p is the coefficient for the behavioral score z on the test score x_p
\epsilon_{d,t,p} are the residuals

Additionally, for the time dependency:

z_{d,t} = f(z_{d, t-1}) for d=1,\dots,D, and t=2,\dots,T_d

where f is some function of time.

Topic		Replies	Views
Gaussian Process Model with Turing Probabilistic Programming regression , turing , gaussian-process	84	11126	November 28, 2020
Faster gp fitting on million observations? Probabilistic Programming gaussian-process	15	1490	May 4, 2021
Imputing Discrete Data Probabilistic Programming question , turing	14	869	April 7, 2021
Conceptual Gaussian Process Questions Probabilistic Programming	15	1911	August 17, 2020
How to implement a model with missing observations in Turing? Probabilistic Programming	6	626	March 12, 2023

Gaussian Processes with missing data

Related topics