Consider a situation where there are multiple subjects and each subject has multiple measurements (response, covariates) over time. The goal is to identify a statistical regression model which allows past responses to influence future responses without creating statistical problems.
When the response is discrete, there are many modelling options for stochastic processes (discrete time and continuous time):
- Multistate model regression for modelling the time based hazard of moving between State i and State j: https://www.researchgate.net/publication/49597683_Multi-State_Models_for_Panel_Data_The_msm_Package_for_R
- Logistic regression for modelling odds of transitioning to State i given currently State j : https://hesim-dev.github.io/hesim/articles/mlogit.html
When the response is continuous, some choices include:
- Time series models with exogenous variables
- Dynamic Panel Models such as those based on the Arellano-Bond GMM estimator
But is it common to base a regression model on a Stochastic Differential Equation (SDE)?
For example, I found this interesting paper (https://gatesopenresearch.org/articles/4-71/v2) here where the authors seem to have adapted a SDE where the drift and diffusion components are themselves regression models.
Using SDEs seems to be the most natural approach as it directly models the response dynamics in continuous time/space as a function of changing response/covariates.
I would be interesting in reading more references on this and see how prevalent this is in statistics.