I am currently working on a longitudinal dataset in which I aim to cluster individuals based on the trajectory of a single continuous variable measured repeatedly across time (e.g., daily values). The goal is to identify distinct trajectory-based subgroups within the study population and subsequently compare these subgroups in terms of clinical or other downstream outcomes.
Initially, I considered using factor analysis and latent class/profile analysis (LCA/LPA), but I soon realized that these methods are typically designed for situations involving multiple observed variables, often for purposes such as dimensionality reduction or latent construct identification. While these methods might technically accommodate repeated measurements as separate variables, I am uncertain whether they are appropriate or statistically valid when applied to only one construct measured at multiple time points.
As I continued my search, I came across Latent Class Growth Analysis (LCGA) and Group-Based Trajectory Modeling (GBTM), which seem specifically designed for clustering individuals based on their longitudinal patterns of one specific variable. From what I understand, GBTM is essentially a special case of LCGA which assumes that error variance is the same for all classes and all time points. I also encountered Growth Mixture Modeling (GMM), which allows for within-class variability (i.e., random effects).
I believe that LCGA/GBTM or GMM are likely the most appropriate approaches for my aim, but as someone new to this modeling framework, I find the distinctions and assumptions somewhat overwhelming.
Could anyone with experience in trajectory-based clustering kindly provide guidance or point me toward best practices for:
◦ Choosing between LCGA, GBTM, and GMM
◦ Whether using LCA/LPA/FA with repeated measures (as separate indicators) is fundamentally flawed in this context
Any insights, references, or shared experiences would be greatly appreciated.
