Let's say that the analytical infrastructure at my place of work heavily centers around t-tests. I'd like to use this platform to perform a t-test on correlated data, e.g. the extent to which an order placed by a customer was 'satisfactory' or 'unsatisfactory' where the treatment variable is at the level of the customer and the continuous dependent variable - 'order is satisfactory' [0,1.0] - is at the order level and is clustered on customers. Our systems are set up to perform an independent two-sample t-test, i.e. $$ t = \displaystyle\frac{\bar{X_1}-\bar{X_2}}{S_p\sqrt{\frac{2}{n}}} \text{ where } S_p=\sqrt{\frac{s^2_{x_1} + s^2_{x_2}}{2}} $$ ...but my understanding is that given the clustered nature of the data, $s^2_{x_1}$ and $s^2_{x_2}$ are no longer unbiased, and more likely than not, will underestimate the true variance.
I've been advised that I can use the Delta Method to approximate the true variance, thus enabling a t-test where the standard errors are approximately correct. I've tried to go over the Delta Method, but I've struggled to understand how I can apply the insight that $$ {\displaystyle {{\sqrt {n}}[g(X_{n})-g(\theta )]\,{\xrightarrow {D}}\,{\mathcal {N}}(0,\sigma ^{2}\cdot [g'(\theta )]^{2})}} $$ .. to the estimation of $s^2_{x_1}$ and $s^2_{x_2}$.
I'm definitely a bit out of my league here, so any help (e.g. example walkthroughs of the Delta Method, reading material appropriate for beginners) would be greatly appreciated.
Thanks!