I'm taking a statistics class. The question has been graded, but I question the validity of the official answer. We are asked whether, based solely on the confidence intervals, the mean BMI differs for individuals with no money worries vs individuals with a lot of money worries. There are 178 individuals with no money worries, and 8 with a lot of money worries. The 95%CI of the lot of money worries completely encloses the 95%CI of the no money worries. On this basis, the official answer is that since the 95% CIs overlap, p is assumed to be >.05.
I am not so convinced that we can say that. The numbers for the two samples are: No worries: N = 178, mean = 30.7264, SE = .36042, 95% CI = 30.010151-31.4376, VAR = 23.122, SD = 4.8057, skew and kurtosis both < 1, outliers: 2, both on the high end but not massively so.
Lot of worries: N = 8, mean = 35.7514, SE = 3.88778, 95% CI = 26.5583 - 44.9446, VAR = 120.919, SD = 10.99630, skew < 1, kurtosis = 1.426, outliers: 1 possible - a single data point much higher than the other 7. Is it actually an outlier? With only 8 data points, who knows.
My general feeling is that the very tiny size of the lot of worries sample combined with the possible outlier among those 8 data points makes it impossible to meaningfully compare the two sample means purely on the basis of whether the 95% confidence intervals overlap. Am I wrong?
Edit: Thanks to everyone. Dave, thanks for the simulation. Jginestet, thanks for clarifying the glitch in that simulation. This was my first post on StackExchange and the system is not allowing me to change my initial acceptance of the simulation as the best answer. But I would like to clarify that I still accept it, only with the modifications suggested by Jginestet. In which case, the consensus does seem to be that I was indeed wrong and it is highly unlikely that p < .05 if the CIs are fully overlapped. Interesting...I think I need to go update my math skills so I can better understand why. And I am still somewhat skeptical that this sort of finding on such a small sample should have public health policy implications, even if it is statistically true. I would still like to see a more "normal" size data set for the lots of money category, to have more confidence that this is an actual reflection of population characteristics.
