Newest 'survey-sampling' Questions

0 votes

1 answer

27 views

references on how to include nonsampling errors in the measure of uncertainty

On the page https://www.ons.gov.uk/methodology/methodologytopicsandstatisticalconcepts/uncertaintyandhowwemeasureit, there's a section where the UK Office for National Statitics talks about non-...

Coris

3

asked Nov 7 at 10:45

9 votes

5 answers

3k views

A national poll of 1000 French returns 25% of "Yes". Is it enough to say that it's quite impossible locally to be 0%, if we don't know the variance?

Ipsos returned the result of a poll of 1000 people in France, telling that 25% of them have already practiced naturism. When we tell mayors of French cities this poll result, they are usually ...

Marc Le Bihan

351

asked Jul 26 at 13:05

0 votes

1 answer

51 views

How to predict household income in census sample using income & expenditure survey data in R? [closed]

I have two datasets at the individual level, which I aggregated to the household level: Census Sample: A 2% sample of the total population from a national census. This includes household-level ...

Mohammad Haddadi

101

asked Jul 18 at 17:40

0 votes

0 answers

23 views

Solving Lagrange multipliers in an Empirical log-likelihood function in the context of stratified survey sampling

How can I find the value of the next function $l_{max } (\theta ) $ for a certain $\theta$ fixed? I have a sampling problem in which I have two variables 𝑦𝑖 and 𝑤𝑖 that represent totals. My idea ...

xenuti

13

asked May 26 at 21:58

1 vote

0 answers

57 views

Sequential sampling for estimating a proportion

I am conducting a systematic review of studies and there are thousands of studies that are eligible for coding. Of these studies I am looking to estimate the proportion of a binary characteristic in ...

Vefeagins

1,106

asked Apr 21 at 21:09

8 votes

1 answer

496 views

Does Bayesian estimation need finite population correction?

Does Bayesian estimation assume an infinite population and does it not require a finite population correction? Say, we want to estimate the mean of a finite population, assuming that the iid values ...

Nip

671

asked Apr 5 at 23:52

0 votes

0 answers

71 views

Approach to Bootstrapping for Survey Data Analysis

I performed a survey and collected $k$ Likert-scale responses for $n$ respondents. My analysis involves averaging the responses for each respondent and comparing the averaged response vector to a ...

TSP

619

asked Jan 6 at 20:55

0 votes

0 answers

31 views

Comparing two identical surveys with different samples sizes and response levels

Silly question, but my mind has gone blank. I’m trying to undertake a simple comparison between two samples (organisational performance) using an online survey. To obtain a CI of 80% with a margin of ...

A Wilson

1

asked Dec 5, 2024 at 10:38

5 votes

1 answer

123 views

Would withholding marks until students respond to a survey bias the responses?

My university is running an anonymous survey, mostly to check if we understand how we are going to be assessed, if we are comfortable with the material, and if we find the material well organised. ...

Porter

53

asked Nov 18, 2024 at 20:22

0 votes

0 answers

48 views

Proper Difference-in-Difference Model for Time Variant Groups

Take the following example... I have two areas: Area A and Area B. Area A are individuals in a geographic area who are exposed to a health intervention. The health intervention is applied to the ...

LeslieKish

238

asked Aug 26, 2024 at 21:37

1 vote

0 answers

92 views

Combining Survey Weights

I have an annual survey that is somewhat complex in design. Sample frames are pulled quarterly and overlap. Each quarter's sample is removed from the subsequent quarter's frame. Samples are stratified ...

ReliableResearch

426

asked May 29, 2024 at 14:43

0 votes

1 answer

69 views

Sample size for survey

My interest is to perform a statistically significant survey on a population of 1700 people, that can be described in different categories, so each person belongs to only one category. I have two ...

user412668

asked May 21, 2024 at 10:12

1 vote

0 answers

87 views

Multi-level Model and Multi-level Data

I have a question about multi-level models with multi-level survey data. I am working with survey data that has a two-stage sampling design with primary sampling units defined as schools randomly ...

UT_Max

11

asked May 9, 2024 at 21:54

3 votes

1 answer

76 views

Statistical Non-Response and Drop Out

In statistical studies, it is possible that there might be biases: Someone groups of people are more likely to be represented compared to others groups of people (e.g. poorer people have difficult ...

user412241

asked May 9, 2024 at 18:12

0 votes

0 answers

84 views

Is naive mean estimator uniformly worse than HT (IPW) or Hajek estimators in survey sampling? If not, why is it less discussed in the literature?

Consider a toy example: we are interested in the average height of $n$ students $\bar{\tau}=\frac{1}{n}\sum_{i=1}^n\tau_i$, but for some reason, we can only access a random subset $S$ of it. Every ...

Voyager

295

asked Apr 22, 2024 at 7:23

1 vote

1 answer

124 views

Sample a random subgraph from an undirected, unweighted graph, what's the probability of "every two nodes's distance is at least 3 in the subgraph"?

This may be a problem in sampling theory or graph theory. I have done many research but I still didn't find valid solutions. I know a simple random sample is representative of the population. Now I ...

Voyager

295

asked Apr 17, 2024 at 2:40

2 votes

0 answers

59 views

Use calibration weights to correct for unit non-response bias?

I have a question about how calibration weights can be used to sufficiently correct for unit non-response bias. Suppose the sample is s and the response set is r. Calibration is applied to the ...

Willi Zhang

410

asked Mar 5, 2024 at 8:53

5 votes

3 answers

351 views

In stratified sampling, why is the stratum population variance obtained by dividing by 1 less than the stratum size

I am aware that flavors of this question get asked a lot, for e.g., here. I am fine with the sample variance being divided by $n-1$ and that is what makes it an unbiased estimator of the population ...

Tryer

307

asked Feb 7, 2024 at 16:15

0 votes

2 answers

302 views

Reliability of online surveys

I'm trying to get an idea about reliability of online surveys: I found some indication that "internet-based surveys produce data that is at least as reliable, valid, and of equal quality as data ...

Mauro

103

asked Jan 20, 2024 at 8:43

1 vote

0 answers

50 views

Sampling inquiry for thesis [closed]

I have a mixed-method thesis ongoing and I plan collecting data on my own college (namely college X), specifically from students and faculty members on my department. Evidently, that would be ...

Kenzo

41

asked Jan 9, 2024 at 5:22

0 votes

1 answer

359 views

comparing two samples drawn using two different sampling methods

This is a hypothetical question, so I don't have a lot of additional details to give. However my question is pretty straightforward: Is it theoretically valid to conduct tests (e.g. for comparing ...

Daniela

57

asked Dec 13, 2023 at 5:36

0 votes

0 answers

56 views

Can I add a variable to a complex sample, and run a regression?

In a survey, a complex sample was collected, and the sample was designed to provide estimates at national level. In other words, individuals from one state were more likely to be sampled due to ...

Oalvinegro

439

asked Dec 6, 2023 at 12:10

1 vote

0 answers

106 views

Measuring the reliability of a survey data

I have a survey data and I applied KR20 on it. The KR20 score is 0.63 which means this survey result is not consistent and reliable, at least not in a reliable range with the definition of a reliable ...

Moh-Spark

31

asked Nov 20, 2023 at 19:53

0 votes

0 answers

80 views

Sampling error for proportion with finite population - which correction to use?

I am trying to calculate sampling error for a questionnaire that was answered by some of the participants in a program (say about . I want to calculate the sampling error for the proportion of the ...

eli-k

134

asked Nov 13, 2023 at 20:13

0 votes

1 answer

77 views

How to treat age-eligibility thresholds in household surveys (e.g. HRS)?

Most household surveys have age-eligibility thresholds. The HRS interviews individuals aged 51 and older, plus their spouse (if any) using PPS sampling. Do I need to drop individuals who are younger ...

cascom

41

asked Nov 9, 2023 at 15:46

1 vote

0 answers

82 views

Unbiased estimate of mean test score of pupils in a country (sampling frame of schools is avaible only)

My primary goal is to get unbiased estimate of mean test score of every pupil in a country. I have no sampling frame of all pupils to randomly sample from. But I have a sampling frame for every school....

Nothingman

101

asked Nov 7, 2023 at 11:17

1 vote

1 answer

368 views

When to use replicate weights in complex survey analysis

I am curious about when it is recommended to use replicate weights in survey analysis. I compared the usual survey analysis with using replicate weights, as illustrated below. Based on the paper "...

Willi Zhang

410

asked Oct 30, 2023 at 10:44

2 votes

1 answer

129 views

Different ways to define survey design object under MCAR assumption

I have a stratified random sample, and would like to conduct complete-case analysis, assuming Missing Completely At Random. However, I find that there seem to be two ways to define survey design ...

Willi Zhang

410

asked Oct 18, 2023 at 8:58

1 vote

2 answers

273 views

Is it reasonable to subset a survey design object by dependent (outcome) variable and fit a weighted logistic regression model?

I would like to study which factors are associated with an outcome which has more than two categories. After considering multinomial logistic regression model (which I find is very challenging to ...

Willi Zhang

410

asked Oct 17, 2023 at 15:31

0 votes

0 answers

200 views

What is the difference between a repeated cross-sectional survey design and a trend survey design?

Most of the references I have checked for repeated cross-sectional design and trend design (a type of longitudinal design) have said that they are one and the same. However, my professor says that ...

abetebatebs

1

asked Sep 24, 2023 at 11:34

1 vote

1 answer

155 views

Domain (subgroup) estimation in a stratified random sample

I want to ask a question about domain estimation (i.e., estimation of a parameter among subpopulations) in a stratified random sample. It seems to me that in a stratified random sample, domain ...

Willi Zhang

410

asked Sep 17, 2023 at 8:30

0 votes

0 answers

243 views

How can I reweight survey data

I have survey data from a complex survey with stratification, weights and clustering. I'm using the survey package in R to run regressions: ...

dash2

216

asked Sep 16, 2023 at 10:03

2 votes

1 answer

69 views

Cluster sampling result in larger sample-to-sample variability

I'm reading STATA's Survey Data Reference Manual. There is written that: Cluster sampling typically results in larger sample-to-sample variability than sampling individuals directly. Do you have an ...

robertspierre

3,403

asked Aug 24, 2023 at 9:53

0 votes

1 answer

150 views

Can I make a proportional-to-size without replacement sample (PPS WOR) self weighted?

Let's say I have 100 schools and each has a different number of students. I want to estimate which % of students are in schools with electricity. Simulation and theory indicate it is more efficient to ...

Fernando Irarrázaval G

131

asked Jul 14, 2023 at 22:08

3 votes

1 answer

329 views

Appropriate way to use post-stratification weights when running statistical tests SPSS

I have used Complex Samples in SPSS (and SUDAAN in SAS, Survey in R) when working with survey data that were collected using a sampling design that was not random. For example, when an oversample was ...

Brett Wyker

31

asked May 15, 2023 at 19:45

1 vote

0 answers

46 views

Small Area Estimation techniques when no micro information is available

Small area estimation (SAE) techniques combine information from household surveys with existing auxiliary information at population level to make inferences of certain indicators for population groups ...

RJ-mac

11

asked Apr 4, 2023 at 13:51

7 votes

1 answer

396 views

Who created the "soup analogy" for sampling

The soup analogy is, You only need a single spoon to sample the soup, provided it is well stirred. It has been used several times here Sampling distributions of sample means and What is your ...

James K

597

asked Apr 3, 2023 at 11:28

6 votes

1 answer

175 views

What are the differences and common points, if any, between oversampling as a survey design method and oversampling in a machine learning context?

I've seen the term "oversampling" used in a survey design methodology context and in a machine learning context (e.g. methods like SMOTE). I'm intrigued by the differences between the two. ...

Kap

63

asked Mar 31, 2023 at 19:24

2 votes

0 answers

146 views

What is the (Ratio estimator for the) covariance of two weighted means? [closed]

In a previous question I've asked How to estimate the (approximate) variance of the weighted mean?, specifically, how to prove the following formula: $$ \widehat{\sigma_{\bar{y}_w}^2} = \frac{1}{(\sum{...

Tal Galili

22.1k

asked Mar 26, 2023 at 18:09

5 votes

1 answer

373 views

Why does the survey package in R and SPSS complex samples add-on give different standard errors?

I was comparing results that I generated in R for complex survey analysis using the survey package to results from SPSS using the complex samples analysis add-on. The sample size is large ~ N=5500 ...

s.stats

485

asked Mar 10, 2023 at 16:40

1 vote

0 answers

53 views

Conformal prediction for model-assisted survey estimation

In model assisted survey estimation, one typically uses the generalized difference estimator: $$ \hat{t}_{ma} = \sum_{k \in U} \hat{m}(\mathbf{x}_k) + \sum_{k \in S} \frac{y_k - \hat{m}(\mathbf{x}_k)}{...

user191413

11

asked Mar 2, 2023 at 1:32

5 votes

0 answers

240 views

Question concerning svydesign and svyglm in R

I have a complicated data set which was made by a multistage stratified cluster design. I had originally analysed this using glm, however now realise that I have to use svyglm. I'm not quite sure ...

Ian Holdroyd

63

asked Feb 18, 2023 at 18:19

0 votes

0 answers

48 views

Definition of rotated panel sampling

I am doing exercises and I come across a question that asks me to describe sampling with rotated panel. What does rotated panel sampling mean?

iStats7238

83

asked Feb 11, 2023 at 15:52

2 votes

1 answer

81 views

Can I CUT a sample to become representative?

Suppose that, from a finite population, we estimated the minimum sample size as 1000 to reach our desired confidence level and error. Data was collected using an online survey and the survey remained ...

Oalvinegro

439

asked Feb 9, 2023 at 1:33

0 votes

0 answers

105 views

Is it appropriate to pre-stratify and post-stratify along different delineations of the same variables in a single survey?

I am working in the context of opt-in, web-based surveys. Often the desire is for accurate population estimates, and often at a country-wide level. The standard approach at this organization is to ...

spathartic

1

asked Feb 7, 2023 at 20:04

1 vote

0 answers

48 views

How can I show probability of selection change when adding stratification to a survey design

I have a survey that uses a stratified sampling approach with optimal allocation. The team conducting the survey has asked that we make two changes: Subdivide one of the strata into smaller pieces. ...

B. Bogart

123

asked Jan 23, 2023 at 18:31

2 votes

0 answers

220 views

Survey with two simple random samples without repetition

I have an particular exercise of sampling survey, or sampling theory, which I report below. One is interested in knowing the price per gram of gold produced by 100 companies. A monthly survey of a ...

iStats7238

83

asked Jan 14, 2023 at 15:05

0 votes

1 answer

59 views

Population bias in survey leading to inaction

This isn’t exactly an academic statistics question, but it is a real problem that I’m trying to understand with regards to bias in survey statistics leading to issues in real-world decision making. I’...

Concerned Sampling Person

1

asked Jan 8, 2023 at 21:13

3 votes

1 answer

174 views

Is post-stratification inherently non-Bayesian?

It is increasingly common to employ regression with post-stratification. Since probability-weighting is incoherent in Bayesian inference (thus why sampling/survey weights and weighted psuedo-...

socialscientist

889

asked Nov 29, 2022 at 3:29

0 votes

0 answers

119 views

(i.i.d) Random sampling assumption in practical situations

This is a practical question. Assume that there are two finite populations X and Y in the real world. For example, we want to compare $\bar{X}$ and $\bar{Y}$. We can use a probability sampling scheme ...

Neuchâtel

135

asked Nov 25, 2022 at 0:09

Questions tagged [survey-sampling]