Questions tagged [qq-plot]
A Q–Q plot (or quantile quantile plot) is a scatterplot of the quantiles of two distributions. Q–Q plots are useful for comparing distributions.
342 questions
5
votes
1
answer
305
views
Determining best mixed effects model for the prediction of ordinal data, from a continuous non-normally distributed variable
How do I choose a good model for this analysis? I'm going to describe the scenario below, and outline several options I have brainstormed.
First, the scenario:
I have data from 50 technicians that ...
13
votes
2
answers
956
views
How normal do ANOVA residuals have to be?
A collaborator analyzed some data with a one-way ANOVA. But when I looked at the data, I had this residuals QQ Plot. It doesn't look very normal. But my collaborator went ahead with the ANOVA. I've ...
0
votes
0
answers
90
views
Trouble interpreting qqplot from gam.check()
I'm having trouble interpreting the diagnostic plots obtained from a gam modeled with family="scat".
The data seem to adjust reasonably well to the 45 degree line, but the red reference line ...
3
votes
1
answer
396
views
Two definitions of the ECDF - why use 1/(n+1) instead of 1/n, especially for QQ-plots?
In the context of QQ-plots I encountered two different definitions of the ECDF: The first definition is$$F(x)=\frac{1}{n}\sum_{i=1}^n1_{[X_i,\infty[}(x)$$
and the second definition is
$$F(x)=\frac{1}{...
3
votes
0
answers
155
views
Randomized quantile residuals in this paper
I am reading the article "The Unit-improved second-degree Lindley distribution: inference and regression modeling" by Emrah Altun and Gauss M. Cordeiro. And I want to replicate one of their ...
4
votes
2
answers
491
views
Normality assumption - qqplot interpretation
I am currently working on a project that involves evaluating the distribution of several variables, and I am using Q-Q plots as part of the analysis. While I have generated the Q-Q plots for these ...
4
votes
1
answer
196
views
Non normal data in a LMM; beginners question
I am new to statistics and am seeking guidance on analyzing the effects of earthworms on litter-derived carbon using R.
I conducted an experiment to assess the impact of earthworm presence (with three ...
1
vote
1
answer
114
views
Interpretation of scatter and qqplot to apply regression [closed]
I am new to applying the machine learning models. I have to find a correlation between 1 continuous dependent variable and 27 continuous independent variables.
In the beginning, I was confused about ...
3
votes
2
answers
218
views
Justifying Residual Histograms and QQ Plots for Linear Regression
Conceptually, I am having a hard time as to why we consider the quantile-quantile plot for linear regression diagonistics, and I cannot seem to get a clear answer after searching extensively.
The ...
0
votes
0
answers
71
views
Assessing whether a dataset follows a distribution using Q-Q Plot
I recently found myself answering a question on Stack Overflow about adjusting a dataset to a unknown distribution.
Adding my two cents to the community, I have provided a script to draw a Q-Q Plot in ...
1
vote
0
answers
66
views
Distribution of the model vs. Distribution of the Residuals
Let's say I'm going to do an analysis where my response variable has a gamma distribution. I perform the analysis pointing to the distribution in my model (eg. using the lme4 package, m1<-glmer(Y~...
1
vote
1
answer
166
views
Ambiguity between Statistic Normality Test vs Visual Normality Check
I'm learning some basic EDA using the Boston housing price dataset and I want to filter out outliers in the feature columns. To do that I first wanted to understand what distribution each of my ...
0
votes
0
answers
137
views
Are GLM response residuals supposed to be centered on 0?
I'm struggling with the idea of residuals and error terms in GLMs. I've gathered that there are no explicit error terms in GLMs because the distributions modelled don't allow the decomposition between ...
13
votes
6
answers
2k
views
Why GLM doesn't have an error term and why shouldn't residuals be i.i.d?
I've read dozens on post on the subject but I cannot figure this out. From what I've gathered, GLMS don't include an error term in their formulation unlike linear models (LM). I was wondering why (or ...
1
vote
1
answer
167
views
How do I interpret this QQ plot and residual vs fitted plot?
I have a model in R looking at infectious disease spread on social networks, and I am running into a problem where my data are clearly not normally-distributed when I try to run a linear regression ...
1
vote
0
answers
343
views
How does plotting QQ plot on ggplot work?
I am new to r programming and have ran into an odd situation while plotting a QQ plot for studentised residuals with ggplot2. See code and plot below:
...
1
vote
1
answer
129
views
Are there any heavy tailed distributions available for GAMM?
I have a gamm that looks to be heavy tailed according to the qqplot so I'd like to account for this. According to this page things like scaled t distributions for heavy tailed data are only available ...
10
votes
3
answers
2k
views
Why does creation of a Q–Q plot in Excel need an adjustment by 0.5?
I am aware that different statistical packages provide Q–Q plots using code or via a black box. For example, minitab with R integration for Q–Q plot from here.
I am trying to do this manually via ...
3
votes
2
answers
568
views
Normality test using normal Q-Q plot and histogram
I have plotted a normal Q-Q Plot and a histogram to check the normality of this set of discrete data. My interpretation is the data are not normally distributed since they do not fall on the linear ...
8
votes
4
answers
1k
views
An interesting observation regarding the log transformation of data
I stumbled upon something interesting while attempting to do a log transformation for some data (with zeros) today. It seems that there must be a good reason for this that I'm just not seeing. I'm ...
0
votes
1
answer
448
views
Impact of outliers to QQ plot
I'm trying to build an GLM regression (10k samples and 50 dimensions). I ran an analysis of the dependent variable since the regression has a normality assumption for the dependent variable.
The QQ ...
1
vote
0
answers
47
views
Why my GWAS p-value QQ-plot falls far above diagonal? [duplicate]
I'm trying to run GWAS pipeline using plink, but the results I got look really off. The QQ-plot of the p-values is far above the diagonal.
I'm pretty sure I followed the correct QC process, and the ...
3
votes
1
answer
763
views
I have applied many statistical tests to my data, but still cannot determine normality
I have run multiple tests to determine normality on my dataset, but I am unsure which one to adhere to, especially since my histograms, density plots, and QQ plots leave much to be desired in terms of ...
0
votes
0
answers
28
views
Distribution looks roughly normal on a q-q plot, but has a p-value of 0.0 for the Shapiro-Wilk normality test. How to interpret? [duplicate]
The distribution is as follows:
However the Shapiro-Wilk test yields a p-value of 0.0 and a W statistic of 0.9. There are over 7,000 values in the sample.
Note, the quantile values have been ...
2
votes
1
answer
505
views
Trouble selecting q-q plot settings with statsmodels. Do any of these plots properly compare the sample quantiles to theoretical normal quantiles?
I have an array of over 6,000 data points and am trying to show whether they follow a normal distribution. Statsmodels (the library I'm using to generate plots) gives the option of using a 45-degree ...
2
votes
2
answers
184
views
Distribution and variable analysis
I am doing a statistical test (program used is SPSS). On the basis of distribution and sample size, I have to chose the correct variable analysis. I also have to justify every decision. I have two ...
0
votes
0
answers
95
views
Does this need to be transformed? If, yes, how?
My data is collecting deposition of particles from the atmosphere once a month for 11 months at two sites. I am testing to see if my two sites' data are normally distributed so I can determine what T-...
3
votes
1
answer
535
views
Help me understand this qqplot
I have plotted the qqplot of the residuals that my model generates with
the python module statsmodel
sm.qqplot(data, line ='r') and it looks like this
The points are placed on a straight line but ...
0
votes
0
answers
75
views
How do I interpret this QQ plot?
I am calculating a multiple regression with a sample of 128 and I was wondering, what distribution would best describe this residuals qq plot? It seems like a a Poisson-distribution to me, is it ...
2
votes
1
answer
263
views
Q-Q plots and normality: Can I use ANOVA?
I want to use an ANOVA for my analysis (2x3 design). I can decide if I can safely use parametric tests. The two samples results: Shapiro-Wilk p<.001) and Q-Q plots don't seem to be normally ...
3
votes
1
answer
596
views
Evaluating goodness-of-fit for GARCH models in R with QQ-plots (rugarch package)
I'm currently working with multivariate GARCH representations of time-series for financial data using the rmgarch R package. This package in turn uses the well-...
5
votes
1
answer
4k
views
How to define the line to fit in Q-Q plot?
I'm trying to figure out if my data follows a normal distribution and if it contains outliers. I have plotted the histogram and now I would like to plot the quantile-quantile (Q-Q) plot. My point is, ...
0
votes
1
answer
135
views
Goodness-of-fit Tests
Continuing from my previous question here.
Furthermore, I intend to perform the chi-squared test and plot QQ-plots to test the hypothesis $H_0:\lambda=1$. I do not get to see the actual data though; I ...
4
votes
1
answer
312
views
Goodness-of-fit Tests
I wish to test whether a large number of observations $X_i$ follows an exponential distribution with parameter $\lambda=1$. I also wish to test this hypothesis exactly, and intend that if the ...
0
votes
0
answers
2k
views
Different Calculation Methods for Theoretical Quantiles of Q-Q Plot
There seem to be at least two different methods to calculate the theoretical quantiles in a Q-Q plot. In the following, the normal distribution is assumed to be the theoretical distribution.
Split ...
0
votes
1
answer
115
views
How can the author get the following conclusion from the QQ plot?
In this paper: https://www.tandfonline.com/doi/pdf/10.1080/02664763.2021.1940109, the authors have two actual datasets (e.g., 59 observations showing continuous annual flood data) and the authors want ...
0
votes
0
answers
108
views
Frequentists tests to check for normality
Let $X_1,...,X_n\sim X$ be $n$ i.i.d. random variables. I want to to test if they follow a normal distribution, in other words, check if their distribution belongs to the Gaussian family.
These are ...
1
vote
1
answer
1k
views
Interpreting QQ plot
Can we say that the assumption for linearity is met? I'm confused because the tails are heavy, and deviations have a bow-shaped pattern. Still, I think that the linearity has met because the majority ...
1
vote
1
answer
1k
views
Interpreting 2 residuals plots [closed]
Hello. Can anyone help me with interpreting these plots? I would like to know what assumptions of the linear model are not being met and what method should be used to fix the problems. I think there ...
5
votes
2
answers
4k
views
How to choose between ordered logit and ordered probit regression?
If the dependent variable is discrete ordinal, like 0-10 then an ordered logit or ordered probit is appropriate to use. They are both similar but their interpretation are different and their error is ...
0
votes
1
answer
128
views
Can we compare a standardized version of a variable with a standard normal distribution to check a variable's normality?
I am currently exploring ways to check the normality of a given variable in the dataset. Since most algorithms assume a variable's gaussian distribution, it is important to check it.
A Q-Q Plot
Came ...
2
votes
1
answer
154
views
Is there normality in my data? Which statistical test should I use?
I runned two GLMs using the same dependent and independent variables, but modelling each analysis according a different type of distribution. Then, I compared its AIC values to find what distribution ...
0
votes
1
answer
173
views
Does this plot indicate the data is normal distributed?
I use qqnorm to plot my data as the photo attached. Does this plot indicate the data is normal distributed?
0
votes
1
answer
52
views
Checking interaction between one dependent continuous variable and two independent continuous variable
I am trying to figure out if there is a way that we can perform some statistical test to check the interaction between two independent continuous variables and a dependent variable in R.
I have three ...
0
votes
0
answers
85
views
How do I interpret this plot?
I'm finding it hard to interpret this plot. Is it skewed, bimodal, or what is it? What do the points lying in the same line and rising suddenly mean? Is it exponential?
0
votes
0
answers
132
views
Theoretical q-q plot: What does the f-value mean in this example?
I am looking at this article on theoretical q-q plots and am trying to understand it in its entirety. The part where I get lost is when the author writes:
We first find the f-values for alto
What do ...
1
vote
1
answer
4k
views
What are the main difference between a QQ plot and a probability plot for measuring nomality? [duplicate]
I am trying to evaluate the normality of the distribution of my model's residuals.
I have been using statsmodels.api.qqplot and ...
2
votes
1
answer
616
views
Is my data normally distributed? (QQ plot and histogram analysis) [duplicate]
I am trying to create a regression model for prediction. I need to generate prediction/confidence intervals for my model.
I am trying to decide whether to use a quantile regression or linear ...
4
votes
1
answer
689
views
Calculation of quantiles with fitted parameters in Python
I am trying to make two-sample Q-Q plots in Python.
A Python function that is used for calculating quantiles has the option of fitting parameters for the calculation of quantiles. These parameters are ...
4
votes
1
answer
426
views
Is it possible to make a confidence envelope for a two sample Q-Q plot in R (or Python)? If so, what is the simplest method?
I want to show the confidence envelope for a two sample Q-Q plot in R (or Python). The aim is to use the Q-Q plot to give an indication of whether my two samples are drawn from the same population
The ...