Skip to main content

Questions tagged [qq-plot]

A Q–Q plot (or quantile quantile plot) is a scatterplot of the quantiles of two distributions. Q–Q plots are useful for comparing distributions.

Filter by
Sorted by
Tagged with
5 votes
1 answer
305 views

How do I choose a good model for this analysis? I'm going to describe the scenario below, and outline several options I have brainstormed. First, the scenario: I have data from 50 technicians that ...
ahach's user avatar
  • 169
13 votes
2 answers
956 views

A collaborator analyzed some data with a one-way ANOVA. But when I looked at the data, I had this residuals QQ Plot. It doesn't look very normal. But my collaborator went ahead with the ANOVA. I've ...
Nuria's user avatar
  • 231
0 votes
0 answers
90 views

I'm having trouble interpreting the diagnostic plots obtained from a gam modeled with family="scat". The data seem to adjust reasonably well to the 45 degree line, but the red reference line ...
Mar's user avatar
  • 1
3 votes
1 answer
396 views

In the context of QQ-plots I encountered two different definitions of the ECDF: The first definition is$$F(x)=\frac{1}{n}\sum_{i=1}^n1_{[X_i,\infty[}(x)$$ and the second definition is $$F(x)=\frac{1}{...
Filippo's user avatar
  • 283
3 votes
0 answers
155 views

I am reading the article "The Unit-improved second-degree Lindley distribution: inference and regression modeling" by Emrah Altun and Gauss M. Cordeiro. And I want to replicate one of their ...
daniel's user avatar
  • 311
4 votes
2 answers
491 views

I am currently working on a project that involves evaluating the distribution of several variables, and I am using Q-Q plots as part of the analysis. While I have generated the Q-Q plots for these ...
Javier Hernando's user avatar
4 votes
1 answer
196 views

I am new to statistics and am seeking guidance on analyzing the effects of earthworms on litter-derived carbon using R. I conducted an experiment to assess the impact of earthworm presence (with three ...
Janus Den Toonder's user avatar
1 vote
1 answer
114 views

I am new to applying the machine learning models. I have to find a correlation between 1 continuous dependent variable and 27 continuous independent variables. In the beginning, I was confused about ...
Manar's user avatar
  • 13
3 votes
2 answers
218 views

Conceptually, I am having a hard time as to why we consider the quantile-quantile plot for linear regression diagonistics, and I cannot seem to get a clear answer after searching extensively. The ...
LateGameLank's user avatar
0 votes
0 answers
71 views

I recently found myself answering a question on Stack Overflow about adjusting a dataset to a unknown distribution. Adding my two cents to the community, I have provided a script to draw a Q-Q Plot in ...
jlandercy's user avatar
  • 141
1 vote
0 answers
66 views

Let's say I'm going to do an analysis where my response variable has a gamma distribution. I perform the analysis pointing to the distribution in my model (eg. using the lme4 package, m1<-glmer(Y~...
Graciliano Santos's user avatar
1 vote
1 answer
166 views

I'm learning some basic EDA using the Boston housing price dataset and I want to filter out outliers in the feature columns. To do that I first wanted to understand what distribution each of my ...
joesan's user avatar
  • 173
0 votes
0 answers
137 views

I'm struggling with the idea of residuals and error terms in GLMs. I've gathered that there are no explicit error terms in GLMs because the distributions modelled don't allow the decomposition between ...
Boussens-Dumon Grégoire's user avatar
13 votes
6 answers
2k views

I've read dozens on post on the subject but I cannot figure this out. From what I've gathered, GLMS don't include an error term in their formulation unlike linear models (LM). I was wondering why (or ...
Boussens-Dumon Grégoire's user avatar
1 vote
1 answer
167 views

I have a model in R looking at infectious disease spread on social networks, and I am running into a problem where my data are clearly not normally-distributed when I try to run a linear regression ...
mxseabat's user avatar
1 vote
0 answers
343 views

I am new to r programming and have ran into an odd situation while plotting a QQ plot for studentised residuals with ggplot2. See code and plot below: ...
Yat-Hon's user avatar
  • 13
1 vote
1 answer
129 views

I have a gamm that looks to be heavy tailed according to the qqplot so I'd like to account for this. According to this page things like scaled t distributions for heavy tailed data are only available ...
adkane's user avatar
  • 1,061
10 votes
3 answers
2k views

I am aware that different statistical packages provide Q–Q plots using code or via a black box. For example, minitab with R integration for Q–Q plot from here. I am trying to do this manually via ...
Tryer's user avatar
  • 307
3 votes
2 answers
568 views

I have plotted a normal Q-Q Plot and a histogram to check the normality of this set of discrete data. My interpretation is the data are not normally distributed since they do not fall on the linear ...
user281667's user avatar
8 votes
4 answers
1k views

I stumbled upon something interesting while attempting to do a log transformation for some data (with zeros) today. It seems that there must be a good reason for this that I'm just not seeing. I'm ...
knrumsey's user avatar
  • 9,402
0 votes
1 answer
448 views

I'm trying to build an GLM regression (10k samples and 50 dimensions). I ran an analysis of the dependent variable since the regression has a normality assumption for the dependent variable. The QQ ...
cat's user avatar
  • 53
1 vote
0 answers
47 views

I'm trying to run GWAS pipeline using plink, but the results I got look really off. The QQ-plot of the p-values is far above the diagonal. I'm pretty sure I followed the correct QC process, and the ...
Celia L.'s user avatar
3 votes
1 answer
763 views

I have run multiple tests to determine normality on my dataset, but I am unsure which one to adhere to, especially since my histograms, density plots, and QQ plots leave much to be desired in terms of ...
Kimber's user avatar
  • 53
0 votes
0 answers
28 views

The distribution is as follows: However the Shapiro-Wilk test yields a p-value of 0.0 and a W statistic of 0.9. There are over 7,000 values in the sample. Note, the quantile values have been ...
NominalSystems's user avatar
2 votes
1 answer
505 views

I have an array of over 6,000 data points and am trying to show whether they follow a normal distribution. Statsmodels (the library I'm using to generate plots) gives the option of using a 45-degree ...
user395052's user avatar
2 votes
2 answers
184 views

I am doing a statistical test (program used is SPSS). On the basis of distribution and sample size, I have to chose the correct variable analysis. I also have to justify every decision. I have two ...
Chester's user avatar
  • 21
0 votes
0 answers
95 views

My data is collecting deposition of particles from the atmosphere once a month for 11 months at two sites. I am testing to see if my two sites' data are normally distributed so I can determine what T-...
NickW's user avatar
  • 1
3 votes
1 answer
535 views

I have plotted the qqplot of the residuals that my model generates with the python module statsmodel sm.qqplot(data, line ='r') and it looks like this The points are placed on a straight line but ...
Alucard's user avatar
  • 325
0 votes
0 answers
75 views

I am calculating a multiple regression with a sample of 128 and I was wondering, what distribution would best describe this residuals qq plot? It seems like a a Poisson-distribution to me, is it ...
Migle's user avatar
  • 1
2 votes
1 answer
263 views

I want to use an ANOVA for my analysis (2x3 design). I can decide if I can safely use parametric tests. The two samples results: Shapiro-Wilk p<.001) and Q-Q plots don't seem to be normally ...
Audere Semper's user avatar
3 votes
1 answer
596 views

I'm currently working with multivariate GARCH representations of time-series for financial data using the rmgarch R package. This package in turn uses the well-...
OJK's user avatar
  • 31
5 votes
1 answer
4k views

I'm trying to figure out if my data follows a normal distribution and if it contains outliers. I have plotted the histogram and now I would like to plot the quantile-quantile (Q-Q) plot. My point is, ...
JCV's user avatar
  • 153
0 votes
1 answer
135 views

Continuing from my previous question here. Furthermore, I intend to perform the chi-squared test and plot QQ-plots to test the hypothesis $H_0:\lambda=1$. I do not get to see the actual data though; I ...
pecer10012's user avatar
4 votes
1 answer
312 views

I wish to test whether a large number of observations $X_i$ follows an exponential distribution with parameter $\lambda=1$. I also wish to test this hypothesis exactly, and intend that if the ...
pecer10012's user avatar
0 votes
0 answers
2k views

There seem to be at least two different methods to calculate the theoretical quantiles in a Q-Q plot. In the following, the normal distribution is assumed to be the theoretical distribution. Split ...
keezar's user avatar
  • 35
0 votes
1 answer
115 views

In this paper: https://www.tandfonline.com/doi/pdf/10.1080/02664763.2021.1940109, the authors have two actual datasets (e.g., 59 observations showing continuous annual flood data) and the authors want ...
Hermi's user avatar
  • 757
0 votes
0 answers
108 views

Let $X_1,...,X_n\sim X$ be $n$ i.i.d. random variables. I want to to test if they follow a normal distribution, in other words, check if their distribution belongs to the Gaussian family. These are ...
pecer10012's user avatar
1 vote
1 answer
1k views

Can we say that the assumption for linearity is met? I'm confused because the tails are heavy, and deviations have a bow-shaped pattern. Still, I think that the linearity has met because the majority ...
Dan's user avatar
  • 11
1 vote
1 answer
1k views

Hello. Can anyone help me with interpreting these plots? I would like to know what assumptions of the linear model are not being met and what method should be used to fix the problems. I think there ...
Mdddl's user avatar
  • 11
5 votes
2 answers
4k views

If the dependent variable is discrete ordinal, like 0-10 then an ordered logit or ordered probit is appropriate to use. They are both similar but their interpretation are different and their error is ...
rr19's user avatar
  • 105
0 votes
1 answer
128 views

I am currently exploring ways to check the normality of a given variable in the dataset. Since most algorithms assume a variable's gaussian distribution, it is important to check it. A Q-Q Plot Came ...
badc0ffee's user avatar
  • 134
2 votes
1 answer
154 views

I runned two GLMs using the same dependent and independent variables, but modelling each analysis according a different type of distribution. Then, I compared its AIC values to find what distribution ...
user avatar
0 votes
1 answer
173 views

I use qqnorm to plot my data as the photo attached. Does this plot indicate the data is normal distributed?
lily zhu's user avatar
0 votes
1 answer
52 views

I am trying to figure out if there is a way that we can perform some statistical test to check the interaction between two independent continuous variables and a dependent variable in R. I have three ...
Ranji Raj Nair's user avatar
0 votes
0 answers
85 views

I'm finding it hard to interpret this plot. Is it skewed, bimodal, or what is it? What do the points lying in the same line and rising suddenly mean? Is it exponential?
Amreesh Karthikeyan's user avatar
0 votes
0 answers
132 views

I am looking at this article on theoretical q-q plots and am trying to understand it in its entirety. The part where I get lost is when the author writes: We first find the f-values for alto What do ...
willpkay's user avatar
1 vote
1 answer
4k views

I am trying to evaluate the normality of the distribution of my model's residuals. I have been using statsmodels.api.qqplot and ...
Archie's user avatar
  • 215
2 votes
1 answer
616 views

I am trying to create a regression model for prediction. I need to generate prediction/confidence intervals for my model. I am trying to decide whether to use a quantile regression or linear ...
Archie's user avatar
  • 215
4 votes
1 answer
689 views

I am trying to make two-sample Q-Q plots in Python. A Python function that is used for calculating quantiles has the option of fitting parameters for the calculation of quantiles. These parameters are ...
weakboneman's user avatar
4 votes
1 answer
426 views

I want to show the confidence envelope for a two sample Q-Q plot in R (or Python). The aim is to use the Q-Q plot to give an indication of whether my two samples are drawn from the same population The ...
weakboneman's user avatar

1
2 3 4 5
7