Questions tagged [boxplot]
A graphical display to summarize the distribution of a sample. It displays five numbers plus (possibly) some outliers - those five points being the median, hinges (approximate quartiles), and the largest and smallest value not counting any points marked as outliers.
267 questions
1
vote
0
answers
58
views
Covariances among the five box-plot statistics for a sample from a normal population
For $X_1,\ldots,X_n\sim\operatorname N(\mu,\sigma^2),$ how much is known about the $5\times5$ matrix of covariances among the box-plot summary statistics: the minimum, the three quartiles, and the ...
1
vote
1
answer
31
views
Bayesian Bernoulli model - getting marginal effects plots based on group, not overall dataset
I have a Bayesian model with a Bernoulli distribution as follows. The dataset is based on site visits (sites have a different n visits) with over 800 observations.
...
1
vote
0
answers
60
views
How Can I Match 0.75 Percentile Results From Database with Pandas?
I am making calculations for a box and whisker chart in my database and want to validate results against Pandas results. I can't figure out why the ...
7
votes
6
answers
1k
views
Does a boxplot assume interval data?
Does a boxplot assume interval data? If not, is it then fine to use a box plot to represent Likert-scale (ordinal) data?
0
votes
1
answer
59
views
How can I filter outliers in data that is manually recorded?
Different people have to write down values on a certain type of parameter in order to fill out a table, and people obviously tend to write wrong. Sometimes, by a factor of 1000. This creates a lot of ...
3
votes
0
answers
28
views
How can you calculate Q1 and Q3 for even numbers? [duplicate]
I've searched this question everywhere, but I've found different answers and none of them result in the same answer Numpy gave me.
I have the following data:[0, 1, 2, 3, 4, 4, 5, 5, 6, 8]
When using ...
3
votes
2
answers
193
views
Intro to Stats- professor's mistake in an exercise on types of diagrams?
The Question
Parking at a university has become a problem. University administrators are interested in determining the average time it takes students to find a parking spot. An administrator ...
2
votes
2
answers
203
views
Reading Boxplots in Multiple Logistic Regression
Assuming a paradigm of logistic regression, I'm having some trouble understanding what particular model is suggested by some parallel boxplots. For example, here:
I'm told the model suggested is:
$$\...
2
votes
2
answers
96
views
Using boxplots of previous day’s percentage change in S&P index to predict today's return based on yesterday's movement
In the book "An Introduction to Statistical Learning with Applications in Python, Trevor Hastie et al., Springer", there's the following paragraph:
The left-hand panel of Figure 1.2 ...
1
vote
1
answer
127
views
Box plots of monthly averaged water flow
I am working with a model that models water flows in a certain area. These flows can be influenced by taking certain measures, resulting in multiple water management scenarios. I would like to compare ...
2
votes
1
answer
779
views
Seaborn Boxplot's quartiles
I'm trying to understand how to properly interpret seaborn-generated boxplots.
Consider the following code:
...
1
vote
1
answer
1k
views
Boxplot with a symmetric box, but with whiskers of different lengths [duplicate]
From Figure 2 of Ferreira et al. (2016) "Graphical representation of chemical periodicity of main elements through boxplot", we can see the taxonomy of some common cases of symmetrical and ...
0
votes
0
answers
44
views
How to visually check for homoscedasticity? [duplicate]
I want to know what to look for in a boxplot, when we want to check for homogeneity of variances among groups, which is an assumption in ANOVA.
I used this codes to get a boxplot:
'''boxplot(log(...
1
vote
0
answers
57
views
Can there be no outliers as per calculation with IQR formula while the boxplot shows there are outliers in the dataset?
While plotting a box plot, the plot is showing the columns in the dataset has outliers, but while trying to calculate it by IQR formula, it is showing there are 0 outliers in the columns of the ...
1
vote
0
answers
107
views
BEST POSSIBLE WAY to determine significantly high values within zero-inflated univariate continuous distributions
I have more than 50 different distributions, corresponding to 50 different kind of customers, who spend their money in a certain way within a period, being this amount the single variable of interest.
...
5
votes
1
answer
360
views
Plots to judge data that is a bit asymmetric
I have data that goes from negative to positive.
When plotted in an histogram it looks like this
The data is the "error".
If I have another set of data I want to learn how to judge which ...
2
votes
1
answer
101
views
Identical interquartile ranges for distributions with different means and numbers of data points
Consider these two distributions, representing ratings (in the range 0-9) given in an experiment for two different conditions A and B:
...
0
votes
1
answer
148
views
1-way ANOVA contradicts boxplot?
I'm trying to find out if there's a significant difference between relative abundance (response variable: rel.abund) and habitat (predictor variable with levels lagoon, bank, shelf: Habitat).
This is ...
1
vote
1
answer
760
views
Interpreting box plots with categorical variables
How do you explain a box plot with categorical variables on the x-axis? For example, I have these two box plots, how do you interpret relative comparison of each category within the box plot?
Sample ...
0
votes
0
answers
140
views
How to find all the data on a box-and-whisker plot? Most importantly the mean
I am super stuck on the question. I looked up on how to find the mean on a Box-and-whisker plot, and never got a clear answer.
2
votes
1
answer
789
views
Boxplot or Scatterplot?
I created boxplots for 5 traits to show the spread of true age for each variant of each trait in both samples between 2 observers:
A professor suggested that I should use scatterplot of Price vs Kim ...
2
votes
1
answer
322
views
R Tukey Anova: Can non-overlapping boxplots share the same letter of significance in Anova / Tukey Test?
I conducted a one way anova followed by a tukey-test in Rstudio and used a compact letter display to add letters of significance to a ggplot.
After a positive Grubbs-outlier-test I removed an outlier ...
1
vote
0
answers
105
views
Grouped Boxplots in R [closed]
I want to draw the same exact graph in R. However, I want to consider two options:
(1) with one x axis for each of the genders &
(2) two different xaxes for each of the gender. Here is also the ...
0
votes
0
answers
36
views
Testing hypothesis that mean of grouped quantitative data is increasing in one direction
My predictor variable is quantitative, but falls into nine discrete groups (i.e., values are either 60, 125, 200 etc.). I have a responder variable that is quantitative and continuous. I have made a ...
3
votes
4
answers
2k
views
Are there any simpler alternatives to boxplots and barplots that can faithfully represent data with outliers and strongly varying ns?
I have a dataset of passwords that looks like this:
My goal is to make a graph that accurately showcases the strength of each category so my initial plan was to group the data by category and ...
5
votes
1
answer
368
views
Boxplot | 5-Number-Summary
I have a question regarding the boxplot. On some web pages, the Minimum and the Maximum of the 5-Number-Summary correspond to the whiskers. However, regarding this definition, my question is:
how is ...
2
votes
1
answer
465
views
Boxplots: valid method for visualizing collinearity?
I recently heard one can detect collinearity between a factor (Species) and a continuous covariate (TL) simply by making a boxplot. If the plots don't overlap, there is evidence of collinearity. I'm ...
1
vote
0
answers
170
views
T-test and linear regression are significant but a grouped boxplot shows similar medians
I did a mean differences test, the t-test gave me a p-value of 0.034 and a regression test gave me significant differences as well. But when I plot my data with a boxplot, I see that the medians are ...
6
votes
3
answers
5k
views
Why use bar chart with error whiskers instead of box plot?
Bar charts with error whiskers, like the one below (taken from What type of plot is it?), seem to be quite common in some communities.
I wonder, however, why is a bar chart used, and not a box plot? ...
3
votes
2
answers
689
views
What type of plot is it?
Here is the plot ,which I do not know what type of plot it is.
It is so similar to box plot, but it does not have first quartile. How we can find mean and standard deviation using this plot?
UPDATE:
...
1
vote
1
answer
747
views
Boxplot whisker don't match the calculated values [closed]
Here are my data points:
...
1
vote
1
answer
109
views
Boxplots t tests and Bonferroni correction
I'm comparing three groups: people that have had two infections, people that have had one infection and people that have had zero infections.
I'm showing them on a boxplot (regarding age for example) ...
8
votes
2
answers
893
views
Is a boxplot useful, when it doesn't even look like a box?
I want to visualize some data with a boxplot -- and I am wondering, if the boxplot is even the correct way for visualizing the data.
I want to compare three datasets (40 entries each) using boxplots. ...
1
vote
1
answer
957
views
Visualize distribution (box plots like) with many and extreme outliers?
I have values with extreme outliers and want to visualize that. But the box plot doesn't seem a good choice for my data as you can see here.
Most of the values are less than 50,000. But some them are ...
24
votes
5
answers
8k
views
How should we do boxplots with small samples?
This question is inspired by this posting, plus a comment by @StephanKolassa and an answer by @dipetkov who point out that the boxplots presented in that question are misleading. As is pointed out, ...
2
votes
1
answer
1k
views
How to use box plots to detect outliers?
Suppose for simplicity that we have Gaussian distributed data with some outliers, whose typical characteristic is getting values that are far from the mean. Suppose my sample size is ...
1
vote
1
answer
331
views
What is the best plot I can do to understand true values vs errors in prediction in a regression problem?
I am facing a regression problem and I would like to understand how my errors distrubuite along the true values I have. For the moment, I have done this scatterplot:
but I want a plot that directly ...
9
votes
4
answers
5k
views
Visualize a continuous variable against a binary variable
I want to visualize a continuous variable, BMI index, against a binary variable, heart disease.
I want to check by visualization ...
1
vote
1
answer
452
views
May the cumulative distribution function be used to calculate the inter-quartile range and vise-versa?
May the cumulative distribution function be used to estimate the inter-quartile range?
I am drawing their similarities.
The IQR is found via a boxplot which has percentiles. A percentile comes from ...
2
votes
1
answer
105
views
IQR range in boxplot
I'm having a look at altair docs, and I don't understand why, in the last two groups (85 and 90), that have a very low IQR, they have similar whiskers than in the first group, where the IQR of the ...
1
vote
0
answers
327
views
How to use side-by-side boxplots to assess assumptions of a 2 sample test?
Here is the context:
So we can analyse this using a 2 sample t tests. The assumptions of t-test comparing the means of two independent samples are populations being compared should follow normal ...
0
votes
1
answer
135
views
Interpreting boxplot VS stacked histogram in example
I understand what a boxplot and histogram are supposed to show, but I'm a bit confused on how it is being presented in this graphic printout from R. The picture is taken from Beyond Multiple
Linear ...
4
votes
1
answer
2k
views
Box Plot Upper Fence is greater than Max
Hi is it normal for the upper fence to be greater than max? If not, what might have gone wrong? I am using Empirical Rule and doing calculation mean +3 * IQR
Thanks
6
votes
1
answer
776
views
How can I test whether one empirical CDF is to the left or right of another?
I am currently working with a box plot (shown below) that consists of two boxes per value of one of the independent variables (call it $x$). The other independent variable is indicated by the two ...
3
votes
2
answers
1k
views
SIgnificant P-Value with overlapping boxplots
I'm confused about some of the results I got after plotting my data.
I have a data set that includes tests scores and a binary group assignment of either polypharmacy or non-polypharmacy. the scores ...
0
votes
1
answer
580
views
How do I determine if differences between medians are statistically significant when notches are very close (see "C" and "D")?
Regarding "C" and "D":
Is there a way other than visually inspecting the notches from the boxplots? I know the range of values for "D" is lower than "C" and the ...
0
votes
0
answers
67
views
Graphic representation of averages
I have a series of averages. One of these however turns out to take too high a value due to a N. low equal to 1.
Let me explain better by means of an example: I have to calculate the averages of the ...
0
votes
0
answers
115
views
Representing MAE and standard deviation for multiple models over multiple bins in python?
I have 21 regression models. I am plotting MAE and Standard deviation of models using a errorbar plot in python. However, if I bin the data (say 10 bins), and for each bin now I can calculate MAE, ...
1
vote
1
answer
5k
views
mean + 95 % confidence interval versus boxplots
I am wondering about the use of boxplots versus mean with 95% in a figure? A boxplot will give you median and you can add a notch to show the 95 % CI for the median so it is quick and easy to compare ...
2
votes
2
answers
2k
views
How can I see Kurtosis in a box plot?
How do I know whether a distribution is leptokurtic or platykurtic by only having the box plot?