Skip to main content

Questions tagged [boxplot]

A graphical display to summarize the distribution of a sample. It displays five numbers plus (possibly) some outliers - those five points being the median, hinges (approximate quartiles), and the largest and smallest value not counting any points marked as outliers.

Filter by
Sorted by
Tagged with
1 vote
0 answers
58 views

For $X_1,\ldots,X_n\sim\operatorname N(\mu,\sigma^2),$ how much is known about the $5\times5$ matrix of covariances among the box-plot summary statistics: the minimum, the three quartiles, and the ...
Michael Hardy's user avatar
1 vote
1 answer
31 views

I have a Bayesian model with a Bernoulli distribution as follows. The dataset is based on site visits (sites have a different n visits) with over 800 observations. ...
bluerabbit 's user avatar
1 vote
0 answers
60 views

I am making calculations for a box and whisker chart in my database and want to validate results against Pandas results. I can't figure out why the ...
Python_Learner's user avatar
7 votes
6 answers
1k views

Does a boxplot assume interval data? If not, is it then fine to use a box plot to represent Likert-scale (ordinal) data?
Ronald's user avatar
  • 105
0 votes
1 answer
59 views

Different people have to write down values on a certain type of parameter in order to fill out a table, and people obviously tend to write wrong. Sometimes, by a factor of 1000. This creates a lot of ...
Huragok's user avatar
3 votes
0 answers
28 views

I've searched this question everywhere, but I've found different answers and none of them result in the same answer Numpy gave me. I have the following data:[0, 1, 2, 3, 4, 4, 5, 5, 6, 8] When using ...
trder's user avatar
  • 700
3 votes
2 answers
193 views

The Question Parking at a university has become a problem. University administrators are interested in determining the average time it takes students to find a parking spot. An administrator ...
nickalh's user avatar
  • 133
2 votes
2 answers
203 views

Assuming a paradigm of logistic regression, I'm having some trouble understanding what particular model is suggested by some parallel boxplots. For example, here: I'm told the model suggested is: $$\...
Disc03's user avatar
  • 21
2 votes
2 answers
96 views

In the book "An Introduction to Statistical Learning with Applications in Python, Trevor Hastie et al., Springer", there's the following paragraph: The left-hand panel of Figure 1.2 ...
Tran Khanh's user avatar
1 vote
1 answer
127 views

I am working with a model that models water flows in a certain area. These flows can be influenced by taking certain measures, resulting in multiple water management scenarios. I would like to compare ...
Nathan's user avatar
  • 11
2 votes
1 answer
779 views

I'm trying to understand how to properly interpret seaborn-generated boxplots. Consider the following code: ...
Evan Aad's user avatar
  • 1,453
1 vote
1 answer
1k views

From Figure 2 of Ferreira et al. (2016) "Graphical representation of chemical periodicity of main elements through boxplot", we can see the taxonomy of some common cases of symmetrical and ...
Ommo's user avatar
  • 454
0 votes
0 answers
44 views

I want to know what to look for in a boxplot, when we want to check for homogeneity of variances among groups, which is an assumption in ANOVA. I used this codes to get a boxplot: '''boxplot(log(...
scholar101's user avatar
1 vote
0 answers
57 views

While plotting a box plot, the plot is showing the columns in the dataset has outliers, but while trying to calculate it by IQR formula, it is showing there are 0 outliers in the columns of the ...
Taniya Pal's user avatar
1 vote
0 answers
107 views

I have more than 50 different distributions, corresponding to 50 different kind of customers, who spend their money in a certain way within a period, being this amount the single variable of interest. ...
0xGolovkin's user avatar
5 votes
1 answer
360 views

I have data that goes from negative to positive. When plotted in an histogram it looks like this The data is the "error". If I have another set of data I want to learn how to judge which ...
KansaiRobot's user avatar
2 votes
1 answer
101 views

Consider these two distributions, representing ratings (in the range 0-9) given in an experiment for two different conditions A and B: ...
z8080's user avatar
  • 2,372
0 votes
1 answer
148 views

I'm trying to find out if there's a significant difference between relative abundance (response variable: rel.abund) and habitat (predictor variable with levels lagoon, bank, shelf: Habitat). This is ...
FlyingDutch's user avatar
1 vote
1 answer
760 views

How do you explain a box plot with categorical variables on the x-axis? For example, I have these two box plots, how do you interpret relative comparison of each category within the box plot? Sample ...
kms's user avatar
  • 590
0 votes
0 answers
140 views

I am super stuck on the question. I looked up on how to find the mean on a Box-and-whisker plot, and never got a clear answer.
brogan brown's user avatar
2 votes
1 answer
789 views

I created boxplots for 5 traits to show the spread of true age for each variant of each trait in both samples between 2 observers: A professor suggested that I should use scatterplot of Price vs Kim ...
user34930's user avatar
  • 121
2 votes
1 answer
322 views

I conducted a one way anova followed by a tukey-test in Rstudio and used a compact letter display to add letters of significance to a ggplot. After a positive Grubbs-outlier-test I removed an outlier ...
runald's user avatar
  • 21
1 vote
0 answers
105 views

I want to draw the same exact graph in R. However, I want to consider two options: (1) with one x axis for each of the genders & (2) two different xaxes for each of the gender. Here is also the ...
MK25's user avatar
  • 31
0 votes
0 answers
36 views

My predictor variable is quantitative, but falls into nine discrete groups (i.e., values are either 60, 125, 200 etc.). I have a responder variable that is quantitative and continuous. I have made a ...
Rosie's user avatar
  • 1
3 votes
4 answers
2k views

I have a dataset of passwords that looks like this: My goal is to make a graph that accurately showcases the strength of each category so my initial plan was to group the data by category and ...
wageeh's user avatar
  • 241
5 votes
1 answer
368 views

I have a question regarding the boxplot. On some web pages, the Minimum and the Maximum of the 5-Number-Summary correspond to the whiskers. However, regarding this definition, my question is: how is ...
Made's user avatar
  • 121
2 votes
1 answer
465 views

I recently heard one can detect collinearity between a factor (Species) and a continuous covariate (TL) simply by making a boxplot. If the plots don't overlap, there is evidence of collinearity. I'm ...
Nate's user avatar
  • 2,537
1 vote
0 answers
170 views

I did a mean differences test, the t-test gave me a p-value of 0.034 and a regression test gave me significant differences as well. But when I plot my data with a boxplot, I see that the medians are ...
Gerald Vasquez Aleman's user avatar
6 votes
3 answers
5k views

Bar charts with error whiskers, like the one below (taken from What type of plot is it?), seem to be quite common in some communities. I wonder, however, why is a bar chart used, and not a box plot? ...
Igor F.'s user avatar
  • 10.3k
3 votes
2 answers
689 views

Here is the plot ,which I do not know what type of plot it is. It is so similar to box plot, but it does not have first quartile. How we can find mean and standard deviation using this plot? UPDATE: ...
mohammad rezza's user avatar
1 vote
1 answer
747 views

Here are my data points: ...
Innuendo's user avatar
1 vote
1 answer
109 views

I'm comparing three groups: people that have had two infections, people that have had one infection and people that have had zero infections. I'm showing them on a boxplot (regarding age for example) ...
FluidMechanics Potential Flows's user avatar
8 votes
2 answers
893 views

I want to visualize some data with a boxplot -- and I am wondering, if the boxplot is even the correct way for visualizing the data. I want to compare three datasets (40 entries each) using boxplots. ...
Andre's user avatar
  • 183
1 vote
1 answer
957 views

I have values with extreme outliers and want to visualize that. But the box plot doesn't seem a good choice for my data as you can see here. Most of the values are less than 50,000. But some them are ...
buhtz's user avatar
  • 282
24 votes
5 answers
8k views

This question is inspired by this posting, plus a comment by @StephanKolassa and an answer by @dipetkov who point out that the boxplots presented in that question are misleading. As is pointed out, ...
Russ Lenth's user avatar
  • 22.2k
2 votes
1 answer
1k views

Suppose for simplicity that we have Gaussian distributed data with some outliers, whose typical characteristic is getting values that are far from the mean. Suppose my sample size is ...
Thomas's user avatar
  • 1,137
1 vote
1 answer
331 views

I am facing a regression problem and I would like to understand how my errors distrubuite along the true values I have. For the moment, I have done this scatterplot: but I want a plot that directly ...
Luigi D'Amico's user avatar
9 votes
4 answers
5k views

I want to visualize a continuous variable, BMI index, against a binary variable, heart disease. I want to check by visualization ...
PythonNoob's user avatar
1 vote
1 answer
452 views

May the cumulative distribution function be used to estimate the inter-quartile range? I am drawing their similarities. The IQR is found via a boxplot which has percentiles. A percentile comes from ...
user avatar
2 votes
1 answer
105 views

I'm having a look at altair docs, and I don't understand why, in the last two groups (85 and 90), that have a very low IQR, they have similar whiskers than in the first group, where the IQR of the ...
David Masip's user avatar
1 vote
0 answers
327 views

Here is the context: So we can analyse this using a 2 sample t tests. The assumptions of t-test comparing the means of two independent samples are populations being compared should follow normal ...
CountDOOKU's user avatar
0 votes
1 answer
135 views

I understand what a boxplot and histogram are supposed to show, but I'm a bit confused on how it is being presented in this graphic printout from R. The picture is taken from Beyond Multiple Linear ...
D.C. the III's user avatar
4 votes
1 answer
2k views

Hi is it normal for the upper fence to be greater than max? If not, what might have gone wrong? I am using Empirical Rule and doing calculation mean +3 * IQR Thanks
Nathan's user avatar
  • 43
6 votes
1 answer
776 views

I am currently working with a box plot (shown below) that consists of two boxes per value of one of the independent variables (call it $x$). The other independent variable is indicated by the two ...
Mahmoud's user avatar
  • 5,345
3 votes
2 answers
1k views

I'm confused about some of the results I got after plotting my data. I have a data set that includes tests scores and a binary group assignment of either polypharmacy or non-polypharmacy. the scores ...
Pharma's user avatar
  • 31
0 votes
1 answer
580 views

Regarding "C" and "D": Is there a way other than visually inspecting the notches from the boxplots? I know the range of values for "D" is lower than "C" and the ...
sumthymes's user avatar
0 votes
0 answers
67 views

I have a series of averages. One of these however turns out to take too high a value due to a N. low equal to 1. Let me explain better by means of an example: I have to calculate the averages of the ...
Laura Santangelo's user avatar
0 votes
0 answers
115 views

I have 21 regression models. I am plotting MAE and Standard deviation of models using a errorbar plot in python. However, if I bin the data (say 10 bins), and for each bin now I can calculate MAE, ...
srinivas's user avatar
  • 101
1 vote
1 answer
5k views

I am wondering about the use of boxplots versus mean with 95% in a figure? A boxplot will give you median and you can add a notch to show the 95 % CI for the median so it is quick and easy to compare ...
Siri's user avatar
  • 11
2 votes
2 answers
2k views

How do I know whether a distribution is leptokurtic or platykurtic by only having the box plot?
StatisticsNoobie's user avatar

1
2 3 4 5 6