Quartiles are a measure of dispersion used in statistics to divide the elements of a data set into four parts according to their magnitude. The lower quartile, or first quartile, is the middle number between the smallest number in a data set and its median. The second quartile, or median, is the middle number of the data set. The third quartile, or third quartile, is the middle number between the largest number in a data set and its median.
Quartiles are used to compare different sets of data to grade a student population, compare wages of different nationalities and professions, or evaluate the performance of investments. Quartiles are often used to plot box-and-whisker plots on a graph. This plot shows the distribution of data visually so that a reader can easily grasp the overall spread of the data.
The first quartile, or the lower quartile, is a number that divides the first 25 percent of the data set from the rest of the set. Similarly, the third quartile is a number that divides the top 75 percent of the data set from the remaining 25 percent. For example, if a data set contains 5 numbers (2, 7, 8, 10, 18), the first quartile or lower quartile will be the middle number between the minimum number (2) and its median (8) which is 7. Similary, the third quartile or upper quartile will be the middle number between the maximum number (18) and the median (8) which is 10.
The median of a data set is the middle number and it can be calculated by adding the two middle numbers of the data set and then dividing them by two. For example, if there are 8 numbers in a data set (2, 4, 6, 8, 10, 12, 14, 16), the median will be (10+12)/2=11.
Quartile deviation is a measure of spread used to measure the distance between the quartiles of a data set. It is calculated by subtracting the first quartile from the third quartile. For example, if a data set contains 5 numbers (1, 2, 4, 6, 8), the first quartile is 2, the third quartile is 6 and the quartile deviation is 6-2=4.
Quartiles can also be used to identify outliers in a data set. Outliers are values that are much higher or lower than the rest of the data set and can skew the data. The first quartile and third quartile can then be used to determine which values should be considered outliers. For example, if a data set contains the numbers 5, 7, 9, 11, 15, 17 and 20, the lower quartile is 9, the upper quartile is 15, and any numbers higher than 15 are considered outliers.
Overall, quartiles are an important measure of dispersion used to compare different data sets, grade populations, and identify outliers.