Find top 1-on-1 online tutors for Coding, Math, Science, AP and 50+ subjects
Tutoring
Tutors by Subject
Computer Science
Math
AP (Advanced Placement)
Courses
Coding Classes for Kids
Robotics Classes for Kids
Design Classes for Kids
Resources
AP (Advanced Placement)
Calculators
Length Calculators
Weight Calculators
Tools
Tutorials
Scratch Tutorial
Learn
Math Tutorials
AP Statistics Tutorials
Python Tutorials
Blog
Central tendencies are a measure of centralizing values in a data set, i.e., they measure some sort of average value of the observations in a survey or an experiment. There are three most basic measures of central tendencies, Mean, Median, and Mode. In this article, we are going to learn about Mean and Median, how to find them, their properties, and their applications.
Want to learn AP Statistics from experts? Explore Wiingy’s Online AP Statistics tutoring services to learn from top mathematicians and experts.
Data is a collection of observation values in a survey or an experiment. In a data set usually, the frequency of the observations tends to have higher density towards the center, i.e., the frequency-observation graph peaks near the center. Thus, we can assume the values near the center tend to give a lot of information about the data, and thus we define central tendencies.
Central Tendencies are quantities about a data set that tell a certain central value of the data. Central Tendencies may or may not actually belong to the data set.
Mean is defined as the average value of the observations in a data set, i.e., it represents the center of the data set values by the average of all the data points. There are 3 different types of mean in mathematics, Arithmetic mean, Geometric mean, and Harmonic mean, each corresponding to a different relation between the data points.
2 data points and their arithmetic mean form an arithmetic sequence, i.e., the difference between two consecutive terms have a constant difference, similarly geometric mean forms a geometric sequence, and harmonic mean forms a harmonic sequence. For the sake of simplicity and practicality, we mainly focus on the arithmetic mean in statistics.
The arithmetic mean is defined as the average value. It is calculated by dividing the sum of all the observations by the total number of observations.
The median is the middle term of all the observations arranged in ascending or descending order.
To find the mean of discrete data, i.e., ungrouped data, we simply add all the observations and divide that sum by the number of observations, thus the formula for the mean of ungrouped data is given by,
To find the mean of grouped data, we have two different types of grouping of data.
Where, ’s are the observations, and
’s are their respective frequencies.
The formula for mean using the direct method is,
Where, ’s are the class marks, and
’s are their respective frequencies.
The formula for the Assumed mean method is
Where, ’s are the differences between class marks and assumed mean
, and
’s are their respective frequencies.
The formula for the Step Deviation method is,
Where, ’s are the ratios of differences between class marks and assumed mean
with respect to the class height
, and
’s are their respective frequencies.
To find the median of ungrouped data we arrange the data in a monotone, i.e., either ascending or descending order, then we select the middlemost value from the arranged data. If the number of observations is odd then we simply pick out the middlemost data point. But if the number of observations is even then we have two middlemost data points, then we find the arithmetic mean of those two points.
To find the median of grouped data we have two methods, i.e., numerical and geometrical. Both methods have the same basis known as the cumulative frequency method.
We can make the cumulative frequency table in two ways, i.e., the ‘More than’ table and the ‘Less than’ table, median found numerically from both the tables is the same and we have a simple formula for finding the median.
Graphically More than cumulative frequency table results in a More than Ogive and the Less than the table in Less than Ogive, the value of the data at the intersection of both ogives gives the median.
Where, is the lower limit,
is the frequency of the median class and
is the cumulative frequency of the class preceding the median class and
is the total number of observations or simply the total frequency and
is the height of the class intervals. The median class is selected by choosing the class that has a cumulative frequency of just more than the value
.
Mean | Median |
Mean is the average value of all the data points, the mean of a data set may or may not actually belong to the data set. | Median is the middle point of a data set. The median of a data set always belongs to the data set. |
The Mean of a data set depends on all the data points. | Median of a data set does not actually depend on all the data points, just the center points. If all the right half is increased and all of the left half is decreased, the median remains the same. |
The Mean of a data set is the most sensitive central tendency out of the three. Mean of the data set changes if any value of the data set is changed. | Median of a data set is not too sensitive to the change in data points. The median of a data set does not necessarily change when any one of the data points is changed. |
Mean is the preferred central tendency when the data is distributed normally. | Median is the preferred central tendency when the data is formed by skewed distribution. |
Example 1: Find the mean and median for the following data set.
Solution:
The mean of the given data set is given by
And since the number of observations is even, the median is the arithmetic mean of the middlemost term in the arranged data set given by
Thus, median is given by
Example 2: Find the mean of the following grouped data by all three methods and verify your answers. Also, find the median using More than cumulative frequency distribution.
Class Intervals | 0-10 | 10-20 | 20-30 | 30-40 | 40-50 |
Frequency | 6 | 10 | 14 | 12 | 8 |
Solution:
To find the mean we will construct the following distribution table
Class Intervals | Class Marks![]() | Frequency ![]() | ![]() | ![]() | ![]() | ![]() | ![]() |
0-10 | 5 | 6 | 30 | -20 | -120 | -2 | -12 |
10-20 | 15 | 10 | 150 | -10 | -100 | -1 | -10 |
20-30 | 25![]() | 14 | 350 | 0 | 0 | 0 | 0 |
30-40 | 35 | 12 | 420 | 10 | 120 | 1 | 12 |
40-50 | 45 | 8 | 360 | 20 | 160 | 2 | 16 |
![]() | ![]() | ![]() | ![]() |
Now, we can find the mean of this data
Direct method,
Assumed mean method,
Step deviation method
Hence, the mean of the given grouped data is 26.2, and we have verified the answer using all three methods.
Now for median using more than cumulative frequency distribution we have the following table,
Class Intervals | Frequency | More Than | Cumulative Frequency |
0-10 | 6 | More than 0 | 50 |
10-20 | 10 | More than 10 | 44 |
20-30(Median Class) | 14(![]() | More than 20 | 34 |
30-40 | 12 | More than 30 | 20(![]() |
40-50 | 8 | More than 40 | 8 |
![]() |
In the given table, we have the following values,
And the median is given by,
Substituting the values we have,
Hence, the median of the given grouped data is 23.57.
In this article, we have learned about Mean and Median as central tendencies. Mean is the average value of all the observations of all the data points, whereas the median is the middlemost data point.
When the data is formed by the normal distribution, we prefer to use the mean as the central tendency as in a normal distribution the variance is optimum, whereas if the data is formed by skewed distribution, we prefer the median as the central tendency since the data is weighted to one of the sides, and hence, the mean is not near the actual center.
Want to learn AP Statistics from experts? Explore Wiingy’s Online AP Statistics tutoring services to learn from top mathematicians and experts.
Pham-Gia, T., & Hung, T. L. (2001). The mean and median absolute deviations. Mathematical and computer Modelling, 34(7-8), 921-936.
Lewis, J. R. (1993). Multipoint scales: Mean and median differences and observed significance levels. International Journal of Human‐Computer Interaction, 5(4), 383-392.