Mean Formula: Understanding the Basics of Central Tendency

Mean Formula: Understanding the Basics of Central Tendency

7 mins read324 Views Comment
Updated on May 22, 2024 13:36 IST

In this article, learn how to calculate the mean using the mean formula. The mean is a measure of central tendency that is widely used in statistics to describe the average value of a set of numbers. Discover the steps involved in computing the mean, as well as its strengths and limitations as a statistical measure.

2023_04_Feature-Image-Templates-11.jpg

Let us consider you are a teacher, and you have to find the average height or average score of all the students in a class, or you are a sales manager, and you have to find the average sales by all the employees for the last year. Then how will you calculate it?

Don’t worry; in statistics, there is a Mean statistics formula that will help you to get the answer to your above queries. It can also help you to make decisions or predictions.

This article will explore the concept of mean statistics, formulas to find the mean, its different types, calculation methods, some real-life applications, and its limitations.

What is a Mean?

The mean in statistics refers to the measure of central tendency representing the average value of a numerical data set. It is a useful tool to summarize (or describe) a dataset and to detect patterns( or trends) in the data. 

Apart from summarizing and detecting patterns, it is used in finance, economics, and healthcare for decision-making.

It is defined as the ratio of the sum of all observations to the total number of observations.

Must Read: What is Maths for Data Science? A Beginner Guide to learn Maths for Data Science in 2023

Statistics Interview Questions for Data Scientists
Statistics Interview Questions for Data Scientists
In this article, Statistics Interview Questions for Data Scientists are listed. It starts with defining Statistics and ends with describing Empirical Rule.
Measures of Dispersion: Range, IQR, Variance, Standard Deviation
Measures of Dispersion: Range, IQR, Variance, Standard Deviation
To describe the data, a measure of the central tendency is not just enough as it only gives information about the central values of the dataset. 
Introduction to Inferential Statistics
Introduction to Inferential Statistics
The branch of mathematics that deals with the collection, analysis, prediction, and presentation of numerical data is known as Statistics.

Mean Formula

Mean = Sum of all Observations / Total Number of Data Points (Observations)

Let’s take a simple example to calculate the mean of the data:

Example-1: Let the score of five students in a class is 10, 15, 20, 25, and 30. Find the mean score of the class.

Solution: 

Sum of all Observations = 10 + 15 + 20 + 25 + 30 = 100

Total Number of Observations = 5

=> Mean = Sum of all Observations / Number of Observations = 100 / 5 = 20

Hence, the Mean Score = 20

Whenever the mean is referred to in the context of statistics, it always means Arithmetic Mean. But two more types of mean exist, i.e., Geometric and Harmonic Mean.

Difference Between Covariance and Correlation
Difference Between Covariance and Correlation
Looking to understand the difference between covariance and correlation? This article breaks down the key differences between the two statistical measures, including their definitions, range of values, units, sensitivity to...read more
Standard Error vs. Standard Deviation
Standard Error vs. Standard Deviation
Standard Error quantifies the variability between samples drawn from the same population, whereas standard deviation quantifies the variability of values in a dataset. In this article, we will discuss Standard...read more
Difference between Median and Average
Difference between Median and Average
Average and median are two basic terms that are used in statistics very often. Median is the middle value in a set, whereas average is an arithmetic mean of set...read more

The next section will explore different types of mean in statistics.

Recommended online courses

Best-suited Statistics for Data Science courses for you

Learn Statistics for Data Science with these high-rated online courses

Free
12 weeks
– / –
12 weeks
– / –
2 months
Free
12 weeks
– / –
10 days
Free
23 hours
Free
8 hours
Free
3 hours
– / –
10 days

Types of Mean

There are 3 different types of Mean in Statistics: Arithmetic Mean, Geometric Mean, and Harmonic Mean.

We have already discussed arithmetic mean (the above formula and definition of mean), so now, we will explain only geometric and Harmonic Mean.

What is Geometric Mean?

A statistical measure which is used to find the central tendency of a set of values. Mainly it is useful for calculating the average rate of change. It can also provide a more accurate representation of central tendency when the data is skewed or has outliers.

Dissimilar to the arithmetic mean, it is calculated by multiplying all the data points (n-value) and then taking the n-th root of it.

Geometric Mean Formula

If x1, x2 , x3 , …, xn are n-data points, then

G.M = ( x1 * x2 * x3 * …. * xn )1/n

What is Harmonic Mean?

Similar to the arithmetic mean, it is a type of average that is defined as the reciprocal of the average of the reciprocal of the data values.

Confused!!

Let’s get the mathematical formula that will help you out.

Harmonic Mean Formula

If x1, x2 , x3 , …, xn are n-data points, then

HM = n / ( (1/x1) + (1/ x2 ) + (1/ x3 ) + …. + (1/ xn ))

Now, let’s take an example to find the harmonic mean value.

Example: Find the Harmonic Mean for the data: 2, 4, and 8.

Solution: First, take the reciprocal of all three data points and add them, i.e.

1/2 + 1/4 + 1/8 =  (4 + 2 + 1)/8 = ⅞

=> HM = 3 / (⅞) = 3 * (8/7) = 24/7 = 3.42

Hence, the Harmonic Mean for the data: 2, 4, and 8 is 3.42.

Difference between Median and Average
Difference between Median and Average
Average and median are two basic terms that are used in statistics very often. Median is the middle value in a set, whereas average is an arithmetic mean of set...read more
Nominal vs. Ordinal
Nominal vs. Ordinal
There are four levels of measurements: Nominal, Ordinal, Interval and Ratio. Nominal and Ordinal are qualitative data, whereas Interval and Ratio are quantitative data. In this article, we will discuss...read more
Difference between Variance and Standard Deviation
Difference between Variance and Standard Deviation
Variance and standard deviation are statistical measures of data dispersion. Variance quantifies the average squared deviation from the mean, while standard deviation is the square root of variance, providing a...read more

Relation Between Arithmetic Mean, Geometric Mean, and Harmonic Mean

Harmonic Mean <= Geometric Mean <= Arithmetic Mean

How to Calculate the Mean

To make it simple and understandable, we divide the data values (or dataset) into Ungrouped and Grouped data.

Mean for Ungrouped Data

Ungrouped data is raw data that is not organized or sorted into categories or intervals.

Example:

  • Height of student in a class: 150cm, 175cm, 130cm, 145cm, 160cm.
  • Marks of students: 10, 14, 15, 13, 12, 8, 7, 10.

To calculate the mean of ungrouped data, you have to follow three simple steps:

  • Add all data values
  • Count the number of data values
  • Divide the sum of data values by the number of data values.

It is as simple as you have done in the very first example.

Isn’t it??

Now, it’s time to take an example to understand better how to calculate the mean of ungrouped data.

Example: Calculate the mean score of 10 students: 

If the marks of 10 students in a math exam are: 85, 78, 92, 89, 93, 87, 81, 90, 95, 88.

Then, calculate the mean score.

Solution: 

Sum of Data Values = 85 + 78 + 92 + 89 + 93 + 87 + 81 + 90 + 95 + 88 = 878

Number of Data Values = 10

Mean = Sum of Data Values / Number of Data Values = 878/10 = 87.8

Hence, the mean score of the 10 students in the mathematics exam is 87.8.

Difference between Correlation and Regression
Difference between Correlation and Regression
Correlation measures the degree of relationship between two variables while regression is about how one variable affects the other. In this article, we will briefly discuss the difference between correlation...read more
Correlation vs Causation
Correlation vs Causation
Correlation and causation are one of the most important but confusing topics of statistics. Correlation gives the relationship between two variables, whereas causation means one event is cause due to...read more
Difference between Accuracy and Precision
Difference between Accuracy and Precision
Precision refers to the closeness of multiple reading of the same quantity, whereas accuracy refers to the measured value to the true value. In this article we will discuss difference...read more

Mean for Grouped Data

Grouped Data is a set of data values that have been grouped into categories or intervals.

Example: 

  • Sales in Each Quarter
  • The number of hours workers work per week can be grouped into intervals of 0-8 hours, 8-12 hours, and 12-16 hours.

To calculate the mean of the grouped data, you have to follow these simple steps:

  • If the data values (xi) are given in intervals, find the midpoint ((upper-class limit + lower-class limit)/2).
  • Multiply data values (or mid-points) by their corresponding frequency (fi), i.e., xi*fi
  • Add up all products from step-2, i.e., x1*f1 + x2*f2 + … + xn*fn.
  • The mean is calculated by dividing the above result by the number of observations (f1 + f2 + f3 + … + fn). 

Now, let’s take an example to understand better how to calculate the mean of grouped data.

Example: Consider the below data on the number of hours that students study per week.
Class Interval 0 – 4 4 – 8 8 – 12 12 – 16
Frequency 10 20 15 5

Find the mean.

Solution:

Class Interval Hour (xi) Frequency (fi) xi * fi
0 – 4 (4 + 0) / 2 = 2 10 2 * 10 = 20
4 – 8 (8 + 4) / 2 = 6  20 6 * 20 = 120
8 – 12 (8 + 12) / 2 = 10 15 10 * 15 = 150
12 – 16 (12 + 16) / 2 = 14 5 14 * 5 = 70

=> ∑xi* fix1*f1 + x2*f2 + x3*f3+ x4*f4 = 20 + 120 + 150 + 70 = 360 

and, ∑fi =  f1 + f2 + f3 + f4 =  10 + 20 + 15 + 5 = 50

=>  Mean = ∑xi* fi / ∑fi = 360 / 50 = 7.2

Hence, the mean number of hours students’ study per week is 7.2.

Real-life Application of Mean

Mean is used in different domains such as Finance, Medicine, Quality Control, and Sports to calculate average return on investment, to describe the average value of a group of measurements, to determine the average performance of a manufacturing process, and to calculate the average performance of the team or an individual player respectively.

Limitations of Mean

  • Influenced by Sample Size: Mean may not be a good estimate of the population mean if the sample size is small.
  • Sensitive to Outliers: Mean is extremely sensitive to outliers. Even a single outlier can significantly affect the value of the mean.
  • Inappropriate by Ordinal Data: Mean is inappropriate for the data where the values have a natural order but not numerical values. 
  • Limited Usefulness for Skewed Data: The mean is not a robust measure of central tendency for skewed distributions. If the data is highly skewed, the mean may not accurately reflect the typical value of the data.

Conclusion

In conclusion, the mean (arithmetic mean) formula is one of the fundamental tools in statistics, which is used to calculate the average value of the data points. While the mean is a useful measure of central tendency, it has limitations too. As it is highly vulnerable to outliers and skewed distributions. Therefore, it is essential to understand the importance of the mean formula.
Hope you will like the article.
Happy Learning!!

 

Understand the Basics of P-value
Understand the Basics of P-value
Discover what p-value means in statistics, why it’s important for scientific research, and how to interpret its results correctly. Learn how to avoid common misconceptions and pitfalls and improve your...read more
z-test : Definition and Example
z-test : Definition and Example
z-test is a statistical method for the comparison of mean in a sample from the normally distributed population or between two independent samples. In this article we will briefly discuss...read more
t-test : Definition and Example
t-test : Definition and Example
In this article, we will discuss t-test, types of t-test (one-sample, independent sample, and paired sample) with the help of examples. Statistical method for the comparison of the mean of...read more
Chi-Square Test: Definition and Example
Chi-Square Test: Definition and Example
Chi-square test is a statistically significant test for Hypothesis Testing.
Central Limit Theorem
Central Limit Theorem
Central limit theorem one of the most important and used theorem in statistics and data science, it is in the heart of Hypothesis testing. 
Probability Density Function: Definition, Properties, and Application
Probability Density Function: Definition, Properties, and Application
Probability Density function describes the probability distribution of the continuous random variable. In this article, we will briefly discuss what is probability density function, its properties, its application, and how...read more
Top 10 Probability Questions Asked in Interviews
Top 10 Probability Questions Asked in Interviews
Probability is defined as the likeliness of something to occur or happen. In this article, we will discuss top 10 probability questions that are asked in the interviews with their...read more
PDF vs. CDF: Difference Between PDF and CDF
PDF vs. CDF: Difference Between PDF and CDF
PDF describes the probability distribution of a continuous random variable, while PDF describes the probability distribution of both discrete and continuous random variables. In this article, we will learn what...read more
Decoding Probability Formulas: Understand Chance and Uncertainty
Decoding Probability Formulas: Understand Chance and Uncertainty
In this article, we will briefly discuss the probability formulas that can be used to calculate the probability of any event. The blog discusses some probability formulas that can be...read more
All About Probability Mass Function
All About Probability Mass Function
Discover the fundamentals of probability mass function in this comprehensive guide. Learn what a PMF is, and how to use PMFs to calculate probabilities. You’ll also explore the relationship between...read more
Total Probability Theorem: Definition, Example, and Applications
Total Probability Theorem: Definition, Example, and Applications
Explore the total probability theorem in this comprehensive guide. Learn how to calculate the probability of an event based on different conditions, using the formula for the total probability theorem....read more
Probability Distributions used in Data Science
Probability Distributions used in Data Science
Probability is defined as the likeliness of something to occur or happen and probability distributions are functions that give the relation between all the outcomes of random variable in any...read more
Normal Distribution: Definition and Examples
Normal Distribution: Definition and Examples
A probability distribution is a function that gives the relationship between the outcome of a random variable in any random experiment and its probable values.
Poisson Distribution: Definition and Examples
Poisson Distribution: Definition and Examples
A probability distribution is a function that gives the relationship between the outcome of a random variable in any random experiment and its probable values. In this article, we will...read more
Binomial Distribution: Definition and Examples
Binomial Distribution: Definition and Examples
A probability distribution is a function that gives the relationship between the outcome of a random variable in any random experiment and its probable values.
Difference between Permutation and Combination
Difference between Permutation and Combination
In mathematics, Permutation and Combination are one of the most confusing topics as they are closely related. Permutation counts the number of different arrangements from n objects; on the other...read more
Difference between Eigenvalue and Eigenvector
Difference between Eigenvalue and Eigenvector
Let A be a square matrix of order ‘n-by-n.’ A scalar k is called a eigenvalue of A, if there exist a non-zero vector v satisfying Av = kv, then...read more
All about Symmetric Matrix
All about Symmetric Matrix
A matrix is a rectangular arrangement of numbers (real or complex) or symbols arranged in rows and columns. The number in the matrix are called the elements, and if the...read more
About the Author