What is the sum of all the members of the set divided by the number of items in the set?

Published on October 9, 2020 by Pritha Bhandari. Revised on May 22, 2022.

The mean (aka the arithmetic mean, different from the geometric mean) of a dataset is the sum of all values divided by the total number of values. It’s the most commonly used measure of central tendency and is often referred to as the “average.”

Mean formulas for populations and samples

In research, you often collect data from samples and perform inferential statistics to understand the population they came from.

The formulas for the sample mean and the population mean only differ in mathematical notation. Population attributes use capital letters while sample attributes use lowercase letters.

Population mean

Population mean formulaExplanation
What is the sum of all the members of the set divided by the number of items in the set?
  • What is the sum of all the members of the set divided by the number of items in the set?
    = population mean
  • What is the sum of all the members of the set divided by the number of items in the set?
     = sum of each value in the population
  • What is the sum of all the members of the set divided by the number of items in the set?
    = number of values in the population

    The population mean can also be denoted as μ.

    Sample mean

    Sample mean formulaExplanation
    What is the sum of all the members of the set divided by the number of items in the set?
    • What is the sum of all the members of the set divided by the number of items in the set?
      = sample mean
    • What is the sum of all the members of the set divided by the number of items in the set?
       = sum of each value in the sample
    • What is the sum of all the members of the set divided by the number of items in the set?
      = number of values in the population

        The sample mean is also referred to as M.

        Steps for calculating the mean

        There are two steps for calculating the mean:

        1. Add up all the values in the data set.
        2. Divide this number by the number of values.

        We’ll walk through these steps with a sample data set.

        Let’s say you want to find the average amount people spend on a restaurant meal in your neighborhood. You ask a sample of 8 neighbors how much they spent the last time they went out for dinner, and find the mean cost.

        Data set
        Cost of dinner for two (USD)42 13 31 87 24 58 76 69

        Step 1: Find the sum of the values by adding them all up

        Because we’re working with a sample, we use the sample formula.

        FormulaCalculation
        What is the sum of all the members of the set divided by the number of items in the set?
         42 + 13 + 31 + 87 + 24 + 58 + 76 + 69 = 400

        Step 2: Divide the sum by the number of values

        In the formula, n is the number of values in your data set. Our data set has 8 values.

        FormulaCalculation
        What is the sum of all the members of the set divided by the number of items in the set?
        What is the sum of all the members of the set divided by the number of items in the set?
        = 8
        What is the sum of all the members of the set divided by the number of items in the set?
        = 400
        What is the sum of all the members of the set divided by the number of items in the set?
        = 400
        What is the sum of all the members of the set divided by the number of items in the set?
        8 = 50

        The mean tells us that in our sample, participants spent an average of 50 USD on their restaurant bill.

        Outlier effect on the mean

        Outliers are extreme values that differ from most values in the data set. Because all values are used in the calculation of the mean, an outlier can have a dramatic effect on the mean by pulling the mean away from the majority of the values.

        Let’s see what happens to the mean when we add an outlier to our data set.

        Data set
        Cost of dinner for two (USD)42 13 31 87 24 58 76 69 230

        Step 1: Find the sum of the values by adding them all up

        FormulaCalculation
        What is the sum of all the members of the set divided by the number of items in the set?
        42 + 13 + 31 + 87 + 24 + 58 + 76 + 69 + 230 = 630

        Step 2: Divide the sum by the number of values

        FormulaCalculation
        What is the sum of all the members of the set divided by the number of items in the set?
        What is the sum of all the members of the set divided by the number of items in the set?
        = 9
        What is the sum of all the members of the set divided by the number of items in the set?
        = 630
        What is the sum of all the members of the set divided by the number of items in the set?
        = 630
        What is the sum of all the members of the set divided by the number of items in the set?
        9 = 70

        As we can see, adding just one outlier to our data set raised the mean by 20 USD. In this case, a different measure of central tendency, like the median, would be more appropriate.

        When can you use the mean, median or mode?

        The mean is the most widely used measure of central tendency because it uses all values in its calculation. The best measure of central tendency depends on your type of variable and the shape of your distribution.

        Type of variable

        The mean can only be calculated for quantitative variables (e.g., height), and it can’t be found for categorical variables (e.g., gender).

        In categorical variables, data is placed into groupings without exact numerical values, so the mean cannot be calculated. For categorical variables, the mode is the best measure of central tendency because it tells you the most common characteristic or popular choice for your sample.

        But for continuous or discrete variables, you have exact numerical values. With these, you can easily calculate the mean or median.

        Distribution shape

        The mean is best for data sets with normal distributions. In a normal distribution, data is symmetrically distributed with no skew. Most values cluster around a central region, with values tapering off as they go further away from the center.

        The mean, mode and median are exactly the same in a normal distribution.

        What is the sum of all the members of the set divided by the number of items in the set?

        In skewed distributions, more values fall on one side of the center than the other, and the mean, median and mode all differ from each other. One side has a more spread out and longer tail with fewer scores at one end than the other.

        What is the sum of all the members of the set divided by the number of items in the set?
        What is the sum of all the members of the set divided by the number of items in the set?

        For skewed distributions and distributions with outliers, the mean is easily influenced by extreme values and may not accurately represent the central tendency. The median is a better measure for these distributions as it takes a value from the middle of the distribution.

        Alternatively, you can systematically review and remove outliers from your dataset in the data cleansing process.

        Frequently asked questions about the mean

        How do I find the mean?

        You can find the mean, or average, of a data set in two simple steps:

        • Find the sum of the values by adding them all up.
        • Divide the sum by the number of values in the data set.

        This method is the same whether you are dealing with sample or population data or positive or negative numbers.

        What are the different types of means?

        The arithmetic mean is the most commonly used mean. It’s often simply called the mean or the average. But there are some other types of means you can calculate depending on your research purposes:

        • Weighted mean: some values contribute more to the mean than others.
        • Geometric mean: values are multiplied rather than summed up.
        • Harmonic mean: reciprocals of values are used instead of the values themselves.

        Cite this Scribbr article

        If you want to cite this source, you can copy and paste the citation or click the “Cite this Scribbr article” button to automatically add the citation to our free Citation Generator.

        Bhandari, P. (2022, May 22). What Is the Mean | How to Find It & Examples. Scribbr. Retrieved November 1, 2022, from https://www.scribbr.com/statistics/mean/

        Is this article helpful?

        You have already voted. Thanks :-) Your vote is saved :-) Processing your vote...

        What is the sum of all the values in the data set divided by the number of values in the data set?

        The mean (aka the arithmetic mean, different from the geometric mean) of a dataset is the sum of all values divided by the total number of values. It's the most commonly used measure of central tendency and is often referred to as the “average.”

        What is the sum of the values of a group of items divided by the number of such items?

        The mean (also called arithmetic mean), in everyday language called the average, is the sum of the values of a group of numbers divided by the amount of numbers in the group.

        What is a sum of numbers in a data set divided by how many numbers are in the data set it's also called the average?

        The Mean of a Data Set The mean of a set of numbers, sometimes simply called the average , is the sum of the data divided by the total number of data.

        Is the sum of total divided by the number of data points?

        The arithmetic mean is the sum of all of the data points divided by the number of data points.