Write python code to find mean variance and standard deviation for a list of numbers
Mean and standard deviation are two essential metrics in Statistics. We can use the statistics module to find out the mean and standard deviation in Python. Standard deviation is also abbreviated as SD. The mean is the sum of all the entries divided by the number of entries. For example, if we have a list
of 5 numbers [1,2,3,4,5], then the mean will be (1+2+3+4+5)/5 = 3. Standard deviation is a measure of the amount of variation or dispersion of a set of values. We first need to calculate the mean of the values, then calculate the variance, and finally the standard deviation. Let’s say we have the data of population per square kilometer for different states in the USA. We can calculate
the standard deviation to find out how the population is evenly distributed. A smaller value means that the distribution is even whereas a larger value means there are very few people living in some places while some areas are densely populated. Let’s look at the steps required in calculating the mean and standard deviation. Let’s write the code to calculate the mean and standard deviation in Python. We will use the statistics module and later on try to write our own implementation. This module provides you the option of calculating mean and standard deviation directly. Let’s start by importing the module. Let’s declare a list with sample data. Now to calculate the mean of the sample data, use the following function: This statement will return the mean of the data. We can print the mean in the output using: print("Mean of the sample is % s " %(statistics.mean(data))) We get the output as: Mean of the sample is 13.666666666666666 If you are using an IDE for coding you can hover over the statement and get more information on statistics.mean() function. Alternatively, you can read the documentation here. To calculate the standard deviation of the sample data use: print("Standard Deviation of the sample is % s "%(statistics.stdev(data))) We get the output as: Standard Deviation of the sample is 15.61623087261029 Here’s a brief documentation of statistics.stdev() function. Complete Code to Find Standard Deviation and Mean in PythonThe complete code for the snippets above is as follows : import statistics data = [7,5,4,9,12,45] print("Standard Deviation of the sample is % s "% (statistics.stdev(data))) print("Mean of the sample is % s " % (statistics.mean(data))) 2. Write Custom Function to Calculate Standard DeviationLet’s write our function to calculate the mean and standard deviation in Python. def mean(data): n = len(data) mean = sum(data) / n return mean This function will calculate the mean. Now let’s write a function to calculate the standard deviation. This can be a little tricky so let’s go about it step by step. The standard deviation is the square root of variance. So we can write two functions:
The function for calculating variance is as follows: def variance(data): n = len(data) mean = sum(data) / n deviations = [(x - mean) ** 2 for x in data] variance = sum(deviations) / n return variance You can refer to the steps given at the beginning of the tutorial to understand the code. Now we can write a function that calculates the square root of variance. def stdev(data): import math var = variance(data) std_dev = math.sqrt(var) return std_dev Complete CodeThe complete code is as follows : import numpy as np #for declaring an array or simply use list def mean(data): n = len(data) mean = sum(data) / n return mean def variance(data): n = len(data) mean = sum(data) / n deviations = [(x - mean) ** 2 for x in data] variance = sum(deviations) / n return variance def stdev(data): import math var = variance(data) std_dev = math.sqrt(var) return std_dev data = np.array([7,5,4,9,12,45]) print("Standard Deviation of the sample is % s "% (stdev(data))) print("Mean of the sample is % s " % (mean(data))) ConclusionThe mean and Standard deviation are mathematical values used in statistical analysis. Python statistics module provides useful functions to calculate these values easily. What’s Next?
Resources
How do you find standard deviation and variance in Python?Coding a stdev() Function in Python
sqrt() to take the square root of the variance. With this new implementation, we can use ddof=0 to calculate the standard deviation of a population, or we can use ddof=1 to estimate the standard deviation of a population using a sample of data.
How do you find the mean and STD of a list in Python?You can use one of the following three methods to calculate the standard deviation of a list in Python:. Method 1: Use NumPy Library import numpy as np #calculate standard deviation of list np. ... . Method 2: Use statistics Library import statistics as stat #calculate standard deviation of list stat.. How do you find the variance of a list of numbers in Python?How to Get the Variance of a List in Python?. With External Dependency: Import the NumPy library with import numpy as np and use the np. var(list) function.. Without External Dependency: Calculate the average as sum(list)/len(list) and then calculate the variance in a list comprehension statement.. How do I find the SD of a list in Python?Calculate the Standard Deviation of a List in Python. Use the pstdev() Function of the statistics Module to Calculate the Standard Deviation of a List in Python.. Use the std() Function of the NumPy Library to Calculate the Standard Deviation of a List in Python.. |