Select Page

Standard deviation is a widely used measure of dispersion or variability in a dataset. It quantifies the average distance or spread of data points from the mean of the dataset. The standard deviation provides valuable insights into the consistency, reliability, and distribution of data around the mean.

Formula for Standard Deviation:

For a sample:

s=∑(x1−xˉ)2n−1

For a population:


∑()2n

Where:

  • represents each individual data point



  • represents the sample mean

    represents the population mean

  • represents the sample size
  • represents the population size

  • ∑(��−�ˉ)2

    or 


    represents the sum of squared differences between each data point and the mean



  • denotes the square root

Characteristics and Considerations:

  1. Sensitivity to Variability: Standard deviation measures the spread of data points around the mean. A higher standard deviation indicates greater variability, while a lower standard deviation indicates less variability.
  2. Units of Measurement: Standard deviation is expressed in the same units as the original data, making it interpretable and relevant in various contexts.
  3. Square Root of Variance: Standard deviation is the square root of variance, providing a measure of dispersion that considers the original units of measurement.
  4. Population vs. Sample: The formulas for standard deviation differ slightly depending on whether the data represents a population or a sample, with the sample formula (using

    ) providing an unbiased estimate of the population standard deviation.

  5. Comparison with Mean Deviation: Compared to mean deviation, which considers the absolute differences, standard deviation considers the squared differences, giving more weight to larger deviations and providing a comprehensive measure of data variability.

Example:

Consider the following dataset representing the daily sales (in units) of a product for a week:

100,105,110,115,120

To calculate the standard deviation:

  1. Calculate the sample mean:

    x=100+105+110+115+1205=5505=110 units


  2. 5100+105+110+115+120​=5550​=110 units

  3. Calculate the squared differences between each data point and the mean:

    (100−110)2=100

     


    (105−110)2=25

     


    (110−110)2=0

     


    (115−110)2=25

     


    (120−110)2=100

     

  4. Calculate the sum of squared differences:

    ∑(ˉ)2=100+25+0+25+100=250

     

  5. Calculate the sample standard deviation:

    s=2505−1=2504=62.5≈7.91 units

     

The sample standard deviation of the daily sales for the week is approximately 7.91 units, indicating the average distance of daily sales from the mean of 110 units.

Standard deviation is a fundamental measure of dispersion that quantifies the average spread of data points around the mean. By considering the squared differences, standard deviation provides a comprehensive and interpretable measure of data variability, making it a valuable tool in descriptive statistics, data analysis, and statistical inference.