Variance and Standard Deviation: Definition, Formulas
Variance and Standard Deviation: In statistics, the two most essential measurements are variance and standard deviation. The main difference between variance and the standard deviation is in the units they use. The variance is expressed in square units, while the standard deviation is expressed in the same units as the data. Here, we aim to discuss the relationship between Variance and Standard Deviation by knowing their definitions, and also the formulas for finding the values of Variance and Standard Deviation of various frequency distributions.
Variance vs. Standard Deviation
Standard deviation is a measure of the distribution of statistical data, whereas the variance of data points is a measure of how they deviate from the mean.
The variance of a variate \(X\) is the arithmetic mean of the squares of all deviations of \(X\) from the arithmetic mean of the observations and is denoted by \({\mathop{\rm Var}\nolimits} (X)\) or \({\sigma ^2}\).
The standard deviation of a variate \(X\) is the positive square root of its variance. Thus, Standard deviation \((\sigma ) = \sqrt {{\mathop{\rm Var}\nolimits} (X)} \)
Thus, variance and standard deviation of Individual Observations can be computed by applying any of the formulas given as equations \(\left( i \right),\,\left( {ii} \right)\), or \(\left( {iii} \right)\).
Variance and Standard Deviation Using Actual Mean:
Step 1: Compute the mean \(\overline X \) of the given observations \({x_1},\,{x_2},\, \ldots ,\,{x_n}\).
Step 2: Take the deviations of the observations from the mean i.e. find \({x_i} – \overline X ;\,i = 1,\,2,\, \ldots ,\,n\).
Step 3: Square the deviations obtained in step \(2\) and obtain the sum i.e., find \(\sum\limits_{i = 1}^n {{{\left( {{x_i} – \overline X } \right)}^2}} \).
Step 4: Divide the sum obtained in step \(3\) by \(n\). This gives the value of variance of \(X\).
Let \({x_1},\,{x_2},\,{x_3},\, \ldots ,\,{x_n}\) be \(n\) values of a variate \(X\). If these values are changed to \({x_1} + a,\,{x_2} + a,\, \ldots ,\,{x_n} + a\), where \(a \in R\) then the variance remains unchanged.
Let \({x_1},\,{x_2},\,{x_3},\, \ldots ,\,{x_n}\) be \(n\) values of a variate \(X\) and let a be a non-zero real number. Then, the variance of the observations \(a{x_1},\,a{x_2},\, \ldots ,\,a{x_n}\) is \({a^2}{\mathop{\rm Var}\nolimits} (X)\).
2. Discrete Frequency Distribution
If \({x_i}\) or \({f_i};i = 1,\,2,\, \ldots ,\,n\) is a discrete frequency distribution of a variate \(X\), then
If the values \({{x_i}}\) of variate \(X\) and/or frequencies \({{f_i}}\) are large the calculation of variance using the formulas \((iii)\), and \((v)\) is quite tedious and time consuming.
In such a case, we take deviations of the values of variable \(X\) from an arbitrary point \(A\) (say). If \({d_i} = {x_i} – A,\,i = 1,\,2,\, \ldots ,\,n\), then the above formula reduces to
Sometimes \({d_i} = {x_i} – A\) are divisible by a common number \(ℎ\) (say).If we define \({u_i} = \frac{{{x_i} – A}}{h} = \frac{{{d_i}}}{h},\,i = 1,\,2,\, \ldots ,\,n\) then we obtain the following formula for variance.
Thus, the formulas \((iv),\,(v),\,(vi)\), and \((vii)\) can be used finding the variance of a discrete frequency distribution.
Variance and Standard Deviation Using Actual Mean
Here, we use the formula, \({\mathop{\rm Var}\nolimits} (X) = \frac{1}{N}\left[ {\sum\limits_{i = 1}^n {{f_i}} {{\left( {{x_i} – \overline X } \right)}^2}} \right]\), and Standard Deviation \((\sigma ) = \sqrt {{\mathop{\rm Var}\nolimits} (X)} = \sqrt {\frac{1}{N}\left[ {\sum\limits_{i = 1}^n {{f_i}} {{\left( {{x_i} – \overline X } \right)}^2}} \right]} \)
Step 1: Write the given frequency distribution.
Step 2: Find the mean \(\overline X \) of the given frequency distribution.
Step 3: Compute deviations \(\left( {{x_i} – \overline X } \right)\) from the mean \(\overline X \).
Step 4: Find the squares of deviations obtained in step \(3\).
Step 5: Multiply the squared deviations by respective frequencies and obtain the total \(\Sigma {f_i}{\left( {{x_i} – \overline X } \right)^2}\).
Step 6: Divide the total obtained in step \(5\) by \(N = \Sigma {f_i}\) to obtain the variance.
Variance and Standard Deviation Using Assumed Mean
Here, we use the formula, \({\mathop{\rm Var}\nolimits} (X) = \left[ {\left( {\frac{1}{N}\Sigma {f_i}d_i^2} \right) – {{\left( {\frac{1}{N}\Sigma {f_i}{d_i}} \right)}^2}} \right]\) and Standard Deviation \((\sigma ) = \sqrt {{\mathop{\rm Var}\nolimits} (X)} = \sqrt {\left( {\frac{1}{N}\Sigma {f_i}d_i^2} \right) – {{\left( {\frac{1}{N}\Sigma {f_i}{d_i}} \right)}^2}} \)
Step 1: Let the assumed mean \( = A\). Calculate the deviations of observations from \(A\) i.e., \({d_i} = {x_i} – A\) where deviation \( = {d_i}\).
Step 2: Then, find \(\Sigma {f_i}{d_i}\) i.e., first multiply each deviation by their respective frequencies and then calculate the sum.
Step 3: Calculate the squares of deviations obtained in step \(1\) i.e., \(d_i^2\).
Step 4: Multiply the squared deviations by respective frequencies and obtain the total i.e., \(\Sigma {f_i}d_i^2\).
Step 5: Substitute the values in the formula, \({\mathop{\rm Var}\nolimits} (X) = \left( {\frac{1}{N}\Sigma {f_i}d_i^2} \right) – {\left( {\frac{1}{N}\Sigma {f_i}{d_i}} \right)^2}\) and simplify.
3. Grouped or Continuous Frequency Distribution:
Any of the strategies outlined above for a discrete frequency distribution may be applied in a grouped or continuous frequency distribution. We use the following algorithm for computing variance of a grouped or continuous frequency distribution.
Step 1: Find the mid-points of various classes.
Step 2: Take the deviations of these mid-points from an assumed mean. Denote these deviations by \({d_i}\)
Step 3: Divide the deviations in step \(2\) by the class interval \(ℎ\) and denote them by \({u_i}\), i.e \({u_i} = \frac{{{d_i}}}{h}\).
Step 4: Multiply the frequency of each class with the corresponding \({u_i}\) and obtain \(\Sigma {f_i}{u_i}\).
Step 5: Square the values of \({u_i}\) and multiply them with the corresponding frequencies and obtain \(\Sigma {f_i}u_i^2\).
Step 6: Substitute the values of \(\sum {{f_i}} {u_i},\,\Sigma {f_i}u_i^2h\) and \(N = \sum\limits_i {{f_i}} \) in the formula, \({\mathop{\rm Var}\nolimits} (X) = {h^2}\left\{ {\frac{1}{N}\sum {{f_i}} u_i^2 – {{\left( {\frac{1}{N}\sum {{f_i}} {u_i}} \right)}^2}} \right\}\). Simplify.
Relationship Between Variance and Standard Deviation
The square root of the arithmetic means of the squares of the deviations measured from the arithmetic mean of the data is the standard deviation. The mean of the squares of the deviations from the mean is the variance.
So, mathematically, we can say that the square root of variance is standard deviation, and the square of standard deviation is variance.
Q.1. Find the variance and standard deviation for the data: \({\rm{65,}}\,{\rm{68,}}\,{\rm{58,}}\,{\rm{44,}}\,{\rm{48,}}\,{\rm{45,}}\,{\rm{60,}}\,{\rm{62,}}\,{\rm{60,}}\,{\rm{50}}\)
Ans: Let \(\overline X \) be the mean of the given set of observations. Then,
Q.2. For a group of \(200\) candidates the mean and S.D. were found to be \(40\) and \(15\) respectively. Later on, it was found that the score \(43\) was misread as \(34\). Find the correct mean and correct S.D.
Ans: Given, \(n = 200,\,\overline X = 40,\,\sigma = 15\)
\(\overline X = \frac{1}{n}\Sigma {x_i} \Rightarrow \Sigma {x_i} = n\overline X = 200 \times 40 = 8000\)
Standard deviation and variance are statistical qualities that quantify dispersion around a central tendency, the arithmetic means in most cases. The higher the standard deviation and variance of a collection of scores, the more the observations (or data points) are spread out around the mean. This article explains and derives the various formulas to calculate it for three types of data: individual observations, discrete frequency distribution, continuous or grouped frequency distribution. Although they appear to be distinct values, they are related to each other. The square root of the variance is the standard deviation.
Frequently Asked Questions (FAQs)
Q.1. How do you calculate variance and standard deviation? Ans: If \({x_1},\,{x_2},\, \ldots ,\,{x_n}\) are \(n\) values of a variable \(X\), then \({\mathop{\rm Var}\nolimits} (X) = \frac{1}{n}\left\{ {\sum\limits_{i = 1}^n – {{\left( {{x_i} – \overline X } \right)}^2}} \right\}\), and Standard deviation \( = \sqrt {{\mathop{\rm Var}\nolimits} (X)} \) To calculate the variance, follow these steps: Step 1: Find the mean \((\overline X )\) of the given observations Step 2: Subtract \(\overline X \) from each observation Step 3: Find the square of each result Step 4: Find the average of all squared values, which is the required variance.
Q.2. Which is better standard deviation and variance? Ans: Standard deviation and variance are statistical qualities that quantify dispersion around the arithmetic mean. The higher the standard deviation and variance of data, the more the observations (or data points) are spread out around the mean. Although standard deviation and variance are closely related to descriptive statistics, the standard deviation is more commonly used because it is more intuitive in terms of units of measurement; the variance is reported in the squared values of units of measurement, whereas standard deviation is reported in the same units as the data.
Q.3. What is the relation between standard deviation and variance? Ans: The variance is the square of the standard deviation. In other words, the standard deviation is the positive square root of variance.
Q.4. Why do we need variance and standard deviation? Ans: Variance and standard deviation both assist in determining the distribution of data in a population from a mean, but standard deviation provides greater information regarding the deviation of data from a mean.
Q.5. What does the variance tell you? Ans. The variance is a measure for determining how variable a value is. The average of squared deviations from the mean is used to compute it. The degree of dispersion in a data collection is measured by variance. The bigger the variance in respect to the mean, the more spread out the data is about the central deviation.
Hope this detailed article on Variance and Standard Deviation helps you in your preparation. In case of any query, reach out to us in the comment section and we will get back to you at the earliest.