10.4: The One Variance Chi-Squared Procedure
( \newcommand{\kernel}{\mathrm{null}\,}\)
To create the One Mean Z and T Procedures and One Proportion Z Procedure, we used the Central Limit Theorem for sample means and proportions. While there is no Central Limit Theorem for sample variances, we can still construct a confidence interval using the fact that s2σ2/(n−1) follows a well-studied χ2-distribution with n−1 degrees of freedom, i.e.
s2σ2/(n−1)∼χ2df=n−1
The following is the equivalent of the Generalized Empirical Rule:
P(χ21−α/2<s2σ2/(n−1)<χ2α/2)=1−α
After manipulating the inequality inside the parenthesis, we can conclude that:
P(σ2χ21−α/2n−1<s2<σ2χ2α/2n−1)=1−α
Therefore, we have the following two interpretations:
We are (1−α)⋅100% confident that
(1) s2i is between σ2χ21−α/2n−1 and σ2χ2α/2n−1
OR
(2) σ2 is between s2i(n−1)χ2α/2 and s2i(n−1)χ21−α/2.
Hence, we are (1−α)⋅100% confident that σ2 is between s2iχ21−α/2n−1 and s2iχ2α/2n−1.
Unlike the previous procedures, the One Variance Chi-Square Procedure is based on a distribution that is not symmetric. As a result, the confidence interval is not symmetric. While the sample variance, s2i, is still an unbiased estimate of the population variance it is not the center of the interval. From the practical point of view, this means that the interval can be found by computing its lower and upper bounds directly with the following formulas:
LB=s2i(n−1)χ2α/2 and RB=s2i(n−1)χ21−α/2
Note that we can always turn a confidence interval for the population variance into a confidence interval for the population standard deviation by taking the square root of the boundaries!
LB=√s2i(n−1)χ2α/2 and RB=√s2i(n−1)χ21−α/2
As a result, the following template will be used to construct and interpret the confidence intervals:
Unknown Parameter |
σ2, population variance |
---|---|
Point Estimate |
s2i, the sample variance |
Confidence Level (of the point estimate) |
0% |
Confidence Level (for a new interval estimate) |
(1−α)100% α/2 1−α/2 |
Critical Value(s) | χ2α/2 and χ21−α/2 for df=n−1 |
Margin of Error |
n/a |
Lower Bound |
LB=s2i(n−1)χ2α/2 |
Upper Bound |
RB=s2i(n−1)χ21−α/2 |
Interpretation | We are (1−α)100% confident that the unknown population variance σ2 is between LB and UB. |
For the procedure to work the following assumptions must be made:
- The sample is simple random.
- The population has a normal distribution.
Consider the following example.
For a certain drug, based on standards set by the United States Pharmacopeia (USP) - an official public standards-setting authority for all prescription and over-the-counter medicines and other health care products manufactured or sold in the United States, a standard deviation of capsule weights of less than 2 mg is acceptable. A sample of 10 capsules was taken and the weights are provided below:
99.8, 100.6, 101.7, 100.8, 99.5, 98.9, 101.7, 100.8, 99.8, 98.2
Construct and interpret a 90% confidence interval for the standard deviation of the weights of all capsules. You may assume that the weights of capsules are normally distributed.
Solution
First, let’s check if all necessary assumptions are satisfied:
- The sample is simple random.
- The population has a normal distribution.
Also, note that the sample mean and variance are respectively:
ˉxi=100.18
s2i=1.32
Once the assumptions are verified, we may apply the procedure by using the template.
Example | Template | |
---|---|---|
Unknown Parameter |
σ2, population variance of capsule weights |
σ2, population variance |
Point Estimate |
s2i=1.32, the sample variance of a sample of size n=10 |
s2i, the sample variance |
Confidence Level (of the point estimate) |
0% |
0% |
Confidence Level (for a new interval estimate) |
CL=90% α/2=0.05 1−α/2=0.95 |
(1−α)100% α/2 1−α/2 |
Critical Value(s) | When df=9: χ20.95=3.325 and χ20.05=16.919 | χ2α/2 and χ21−α/2 for df=n−1 |
Margin of Error |
n/a |
n/a |
Lower Bound |
LB=1.32⋅916.919=0.70 | LB=s2i(n−1)χ2α/2 |
Upper Bound |
RB=1.32⋅93.325=3.57 | RB=s2i(n−1)χ21−α/2 |
Interpretation | We are 90% confident that the variance of capsule weights is between 0.70 and 3.57. | We are (1−α)100% confident that the unknown population variance σ2 is between LB and UB. |
Now we can interpret the results: We are 90% confident that the variance of capsule weights is between 0.70 and 3.57.
Remember that we can always convert a confidence interval for the population variance to a confidence interval for the population standard deviation by taking the square root of the boundaries!
In our example, we are 90% confident that the weight variance of capsules is between 0.70 and 3.57, therefore we are 90% confident that the weight standard deviation of capsules is between 0.83 and 1.89 mg - well within the standard 2 mg.