When samples are small, using a larger multiplier is crucial, since confidence intervals can have probabilities of actually containing the mean that are far below the reported confidence levels.
When samples are large, there is very little difference between the multiplier from a t distribution and the multiplier from a normal distribution.
An example demonstrates the use of the t distribution.
All of the t distributions are centered at 0 and bell-shaped, much like the standard normal curve. However, they are spread out farther. When the number of degrees of freedom is small, this spread is noticeable. When the number of degrees of freedom is relatively large (more than 30, or so) there is very little practical difference between the t distribution and the standard normal distribution.
The formula to use is
.
This formula is appropriate to use whenever the standard error is estimated and the underlying population is roughly symmetric and mound shaped. It need not be perfectly normal. The t distribution should never be used for proportions, since the sample data in proportions are all 0's and 1's, which looks nothing like a symmetric mound. For larger samples, the need to look approximately normal is not great. The central limit theorem is taking over, and the shape of the sampling distribution will be approximately normal, which is all that the theory relies on.
The shape of the sample indicates that
that the shape of the sampling distribution of
is sufficiently normal
to not worry about it.
We do not know the standard error for
exactly,
so we must estimate it by 18.4 / sqrt( 14 ) = 4.92.
Since we are estimating the SE, we should use the t distribution
instead of the normal.
There are 14 - 1 = 13 degrees of freedom.
The appropriate t
gives an answer 114.0 +/- 10.6.
This can interpreted as
"we are 95% confident that the unknown mean weight of all newborn infants
in Boston is between 103.4 and 124.6 ounces".
Bret Larget, larget@mathcs.duq.edu