Ethan P. Marzban
2023-05-04
Worked-Out Example 1
A veterinarian wishes to determine the true proportion of cats that suffer from FIV (Feline Immunodeficiency Virus). To that end, she takes a representative sample of 100 cats and finds that 3.2% of cats in this sample have FIV.
the population is the corpus of all cats in the world, since we the veterinarian seeks to describe the prevalence of FIV among all cats.
In this context, the sample is the set of 100 cats the veterinarian examined.
The population parameter of interest is \(p =\) “the true proportion of cats that suffer from FIV”.
The value of 3.2% is a sample statistic, because if the veterinarian had taken a different sample of 100 cats she likely would have observed a different proportion of FIV-positive cats.
Exercise 1
A group of (slightly bored) college students would like to determine the true average amount of soda (in liters) in 1-liter soda bottles. To that effect, they purchas 12 different 1-liter soda bottles and find the average amount of soda in these 12 bottles is 0.98L.
So far we’ve seen examples of two different population parameters:
Other population parameters exist! For instance, we could talk about the population median, the population variance, or even the population IQR.
Up until now, I’ve been pretty vague about what “inferences” mean. This is because “making inferences” is a broad term!
One part of making inferences is trying to estimate the value of a population parameter.
Parameter | Symbol | Estimator |
---|---|---|
Mean | \(\mu\) | \(\displaystyle \overline{x} = \frac{1}{n} \sum_{i=1}^{n}\) (the sample mean) |
Proportion | \(p\) | \(\widehat{p}\) (the sample proportion) |
Variance | \(\sigma^2\) | \(\displaystyle s_x^2 = \frac{1}{n - 1} \sum_{i=1}^{n} (x_i - \overline{x})^2\) (the sample variance) |
Std. Dev | \(\sigma\) | \(\displaystyle s_x = \sqrt{\frac{1}{n - 1} \sum_{i=1}^{n} (x_i - \overline{x})^2}\) (the sample std. dev.) |
Important
If we have reasonably representative samples taken from a population with true proportion \(p\) and let \(\widehat{P}\) denote the sample proportion, then \[ \widehat{P} \sim \mathcal{N}\left( p, \ \sqrt{ \frac{p(1 - p)}{n} } \right) \] provided that
Worked-Out Example 1
Suppose a recent study has revealed that 87% of Americans are in favor of offering more healthy options at fast-food restaurants. A surveyor takes a representative sample of size 120 Americans, and records the proportion of these Americans that support offering more healthy options at fast-food restaurants.
Exercise 2
At a certain company, it is known that 65% of employees are from underrepresented minorities (UMs). A representative sample of 80 employees is taken, and the proportion of people from UMs is recorded.
Due to the retroactive change in the success-failure conditions, I will post the written-out solutions to this exercise before Lecture 11.