Module 3: Interpreting Data
4. Using data to support an argument; making inferences
4.2 Using null and alternative hypotheses
Is some desired effect present in the population or not?
With this type of question we want to know whether or not a factor or a treatment has an effect on a population. Such questions are answered by conducting research that uses available evidence, observations and experimentation.
Have a look at the issue regarding Vitamin C.
You may believe or have heard people say that taking large doses of vitamin C prevents colds. Vitamin C is a nutrient that is essential to humans for normal functioning. However, some scientists postulated that Vitamin C also could help people to fight off colds and flu. This idea has often been attributed to the Nobel Prize winner for Chemistry, Dr. Linus Pauling.
So the question is: "Does Vitamin C have no effect on colds in the population or does it have an effect?"
The question is answered by developing null and alternative hypotheses.
EXAMPLE
In Toronto (Moore 1991) an experiment was conducted to discover if large doses of vitamin C prevented colds. 500 volunteers were randomly assigned, half each, to the treatment group and the control group. Members of the treatment group were given 1g/day of vitamin C (about one vitamin C tablet/day) and if they felt as though they might be getting a cold the dosage was increased to 4g/day. Those in the control group were given a placebo.
It is possible for this study to be framed in a particular way that indicates competing beliefs about the population.
- First we assume that Vitamin C makes no difference - has no effect. This is called the null hypothesis - the no change hypothesis. The symbol used is H0 - "H" for hypothesis and "0" for zero change. (The word "null" is another way of saying "zero".)
- Second we set up an alternate hypothesis Ha (or H1) which takes the opposite point of view - namely Vitamin C does make a difference.
- The data is used to produce a test value - a test statistic - in this case it measures the average number of colds caught by each group (treatment group and control group).
- Then the process evaluates whether the difference between the groups is significantly large.
So, in statistical inference the language used is to talk about alternative hypotheses. The current belief (no change situation) is written as an hypothesis called a null hypothesis (H0) and the alternative position is written as an hypothesis called the alternative hypothesis (Ha or H1).
We can write the hypotheses as follows:
| H0: | there is no difference between the Vitamin C group and the placebo group |
| Ha: | there is a significant difference between the Vitamin C group and the placebo group |
or:
| H0: | the Vitamin C group and the placebo group catch the (approximately) same number of colds |
| Ha: | the Vitamin C group catches a significantly smaller number of colds |
or, using symbols, where is the mean of the population taking Vitamin C and is the mean of the population not taking Vitamin C (control group) :
From the data, an appropriate test statistic can be calculated that assesses the strength of the evidence against the null hypothesis. If the statistic is significant then the data lead you to reject the null hypothesis and accept the alternative hypothesis. [This process is beyond the scope of our module.]
To be statistically significant the experimental evidence for Vitamin C needs to be so strong that it would appear by chance only rarely in many experiments.
That is, if there was no significant difference between Vitamin C and the placebo, the only differences occurring in the mean number of colds caught by each group would be due to random effects.
EXAMPLES
1. The General Manager of Fresh'n'Easy air conditioners tells an investigative reporter that at least 85% of its customers are "completely satisfied" with their overall purchase performance. What hypotheses will be used by the reporter to test the claim?
Answer:
Let be the true proportion of "complete satisfaction" in the population.
| H0: | there is no difference between the manager's claim and reality
- i.e., 85% or more are satisfied |
| H1: | less than 85% of customers are satisfied |
Or, using symbols:
| H0: | ≥ 0.85 |
| H1: | < 0.85 |
2. A consumers' advocate claims that the contents of only 70% of muesli packets produced by Big Oats have contents with the same mass as claimed on the packet. What hypotheses will be used to test the claim
Answer:
Let be the true proportion of packets in the population which weigh the same as the stated contents.
| H0: | there is no difference between the advocate's claim and reality
- i.e., 70% have the same mass as that printed on the packet |
| H1: | 70% do not match the stated contents |
Or, using symbols:
| H0: | = 0.70 |
| H1: | ≠ 0.70 |
This alternate hypothesis means some packets are overfilled, and some are underfilled. Naturally the overfilled ones do not generate consumer complaints, but the underfilled ones will!
3. A student counsellor claims that first year Science students spend an average 3 hours per week doing exercises in each subject. What hypotheses will be used by a lecturer to test the claim
Answer:
Let be the true mean number of hours spent per week
| H0: | there is no difference between the counsellor's claim and reality
- i.e., 3 hours are spent per week |
| H1: | the amount of exercise hours is not 3 (it could be more, it could be less) |
Or, using symbols:
| H0: | =3 |
| H1: | ≠ 3 |
4. This example deals with credit accounts held with a company. To conclude that the system is cost effective requires the manager to show that the mean account balance for all customers is greater than $1700 - so we set up hypothesis:
Answer:
Let be the true mean account balance
| H0: | the system is not effective |
| H1: | the system is effective |
Or, using symbols:
| H0: | ≤ 1700 |
| H1: | > 1700 |
|