1331.0 - Statistics - A Powerful Edge!, 1996
ARCHIVED ISSUE Released at 11:30 AM (CANBERRA TIME) 31/07/1998
Page tools: Print Page Print All | ||||||||||||||||||||||||||||||||||||||||||||||||||
INFORMATION - PROBLEMS WITH USING The previous section should have given you an idea of just how important statistical information is in modern society. Decisions that affect the lives of all Australians are often made by taking statistics into account. This places a large responsibility on people who make decisions. They should be aware of the traps one can fall into when using statistics. This section will outline some of the problems you may encounter if you are not careful in using statistics. The quotation below from H.G. Wells was made at the beginning of the 20th century, and few would disagree that it is relevant today. The modern citizen needs to have an awareness of the problems with using statistical information. “STATISTICAL THINKING WILL ONE DAY BE AS NECESSARY FOR EFFICIENT CITIZENSHIP AS THE ABILITY TO READ AND WRITE.” H. G. Wells MISINTERPRETATION OF STATISTICS Misinterpretation is a good example of a common problem in the use of statistical information. It may be caused by a number of factors such as:
The ABS released a labour force publication in November 1992 with the following main feature: Based on the above, a headline in a leading Australian newspaper read: This headline does not logically follow from the main feature above it. The headline represented a lack of understanding about the definition of unemployed. If you are not employed you may be unemployed: that is, in the labour force and actively seeking a job; OR, you may not be in the labour force, for example, you may be a student, retired or not actively looking for work. Just because a family has no member employed does not necessarily mean that those members are unemployed, because to be unemployed you have to be in the labour force: that is, you have to be actively seeking work. The headline showed a misinterpretation based on lack of understanding of an underlying definition.
However, there can be real problems in comparing statistics when the definitions, classifications or methods of collection underpinning them are different. Nowhere is this more apparent than with environmental statistics. Consider the table below.
“POLITICAL TACTICIANS ARE NOT IN SEARCH OF SCHOLARLY TRUTH OR EVEN SIMPLE ACCURACY. THEY ARE LOOKING FOR AMMUNITION TO USE IN THE INFORMATION WARS. DATA, INFORMATION, AND KNOWLEDGE DO NOT HAVE TO BE TRUE TO BLAST AN OPPONENT OUT OF THE WATER.” Alvin Toffler You might say this is an overly cynical quotation, but one does need to realise that information is open to manipulation by various forces, for example: This section outlined some problems you may encounter trying to understand and compare statistical information. Of course, you also have to be careful about how accurately statistics were collected in the first place. This leads to you being aware of sampling and non-sampling error, concepts outlined in the following pages. SAMPLING ERROR In any sample survey that you undertake you will experience sampling error. Sampling error refers to: THE DIFFERENCE BETWEEN AN ESTIMATE DERIVED FROM A SAMPLE SURVEY AND THE ‘TRUE’ VALUE THAT WOULD RESULT IF A CENSUS OF THE WHOLE POPULATION WAS TAKEN. Sampling error can be measured mathematically and is influenced by: Size of sample. In general, the larger the sample size (the number of people being surveyed) the smaller the sampling error. Many people are surprised by the small size of well-known sample surveys. Opinion polls about which party people will vote for are taken with sample sizes ranging from 600 to 2,000 people, with samples of about 1,000 the most likely. Television ratings of different programs and channels are taken from a sample survey of about 1,900 homes, out of an Australian total population of 6.5 million homes. Despite a perception that such polls are accurate, some statisticians would question their accuracy due to the small sample sizes. Design of sample. The method of sampling can also affect the size of sampling error. This concept is looked at in detail on the sections Random Sampling and Non-Random Sampling. NON-SAMPLING ERROR This concept refers to error apart from sampling error. Non-sampling error can occur at any stage of a sample survey or census, and unlike sampling error it is not generally easy and inexpensive to measure. There are two main types of non-sampling error: systematic error and variable error. Variable error is less serious than systematic error because, on average, it tends to balance out . Systematic error does lead to distortion of survey results, so it is important to be aware of how it occurs. SYSTEMATIC ERROR (BIAS) Later in this publication you will come across a technical definition of bias (see section Non-Random Sampling). For the purposes of this section, bias is defined as any influence that unreasonably affects or sways the results of a sample survey or census. There are a number of different sources of bias:
CHANGES IN GLOBAL TEMPERATURE (Degrees Celsius) The measurements that make up the graph have been taken at various weather stations around the world. You can regard the world’s surface as the population from which a sample survey can be taken. Scientists argue, therefore, that measurements should be taken to reflect the ratio of the world’s land mass to its sea mass. For example, if the land mass is half the sea mass, then twice as many measurements should come from the world’s seas as opposed to the land. In fact, in the graph above, there have been very few measurements taken from the world’s sea surfaces, whereas the great majority of measurements were taken from weather stations on land. But why might this bias the estimates from the sample survey? The reason is that temperatures on land tend to be naturally higher than on sea surfaces. This is due to a phenomenon known as urban heat island effect. Hence, if the sample is too heavily weighted towards land based temperatures, and the estimates do not take account of this (as some scientists claim), the results may not reflect a true global average.
For the 1986 Census it was requested that the ABS gather data on ethnicity. Initially a question: ‘What is your cultural background’ was framed. One of the replies to this question was simply ‘none’. When the respondent was contacted he was asked what he meant. He replied, ‘Look, leave me alone, I’m a regular sort of a bloke, I go to the footy every now and then, but I’ve never been to the opera and I’ve never taken up a musical instrument in my life.’ This example shows that people may interpret broad concepts such as ‘cultural background’ quite differently. The question was reframed and written as: ‘What is each person’s ancestry?’. E.g. Greek, Armenian, English... etc.
SUMMARY It is useful to have a checklist of questions ready for whenever you are presented with statistical information. This is not because there are always going to be problems with the statistics, but rather because it will give you confidence in judging their reliability. Some questions you might ask include:
EXERCISES
“ORATORY IS DYING, A CALCULATING AGE HAS STABBED IT IN THE HEART WITH INNUMERABLE DAGGER-THRUSTS OF STATISTICS.” Sir Keith Hancock Click here for answers
|