TECHNICAL NOTE SAMPLING VARIABILITY
ESTIMATION PROCEDURE
1 Estimates derived from this survey were obtained using a post-stratification procedure. Post-stratification is a method of stratifying or dividing a sample into groups after the responses have been received. It is used to improve the quality of results through stratifying by variables that were not used at the time of sample design. In this survey the procedure ensured that the survey estimates for persons conformed to independent estimates of the population by age, sex and part of state.
RELIABILITY OF ESTIMATES
2 Estimates in this publication are subject to non-sampling and sampling errors.
Non-Sampling errors
3 Non-sampling errors may arise as a result of errors in the reporting, recording or processing of the data and can occur even if there is a complete enumeration of the population. Non-sampling errors can be introduced through inadequacies in the questionnaire, non-response, inaccurate reporting by respondents, errors in the application of survey procedures, incorrect recording of answers and errors in data entry and processing.
4 It is difficult to measure the size of the non-sampling errors. The extent of these errors could vary considerably from survey to survey and from question to question. Every effort is made to minimise these errors by the careful design of questionnaires, training and supervision of staff and development of efficient data processing procedures.
Sampling errors
5 Sampling errors are the errors which occurs by chance because the data were obtained from a sample, rather than the entire population.
ESTIMATES OF SAMPLING ERROR
6 One measure of the variability of estimates which occurs as a result of surveying only a sample of the population is the standard error (see table on page 23).
7 There are about 2 chances in 3 (67%) that a survey estimate will differ by less than one standard error from the number that would have been obtained if all persons had been included in the survey. There are about 19 chances in 20 (95%) that the estimate will be less than two standard errors.
8 Linear interpolation is used to calculate the standard error of estimates falling between the sizes of estimates listed in the table.
9 The standard error (SE) can also be expressed as a percentage of the estimate. This is known as the relative standard error (RSE). The RSE is determined by dividing the standard error of an estimate SE(x) by the estimate x and expressing it as a percentage. That is, (where x is the estimate)
10 An example of the calculation and use of standard errors for estimates of households follows. Table 2 shows that the estimated number of households in NSW who used a Split system Air Conditioner as a cooling device during 2006 was 560,800. Since the estimate is between 500,000 and 1,000,000 the standard error (as shown in the table on page 23) will be between 16,216 and 21,662 and can be approximated as 16,800 using linear interpolation. Therefore, there are 2 chances in 3 that the value that would have been obtained, had all persons been included in the survey, lies between 544,000 and 577,600. Similarly, there are about 19 chances in 20 that the value lies between 527,200 and 594,400.
11 Proportions and percentages formed from the ratio of two estimates are also subject to sampling error. The size of the error depends on the accuracy of both the numerator and the denominator. The formula for the relative standard error (RSE) of a proportion or percentage is,
12 Only estimates with relative standard errors of 25% or less, and percentages based on such estimates, are considered sufficiently reliable for most purposes. However, estimates and percentages with a larger RSE have been included, preceded by * (RSE between 25% and 50%) or ** (RSE greater than 50%) to indicate that they are subject to high standard errors and should be used with caution.
Standard errors of estimates of NSW persons - October 2006 |
| |
| Size of estimate | Standard error | Relative standard error | |
| persons | no. | % | |
| |
| 1 000 | 645 | 64.5 | |
| 1 500 | 824 | 54.9 | |
| 2 000 | 977 | 48.8 | |
| 2 500 | 1 114 | 44.5 | |
| 3 000 | 1 238 | 41.3 | |
| 3 500 | 1 353 | 38.6 | |
| 4 000 | 1 460 | 36.5 | |
| 5 000 | 1 657 | 33.1 | |
| 8 000 | 2 151 | 26.9 | |
| 10 000 | 2 430 | 24.3 | |
| 20 000 | 3 514 | 17.6 | |
| 30 000 | 4 333 | 14.4 | |
| 50 000 | 5 602 | 11.2 | |
| 100 000 | 7 841 | 7.8 | |
| 200 000 | 10 822 | 5.4 | |
| 300 000 | 12 982 | 4.3 | |
| 500 000 | 16 216 | 3.2 | |
| 1 000 000 | 21 662 | 2.2 | |
| 2 000 000 | 28 533 | 1.4 | |
| 5 000 000 | 40 190 | 0.8 | |
| 10 000 000 | 51 235 | 0.5 | |
| |
13 Where differences between data items have been noted in the Summary of Findings, they are statistically significant unless otherwise specified. In this publication a statistically significant difference is one where there are 19 chances in 20 that the difference noted reflects a true difference between population groups of interest rather than being the result of sampling variability.