TECHNICAL NOTE 2 DATA RELIABILITY
INTRODUCTION
1 For 2006-07 the mining collection was a sample survey designed primarily to deliver national estimates at the industry subdivision and selected class level. Industry division estimates (excluding Subdivision 10 Exploration and Other Mining Support Services) for states and territories for key data variables are also produced, but the survey was not specifically designed for these purposes.
SAMPLING ERROR
2 The majority of data in this publication have been obtained from a sample of mining businesses. As such, these data are subject to sampling variability; that is, they may differ from the figures that would have been produced if the data had been obtained from all mining businesses in the population. One measure of the likely difference is given by the standard error, which indicates the extent to which an estimate might have varied by chance because the data were obtained from only a sample of units.
3 There are about two chances in three that a sample estimate will differ by less than one standard error from the figure that would have been obtained if the data had been obtained from all units, and about 19 chances in 20 that the difference will be less than two standard errors.
4 The standard error can also be expressed as a percentage of the estimate, and this is known as the relative standard error (RSE). RSEs at the industry subdivision and selected class level for Australia for selected data items representing the full range of data contained in this publication are shown in the table below. The relative standard errors of the selected estimates for the states and territories are mainly 5% or less. Detailed RSEs can be made available on request.
5 To illustrate the above, the estimate of sales and service income for total mining in 2006-07 was $115,480m. The RSE of the estimate is shown as 0.8%, giving a standard error of approximately $924m (rounded). This implies that there are two chances in three that, if all units had been included in the survey, an estimate in the range of $114,556m to $116,404m would have been obtained. Similarly, it implies that there are 19 chances in 20 (i.e., a confidence interval of 95%) that the estimate would have been within the range of $113,632m to $117,328m.
6 The size of the RSE may be a misleading indicator of the reliability of some of the estimates for trading profit, OPBT, EBITDA and IVA. Estimates of these variables may legitimately include positive and negative values, reflecting the financial performance of individual businesses. In these cases the aggregated estimate can be small relative to the contribution of individual businesses, resulting in a standard error which is large relative to the estimate.
Relative standard errors |
| |
| Employment at end of June | Wages and salaries | Sales and service income | Industry value added | |
| % | % | % | % | |
| |
06 Coal mining | 0.5 | 0.6 | 0.3 | 0.3 | |
07 Oil and gas extraction | 2.2 | 2.3 | 2.1 | 0.9 | |
0801 Iron ore mining | 0.3 | 0.1 | 0.4 | 0.5 | |
0803 Copper ore mining | 0.8 | 0.3 | 0.4 | 0.4 | |
0804 Gold ore mining | 4.5 | 4.1 | 7.5 | 12.4 | |
0805 Mineral sand mining | 2.9 | 4.7 | 2.1 | 2.7 | |
0807 Silver-lead-zinc ore mining | 4.1 | 6.9 | 3.6 | 3.6 | |
0802, 0806 and 0809 Bauxite
mining, nickel ore mining and
other metal ore mining | 2.5 | 3.2 | 1.5 | 1.7 | |
08 Metal ore mining | 1.2 | 1.4 | 1.2 | 1.4 | |
06-08 Total coal mining, oil and gas
extraction and metal ore mining | 0.7 | 0.7 | 0.7 | 0.6 | |
09 Non-metallic mineral mining
and quarrying | 8.0 | 8.8 | 4.7 | 5.2 | |
10 Exploration and other mining
support services | 4.5 | 5.0 | 6.0 | 13.0 | |
06-10 Total mining | 1.6 | 1.4 | 0.8 | 0.8 | |
| |
NON-SAMPLING ERROR
7 All data presented in this publication are subject to non-sampling error.
8 The imprecision due to sampling variability, which is measured by the standard error, should not be confused with inaccuracies that may occur because of inadequacies in available sources from which the population frame was compiled, imperfections in reporting by providers, errors made in collection such as in recording and coding data, and errors made in processing data. Inaccuracies of this kind are referred to collectively as non-sampling error and they may occur in any enumeration, whether a full census or a sample.
9 For the purpose of compiling the estimates in this publication, businesses in the ATO maintained population (see Technical Note 1) are coded to ANZSIC industry classes on the basis of the activity reported to the ATO when they registered for an ABN. There are a number of reasons why a business classified to any given ANZSIC industry on the ABS Business Register may not have been mainly engaged in activities associated with that industry during the 2006-07 reference year. This may be because of inaccurate or incomplete information at the time the business was registered or it may be because the business has changed activity, either temporarily or permanently.
10 Although it is not possible to quantify non-sampling error, every effort is made to reduce it to a minimum. Collection forms are designed to be easy to complete and assist businesses to report accurately. Efficient and effective operating procedures and systems are used to compile the statistics. The ABS compares data from different ABS (and non-ABS) sources relating to the one industry, to ensure consistency and coherence.
25/07/2008 Note: A minor formatting change has been made in paragraph 4.