USING THE CURF
ABOUT THE CURF
IDENTIFIERS
CURF FILE NAMES
ABOUT THE CURF
The data included in the PIAAC CURF are released under the provisions of the Census and Statistics Act 1905. This legislation allows the Australian Statistician to release unit record data, or microdata, provided this is done "in a manner that is not likely to enable the identification of a particular person or organisation to which it relates."
The ABS ensures the confidentiality of the data by:
- removing name, address and any other information that might uniquely identify any individual
- changing a small number of values - particularly unusual values - and removing very unusual records
- controlling the detail available for all records on the CURF
- perturbing or randomly adjusting income data
- excluding some data items that were collected
- controlling the modes of access to restrict access to more detailed data
- placing restrictions on how the data are used, supported by both information in the User Manual: Responsible Use of ABS CURFs, the undertaking signed by the head of each organisation and the terms and conditions signed by each user.
As a result, data on the CURF will not exactly match other previously published estimates. Any changes to the distribution of values are not significant and the statistical validity of aggregate data is not affected.
IDENTIFIERS
Each person has a unique random identifier - ABSPID.
CURF FILE NAMES
The PIAAC Basic CURF can be accessed on CD-ROM and is available in SAS, SPSS and STATA formats. The CURF comprises the following files:
Data files
- PIAAC12B.csv contains the data for the CURF in comma delimited ASCII text format
- PIAAC12B.sas7bdat contains the data for the CURF in SAS format
- PIAAC12B.sav contains the data for the CURF in SPSS format
- PIAAC12B.dta contains the data for the CURF in STATA format.
Information files
- The Data item list contains all the data items, including details of categories and code values, that are available on the Basic CURF.
- The Formats file is a SAS library containing formats.
- The Frequency file contains
data item code values and category labels with weighted person frequencies of each value. This file is in plain text format.
This page last updated 14 February 2013