4715.0.55.002 - Technical Manual: National Aboriginal and Torres Strait Islander Health Survey, Expanded CURF, 2004-05  
ARCHIVED ISSUE Released at 11:30 AM (CANBERRA TIME) 21/02/2014   
   Page tools: Print Print Page Print all pages in this productPrint All

CHAPTER 4 ABOUT THE CURF


FILE CONTENTS

The 2004-05 NATSIHS CURF contains the files listed below and is available through the RADL


Expanded CURF files

These files contain the data for the CURF in SAS for Windows format:

  • IHS05HHE.SD2 contains the Household level data
  • IHS05APE.SD2 contains (all) Persons in household level data
  • IHS05PNE.SD2 contains (selected) Person level data
  • IHS05ALE.SD2 contains Alcohol level data
  • IHS05CNE.SD2 contains Conditions level data
  • IHS05IDE.SD2 contains Injury damage level data
  • IHS05IPE.SD2 contains Body part injured level data

These files contain the data for the CURF in SPSS for Windows format:
  • IHS05HHE.sav contains the Household level data
  • IHS05APE.sav contains (all) Persons in household level data
  • IHS05PNE.sav contains (selected) Person level data
  • IHS05ALE.sav contains Alcohol level data
  • IHS05CNE.sav contains Conditions level data
  • IHS05IDE.sav contains Injury damage level data
  • IHS05IPE.sav contains Body part injured level data

These files contain the data for the CURF in Stata format:
  • IHS05HHE.dta contains the Household level data
  • IHS05APE.dta contains (all) Persons in household level data
  • IHS05PNE.dta contains (selected) Person level data
  • IHS05ALE.dta contains Alcohol level data
  • IHS05CNE.dta contains Conditions level data
  • IHS05IDE.dta contains Injury damage level data
  • IHS05IPE.dta contains Body part injured level data


Expanded CURF test files

The test files mirror the actual data files, but have random data. The test files can be utilised to trouble shoot SAS, SPSS or Stata code prior to submitting RADL jobs. Data from the test files will not match data from the actual 2004-05 NATSIHS CURF files.
  • Formats.SC2 is a SAS library containing formats for the test files

These files contain the test data in SAS for Windows format:
  • IHS05HHT.SD2 contains the Household level data
  • IHS05APT.SD2 contains (all) Persons in household level data
  • IHS05PNT.SD2 contains (selected) Person level data
  • IHS05ALT.SD2 contains Alcohol level data
  • IHS05CNT.SD2 contains Conditions level data
  • IHS05IDT.SD2 contains Injury damage level data
  • IHS05IPT.SD2 contains Body part injured level data

These files contain the test data in SPSS for Windows format
  • IHS05HHT.sav contains the Household level data
  • IHS05APT.sav contains (all) Persons in household level data
  • IHS05PNT.sav contains (selected) Person level data
  • IHS05ALT.sav contains Alcohol level data
  • IHS05CNT.sav contains Conditions level data
  • IHS05IDT.sav contains Injury damage level data
  • IHS05IPT.sav contains Body part injured level data

These file contain the test data in Stata format:
  • IHS05HHT.dta contains the Household level data
  • IHS05APT.dta contains (all) Persons in household level data
  • IHS05PNT.dta contains (selected) Person level data
  • IHS05ALT.dta contains Alcohol level data
  • IHS05CNT.dta contains Conditions level data
  • IHS05IDT.dta contains Injury damage level data
  • IHS05IPT.dta contains Body part injured level data


Information files
  • CONTENTS_CURF_EXPANDED.TXT contains the file contents of each of the eight levels, including type (whether character or numeric), length and labels.
  • FREQUENCIES_UNWEIGHTED_CURF_EXPANDED.TXT contains, for each of the eight levels, the data item code values and category labels with the unweighted frequencies of each value. This file is in plain text format.
  • FREQUENCIES_WEIGHTED_CURF_EXPANDED_SPERSON.TXT contains, for the (selected) Person level only, the data item code values and category labels with the weighted frequencies of each value. This file is in plain text format.
  • FREQUENCIES_WEIGHTED_CURF_EXPANDED_HHOLD.TXT contains, for the Household level only, the data item code values and category labels with the household weighted frequencies of each value. This file contains household weighted frequencies only for those households identified as including Indigenous persons. This file is in plain text format.
  • 47150.2004-05.pdf is an acrobat file that contains the National Aboriginal and Torres Strait Islander Health Survey, Australia, 2004-05, Summary publication.
  • 4715055002_2004-05.pdf is an acrobat file that contains this Technical Manual.
  • 4715055004_2004-05.pdf is an acrobat file that contains the National Aboriginal and Torres Strait Islander Health Survey: Users' Guide, 2004-05.


ABOUT THE MICRODATA

The 2004-05 NATSIHS Expanded CURF enables users to manipulate the data, produce tabulations and undertake statistical analysis to individual specifications.

Detailed information about the data collected, comments regarding data quality and other points to assist in using and interpreting the data are contained in the Users' Guide, which is available free of charge from the ABS web site. It is recommended that relevant parts of the Guide be read in conjunction with the use of the 2004-05 NATSIHS CURF.

The 2004-05 NATSIHS microdata is released under the Census and Statistics Act 1905 which has provision for the release of data in the form of unit records where the information is not likely to enable the identification of a particular person. Accordingly, there are no names or addresses of survey respondents on the CURF.

In addition there are some general points regarding the data appearing on the CURF which should be noted.

Adjustments have been made to the data in a small number of records in the course of confidentialising records included on the CURF. The nature of the changes made, and the relatively small number of records involved have ensured that the effects on the data for analysis purposes is considered negligible. The changes made include:
  • perturbation of reported $ income, with consequential effects on derived income variables;
  • swapping of some records between states/territories; and
  • removal of some non-selected person records for large household from the (all) Persons in household level.

In determining the long term medical conditions to be separately identified on the CURF, thresholds based on the number and type of observations in the survey have been used. The result was some collapsing of the categories for these items on the CURF as compared with the main data file. This in turn has effected the counts of some conditions.

Duplication may have occurred where, for example, a person reported two conditions, which on the main file are categorised separately (2 conditions) but these categories have been combined on the CURF (counted as one condition). Apparent duplicates, arising from this process, have been removed from the CURF where they are duplicates on both the ICD10 and ICPC. In cases where two or more conditions are collapsed in one item but are separate in the other, the multiple conditions have been left for the item which have separate categories, and a duplicate code (i.e. 999) has been produced for the collapsed item. For example, a person may have originally had two disc disorders. In the ICD10 they were collapsed to ICD10D=103. In the ICPC, they were coded to ICPCCI=47 (i.e. Musculoskeletal - Back and neck symptom/complaint) and ICPCCI=48 (i.e. Low back symptom/complaint). In order to keep both ICPC conditions, the second ICD10D code is given a duplicate code of 999. The ICPCCI also has a duplicate code of 999 for occasions where the ICD10 has two codes and the ICPC has one. These 'duplicate' codes should not be incorporated into analyses of conditions data.

Data items associated with the conditions data items (e.g. number of long term conditions) have been rederived as appropriate based on the classifications shown on the CURF. Where duplicates were removed, subsequent data items on that level were adjusted. Taking the condition example above with the original 2 conditions, if the item "Whether condition a result of an injury" was different then the entry with the lowest value (i.e. 1. Result of an injury) was kept.

Duplicate records have also been removed from the Injury damage and Body part injured levels due to the collapsing of the categories in the injury item.

As a consequence of steps undertaken to maintain confidentiality, aggregated data for Indigenous households/persons and non-Indigenous persons obtained from the 2004-05 NATSIHS CURF may be slightly different to that published or obtained through customised tables.

Steps to confidentialise the data made available on the CURF are taken in such a way as to insure the integrity of the data and optimise its content, while maintaining the confidentiality of respondents. Intending purchasers should ensure that the data they require are available on the 2004-05 NATSIHS CURF. Data collected in the survey but not contained on the CURF may be available in tabulated form on request.


DATA ITEM LIST STRUCTURE

A comprehensive list of data items for the CURF is available from the downloads tab. Data items have been grouped into broad topics for ease of use. Each data item is set up with the following categories:
      SAS name - This identifies the SAS name applied to the data item for the CURF.
      Label - This identifies the name of the data item.
      Categories - This identifies the data item category labels and numbering.
      File level - This identifies on which file level the data item is located (e.g. Household, Person, Alcohol)
      Main population - This identifies the target population of the data item.
      Multiple response item - This identifies whether an item is a multiple response item. See Chapter 3: Using the CURF Data for details regarding use of multiple response items.
      Remote/Non-remote release - This identifies whether an item has restricted geographic release. 'Non-remote' identifies items which can only be released for non-remote areas and are restricted in geography as per directions in Chapter 3: Using the CURF Data. 'Both' indicates that there are no geographic restrictions (other than usage restrictions i.e. no more than one geography can be used at a time).
      Indigenous/Non-Indigenous release - This identifies whether an item is available for the non-Indigenous population. 'Indigenous only' identifies items which are not available for the non-Indigenous population. 'Both' indicates that data is available for both Indigenous and non-Indigenous.
      Comments - This identifies issues the user should be aware of when using this item. Users should also consult the Users Guide for any quality or interpretation issues that should be considered.