4715.0.55.002 - Technical Manual: National Aboriginal and Torres Strait Islander Health Survey, Expanded CURF, 2004-05
ARCHIVED ISSUE Released at 11:30 AM (CANBERRA TIME) 21/02/2014
Page tools: Print Page Print All | ||
CHAPTER 4 ABOUT THE CURF
These files contain the data for the CURF in SPSS for Windows format:
These files contain the data for the CURF in Stata format:
Expanded CURF test files The test files mirror the actual data files, but have random data. The test files can be utilised to trouble shoot SAS, SPSS or Stata code prior to submitting RADL jobs. Data from the test files will not match data from the actual 2004-05 NATSIHS CURF files.
These files contain the test data in SAS for Windows format:
These files contain the test data in SPSS for Windows format
These file contain the test data in Stata format:
Information files
ABOUT THE MICRODATA The 2004-05 NATSIHS Expanded CURF enables users to manipulate the data, produce tabulations and undertake statistical analysis to individual specifications. Detailed information about the data collected, comments regarding data quality and other points to assist in using and interpreting the data are contained in the Users' Guide, which is available free of charge from the ABS web site. It is recommended that relevant parts of the Guide be read in conjunction with the use of the 2004-05 NATSIHS CURF. The 2004-05 NATSIHS microdata is released under the Census and Statistics Act 1905 which has provision for the release of data in the form of unit records where the information is not likely to enable the identification of a particular person. Accordingly, there are no names or addresses of survey respondents on the CURF. In addition there are some general points regarding the data appearing on the CURF which should be noted. Adjustments have been made to the data in a small number of records in the course of confidentialising records included on the CURF. The nature of the changes made, and the relatively small number of records involved have ensured that the effects on the data for analysis purposes is considered negligible. The changes made include:
In determining the long term medical conditions to be separately identified on the CURF, thresholds based on the number and type of observations in the survey have been used. The result was some collapsing of the categories for these items on the CURF as compared with the main data file. This in turn has effected the counts of some conditions. Duplication may have occurred where, for example, a person reported two conditions, which on the main file are categorised separately (2 conditions) but these categories have been combined on the CURF (counted as one condition). Apparent duplicates, arising from this process, have been removed from the CURF where they are duplicates on both the ICD10 and ICPC. In cases where two or more conditions are collapsed in one item but are separate in the other, the multiple conditions have been left for the item which have separate categories, and a duplicate code (i.e. 999) has been produced for the collapsed item. For example, a person may have originally had two disc disorders. In the ICD10 they were collapsed to ICD10D=103. In the ICPC, they were coded to ICPCCI=47 (i.e. Musculoskeletal - Back and neck symptom/complaint) and ICPCCI=48 (i.e. Low back symptom/complaint). In order to keep both ICPC conditions, the second ICD10D code is given a duplicate code of 999. The ICPCCI also has a duplicate code of 999 for occasions where the ICD10 has two codes and the ICPC has one. These 'duplicate' codes should not be incorporated into analyses of conditions data. Data items associated with the conditions data items (e.g. number of long term conditions) have been rederived as appropriate based on the classifications shown on the CURF. Where duplicates were removed, subsequent data items on that level were adjusted. Taking the condition example above with the original 2 conditions, if the item "Whether condition a result of an injury" was different then the entry with the lowest value (i.e. 1. Result of an injury) was kept. Duplicate records have also been removed from the Injury damage and Body part injured levels due to the collapsing of the categories in the injury item. As a consequence of steps undertaken to maintain confidentiality, aggregated data for Indigenous households/persons and non-Indigenous persons obtained from the 2004-05 NATSIHS CURF may be slightly different to that published or obtained through customised tables. Steps to confidentialise the data made available on the CURF are taken in such a way as to insure the integrity of the data and optimise its content, while maintaining the confidentiality of respondents. Intending purchasers should ensure that the data they require are available on the 2004-05 NATSIHS CURF. Data collected in the survey but not contained on the CURF may be available in tabulated form on request. DATA ITEM LIST STRUCTURE A comprehensive list of data items for the CURF is available from the downloads tab. Data items have been grouped into broad topics for ease of use. Each data item is set up with the following categories:
Label - This identifies the name of the data item. Categories - This identifies the data item category labels and numbering. File level - This identifies on which file level the data item is located (e.g. Household, Person, Alcohol) Main population - This identifies the target population of the data item. Multiple response item - This identifies whether an item is a multiple response item. See Chapter 3: Using the CURF Data for details regarding use of multiple response items. Remote/Non-remote release - This identifies whether an item has restricted geographic release. 'Non-remote' identifies items which can only be released for non-remote areas and are restricted in geography as per directions in Chapter 3: Using the CURF Data. 'Both' indicates that there are no geographic restrictions (other than usage restrictions i.e. no more than one geography can be used at a time). Indigenous/Non-Indigenous release - This identifies whether an item is available for the non-Indigenous population. 'Indigenous only' identifies items which are not available for the non-Indigenous population. 'Both' indicates that data is available for both Indigenous and non-Indigenous. Comments - This identifies issues the user should be aware of when using this item. Users should also consult the Users Guide for any quality or interpretation issues that should be considered. Document Selection These documents will be presented in a new window.
|