Page tools: Print Page Print All | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
This document was added or updated on 28/07/2015. USING THE EXPANDED CURF
The nature of the changes made, and the relatively small number of records involved ensure that the effects on data for analysis purposes is considered negligible. The changes mean that estimates produced from the CURF may differ from those published in Australian Aboriginal and Torres Strait Islander Health Survey: First Results, Australia - 2012-13 (cat. no. 4727.0.55.001) or subsequent NATSIHS-related publications. Detailed information about the data collected, comments regarding data quality and other points to assist in using and interpreting the data are contained in Australian Aboriginal and Torres Strait Islander Health Survey: Users’ Guide, 2012-13 (cat. no. 4727.0.55.002). The 2012-13 Aboriginal and Torres Strait Islander Health Survey, Detailed Conditions and Other Health Data is available via two Expanded Confidentialised Unit Record Files (CURFs):
The primary differences between these CURFs are the:
State/Territory Expanded CURF This CURF contains a data item (STATEE) which identified each state and territory separately, except Tasmania and the ACT. Due to confidentiality considerations, the samples from Tasmania and the ACT have been combined into a single category of Tas/ACT. This data item is located on the household level file. In addition, this CURF contains the Socio-economic Index of Relative Disadvantage (SEIFA - deciles) variable. This CURF contains items that were collected in both non-remote and remote areas. The structure of this CURF is: State by ASGS Remoteness Expanded CURF This CURF contains a broad National Remoteness data item (RAECURF) and a special data item (STATREM), consisting of 16 output categories which comprise selected cross-classifications of state/territory by remoteness, where sample and population estimate sizes permit. Output categories can be found in the Expanded CURF data item list located in the Downloads tab of this product. These two data items are available on the household level file. In addition, this CURF contains the Socio-economic Index of Relative Disadvantage (SEIFA - deciles) variable. This CURF contains items that were collected in both non-remote and remote areas as well as items collected in non-remote areas only or remote areas only. Data items are identified in the Data item list for which remoteness area they were collected in and users should ensure they reference this to ensure the population they are representing is correct. The structure of this CURF is: ACCESSING EXPANDED CURFS Expanded CURFs can only be accessed via the Remote Access Data Laboratory (RADL). Users must have applied for use of the RADL prior to using the Expanded CURF microdata. Details on the RADL can be found here - Remote Access Data Laboratory. COUNTS AND WEIGHTS NUMBER OF RECORDS BY LEVEL, NATSIHS 2012-13 EXPANDED CURF
(b) Comprising 9,317 people, including people who have no condition (c) Comprising 796 people who sustained an injury in most recent event in last 4 weeks (d) Comprising biomedical participants 18 years and over (e) Applicable to State/Territory by ASGS Remoteness CURF only (f) Comprising 3 days of data for 1,730 children aged 5-17 years living in non-remote areas (g) Comprising 3 days of activity data for 1,730 children aged 5-17 years living in non-remote areas, including those who did no physical activity Weights and Hierarchical Files Weight Variables There are three weight variables on the CURF files: Household Weight PAA (IHSFHHWT) - Household level - Benchmarked to produce Aboriginal and Torres Strait Islander household estimates. This weight is located on the Household level file Person Weight PAA (IHSFINWT) - Person level - Benchmarked to the total Aboriginal and Torres Strait Islander population. This weight is located on the Person level. Biomedical Weight PAA (IHMSPERW) - Biomedical level - Benchmarked to the total population aged 18 years and over. This weight is located on the Biomedical level. Note that this level also contains non-biomedical participant records however their biomedical weight is set to 0 so they won't contribute to estimates. When using biomedical variables in conjunction with other variables on the biomedical level or with variables from other levels, the biomedical weight should be used. There is no weight associated with the Persons in household level. This level is available in order to produce compositional information about the household (e.g. Number of persons in household aged 4-14 years) which can then either be used with the household weight to represent for example the number of households with at least two persons aged 4-14 years, or with the person weight to represent the number of people living in household that contain at least two persons aged 4-14 years. There are also no weights associated with the other levels. This is because the records are repeated for each person. If, for example, IHSFINWT is merged onto the Conditions level, it will be attached to each condition record and therefore be repeated for each person where they have more than one condition. This should be considered when producing tables. See Copying information across levels below for more information. For more information about weights see Reliability of Estimates below. Using Weights The NATSIHS is a sample survey. To produce estimates for the in-scope population you must use weight fields in your calculations. The 'Biomedical Weight PAA' must be used for all tables where a biomedical level data item is being used. This includes where biomedical items are being used with items from other levels. Which weight, if any, is used on data at non-benchmarked levels will affect the result as shown in the examples below:
(b) Each person selected in the survey has at least one record per level below the Person level. Weights produced for these levels, without any filtering to restrict to the applicable population, therefore includes the weights of persons (or households) who are not applicable to the level or characteristic. (c) Weighted estimate when using biomedical weight. (d) State/Territory by ASGS Remoteness CURF only. IDENTIFIERS Every record on each level of the file is uniquely identified. The identifiers ABSHID, ABSAID, ABSPID, ABSBID, ABSTID, ABSCID, ABSIID, ABSUID, ABSKID, and ABSDID appear on all levels of the file. Where the information for the identifier is not relevant for a level, it has a value of 0. See below for details on which IDs are relevant for which levels. Each household has a unique thirteen digit random identifier, ABSHID. This identifier appears on the household level and is repeated on each level on each record pertaining to that household. The combination of identifiers uniquely identifies a record at a particular level as shown below. State/Territory 1. Household = ABSHID 2. Persons in Household = ABSHID, ABSAID 3. Person = ABSHID, ABSAID, ABSPID 4. Alcohol Day = ABSHID, ABSAID, ABSPID, ABSBID 5. Alcohol Type = ABSHID, ABSAID, ABSPID, ABSBID, ABSTID 6. Conditions = ABSHID, ABSAID, ABSPID, ABSCID 7. Most Recent Injury = ABSHID, ABSAID, ABSPID, ABSIID 8. Biomedical = ABSHID, ABSAID, ABSPID, ABSUID State/Territory by ASGS Remoteness 1. Household = ABSHID 2. Persons in Household = ABSHID, ABSAID 3. Person = ABSHID, ABSAID, ABSPID 4. Alcohol Day = ABSHID, ABSAID, ABSPID, ABSBID 5. Alcohol Type = ABSHID, ABSAID, ABSPID, ABSBID, ABSTID 6. Conditions = ABSHID, ABSAID, ABSPID, ABSCID 7. Most Recent Injury = ABSHID, ABSAID ABSPID, ABSIID 8. Biomedical = ABSHID, ABSAID, ABSPID, ABSUID 9. Child 5-17 Years Physical Activity (NR Only) = ABSHID ABSAID ABSPID ABSKID 10. Child 5-17 Years Physical Activity Detailed (NR Only) = ABSHID ABSAID ABSPID ABSKID ABSDID ABSHID assists with linking together people of the same household and also with household characteristics such as geography (located on the household level). The combination of ABSHID, ABSAID, ABSPID and ABSCID identifies each individual condition record a person has. When merging data with a level above, only those identifiers relevant to the level above are required. However, when merging, for example, the conditions level with the person level, the data on the person level will duplicate for each condition. See Copying information across levels below for more information. COPYING INFORMATION ACROSS LEVELS For information regarding whether a level is higher or lower than another, refer to the structure picture located in the About the Expanded CURF section located above. Lower level to a higher level The following SAS code is an example of copying information from a lower level to a level above. PROC SORT DATA=IHS12SCO; /* Condition level */ BY ABSHID ABSAID ABSPID; DATA TTLLT (KEEP=ABSHID ABSAID ABSPID LONGTERM NOTCURR); SET IHS12SCO; BY ABSHID ABSAID ABSPID; /* This step will go through each Condition record within each unique combination of ABSHID, ABSAID, and ABSPID. Only requires identifier located on level moving up to */ RETAIN LONGTERM NOTCURR; IF FIRST.ABSPID THEN DO; LONGTERM=0; NOTCURR=0; END; /* Note as the file is sorted by three IDs, reference to FIRST is only needed for the last part of the ID */ IF CONDSTAT=1 THEN LONGTERM=LONGTERM+1; /*starts a count of the number of diagnosed long term conditions*/ IF CONDSTAT=3 THEN NOTCURR=NOTCURR+1; /*starts a count of the number of diagnosed conditions that are not current*/ IF LAST.ABSPID THEN OUTPUT; /* This outputs the totals found within each unique combination of ABSHID, ABSAID, ABSPID */ PROC SORT DATA=IHS12SSP; /* PERSONS level - the level above Condition */ BY ABSHID ABSAID ABSPID; DATA MRGFILES; MERGE TTLLT IHS12SSP; BY ABSHID ABSAID ABSPID; PROC FREQ DATA=MRGFILES; /*This procedure gives a weighted count of the data copied up from the Condition level to the Actions level */ TABLES LONGTERM NOTCURR*SEX; /* LONGTERM will be a weighted frequency table. NOTCURR will be in a weighted frequency table cross-tabbed by Sex */ WEIGHT IHSFINWT; RUN; The new variables LONGTERM and NOTCURR produce the number of collected conditions a person has that are either diagnosed/longterm or diagnosed/not current. So they are meaningful on the Person level, where only one value per Person is produced for each variable. Merging these new items onto the Person level now allows them to be analysed with any other items on the person level and for weighted estimates to be correctly produced. Higher level to a lower level The following SAS code is an example of copying information from a higher level to a level below DATA PERSON (KEEP=ABSHID ABSAID ABSPID AGEEC SEX IHSFINWT); SET IHS12SSP; PROC SORT DATA=PERSON; BY ABSHID ABSAID ABSPID; PROC SORT DATA= IHS12SCO; BY ABSHID ABSAID ABSPID; DATA MRGFILES2; MERGE IHS12SCO PERSON; BY ABSHID ABSAID ABSPID; PROC FREQ DATA=MRGFILES2; /*This procedure gives a weighted count of the CONDSTAT items located on the Condition level by the SEX variable and weight brought down from the Persons level. */ TABLES SEX*CONDSTAT; WEIGHT IHSFINWT; RUN; This merge matches one Person record to many Conditions records. So, the data items copied from the person level ('AGEEC' and 'SEX' and 'IHSFINWT' in the example) will be repeated for the counting unit of the level they have been added to, Conditions in this case. Each Conditions record will therefore receive the Age and Sex and Person Weight of the Person they belong to. Weighted estimates will now be influenced by people who have more than one condition as their weight will be applied to multiple conditions. RELIABILITY OF ESTIMATES As the survey was conducted on a sample of private households in Australia, it is important to take account of the method of sample selection when deriving estimates from the CURF. This is particularly important as a person's chance of selection in the survey varied depending on the state or territory in which the person lived. If these chances of selection are not accounted for, by use of appropriate weights, the results will be biased. For details on the weighting process see Weighting, Benchmarks and Estimation procedures in Australian Aboriginal and Torres Strait Islander Health Survey: Users' Guide, 2012-13 (cat. no. 4727.0.55.002). Each person record has a main weight (IHSFINWT). This weight indicates how many population units are represented by the sample units. When producing estimates of sub-populations from the CURF, it is essential that they are calculated by adding the weights of persons in each category and not just by counting the sample number in each category. If each person's weight were to be ignored when analysing the data to draw inferences about the population, then no account would be taken of a person's chance of selection or of different response rates across population groups, with the result that the estimates produced could be biased. The application of weights ensures that estimates will conform to an independently estimated distribution of the population by age, by sex, etc. rather than to the distributions within the sample itself. Each person record on the CURF contains 60 replicate weights in addition to the main weight. Replicate weights can be used to calculate measures of sampling error. For details on sampling error calculations and replicate weights see the Technical Note in the Australian Aboriginal and Torres Strait Islander Health Survey: Users' Guide, 2012-13 (cat. no. 4727.0.55.002). EXPANDED CURF FILES SAS files These files contain the data for the CURF in SAS format. STATE/TERRITORY IHS12SHH.sas7bdat contains the Household level data IHS12SAP.sas7bdat contains the Persons in Household level data (All Persons) IHS12SSP.sas7bdat contains the Person level data (Selected Person) IHS12SAD.sas7bdat contains the Alcohol Day level data IHS12SAT.sas7bdat contains the Alcohol Type level data IHS12SCO.sas7bdat contains the Condition level data IHS12SRI.sas7bdat contains the Most Recent Injury level data IHS12SBI.sas7bdat contains the Biomedical level data STATE/TERRITORY BY ASGS REMOTENESS IHS12RHH.sas7bdat contains the Household level data IHS12RAP.sas7bdat contains the Persons in Household level data (All Persons) IHS12RSP.sas7bdat contains the Person level data (Selected Person) IHS12RAD.sas7bdat contains the Alcohol Day level data IHS12RAT.sas7bdat contains the Alcohol Type level data IHS12RCO.sas7bdat contains the Condition level data IHS12RRI.sas7bdat contains the Most Recent Injury level data IHS12RCP.sas7bdat contains the Child 5-17 years Physical Activity level (NR Only) data IHS12RCD.sas7bdat contains the Child 5-17 years Physical Activity detailed level (NR Only) data IHS12RBI.sas7bdat contains the Biomedical level data SPSS files These files contain the data for the CURF in SPSS format. STATE/TERRITORY IHS12SHH.sav contains the Household level data IHS12SAP.sav contains the Persons in Household level data (All Persons) IHS12SSP.sav contains the Person level data (Selected Person) IHS12SAD.sav contains the Alcohol Day level data IHS12SAT.sav contains the Alcohol Type level data IHS12SCO.sav contains the Condition level data IHS12SRI.sav contains the Most Recent Injury level data IHS12SBI.sav contains the Biomedical level data STATE/TERRITORY BY ASGS REMOTENESS IHS12RHH.sav contains the Household level data IHS12RAP.sav contains the Persons in Household level data (All Persons) IHS12RSP.sav contains the Person level data (Selected Person) IHS12RAD.sav contains the Alcohol Day level data IHS12RAT.sav contains the Alcohol Type level data IHS12RCO.sav contains the Condition level data IHS12RRI.sav contains the Most Recent Injury level data IHS12RCP.sav contains the Child 5-17 years Physical Activity level data IHS12RCD.sav contains the Child 5-17 years Physical Activity detailed level data IHS12RBI.sav contains the Biomedical level data STATA files These files contain the data for the CURF in STATA format. STATE/TERRITORY IHS12SHH.dta contains the Household level data IHS12SAP.dta contains the Persons in Household level data (All Persons) IHS12SSP.dta contains the Person level data (Selected Person) IHS12SAD.dta contains the Alcohol Day level data IHS12SAT.dta contains the Alcohol Type level data IHS12SCO.dta contains the Condition level data IHS12SRI.dta contains the Most Recent Injury level data IHS12SBI.dta contains the Biomedical level data STATE/TERRITORY BY ASGS REMOTENESS IHS12RHH.dta contains the Household level data IHS12RAP.dta contains the Persons in Household level data (All Persons) IHS12RSP.dta contains the Person level data (Selected Person) IHS12RAD.dta contains the Alcohol Day level data IHS12RAT.dta contains the Alcohol Type level data IHS12RCO.dta contains the Condition level data IHS12RRI.dta contains the Most Recent Injury level data IHS12RCP.dta contains the Child 5-17 years Physical Activity level data IHS12RCD.dta contains the Child 5-17 years Physical Activity detailed level data IHS12RBI.dta contains the Biomedical level data Information files FORMATS.sas7bcat is a SAS library containing formats. There is one produced for the State/Territory CURF and one for the State/Territory by ASGS Remoteness CURF. Frequency files The following plain text format files contain data item code values and category labels at each level, with weighted and unweighted frequencies for each value. STATE/TERRITORY ECURF IHS12S Household Freq.txt contains frequencies for Household level items ECURF IHS12S Persons in Household Freq.txt contains frequencies for Persons in Household level items ECURF IHS12S Person Freq.txt contains frequencies for Person level items ECURF IHS12S Alcohol Day Freq.txt contains frequencies for Alcohol Day level items ECURF IHS12S Alcohol Type Freq.txt contains the weighted frequencies for the Alcohol Type level items ECURF IHS12S Condition Freq.txt contains frequencies for Condition level items ECURF IHS12S Most Recent Injury Freq.txt contains frequencies for the Most Recent Injury level items ECURF IHS12S Biomedical Weighted Freq.txt contains the weighted frequencies for the Biomedical level items ECURF IHS12S Biomedical Unweighted Freq.txt contains the unweighted frequencies for the Biomedical level items STATE/TERRITORY BY ASGS REMOTENESS ECURF IHS12R Household Freq.txt contains frequencies for Household level items ECURF IHS12R Persons in Household Freq.txt contains frequencies for Persons in Household level items ECURF IHS12R Person Freq.txt contains frequencies for Person level items ECURF IHS12R Alcohol Day Freq.txt contains frequencies for Alcohol Day level items ECURF IHS12R Alcohol Type Freq.txt contains the weighted frequencies for the Alcohol Type level items ECURF IHS12R Condition Freq.txt contains frequencies for Condition level items ECURF IHS12R Most Recent Injury Freq.txt contains frequencies for the Most Recent Injury level items ECURF IHS12R Biomedical Weighted Freq.txt contains the weighted frequencies for the Biomedical level items ECURF IHS12R Biomedical Unweighted Freq.txt contains the unweighted frequencies for the Biomedical level items ECURF IHS12R Child Physical Activity Freq.txt contains frequencies for Child 5-17 years Physical Activity level data ECURF IHS12R Child Physical Activity Detailed Freq.txt contains frequencies for Child 5-17 years Physical Activity detailed level data Document Selection These documents will be presented in a new window.
|