Page tools: Print Page Print All | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
USING THE CURF
Table 1 shows the change to the total estimated population, and population estimates by state or territory, as previously published and from the CURF. It can be seen that, proportionally, the largest impact of the confidentialising process is in relation to Northern Territory estimates. Table 1: Published (cat.no. 4430.0) population estimates (weighted) compared to the CURF estimates
However, Table 2 shows that the impact of the confidentialising actions on high level target population estimates was small. Table 2: Published (cat.no. 4430.0) disability and carer prevalence, and percentage of population aged 65 years or more, compared to the CURF
Steps to confidentialise the datasets made available on the CURF are undertaken in such a way as to ensure the integrity of the datasets and optimise the content, while maintaining the confidentiality of respondents. Intending purchasers should ensure that the data they require at the level of detail they require are available on the CURF; data obtained in the survey, but not contained on the CURF may be available in TableBuilder or in tabulated form on request. The Data Item Lists document on the Summary tab contains information about the list of data items, which is available as an Excel spreadsheet on the Downloads tab. RECORD COUNTS Table 3 shows the number of records on each level for the CURF dataset. Table 3: Number of records by level for the SDAC 2012 CURF
There is a series of identifiers that can be used on records at each level of the file. File level identifiers The identifiers ABSHID, ABSFID, ABSIID, ABSPID, ABSCID, ABSRSID, ABSSAID, ABSRCID, ABSBAID and ABSAPID appear on all levels of the file (as they are needed to create a hierarchical CSV file). Where the information for the identifier is not relevant for a level, it has a value of 0. Each household has a unique twelve digit random identifier, ABSHID. This identifier appears on the Household level and is repeated on every other level. On the family level, each family within the household is numbered sequentially. The item ABSFID containing this family number also appears on the income unit and person level. The combination of household and family identifier uniquely identifies the family. On the income unit level, each income unit within the family is numbered sequentially. This income unit number is also stored at the person level, as ABSIID. A combination of household, family and income unit identifiers uniquely identifies an income unit. Records from the establishments component do not exist at the family and income unit levels. A person identifier (ABSPID) is assigned to each person within a combination of household, family and income unit identifiers. It should be noted that this is NOT the person number within a household, hence there may be more than one person with ABSPID=1 in a household. On the person level, the combination of household, family, income unit and person identifier (ABSPID) uniquely identifies the person. This information is also recorded on each lower level. The broad areas of activity level and the assistance providers level also have an identifier (ABSBAID and ABSAPID, respectively) which, when combined with ABSHID, ABSFID, ABSIID and ABSPID, enables information to be copied between levels nine and ten.
Collection component identifiers The person level contains two items indicating whether the data were collected in the household or cared accommodation component. These items are POPESTAB and DWELTYPE. The item DWELTYPE has the categories 'Private dwelling' and 'Special dwelling' for all dwellings in the household component and 'Not applicable' for the cared accommodation component. The item POPESTAB has the category 'Living in an establishment' for the cared accommodation component and 'Not in this population' for the household component. These items can be copied from the person level to other levels of the file as required. Copying information across levels Identifiers can be used to copy information from one level of the file to another. The following SAS code (or equivalent) can be used to copy information from a lower level to a level above: PROC SORT DATA=DAC12RES; * Restrictions level file; BY ABSHID ABSFID ABSIID ABSPID ; RUN ; DATA SUMMARY (KEEP=ABSHID ABSFID ABSIID ABSPID MENTUND) ; SET DAC12RES; BY ABSHID ABSFID ABSIID ABSPID ; RETAIN MENTUND ; IF FIRST.ABSPID THEN MENTUND=2 ; * Initialise variable – Does not have learning or understanding difficulties or mental illness ; IF RESTRCT=7 OR RESTRCT=14 THEN MENTUND=1 ; * Has learning or understanding difficulties or mental illness ; IF LAST.ABSPID THEN OUTPUT ; RUN ; PROC SORT DATA = DAC12PER; * Person level file ; BY ABSHID ABSFID ABSIID ABSPID ; RUN ; DATA MERGE_FILE ; MERGE SUMMARY (IN=A) DAC12PER (IN=B) ; BY ABSHID ABSFID ABSIID ABSPID ; IF B AND NOT A THEN MENTUND=2 ; * Records which are not on the SUMMARY file ; RUN ; The SUMMARY file only keeps the last record for each person on the DAC12RES file so the merge is a one to one match of person records on the SUMMARY file with records on the DAC12PER file. Some DAC12PER records will have no matching records on the SUMMARY file (as the person has no restrictions). For these records the statement 'If B and not A then MENTUND=2 ;' will allocate the code 2 because they don't have any restrictions. This method allows summary information from one level (e.g. Restrictions) to be used on the level above it in the hierarchy (e.g. Person). The item generated above can now be cross classified by any number of items on the person level file. The following SAS code (or equivalent) can be used to copy information from a higher level to a level below: PROC SORT DATA=DAC12RES; * Restrictions level file; BY ABSHID ABSFID ABSIID ABSPID; RUN ; PROC SORT DATA = DAC12PER; * Person level file; BY ABSHID ABSFID ABSIID ABSPID; RUN; DATA MERGE_FILE ; MERGE DAC12RES (IN=A) DAC12PER (KEEP=ABSHID ABSFID ABSIID ABSPID SEX AGEPB IN=B); BY ABSHID ABSFID ABSIID ABSPID; IF A AND B THEN OUTPUT; * Only keeps records which are present on both files; RUN; Unlike the previous merge, this merge will match one DAC12PER record to many DAC12RES records. The statement 'If A and B then OUTPUT;' ensures that only records present on both files are kept. If this statement was not used then DAC12PER records without a corresponding DAC12RES record would appear with a missing value for all DAC12RES data items. Note that the data items copied from the DAC12PER level will now have the counting unit for the level they have been added to, restrictions in this case. Combining carer and recipient data Combining carer and recipient data can sometimes be confusing, both in selecting an appropriate item and in understanding the counting unit. For example, if your interest is in people with a disability, and you want to analyse them by their main condition, such as dementia or arthritis, etc., using the item 'Main disabling condition of main care recipient' and the person weight from the person level identifies a population of primary carers whose main recipient co-resides with them and has dementia, or arthritis, or some other condition as a main condition. Using an item such as 'Disability status' with this item provides information on the disability status of persons who are primary carers of people with the specified condition. It is, therefore, important when using items relating to carers and care recipients to pay particular attention to the populations for the items used. The recipient level includes identifiers for the carer (ABSHID, ABSFID, etc.) as well as for the recipient(s) of care (MAPHHDRX, MAPFAMRX, etc.) to allow for data to be copied from other levels. Person level carer data can be added using the ABS identifiers as per other levels. The following SAS code (or equivalent) provides an example of how to add person level data for the recipient of care. DATA DAC12PER; SET DAC12PER; * Person level file; RECIPHID = put(ABSHID,13.); RECIPFID = put(ABSFID,2.); RECIPIID = put(ABSIID,2.); RECIPPID = put(ABSPID,2.); RUN; DATA DAC12REC; SET DAC12ALR; * Recipient level file; RECIPHID = put(MAPHHDRX,13.); RECIPFID = put(MAPFAMRX,2.); RECIPIID = put(MAPINCRX,2.); RECIPPID = put(MAPPSNRX,2.); RUN; PROC SORT DATA=DAC12PER; BY RECIPHID RECIPFID RECIPIID RECIPPID; RUN; PROC SORT DATA=DAC12REC; BY RECIPHID RECIPFID RECIPIID RECIPPID; RUN; DATA PERSON_RECIP; MERGE DAC12REC (IN=A) DAC12PER (IN=B); BY RECIPHID RECIPFID RECIPIID RECIPPID; IF A AND B; * Only keeps records which are present on both files; RUN; MULTI-RESPONSE ITEMS ON THE CURF A number of questions included in the survey allowed respondents to provide one or more responses. Each response category for one of these 'multi-response questions' (or data items) is basically treated as a separate data item. On the CURF, these data items have the same general data item identifier (SASName) but are each suffixed with a letter – A for the first response, B for the second response, C for the third response, D for the fourth response and so on. For example, the multi-response data item 'Purpose for computer use at home in the last 12 months' (with a general SASName of COMPRCM – see data item list), has six response categories. Consequently, six data items have been produced - COMPRCMA, COMPRCMB, COMPRCMC, COMPRCMD, COMPRCME and COMPRCMF. Each data item in the series (i.e. COMPRCMA -- COMPRCMF) will have two response codes: A 'Yes' response (for the first in the series (code 1), for the second in the series (code 2) etc.) and a 'Null' response (code 0) indicating that the response was not relevant for the respondent. The last data item in the series will represent a 'Not Applicable' response (i.e. value of last character in series) which comprises the respondents not asked the questions (e.g. COMPRCMF with values of 0 or 6). It should be noted that the sum of individual multi-response categories will be greater than the population or number of people applicable to the particular data item as respondents are able to select more than one response. Multi-response data items can be identified in the data item list as SASNames followed by a range of letters in brackets; for example, COMPRCM(A-F). CURF DATA FILES The 2012 Basic CURF can be accessed on CD-ROM, or via the RADL, and is available in SAS, SPSS and STATA formats. The CURF comprises the following files: ASCII text format files These files contains the raw confidentialised survey data in hierarchical comma delimited ASCII text format. DAC12B.CSV contains all levels DAC12HH.csv contains the Household level data DAC12FAM.csv contains the Family level data DAC12IU.csv contains the Income Unit level data DAC12PER.csv contains the Person level data DAC12CON.csv contains the All Conditions level data DAC12RES.csv contains the Restrictions level data DAC12SPA.csv contains the Specific Activities level data DAC12ALR.csv contains the All Recipients level data DAC12BRA.csv contains the Broad Activities level data DAC12PAS.csv contains the Assistance Providers level data SAS files These files contain the data for the CURF in SAS format. DAC12HH.SAS7BDAT contains the Household level data DAC12FAM.SAS7BDAT contains the Family level data DAC12IU.SAS7BDAT contains the Income Unit level data DAC12PER.SAS7BDAT contains the Person level data DAC12CON.SAS7BDAT contains the All Conditions level data DAC12RES.SAS7BDAT contains the Restrictions level data DAC12SPA.SAS7BDAT contains the Specific Activities level data DAC12ALR.SAS7BDAT contains the All Recipients level data DAC12BRA.SAS7BDAT contains the Broad Activities level data DAC12PAS.SAS7BDAT contains the Assistance Providers level data SPSS files These files contain the data for the CURF in SPSS format. DAC12HH.SAV contains the Household level data DAC12FAM.SAV contains the Family level data DAC12IU.SAV contains the Income Unit level data DAC12PER.SAV contains the Person level data DAC12CON.SAV contains the All Conditions level data DAC12RES.SAV contains the Restrictions level data DAC12SPA.SAV contains the Specific Activities level data DAC12ALR.SAV contains the All Recipients level data DAC12BRA.SAV contains the Broad Activities level data DAC12PAS.SAV contains the Assistance Providers level data STATA files These files contain the data for the CURF in STATA format. DAC12HH.DTA contains the Household level data DAC12FAM.DTA contains the Family level data DAC12IU.DTA contains the Income Unit level data DAC12PER.DTA contains the Person level data DAC12CON.DTA contains the All Conditions level data DAC12RES.DTA contains the Restrictions level data DAC12SPA.DTA contains the Specific Activities level data DAC12ALR.DTA contains the All Recipients level data DAC12BRA.DTA contains the Broad Activities level data DAC12PAS.DTA contains the Assistance Providers level data INFORMATION FILES Data item list The Data item list contains all the data items, including details of categories and code values, that are available on the Basic CURF. Formats file This file is a SAS library containing formats. Example code DAC12B.SAS provides as example of how data can be loaded into SAS for Windows from DAC12B.CSV and can assist in creating code for your own analysis package. Frequency files Household (HH) A file containing documentation of the Household level data. Data item code values and category labels are provided with weighted household frequencies of each value. This file is in plain text format. Family (FAM) A file containing documentation of the Family level data. Data item code values and category labels are provided with weighted household frequencies of each value. This file is in plain text format. Income unit (IU) A file containing documentation of the Income Unit level data. Data item code values and category labels are provided with weighted household frequencies of each value. This file is in plain text format. Person (PER) A file containing documentation of the Person level data. Data item code values and category labels are provided with weighted person frequencies of each value. This file is in plain text format. All Conditions (CON) A file containing documentation of the All Conditions level data. Data item code values and category labels are provided with weighted person frequencies of each value. This file is in plain text format. Restrictions (RES) A file containing documentation of the Restrictions level data. Data item code values and category labels are provided with weighted person frequencies of each value. This file is in plain text format. Specific activities (SPA) A file containing documentation of the Specific Activities level data. Data item code values and category labels are provided with weighted person frequencies of each value. This file is in plain text format. All recipients (ALR) A file containing documentation of the All Recipients level data. Data item code values and category labels are provided with weighted person frequencies of each value. This file is in plain text format. Broad activity (BRA) A file containing documentation of the Broad Activities level data. Data item code values and category labels are provided with weighted person frequencies of each value. This file is in plain text format. Assistance providers (PAS) A file containing documentation of the Assistance Providers level data. Data item code values and category labels are provided with weighted person frequencies of each value. This file is in plain text format. Document Selection These documents will be presented in a new window.
|