Page tools: Print Page Print All | ||
1. INTRODUCTION
For further information on the multi-panel sample design refer to Section 1.2. The 2011 Panel size was increased slightly to 5.7%, to achieve a linked sample size of no greater than 5% of the population after allowing for missed links and people no longer being in scope due to death or overseas migration (note that the linked sample size for the 2006 Panel linked to the 2011 Census was only 4.2%.). The 2011 panel sample of over one million records (1,221,057) from the 2011 Census was linked to the 2016 Census, resulting in a linked sample size of 927,520 records (4.3%). Linking the ACLD Data from the 2011 ACLD Panel sample and the 2016 Census were brought together using data linkage techniques. Data linkage is typically undertaken using a combination of deterministic and probabilistic methods:
The 2011-2016 ACLD builds on the success of the ABS' data integration program from the past decade, benefitting from advances in linking methodology, technology and data availability to deliver a high quality integrated statistical resource. To protect the privacy of Census respondents, we used an ABS encoded Census name for linking 2011 and 2016 Census records in the ACLD. Encoding was undertaken in 2011 for the purpose of protecting privacy by anonymising name and improving the future quality and efficiency of the linking process. The codes are created by grouping people with a combination of letters from their first and last names using a secure one-way process, meaning that a code cannot be reversed to deduce the original name information. Each code represents approximately 2,000 people drawn from many different letter combinations, and therefore is not unique to an individual. Actual name information from the 2016 Census was not used to link to 2011 Census records. The codes are only accessible to those ABS officers creating the linked dataset, and will never be released outside the ABS. For many individuals the linkage process will have accurately matched their corresponding records between Censuses. In some cases, the link will represent different people who share a number of characteristics in common. Some inaccuracy in the linkage will not generally affect statistical conclusions drawn from the linked data, although care should be taken in the interpretation of results. For more information see Section 2 - Data Linking Methodology. 1.2 MULTI-PANEL SAMPLE DESIGN Without sample maintenance, the ACLD would decline in its ability to accurately reflect the Australian population over time due to:
Linkage bias in longitudinal datasets is unique to those created via data integration, as traditional longitudinal studies employ strategies to ensure they collect information about the same individual over time. In a linked longitudinal dataset, data integration is necessary due to a lack of a common identifier to identify a person's responses over time. Linkage bias occurs where certain populations are more difficult to link than others (e.g. Aboriginal and Torres Strait Islander people, young males), so links are more likely to not be identified for members of these groups and, if they are found, have a higher chance of being inaccurate. If left untreated, the representation of population groups suffering from linkage bias would worsen as each new Census is linked to the ACLD. The ACLD sample is maintained through application of the Multi-Panel framework, developed by Chipperfield, Brown & Watson (2017). This framework provides an approach for selecting records in the ACLD to create panels which maintain the longitudinal and cross-sectional representativeness of the dataset over time, while minimising the impact of accumulated linkage bias on longitudinal analysis. The Multi-Panel approach designs multiple overlapping panels, with each panel representing a single Census population (2006, 2011, 2016, etc.), which is then linked to subsequent Censuses. The sample selection strategy for each panel is designed to maintain a linked sample size of 5%, maximise sample overlap between the panels, and introduce new records to the dataset in each panel to account for new births, migrants and missed links in previous panels. This allows flexibility for users, who can draw on the most appropriate panel for their research question. The sample overlap between the 2006 and 2011 ACLD Panels is illustrated below: FIGURE 1 - SAMPLE OVERLAP BETWEEN THE 2006 AND 2011 ACLD PANELS Of the 800,758 2011 Census records linked in the original 2006 Panel, 730,756 (91%) are in the 2011 Panel sample. The exclusion of 70,002 2006 Panel records from the 2011 Panel is the result of the sample design used to ensure the representativeness of the 2011 Panel sample. 1.3 CENSUS DATA QUALITY In June 2017, a Report on the Quality of 2016 Census Data was released by the Census Independent Assurance Panel reviewing the quality of the 2016 Census data. The Panel determined that the 2016 Census data is of a comparable quality to previous Censuses, is useful and useable, and will support the same variety of uses of Census data as was the case for previous Censuses. The Report included a broad assessment of the key ACLD linking variables, name and date of birth. Although the quality of these variables was still high for the 2016 Census, there was a decrease in the quality of this information relative to the 2011 Census. The Report noted the following:
For further information on the quality of particular Census variables, please see Understanding the Census and Census Data, Australia, 2016 (cat. no. 2900.0). 1.4 BENEFITS OF THE ACLD Each five-yearly Census provides a rich set of information about Australian people and households at a point in time. The Census provides information on characteristics such as age, sex, Indigenous status, country of birth and year of arrival, together with topics such as family structure, education and qualifications, work, income and housing, and presence of a severe or profound disability. It is able to provide a rich picture of social and economic conditions at a particular point in time, and how these conditions are changing over time and across population groups. The ACLD adds the ability to study transitions in social and economic conditions at the individual level, giving insight into the pathways that tend to lead to particular outcomes, and how these pathways vary for different population groups. It also enables the study of likely consequences of certain socio-economic circumstances for different population groups, as evidenced by the patterns in the longitudinal data. The ACLD aims to help in the development of evidence based strategies to promote positive pathways and avoid negative ones, and assist policy makers in assessing both the social and financial benefits of related intervention strategies. Since the first release of the ACLD, policy makers and researchers have used evidence from the ACLD to:
Document Selection These documents will be presented in a new window.
|