Sources
The data used in this release were sourced from the following:
- Person Level Integrated Data Asset (PLIDA), formerly known as the Multi-Agency Data Integration Project (MADIP). It combines information on health, education, government payments, income and taxation, employment, visas and population demographics (including the Census).
- Census of Population and Housing (Census), which contains key demographic, social and economic data collected from the Australian population every 5 years.
- Australian Census and Migrants Integrated Dataset (ACMID), which links data from the Census with the Department of Home Affairs Settlement Database.
PLIDA data used to produce this release include:
- ATO Client Register, 2006-2021
- Income Tax Returns, 2020-21
- Payment Summaries, 2015-16 to 2020-21
- Census of Population and Housing, 2021
- Higher Education Information Management System 2005-2019
- Migrant data, 1990-2021.
The Migrant data are a suite of administrative datasets (client information, visa grants, and visa applications) from the Department of Home Affairs. These data pertain to permanent migrants and temporary entrants to Australia, as well as Australian citizens who have travelled into or out of Australia. The data provides information on a person’s visa information, citizenship status, and their movements into and out of Australia.
Refer to Data assets for more information about PLIDA and other data assets.
PLIDA is linked using the Person Linkage Spine. More information can be found here: Person linkage spine.
Scope
This release relates to permanent migrants in Australia.
The scope and coverage of these estimates are defined and constrained by the characteristics of the data sources from which they are produced. As such, users should note that some permanent migrants might not be captured within these estimates.
The scope of the Migrant data includes:
- clients listed during the period 1 January 1990* to 10 August 2021
- visas granted between 1 January 1990* to 10 August 2021
- visa applications lodged between 1 January 1990* to 10 August 2021.
* Includes incomplete data for records prior to 1 January 1990
The Migrant data were further scoped to create a population base for these indicators. This involved the following steps:
- retaining all people who arrived in Australia from 1 January 2000 to 10 August 2021 and held a permanent visa as their last held visa
- retaining records of those who held a resident return visa (subclasses 111, 151, 154 -159, 834, R, and K38) and Australian citizens who previously held a permanent visa within the period mentioned above.
Other information from the PLIDA asset is then linked to the scoped migrant population, via the Person Linkage Spine. If a specific characteristic cannot be linked or derived, records missing this information are excluded from the calculations included in the specific indicator.
The Visa Grants data contain a small number of cases where multiple visas were issued on the same date. To distinguish the most recent visa held, a variable (last on day flag) is used to resolve these cases.
For some records, arrival date information is missing. This is likely in part due to the following factors:
- the retention of resident return visa records and locating the last substantial permanent visa
- administrative error.
Once same day visa grants have been reconciled and duplicate records on the Person Linkage Spine have been de-duplicated, visa subclass (3-digit) information is then aggregated into visa streams (highest-level grouping) to produce the Skilled, Family, Humanitarian, Other and undetermined, and Total permanent groupings. Please refer to the Glossary for more information on visa types.
While every effort will be made to maintain the stability of these indicators into the future, they may be subject to revisions as linkage methods develop, the PLIDA asset matures, and more data become available.
The data used in these indicators have different reference periods. Information about this can be found within the footnotes of each table. This should be considered, and caution exercised, when trying to analyse relationships between the data.
Care has been taken to articulate the scope and coverage of the datasets used and the populations in the individual data items themselves.