Socio-Economic Indexes for Areas (SEIFA), Australia methodology

Latest release

Reference period

2021

Released

27/04/2023

Next release Unknown

First release

Release date and time

09/05/2023 11:30am AEST

Introduction

This publication outlines how SEIFA is constructed and provides guidance on how it should be used and understood.

See SEIFA for general information on SEIFA 2021, and to access the data.

For detailed technical specifications refer to the SEIFA Technical Paper.

SEIFA indexes

SEIFA is a collection of four indexes, each summarising a different aspect of the socio-economic conditions in an area using different Census data:

the Index of Relative Socio-economic Advantage and Disadvantage (IRSAD) focuses on both advantage and disadvantage
the Index of Relative Socio-economic Disadvantage (IRSD) focuses on relative socio-economic disadvantage
the Index of Education and Occupation (IEO) focuses on relative Education and Occupation advantage and disadvantage
the Index of Economic Resources (IER) focuses on Economic advantage and disadvantage.

The same area may score differently for each index due to their constituent variables.

Advantage and disadvantage

For SEIFA purposes, relative socio-economic advantage and disadvantage is defined as people's access to material and social resources, and their ability to participate in society. The dimensions are constrained by what is collected in the Census; they include income, education, employment, occupation, housing, and family structure.

SEIFA is area based

SEIFA is designed and constructed as an area-based measure. Area-based deciles are calculated by dividing the areas, ordered by disadvantage, into ten equally sized groups. Decile one contains the most disadvantaged areas and decile ten contains the most advantaged areas.

SEIFA geography

The SEIFA scores are initially calculated by Statistical Area Level 1 (SA1).

Scores for higher level geographic areas, such as Statistical Area Level 2 (SA2), Local Government Areas (LGA), Suburbs and Localities (SAL), and Postal Areas (POA) are calculated using population-weighted averages of the SA1 scores. This ensures that the higher-level geographic area scores reflect the size of the population within each contributing SA1.

Constructing SEIFA

For a detailed discussion on SEIFA’s construction, refer to the SEIFA Technical Paper.

Choosing the variables

Variables that relate to the concepts of advantage and disadvantage that SEIFA is trying to capture are included in the initial variable list. The main constraint on the initial variable list is that the variables can only be derived from Census data.

Data Proportions

The indexes are calculated from 2021 Census data, with variables calculated as proportions at the SA1 level. For example, the proportion of unemployed people is calculated as the number of unemployed people divided by the total number of people in the labour force in each SA1.

Areas without SEIFA Scores

Due to non-response and low population counts, not every SA1 can be given a meaningful index score. ABS maximises the number of areas that receive a score, while maintaining confidentiality and an acceptable level of quality. SA1s do not receive an index score if:

the SA1 has no usual addresses
the usual resident population is less than or equal to Ten
the area is classified as offshore or migratory (refer to special purpose codes)
there are fewer than six employed people
there are fewer than six classifiable occupied private dwellings
the proportion of people in private dwellings is less than or equal to 20 per cent
the denominator of a variable in the index is less than six.

Principal Component Analysis

The indexes are a weighted combination of Census derived variables. The method used to determine the weights and final variable composition is known as Principal Component Analysis (PCA). The weights describe the relative importance of each variable. If the weight determined by PCA for a variable is too low, the variable would not meaningfully contribute to the index, so is excluded.

The variable weights are derived from a consistent, data-driven method and are not determined subjectively.

Once the variable weights have been finalised, they are used together with the variable proportions to create index scores for SA1s across Australia.

Validation

The indexes are checked to ensure they are measuring the desired concept and that the results make intuitive sense. Validation includes:

investigating the correlations between the four indexes
comparing SEIFA 2021 rankings with 2016 rankings
identifying which variables are the drivers of change between SEIFA 2016 and 2021
peer review of the results.

Geography

SEIFA 2021 uses edition 3 of the Australian Statistical Geography Standard (ASGS).

Change between 2016 and 2021

SEIFA 2021 largely uses the same method as SEIFA 2016.
Where available, SEIFA 2021 uses the same candidate variables as SEIFA 2016.
The Dwelling Internet Connection data item was not collected in the 2021 Census, so the NONET variable could not be used.
Occupation variables are based on the Australian and New Zealand Standard Classification of Occupations (ANZSCO).
Variables using cut-off values as part of their specification, such as high and low income (INC_HIGH and INC_LOW), were updated.
The only processing change was to start from confidentialised (instead of unconfidentialised) area level counts. Using confidentialised area level counts did not have a significant impact on the results, and appropriate comparisons to 2016 results are still valid.

Understanding SEIFA

The indexes rank areas according to their relative socio-economic conditions and are useful in understanding the distribution of socio-economic conditions across different areas.

It is recommended that index rankings or quantiles (deciles and percentiles) are used for analysis, rather than index scores.

Area versus individuals

The indexes reflect the socio-economic characteristics of an area, rather than of individuals.

They are calculated at the SA1 level and reflect SA1 characteristics.

Because people within an SA1 are not identical, the index scores for an SA1 do not directly reflect the relative advantage or disadvantage of an individual residing in that SA1. For example, it is possible for a relatively advantaged person to live in an area with a low score or for a relatively disadvantaged person to live in an area with a high score.

Refer to SEIFA: Getting a handle on individual diversity within areas for more information.

Variables

The variables that contribute to each index should be considered when deciding which index to use (see sections on each index for lists of variables). For example:

if you are looking for areas of disadvantage for the purpose of allocating funds or services, the Index of Relative Socio-economic Disadvantage would be the most appropriate
if you are looking for areas with relatively high populations in unskilled jobs or with relatively low educational qualifications, the Index of Education and Occupation would be most suitable.

Time series

The indexes are designed to compare the relative socio-economic characteristics of areas at a point in time, not to compare areas over time. There are several issues that make longitudinal or time series analysis of SEIFA difficult to interpret.

The constituent indicators and indicator weights for each index are likely to have changed.
The geographic boundaries and numbers of relevant small areas may have changed.
The distribution of the standardised index values will have changed (e.g., a score of 800 does not represent the same level of disadvantage in different years).
There are changes in the way the indicators are defined.

If comparisons over time are made, the use of deciles or percentiles is recommended rather than ranks or scores.

Topics not represented in the indexes

Topics represented in SEIFA are limited to what is collected in the Census.

Measures relating to wealth and infrastructure may provide more information about the relative advantage or disadvantage within an area, but these measures are not collected in the Census.

Long-term health conditions, asked for the first time in the 2021 Census, were not included in SEIFA. This is to allow health researchers to analyse the relationship between health outcomes and socio-economic advantage/disadvantage. Adding health variables to SEIFA would make these relationships less clear.

Other potential topics that could be associated with advantage and disadvantage but are not captured in the Census include crime, and the environment.

Terminology

Scores

SA1 scores are created by adding together the weighted characteristics of that SA1. The scores for all SA1s are then standardised to a distribution where the average equals 1,000 and the standard deviation is 100.

Scores for larger geographic areas, such as Statistical Area Level 2 (SA2) and Local Government Area (LGA), are calculated using population-weighted averages of the SA1 scores.

Lower scores indicate areas of relative disadvantage compared to areas with higher scores.

Scores are an ordinal measure on an arbitrary scale and do not represent the quantity of advantage or disadvantage (i.e. it’s not accurate to say an area with a score of 1,000 is twice as advantaged as an area with a score of 500).

Rankings and quantiles (deciles and percentiles) should be used in most cases. Scores should be used for more technically complex analysis. For more information see Technical Paper.

Ranks

Every area is ordered from the lowest to highest score, with the area with the lowest score given a rank of one, the area with the second lowest score given a rank of two and so on, up to the area with the highest score which is given the highest rank.

Deciles

Every area is ordered from lowest to highest score, with the lowest 10 per cent of areas given a decile number of one, the next lowest 10 per cent of areas given a decile number of two and so on, up to the highest 10 per cent of areas which are given a decile number of 10. This means that areas are divided up into 10 equal sized groups, based on their score.

Percentiles

Every area is ordered from lowest to highest score, with the lowest one per cent of areas given a percentile number of one, the next lowest one per cent is given a percentile number two and so on, up to the highest one per cent of areas which are given a percentile number of 100. This means that areas are divided up into one hundred equal sized groups, based on their score.

Ranking within state or territory (rank, decile, and percentile)

Every area within a state or territory is ordered from lowest to highest score, then ranks, deciles and percentiles are assigned to each area within that state or territory.

State/territory ranks, deciles and percentiles can only be used to compare areas within a single state or territory.

Index of Relative Socio-economic Advantage and Disadvantage (IRSAD)

Included variables

This section outlines the variables used in the Index of Relative Socio-economic Advantage and Disadvantage (IRSAD).

The following variables are indicators of disadvantage. NOYR12ORHIGHER is the strongest indicator of disadvantage in the index.

IRSAD Indicators of disadvantage
Variable name	Variable description
NOYR12ORHIGHER	Per cent of people aged 15 years and over whose highest level of education is Year 11 or lower (Includes Certificate I and II)
INC_LOW	Per cent of people living in households with stated annual household equivalised income between $1 and $25,999 (approx. first and second deciles)
OCC_LABOUR	Per cent of employed people classified as 'labourers'
DISABILITYU70	Per cent of people aged under 70 who need assistance with core activities due to a long–term health condition, disability, or old age
CHILDJOBLESS	Per cent of families with children under 15 years of age who live with jobless parents
OCC_DRIVERS	Per cent of employed people classified as Machinery Operators and Drivers
LOWRENT	Per cent of occupied private dwellings paying rent less than $250 per week (excluding $0 per week)
SEPDIVORCED	Per cent of people aged 15 and over who are separated or divorced
ONEPARENT	Per cent of one parent families with dependent offspring only
UNEMPLOYED	Per cent of people (in the labour force) unemployed
OCC_SERVICE_L	Per cent of employed people classified as Low Skill (skill level 4 and 5) Community and Personal Service Workers
CERTIFICATE	Per cent of people aged 15 years and over whose highest level of educational attainment is a certificate III or IV qualification
OVERCROWD	Per cent of occupied private dwellings requiring one or more extra bedrooms (based on the Canadian National Occupancy Standard)
NOEDU	Per cent of people aged 15 years and over who have no educational attainment
OCC_SALES_L	Per cent of employed people classified as Low Skill (skill level 4 and 5) Sales

The following variables are indicators of advantage. INC_HIGH is the strongest indicator of advantage in the index.

IRSAD Indicators of advantage
Variable name	Variable description
INC_HIGH	Per cent of people living in households with stated annual household equivalised income greater than $91,000 (approx. 9th and 10th deciles)
OCC_PROF	Per cent of employed people classified as Professionals
HIGHMORTGAGE	Per cent of occupied private dwellings paying mortgage greater than $3,000 per month
OCC_MANAGER	Per cent of employed people classified as Managers
HIGHRENT	Per cent of occupied private dwellings paying rent greater than $500 per week
DIPLOMA	Per cent of people aged 15 years and over whose highest level of educational attainment is a diploma qualification
HIGHBED	Per cent of occupied private dwellings with four or more bedrooms
ATUNI	Per cent of people aged 15 years and over at university or other tertiary institution

Excluded variables

The following variable was initially considered for the index but was excluded due to being highly correlated with other variables.

IRSAD excluded indicators
Variable name	Variable description
DEGREE	Per cent of people aged 15 years and over whose highest level of educational attainment is a bachelor degree or higher qualification.

The following variables were initially considered for the index but were excluded when the analysis showed that they were weak indicators of relative advantage or disadvantage.

IRSAD excluded indicators
Variable name	Variable description
NOCAR	Per cent occupied private dwellings with no cars
SPAREBED	Per cent occupied private dwellings with one or no bedrooms
ENGLISHPOOR	Per cent people who do not speak English well
HIGHCAR	Per cent occupied private dwellings with three or more cars
OWNING	Per cent occupied private dwellings owning dwelling without a mortgage
FEWBED	Per cent occupied private dwellings with one or no bedrooms

For more information on the IRSAD variables, refer to the Technical Paper.

Index of Relative Socio-Economic Disadvantage (IRSD)

Included variables

This section outlines the variables used in the Index of Relative Socio-economic Disadvantage (IRSD).

All variables in this index are indicators of disadvantage. INC_LOW is the strongest indicator of disadvantage in the index.

IRSD Indicators of disadvantage
Variable name	Variable description
INC_LOW	Per cent of people with stated household equivalised income between $1 and $25,999 per year
CHILDJOBLESS	Per cent of families with children under 15 years of age who live with jobless parents
NOYEAR12ORHIGHER	Per cent of people aged 15 years and over whose highest level of education is Year 11 or lower (Includes Certificate I and II)
LOWRENT	Per cent of occupied private dwellings paying rent less than $250 per week (excluding $0 per week)
UNEMPLOYED	Per cent of people (in the labour force) who are unemployed
OCC_LABOUR	Per cent of employed people classified as Labourers
DISABILITYU70	Per cent of people aged under 70 who need assistance with core activities due to a long–term health condition, disability, or old age
ONEPARENT	Per cent of one parent families with dependent offspring only
OVERCROWD	Per cent of occupied private dwellings requiring one or more extra bedrooms.
OCC_DRIVERS	Per cent of employed people classified as Machinery Operators and Drivers
SEPDIVORCED	Per cent of people aged 15 years and over who are separated or divorced
NOEDU	Per cent of people aged 15 years and over who have no educational attainment
OCC_SERVICE_L	Per cent of employed people classified as low skill (skill level 4 and 5) Community and Personal Service workers
NOCAR	Per cent of occupied private dwellings with no cars
ENGLISHPOOR	Per cent of people who do not speak English well

Excluded variables

The following variables were initially considered for the index but were excluded when the analysis showed that they were weak indicators of relative disadvantage.

IRSD excluded indicators
Variable name	Variable description
OCC_SALES_L	Per cent of employed people classified as Low-Skill (skill level 4 and 5) Sales Workers
CERTIFICATE	Per cent of people aged 15 years and over whose highest educational attainment is a certificate III or IV qualification
FEWBED	Per cent occupied private dwellings with one or no bedrooms

For more information on the IRSD variables, refer to the Technical Paper.

Index of Education and Occupation (IEO)

Included variables

This section outlines the variables used in the Index of Education and Occupation (IEO).

The following variables are indicators of disadvantage. NOYR12ORHIGHER is the strongest indicator of disadvantage in the index.

IEO indicators of disadvantage
Variable name	Variable description
NOYR12ORHIGHER	Per cent of people aged 15 years and over whose highest level of education is Year 11 or lower (Includes Certificate I and II)
OCC_SKILL5	Per cent of employed people who work in a Skill Level 5 occupation
OCC_SKILL4	Per cent of employed people who work in a Skill Level 4 occupation
CERTIFICATE	Per cent of people aged 15 years and over whose highest level of educational attainment is a certificate III or IV qualification
UNEMPLOYED	Per cent of people (in the labour force) unemployed

The following variables are indicators of advantage. OCC_SKILL1 is the strongest indicator of advantage in the index.

IEO indicators of advantage
Variable name	Variable description
OCC_SKILL1	Per cent of employed people who work in a Skill Level 1 occupation
ATUNI	Per cent of people aged 15 years and over currently attending a university or other tertiary institution
DIPLOMA	Per cent of people aged 15 years and over whose highest level of educational attainment is a diploma qualification

Excluded variables

The following variable was initially considered for the index but was excluded due to being highly correlated with other variables.

IEO excluded indicators
Variable name	Variable description
DEGREE	Per cent of people aged 15 years and over whose highest level of educational attainment is a bachelor degree or higher qualification.

The following variables were initially considered for the index but were excluded when the analysis showed that they were weak indicators of relative advantage or disadvantage.

IEO excluded indicators
Variable name	Variable description
NOEDU	Per cent of people aged 15 years and over who have no educational attainment
OCC_SKILL2	Per cent of employed people who work in a Skill Level 2 occupation
ATSCHOOL	Per cent of people aged 15 years and over who are still attending secondary school

For more information on the IEO variables, refer to the Technical Paper.

Index of Economic Resources (IER)

Included variables

This section outlines the variables used in the Index of Economic Resources (IER).

The following variables are indicators of disadvantage. INC_LOW is the strongest indicator of disadvantage in the index.

IER indicators of disadvantage
Variable name	Variable description
INC_LOW	Per cent of people living in households with stated annual household equivalised income between $1 and $25,999 (approx. 1st and 2nd deciles)
LOWRENT	Per cent of occupied private dwellings paying rent less than $250 per week (excluding $0 per week)
NOCAR	Per cent of occupied private dwellings with no cars
LONE	Per cent of occupied private dwellings who are lone person occupied private dwellings
ONEPARENT	Per cent of one parent families with dependent offspring only
OVERCROWD	Per cent of occupied private dwellings requiring one or more extra bedrooms (based on Canadian National Occupancy Standard)
UNEMPLOYED_IER	Per cent of people aged 15 years and over who are unemployed
GROUP	Per cent of occupied private dwellings which are group occupied private dwellings

The following variables are indicators of advantage. HIGHBED is the strongest indicator of advantage in the index.

IER indicators of advantage
Variable name	Variable description
HIGHBED	Per cent of occupied private dwellings with four or more bedrooms
MORTGAGE	Per cent of occupied private dwellings owning dwelling (with a mortgage)
HIGHMORTGAGE	Per cent of occupied private dwellings paying mortgage greater than $3,000 per month
INC_HIGH	Per cent of people living in households with stated annual household equivalised income greater than $91,000 (approx. 9th and 10th deciles)
UNINCORP	Per cent of dwellings with at least one person who is an owner of an unincorporated enterprise
OWNING	Per cent of occupied private dwellings owning dwelling without a mortgage

Excluded variables

The following variable was initially considered for the index but was excluded when the analysis showed that it was a weak indicator of relative disadvantage.

IER excluded indicators
Variable name	Variable description
HIGHRENT	Per cent of occupied private dwellings paying rent greater than $500 per week

For more information on the variables, refer to the Technical Paper.

APA

Socio-Economic Indexes for Areas (SEIFA), Australia methodology

APA

Introduction

SEIFA indexes

Advantage and disadvantage

SEIFA is area based

SEIFA geography

Constructing SEIFA

Choosing the variables

Data Proportions

Areas without SEIFA Scores

Principal Component Analysis

Validation

Geography

Change between 2016 and 2021

Understanding SEIFA

Area versus individuals

Variables

Time series

Topics not represented in the indexes

Terminology

Scores

Ranks

Deciles

Percentiles

Ranking within state or territory (rank, decile, and percentile)

Index of Relative Socio-economic Advantage and Disadvantage (IRSAD)

Included variables

Excluded variables

Index of Relative Socio-Economic Disadvantage (IRSD)

Included variables

Excluded variables

Index of Education and Occupation (IEO)

Included variables

Excluded variables

Index of Economic Resources (IER)

Included variables

Excluded variables

Technical paper

Socio-Economic Indexes for Areas (SEIFA): Technical Paper