Socio-Economic Indexes for Areas (SEIFA), Australia methodology

Latest release
Reference period
2021
Released
27/04/2023
Next release Unknown
First release

Introduction

This publication outlines how SEIFA is constructed and provides guidance on how it should be used and understood.

See SEIFA for general information on SEIFA 2021, and to access the data. 

For detailed technical specifications refer to the SEIFA Technical Paper.

SEIFA indexes

SEIFA is a collection of four indexes, each summarising a different aspect of the socio-economic conditions in an area using different Census data:

  • the Index of Relative Socio-economic Advantage and Disadvantage (IRSAD) focuses on both advantage and disadvantage
  • the Index of Relative Socio-economic Disadvantage (IRSD) focuses on relative socio-economic disadvantage
  • the Index of Education and Occupation (IEO) focuses on relative Education and Occupation advantage and disadvantage
  • the Index of Economic Resources (IER) focuses on Economic advantage and disadvantage.

The same area may score differently for each index due to their constituent variables.

Advantage and disadvantage

For SEIFA purposes, relative socio-economic advantage and disadvantage is defined as people's access to material and social resources, and their ability to participate in society. The dimensions are constrained by what is collected in the Census; they include income, education, employment, occupation, housing, and family structure.

SEIFA is area based

SEIFA is designed and constructed as an area-based measure. Area-based deciles are calculated by dividing the areas, ordered by disadvantage, into ten equally sized groups. Decile one contains the most disadvantaged areas and decile ten contains the most advantaged areas.

SEIFA geography

The SEIFA scores are initially calculated by Statistical Area Level 1 (SA1).

Scores for higher level geographic areas, such as Statistical Area Level 2 (SA2), Local Government Areas (LGA), Suburbs and Localities (SAL), and Postal Areas (POA) are calculated using population-weighted averages of the SA1 scores. This ensures that the higher-level geographic area scores reflect the size of the population within each contributing SA1.

Constructing SEIFA

For a detailed discussion on SEIFA’s construction, refer to the SEIFA Technical Paper.

Choosing the variables

Variables that relate to the concepts of advantage and disadvantage that SEIFA is trying to capture are included in the initial variable list. The main constraint on the initial variable list is that the variables can only be derived from Census data.

Data Proportions

The indexes are calculated from 2021 Census data, with variables calculated as proportions at the SA1 level. For example, the proportion of unemployed people is calculated as the number of unemployed people divided by the total number of people in the labour force in each SA1.

Areas without SEIFA Scores

Due to non-response and low population counts, not every SA1 can be given a meaningful index score. ABS maximises the number of areas that receive a score, while maintaining confidentiality and an acceptable level of quality. SA1s do not receive an index score if:

  • the SA1 has no usual addresses
  • the usual resident population is less than or equal to Ten
  • the area is classified as offshore or migratory (refer to special purpose codes)
  • there are fewer than six employed people
  • there are fewer than six classifiable occupied private dwellings
  • the proportion of people in private dwellings is less than or equal to 20 per cent
  • the denominator of a variable in the index is less than six.

Principal Component Analysis

The indexes are a weighted combination of Census derived variables. The method used to determine the weights and final variable composition is known as Principal Component Analysis (PCA). The weights describe the relative importance of each variable. If the weight determined by PCA for a variable is too low, the variable would not meaningfully contribute to the index, so is excluded.

The variable weights are derived from a consistent, data-driven method and are not determined subjectively.

Once the variable weights have been finalised, they are used together with the variable proportions to create index scores for SA1s across Australia.

Validation

The indexes are checked to ensure they are measuring the desired concept and that the results make intuitive sense. Validation includes:

  • investigating the correlations between the four indexes
  • comparing SEIFA 2021 rankings with 2016 rankings
  • identifying which variables are the drivers of change between SEIFA 2016 and 2021
  • peer review of the results.

Geography

SEIFA 2021 uses edition 3 of the Australian Statistical Geography Standard (ASGS).

Change between 2016 and 2021

  • SEIFA 2021 largely uses the same method as SEIFA 2016.
  • Where available, SEIFA 2021 uses the same candidate variables as SEIFA 2016.
  • The Dwelling Internet Connection data item was not collected in the 2021 Census, so the NONET variable could not be used.
  • Occupation variables are based on the Australian and New Zealand Standard Classification of Occupations (ANZSCO).
  • Variables using cut-off values as part of their specification, such as high and low income (INC_HIGH and INC_LOW), were updated.
  • The only processing change was to start from confidentialised (instead of unconfidentialised) area level counts. Using confidentialised area level counts did not have a significant impact on the results, and appropriate comparisons to 2016 results are still valid.

Understanding SEIFA

The indexes rank areas according to their relative socio-economic conditions and are useful in understanding the distribution of socio-economic conditions across different areas.

It is recommended that index rankings or quantiles (deciles and percentiles) are used for analysis, rather than index scores.

Area versus individuals

The indexes reflect the socio-economic characteristics of an area, rather than of individuals.

They are calculated at the SA1 level and reflect SA1 characteristics.

Because people within an SA1 are not identical, the index scores for an SA1 do not directly reflect the relative advantage or disadvantage of an individual residing in that SA1. For example, it is possible for a relatively advantaged person to live in an area with a low score or for a relatively disadvantaged person to live in an area with a high score.

Refer to SEIFA: Getting a handle on individual diversity within areas for more information.

Variables

The variables that contribute to each index should be considered when deciding which index to use (see sections on each index for lists of variables). For example:

  • if you are looking for areas of disadvantage for the purpose of allocating funds or services, the Index of Relative Socio-economic Disadvantage would be the most appropriate
  • if you are looking for areas with relatively high populations in unskilled jobs or with relatively low educational qualifications, the Index of Education and Occupation would be most suitable.

Time series

The indexes are designed to compare the relative socio-economic characteristics of areas at a point in time, not to compare areas over time. There are several issues that make longitudinal or time series analysis of SEIFA difficult to interpret.

  • The constituent indicators and indicator weights for each index are likely to have changed. 
  • The geographic boundaries and numbers of relevant small areas may have changed. 
  • The distribution of the standardised index values will have changed (e.g., a score of 800 does not represent the same level of disadvantage in different years). 
  • There are changes in the way the indicators are defined.

If comparisons over time are made, the use of deciles or percentiles is recommended rather than ranks or scores.

Topics not represented in the indexes

Topics represented in SEIFA are limited to what is collected in the Census.

Measures relating to wealth and infrastructure may provide more information about the relative advantage or disadvantage within an area, but these measures are not collected in the Census.

Long-term health conditions, asked for the first time in the 2021 Census, were not included in SEIFA. This is to allow health researchers to analyse the relationship between health outcomes and socio-economic advantage/disadvantage. Adding health variables to SEIFA would make these relationships less clear.

Other potential topics that could be associated with advantage and disadvantage but are not captured in the Census include crime, and the environment.

Terminology

Scores

SA1 scores are created by adding together the weighted characteristics of that SA1. The scores for all SA1s are then standardised to a distribution where the average equals 1,000 and the standard deviation is 100.

Scores for larger geographic areas, such as Statistical Area Level 2 (SA2) and Local Government Area (LGA), are calculated using population-weighted averages of the SA1 scores.

Lower scores indicate areas of relative disadvantage compared to areas with higher scores.

Scores are an ordinal measure on an arbitrary scale and do not represent the quantity of advantage or disadvantage (i.e. it’s not accurate to say an area with a score of 1,000 is twice as advantaged as an area with a score of 500).

Rankings and quantiles (deciles and percentiles) should be used in most cases. Scores should be used for more technically complex analysis. For more information see Technical Paper.

Ranks

Every area is ordered from the lowest to highest score, with the area with the lowest score given a rank of one, the area with the second lowest score given a rank of two and so on, up to the area with the highest score which is given the highest rank.

Deciles

Every area is ordered from lowest to highest score, with the lowest 10 per cent of areas given a decile number of one, the next lowest 10 per cent of areas given a decile number of two and so on, up to the highest 10 per cent of areas which are given a decile number of 10. This means that areas are divided up into 10 equal sized groups, based on their score.

Percentiles

Every area is ordered from lowest to highest score, with the lowest one per cent of areas given a percentile number of one, the next lowest one per cent is given a percentile number two and so on, up to the highest one per cent of areas which are given a percentile number of 100. This means that areas are divided up into one hundred equal sized groups, based on their score.

Ranking within state or territory (rank, decile, and percentile)

Every area within a state or territory is ordered from lowest to highest score, then ranks, deciles and percentiles are assigned to each area within that state or territory.

State/territory ranks, deciles and percentiles can only be used to compare areas within a single state or territory.

 

Index of Relative Socio-economic Advantage and Disadvantage (IRSAD)

Included variables

This section outlines the variables used in the Index of Relative Socio-economic Advantage and Disadvantage (IRSAD).

The following variables are indicators of disadvantage. NOYR12ORHIGHER is the strongest indicator of disadvantage in the index.

IRSAD Indicators of disadvantage

Variable name

Variable description

NOYR12ORHIGHER

Per cent of people aged 15 years and over whose highest level of education is Year 11 or lower (Includes Certificate I and II)

INC_LOW

Per cent of people living in households with stated annual household equivalised income between $1 and $25,999 (approx. first and second deciles)

OCC_LABOUR

Per cent of employed people classified as 'labourers'

DISABILITYU70

Per cent of people aged under 70 who need assistance with core activities due to a long–term health condition, disability, or old age

CHILDJOBLESS

Per cent of families with children under 15 years of age who live with jobless parents

OCC_DRIVERS

Per cent of employed people classified as Machinery Operators and Drivers

LOWRENT

Per cent of occupied private dwellings paying rent less than $250 per week (excluding $0 per week)

SEPDIVORCED

Per cent of people aged 15 and over who are separated or divorced

ONEPARENT

Per cent of one parent families with dependent offspring only

UNEMPLOYED

Per cent of people (in the labour force) unemployed

OCC_SERVICE_L

Per cent of employed people classified as Low Skill (skill level 4 and 5) Community and Personal Service Workers

CERTIFICATE

Per cent of people aged 15 years and over whose highest level of educational attainment is a certificate III or IV qualification

OVERCROWD

Per cent of occupied private dwellings requiring one or more extra bedrooms (based on the Canadian National Occupancy Standard)

NOEDU

Per cent of people aged 15 years and over who have no educational attainment

OCC_SALES_L

Per cent of employed people classified as Low Skill (skill level 4 and 5) Sales

The following variables are indicators of advantage. INC_HIGH is the strongest indicator of advantage in the index.

IRSAD Indicators of advantage

Variable name

Variable description

INC_HIGH

Per cent of people living in households with stated annual household equivalised income greater than $91,000 (approx. 9th and 10th deciles)

OCC_PROF

Per cent of employed people classified as Professionals

HIGHMORTGAGE

Per cent of occupied private dwellings paying mortgage greater than $2,800 per month

OCC_MANAGER

Per cent of employed people classified as Managers

HIGHRENT

Per cent of occupied private dwellings paying rent greater than $470 per week

DIPLOMA

Per cent of people aged 15 years and over whose highest level of educational attainment is a diploma qualification

HIGHBED

Per cent of occupied private dwellings with four or more bedrooms

ATUNI

Per cent of people aged 15 years and over at university or other tertiary institution

 

Excluded variables

The following variable was initially considered for the index but was excluded due to being highly correlated with other variables.

IRSAD excluded indicators

Variable name

Variable description

DEGREE

Per cent of people aged 15 years and over whose highest level of educational attainment is a bachelor degree or higher qualification.

 

The following variables were initially considered for the index but were excluded when the analysis showed that they were weak indicators of relative advantage or disadvantage.

IRSAD excluded indicators 

Variable name

Variable description

NOCAR

Per cent occupied private dwellings with no cars

SPAREBED

Per cent occupied private dwellings with one or no bedrooms

ENGLISHPOOR

Per cent people who do not speak English well

HIGHCAR

Per cent occupied private dwellings with three or more cars

OWNING

Per cent occupied private dwellings owning dwelling without a mortgage

FEWBED

Per cent occupied private dwellings with one or no bedrooms

 

For more information on the IRSAD variables, refer to the Technical Paper.

Index of Relative Socio-Economic Disadvantage (IRSD)

Included variables

This section outlines the variables used in the Index of Relative Socio-economic Disadvantage (IRSD).

All variables in this index are indicators of disadvantage. INC_LOW is the strongest indicator of disadvantage in the index.

IRSD Indicators of disadvantage

Variable name

Variable description

INC_LOW

Per cent of people with stated household equivalised income between $1 and $25,999 per year

CHILDJOBLESS

Per cent of families with children under 15 years of age who live with jobless parents

NOYEAR12ORHIGHER

Per cent of people aged 15 years and over whose highest level of education is Year 11 or lower (Includes Certificate I and II)

LOWRENT

Per cent of occupied private dwellings paying rent less than $250 per week (excluding $0 per week)

UNEMPLOYED

Per cent of people (in the labour force) who are unemployed

OCC_LABOUR

Per cent of employed people classified as Labourers

DISABILITYU70

Per cent of people aged under 70 who need assistance with core activities due to a long–term health condition, disability, or old age

ONEPARENT

Per cent of one parent families with dependent offspring only

OVERCROWD

Per cent of occupied private dwellings requiring one or more extra bedrooms.

OCC_DRIVERS

Per cent of employed people classified as Machinery Operators and Drivers

SEPDIVORCED

Per cent of people aged 15 years and over who are separated or divorced

NOEDU

Per cent of people aged 15 years and over who have no educational attainment

OCC_SERVICE_L

Per cent of employed people classified as low skill (skill level 4 and 5) Community and Personal Service workers

NOCAR

Per cent of occupied private dwellings with no cars

ENGLISHPOOR

Per cent of people who do not speak English well

 

Excluded variables

The following variables were initially considered for the index but were excluded when the analysis showed that they were weak indicators of relative disadvantage.

IRSD excluded indicators

Variable name

Variable description

OCC_SALES_L

Per cent of employed people classified as Low-Skill (skill level 4 and 5) Sales Workers

CERTIFICATE

Per cent of people aged 15 years and over whose highest educational attainment is a certificate III or IV qualification

FEWBED

Per cent occupied private dwellings with one or no bedrooms

 

For more information on the IRSD variables, refer to the Technical Paper.

Index of Education and Occupation (IEO)

Included variables

This section outlines the variables used in the Index of Education and Occupation (IEO).

The following variables are indicators of disadvantage. NOYR12ORHIGHER is the strongest indicator of disadvantage in the index.

IEO indicators of disadvantage

Variable name

Variable description

NOYR12ORHIGHER

Per cent of people aged 15 years and over whose highest level of education is Year 11 or lower (Includes Certificate I and II)

OCC_SKILL5

Per cent of employed people who work in a Skill Level 5 occupation

OCC_SKILL4

Per cent of employed people who work in a Skill Level 4 occupation

CERTIFICATE

Per cent of people aged 15 years and over whose highest level of educational attainment is a certificate III or IV qualification

UNEMPLOYED

Per cent of people (in the labour force) unemployed

 

The following variables are indicators of advantage. OCC_SKILL1 is the strongest indicator of advantage in the index.

IEO indicators of advantage

Variable name

Variable description

OCC_SKILL1

Per cent of employed people who work in a Skill Level 1 occupation

ATUNI

Per cent of people aged 15 years and over currently attending a university or other tertiary institution

DIPLOMA

Per cent of people aged 15 years and over whose highest level of educational attainment is a diploma qualification

 

Excluded variables

The following variable was initially considered for the index but was excluded due to being highly correlated with other variables.

IEO excluded indicators

Variable name

Variable description

DEGREE

 Per cent of people aged 15 years and over whose highest level of educational attainment is a bachelor degree or higher qualification.

 

The following variables were initially considered for the index but were excluded when the analysis showed that they were weak indicators of relative advantage or disadvantage.

IEO excluded indicators

Variable name

Variable description

NOEDU

 Per cent of people aged 15 years and over who have no educational attainment

OCC_SKILL2

 Per cent of employed people who work in a Skill Level 2 occupation

ATSCHOOL

 Per cent of people aged 15 years and over who are still attending secondary school

 

For more information on the IEO variables, refer to the Technical Paper.

Index of Economic Resources (IER)

Included variables

This section outlines the variables used in the Index of Economic Resources (IER).

The following variables are indicators of disadvantage. INC_LOW is the strongest indicator of disadvantage in the index.

IER indicators of disadvantage

Variable name

Variable description

INC_LOW

Per cent of people living in households with stated annual household equivalised income between $1 and $25,999 (approx. 1st and 2nd deciles)

LOWRENT

Per cent of occupied private dwellings paying rent less than $250 per week (excluding $0 per week)

NOCAR

Per cent of occupied private dwellings with no cars

LONE

Per cent of occupied private dwellings who are lone person occupied private dwellings

ONEPARENT

Per cent of one parent families with dependent offspring only

OVERCROWD

Per cent of occupied private dwellings requiring one or more extra bedrooms (based on Canadian National Occupancy Standard)

UNEMPLOYED_IER

Per cent of people aged 15 years and over who are unemployed

GROUP

Per cent of occupied private dwellings which are group occupied private dwellings

 

The following variables are indicators of advantage. HIGHBED is the strongest indicator of advantage in the index.

IER indicators of advantage

Variable name

Variable description

HIGHBED

Per cent of occupied private dwellings with four or more bedrooms

MORTGAGE

Per cent of occupied private dwellings owning dwelling (with a mortgage)

HIGHMORTGAGE

Per cent of occupied private dwellings paying mortgage greater than $2,800 per month

INC_HIGH

Per cent of people living in households with stated annual household equivalised income greater than $91,000 (approx. 9th and 10th deciles)

UNINCORP

Per cent of dwellings with at least one person who is an owner of an unincorporated enterprise

OWNING

Per cent of occupied private dwellings owning dwelling without a mortgage

 

Excluded variables

The following variable was initially considered for the index but was excluded when the analysis showed that it was a weak indicator of relative disadvantage.

IER excluded indicators

Variable name

Variable description

HIGHRENT

Per cent of occupied private dwellings paying rent greater than $470 per week

 

For more information on the variables, refer to the Technical Paper.

Back to top of the page