1264.0 - Language Variables, 1997  
ARCHIVED ISSUE Released at 11:30 AM (CANBERRA TIME) 14/05/1997   
   Page tools: Print Print Page Print all pages in this productPrint All  
Contents >> Main Language Spoken at Home >> Classification and coding

The classification criteria

13. The Australian Standard Classification of Languages (ASCL), Second Edition (ABS cat. no. 1267.0) is used when collecting, aggregating and disseminating data on language usage in Australia. Two classification criteria are used in the ASCL to form the categories of the classification:

      • the relationships between languages owing to their evolution from a common ancestral language (genetic affinity); and
      • geographic proximity of languages in terms of the areas in which they originated. (This refers to the area where a language first became, or was first recognised as, a distinct entity.)

14. Further details regarding the classification criteria used in the language classification can be found in the ASCL.


The standard classification

15. When collecting, aggregating and disseminating data relating to the variable Main Language Spoken at Home, the ASCL is used. The term 'language' is used in the ASCL to describe languages, dialects, pidgins, creoles, and invented and sign languages.

16. With the exception of the Yolngu Matha languages (Narrow Group 82) this classification has a three-level hierarchical structure. For the Yolngu Matha languages, an extra level has been added between the narrow group and the language level of the classification in the second edition. These extra level categories are meaningful and useful groups of base level units which taken together form the narrow group. Nine 'extra level' categories have been included for the narrow group. The extra level is identified by a three-digit code, the first two digits of which identify the narrow group. The most detailed level of the classification consists of 364 (170 Australian Indigenous and 194 non-Australian Indigenous) base or third level units which are languages (as defined above). Included in the 364 base level units are 41 'not elsewhere classified' (n.e.c.) and three 'other' categories, which contain entities that are not listed separately in the classification.

17. The second level of the classification comprises 51 narrow groups of languages which are similar in terms of the classification criteria (genetic affinity and geographic proximity in terms of the areas in which they originated). Included in the 51 narrow groups are seven 'other' categories which consist of languages which do not fit into a particular narrow group.

18. The first and most general level of the classification comprises nine broad groups of languages including one 'other' category. Broad groups are formed by aggregating geographically proximate narrow groups.

19. Further details regarding the standard classification and code structure can be found in the ASCL.


The code structure

20. One, two and four-digit codes are assigned to the first, second and base level units of the classification respectively. The first digit identifies the broad group in which each language or narrow group is contained. The first two digits taken together identify the narrow group in which each language is contained. The four-digit codes represent each of the 364 Languages or base level units.In the exceptional Narrow Group 82 of the ASCL, Second Edition the first three digits identify the third level classification groupings of languages while the four digit code identifies each specific language (see 21 b).

21. The following examples illustrate the coding scheme:

a.Broad Group5Southern Asian Languages
Narrow Group52Indo-Aryan
Language5206Nepali

b.Broad Group8Australian Indigenous Languages
Narrow Group82Yolngu Matha
Extra Level824Dhuwala
Language8245Madarrpa

22. Further details regarding the code structure can be found in the ASCL.

Residual categories and codes

23. In narrow groups, a four-digit code, consisting of the two digits of the narrow group code, followed by the digits '99', is reserved as a residual 'not elsewhere classified' (n.e.c.) or 'other' category. Additionally, in the third level classification groups of Narrow Group 82, the three-digit group code is followed by '9' to denote a 'not elsewhere classified' (n.e.c.) or 'other' category. All languages which are not separately identified in the classification are notionally included in the residual 'n.e.c.' or 'other' category of the narrow group to which they relate.

24. In each broad group, two-digit codes are also reserved for residual categories at the narrow group level. These codes consist of the broad group code followed by '9'. These categories are termed 'other' and consist of separately identified languages which do not fit into any of the narrow groups contained within the broad group, on the basis of the classification criteria.

25. Further details regarding the residual categories 'n.e.c.' and codes used when coding language data can be found in the ASCL.

Supplementary codes

26. The supplementary codes are of two types:

      • four-digit codes ending with one, two or three zeros; and
      • four-digit codes commencing with three zeros.

27. Codes ending in zero are described as 'not further defined' (n.f.d.) codes and are used to code responses to a question about language which cannot be coded to the detailed (language) level of the classification but which can be coded to a higher level of the classification structure.

28. Four-digit codes commencing with '000' are supplementary codes included for operational purposes to facilitate the coding of responses such as inadequately described languages, etc., which present particular problems in that they cannot be allocated a language, narrow group or broad group code.

29. Further details regarding the supplementary codes (eg. 'not further defined' (n.f.d.), 'inadequately described' and 'not stated') used when coding language data can be found in the ASCL.


Scope of the variable

30. The variable Languages Spoken at Home applies to all persons.

Application of the classification to other variables

31. The ASCL can be used for a variety of variables, in particular First Language Spoken, Main Language Other Than English Spoken at Home, Language of Greatest Competency, Languages Spoken at Home and Preferred Language.


Coding procedures

32. Language responses to the Main Language Spoken at Home question are coded to the ASCL using the guidelines detailed in that classification. Each language response is matched with an entry in the ASCL Coding Index to determine the correct code.


Coding indexes

33. A coding index has been developed to assist in the implementation and use of the ASCL and should be used when coding responses to questions relating to Main Language Spoken at Home. It contains a comprehensive list of the most probable responses to questions relating to language and their correct classification codes. Use of the coding index enables responses to be coded accurately and quickly to the appropriate category of the classification.

34. Copies of the coding index sorted in alphabetical and numerical (code) order can be found in the ASCL, Second Edition publication. They can also be found separately in electronic form (.csv and .xls) on the ABS web site (www.abs.gov.au) via this link:

Coding Indexes

or by contacting the Assistant Director, Social Classifications and Standards, Classifications and Data Standards Section, Australian Bureau of Statistics on phone (02) 6252 5736 or e-mail social.classifications@abs.gov.au.



Previous PageNext Page