5202.0 - Spotlight on National Accounts, May 2011  
ARCHIVED ISSUE Released at 11:30 AM (CANBERRA TIME) 26/05/2011   
   Page tools: Print Print Page Print all pages in this productPrint All

SPOTLIGHT ON NATIONAL ACCOUNTS, AUSTRALIA
WHICH IS THE BEST SHORT-TERM MEASURE OF GROSS DOMESTIC PRODUCT?

INTRODUCTION

This spotlight reviews the decision to make GDP(A) Australia's official estimate of GDP in volume terms. The ABS currently publishes four measures of GDP (in chain volume terms) in the Australian National Accounts. These are the:

  • the expenditure based estimate (GDP(E));
  • the income based estimate (GDP(I));
  • the production based estimate (GDP(P)); and
  • the average of the above three measures, (GDP(A)), which is referred to simply as GDP.

The publication of GDP(A) followed research indicating that it was, on balance, the most satisfactory indicator of short term growth (Aspden 1990). (footnote 1) Quarterly estimates of GDP(A) were first published in the June quarter 1990 issue of Australian National Accounts: Gross Product, Employment and Hours Worked (cat. no. 5222.0). Prior to GDP(A) becoming the headline volume measure in December quarter 1991, GDP(I) was the official measure of GDP and a statistical discrepancy between GDP(I) and GDP(E) was shown on the expenditure side. GDP(I) remained the official measure for current price GDP until the implementation of SNA93 in September quarter 1998. Following the final issue of 5222.0 in September quarter 1992, GDP(A) remained the official measure as published in Australian National Accounts: National Income, Expenditure and Product (cat. no. 5206.0).


WHY THE GDP MEASURES DIFFER

In principle, the three measures of GDP should give the same result, but in practice they differ due to limitations of data sources. These limitations include sampling error, reporting error, incomplete coverage in the numerous individual data sources, and variations in the timing of recording of transactions.

Differences between the three alternative measures of GDP are an inevitable consequence of the capacity to derive such measures from combinations of independent data sources. Improvements in the quality of data sources and the use of data confrontation may reduce the magnitude of differences between the series, but such statistical discrepancies can never be eliminated entirely. In Australia's case, a statistical discrepancy was first identified in the national accounts with the compilation of quarterly estimates of GDP(E) and GDP(I) in the 1960's. With the publication of quarterly estimates of GDP(P) for the first time in 1988, it became possible to calculate differences between all three measures and to publish GDP(A).

As seen in Chart 1, the quarterly movements in the three measures of GDP do not exhibit a uniform pattern in relation to each other across time. The discrepancies in all three measures are offsetting at various times meaning that there is no one component of GDP that would be responsible for the statistical discrepancy.

GRAPH 1: PERCENTAGE CHANGES IN GDP CHAIN VOLUMES, Seasonally Adjusted
Graph: This graph shows the seasonally adjusted percentage change movements of GDP(E), GDP(I), GDP(P) and GDP(A) from September 2007 to December 2010.


OVERSEAS PRACTICES

Australia is one of only a few OECD member countries for which independent quarterly measures of GDP(E) and GDP(I) are produced, and is one of only three countries that publish these and an independent estimate of GDP(P) (the other two being Canada and the United Kingdom). The United States Bureau of Economic Analysis, for example, produces only two independent measures, that is, GDP(E) and GDP(I), while New Zealand produces independent estimates of GDP(E) and GDP(P).


ANALYSING THE GDP MEASURES

The short-term fluctuations in a seasonally adjusted time series are commonly referred to as its irregular component. The irregular components of the GDP measures can be considered to have two parts: measurement error and an actual irregular. To give the clearest possible picture of what is really happening in the economy, it is desirable to minimise measurement error.

A way of filtering out the measurement error is to apply a moving average to seasonally adjusted estimates to obtain a trend estimate (Chart 2). However, the moving averages used to derive trend estimates cannot distinguish between measurement error and actual irregulars, and both types of movement are smoothed out. For example, the downturn resulting from the global financial crisis, where the main impact on GDP occurred in the December quarter 2008, is spread over a number of neighbouring quarters in the trend estimates of GDP. Another problem that leads to revisions occurs at the end of a trend series. Trend estimates are updated as more observations become available for the moving average. This is because weights of the observations at the end of series change as more observations are added.

GRAPH 2: PERCENTAGE CHANGES IN GDP CHAIN VOLUMES, Trend
Graph: This graph shows the trend percentage change movements of GDP(E), GDP(I), GDP(P) and GDP(A) from September 2007 to December 2010.



Another way of obtaining a smoother measure of GDP is to average the three seasonally adjusted chain volume measures of GDP to obtain GDP(A). Positive and negative measurement errors in each of the independent measures of GDP are cancelled out to some extent in GDP(A). If each independent measure of GDP had the same degree of measurement error and the measurement errors were uncorrelated with each other, then the variance of the measurement error of the average would be a third of that of each individual measure. Unlike a trend estimate, only the measurement error is smoothed out in GDP(A). Also, unlike trend estimates, GDP(A) is subject to revision only when the seasonally adjusted estimates of the independent GDP measures are revised. Thus, there is a prima facie case for considering whether GDP(A) is a useful additional measure of GDP.


QUALITY ASSESSMENT AND RESULTS

There are three principal criteria available to assess the short-term movements of the seasonally adjusted estimates of the three measures of GDP and their average. These are timeliness, accuracy and reliability. They are defined as follows:
  • timeliness is measured by the length of time between the end of the reference period and the time of publication;
  • accuracy is determined by the proximity of an estimate to a notional true value; and
  • reliability is determined by the proximity of successive estimates for a particular period to the 'final' estimate for that period, where the 'final' estimate is not necessarily an accurate estimate.

Timeliness

There is often a trade-off between timeliness versus accuracy and detail. All four measures of GDP are released approximately two months after the reference period for Australia. Around half of the 34 OECD countries release their national accounts with a similar lag to Australia. The International Monetary Fund (IMF) Special Data Dissemination Standard (SDDS) requires that countries release data for GDP and major expenditure aggregates/or by productive sectors no later than 90 days after the reference quarter.

Accuracy

The accuracy of the GDP estimates cannot be assessed definitively since the true value of GDP is unknown. However, two quantitative comparisons of accuracy are available. The first is to determine which measures of GDP are least variable in terms of quarter-to-quarter movements and the second is to determine which measure of GDP is a leading indicator in economic growth and recession cycles.

Variability
    A comparison of the variability of quarter-to-quarter movements is a useful quantitative comparison since low variability indicates less erratic data. If one measure of GDP is less variable than another, it does not always mean that it is better. The lower variability could have been achieved due to source data limitations, where some components may have been imputed using relatively smooth indicators. Nevertheless with all else being equal, a smooth measure of GDP implies that it is subject to less measurement error than a more erratic measure. Table 1 presents results of a variability test which show the average of quarterly movements without regard to sign from September quarter 1974 to December quarter 2010.

    TABLE 1. MEASURES OF VARIABILITY, September quarter 1974 to December quarter 2010 (a)


    Average absolute
    percentage movement in

    seasonally adjusted estimate
    Average absolute
    percentage change in trend (b)
    Irregular as a proportion
    of previous period trend

    GDP(E)
    1.10
    0.86
    0.41
    GDP(I)
    1.06
    0.86
    0.39
    GDP(P)
    1.01
    0.89
    0.28
    GDP(A)
    1.00
    0.86
    0.27

    (a) includes average percentages without regard to sign.
    (b) Trend is calculated using a 7-term Henderson moving average


    In line with the results obtained by Aspden(1990), Table 1 shows GDP(A) is the smoothest measure of GDP. It has the lowest average variation in the seasonally adjusted series and equal lowest variation in the trend series. GDP(A) also has the lowest value of variation in the irregular component being slightly ahead of GDP(P). This indicates that GDP(A) and GDP(P) quarter to quarter movements remain better indicators of underlying trend than are the movements in GDP(I) and GDP(E). Aspden (1990) also examined the distribution of the magnitude of quarterly movements without regard to sign as they were in March quarter 1990.

    The earlier analysis discussed why GDP(P) should be less variable than either GDP(I) or GDP(E). The main factor noted was related to timing issues. Timing issues arise when transactions occurring in a particular quarter are inadvertently allocated to a neighbouring quarter. Timing problems are likely to be more acute for GDP(I) and GDP(E) than for GDP(P).

    Timing discrepancies can occur in the compilation of GDP(E) through lags in tracing transactions through the economy. On the arrival of an imported good into Australia it will be debited to imports and credited to wholesale stocks. Then, when it is sold to a retailer, it will be debited to wholesale stocks and credited to retail stocks. When it is sold to the final purchaser, it will be debited to retail stocks and credited to one of the expenditure aggregates (either final consumption expenditure or gross fixed capital formation). For each transaction, the credit and debit entries should have the same value and be recorded in the same quarter, otherwise a measurement error will occur. The good itself makes no contribution to GDP, but the chain of services provided in delivering it to its final purchaser does.

    Similar potential timing problems occur with the compilation of GDP(P) as it is defined to be gross output less intermediate input. Timing inconsistencies in recording output and intermediate input would lead to measurement error in value added. However, the method of double deflation (that is, deriving volume estimates of value added for an industry by subtracting a volume estimate of intermediate input from a volume estimate of gross output) is used only to derive quarterly volume estimates of Gross farm value added. For all the other industries, quarterly volume estimates are derived by extrapolating value added estimates by a volume indicator. It is assumed that the gross value added grows at the same rate as the volume measure of output (gross output method), but changes in the ratio of value added to output can cause a discrepancy. Apart from agriculture, the potential short-term timing issues are avoided, although it can be argued that the use of extrapolative methods in non-farm production can be inferior when examining longer term movements.

    The original article noted that GDP(I) had an advantage over GDP(E) as an indicator of recent short-term movements since it was less reliant on the ABS business register. There is a continuous flow of business births, deaths and re-organisations and there are inevitable lags between these events and their notification to the ABS to enable the register to be updated. While, as a proportion of GDP the effect of this may be small, it can be significant in terms of GDP growth, particularly when the economy is in an expansion phase. However, more recent estimates of GDP(I) are more heavily dependent on the business register. Private sector compensation of employees, gross operating surplus and gross mixed income, which comprise more than 70% of GDP(I), now use indicators derived from the Quarterly Business Indicator Survey (QBIS). This survey is based on the business register, while the proportion of GDP(E) using the business register remains at about 40%. This relates mainly to estimates of capital formation and the retail component of household final consumption. The continuing advantage of GDP(I) over GDP(E) could lie in improvements to the updating of the business register since 1990.

    Sampling and non-sampling errors can occur within the source data used to compile the various measures of GDP. Sampling error exists because a sample and not the entire population is being observed, while non-sampling errors relate to other issues such as question framing and accuracy of reporting by respondents.

    Aspden (1990) noted that the quality of the volume indicators for a number of service industries in gross value added and GDP(P) were weak. Hours worked was used as a proxy for output for some service industries. Recently there has been much greater use made of output measures from QBIS for estimating volume estimates for service industries. This has potentially improved the quality of the GDP(P) volume measure.

    Leads and lags

    A variable is said to lead another if it consistently changes before the other variable. Leading indicators provide information that may help predict the turning points of GDP at either the commencement or the end of a business cycle. Similarly, a lag indicator would consistently show changes in its cycle after other indicators have changed.

    Aspden (1990) used two methods to examine the possibility of leads and lags:
    • phase shift analysis of the GDP(E), GDP(I) and GDP(P) seasonally adjusted time series; and
    • growth cycle turning point analysis of movements in seasonally adjusted GDP(E), GDP(I) and GDP(P) compared with GDP(A).

    In phase shift analysis the attributes of component waves were correlated across the three independent measures of GDP in terms of their duration of wavelength, height of amplitude and their timing or phase. The results indicated that the attributes of the three independent measures coincided and there was no significant difference in most cycles from September quarter 1974 to June quarter 1990.

    From the phase shift and growth cycle turning point comparisons, it was found that the three independent measures of GDP are correlated most of the time. However, GDP(P) showed less irregular movement in the growth cycle than the other measures as also indicated by the tests of variability.

    Turning point analysis on three recent economic cycles is presented below in Table 2. The results show that GDP(P) led GDP(A) by one quarter in the cycle peak of March quarter 2007, but GDP(A) led GDP(E) and GDP(P) by one quarter at the troughs in March quarter 1991 and December quarter 2008, respectively.

    TABLE 2. GROWTH CYCLE TURNING POINTS


    Leads(+) and lags(-) in quarters compared to GDP(A)

    GDP(A) turning points
    GDP(E)
    GDP(I)
    GDP(P)

    PeakJun 1989
    -
    -
    -
    TroughMar 1991
    -1
    -
    -
    PeakDec 1999
    -
    -
    -
    TroughSep 2000
    -
    -
    -
    PeakMar 2007
    -
    -
    1
    TroughDec 2008
    -
    -
    -1


    Reliability

    A desirable property of a statistical measure is reliability, that is, a low susceptibility to revision. However, the desire for reliability needs to be qualified by the need for accuracy as well. The ABS policy has been to revise published estimates as better source data become available. However, there have been some changes to revisions policy since the introduction of Supply and Use Tables.

    Aspden (1990) focussed upon the revision characteristics of GDP(E), GDP(I) and an average of these two measures. (footnote 2) The mean revision and mean absolute revision of quarterly percentage movements from initial to latest observations were examined for each series. The results showed that GDP(E) had both a greater mean revision and mean absolute revision than GDP(I). This was attributed to GDP(E)'s greater reliance on the business register.

    An updated revisions analysis was published in Information Paper: Quality Dimensions of the Australian National Accounts, 2007 (cat. no. 5216.0.55.002). This utilised quarterly data from March quarter 1988 to June quarter 2002. Table 3 below shows mean revision and mean absolute revision for the percentage movement from initial release to three years later for all three measures of GDP and their average. The results for the mean revision support the original findings with GDP(I) being slightly better than GDP(E). However, a notable result was that GDP(A) had the lowest mean absolute revision of all measures.

    TABLE 3. GDP, Initial estimates versus estimates 3 years later, March quarter 1988 to June quarter 2002

    Mean revision (% points)

    Mean absolute revision (% points)

    Full:
    Jun 1988
    to
    Dec 2002
    Period 1:
    Jun 1988
    to
    Dec 1992
    Period 2:
    Mar 1993
    to
    Dec 1997
    Period 3:
    Mar 1998
    to
    Dec 2002
    Full:
    Jun 1988
    to
    Dec 2002
    Period 1:
    Jun 1988
    to
    Dec 1992
    Period 2:
    Mar 1993
    to
    Dec 1997
    Period 3:
    Mar 1998
    to
    Dec 2002
    Number
    of
    sign
    changes

    GDP(E)
    0.09
    0.05
    0.10
    0.10
    0.60
    0.89
    0.44
    0.50
    8
    GDP(I)
    0.07
    0.10
    0.10
    0.03
    0.51
    0.44
    0.45
    0.64
    6
    GDP(P)
    0.16
    0.26
    0.13
    0.10
    0.44
    0.55
    0.40
    0.37
    6
    GDP(A)
    0.10
    0.14
    0.10
    0.05
    0.37
    0.45
    0.35
    0.31
    3



    The information paper compared the correlation of the three independent measures against GDP(A) over the 75 quarters. GDP(P) was found to be the most highly correlated measure against GDP(A) with a coefficient of 0.87, while GDP(E) and GDP(I) had coefficients of 0.77 and 0.78, respectively. The GDP(E), GDP(I) and GDP(P) measures moved in the same direction in 80% of quarters. In 75% of quarters the disparity in growth rates between the strongest and weakest components was greater than 0.5 percentage points and in 90% of quarters it was greater than 0.3 percentage points. If only the two most consistent components at each point in time are taken into account, the coherence improves as only 30% of quarters had a difference greater than 0.3 percentage points.


    CONCLUSION

    On the basis of variability, GDP(A) is the best performing measure. GDP(A) is marginally less irregular than GDP(P) and is appreciably less irregular than GDP(I) or GDP(E). GDP(A) encompasses all the data measures, which appear to display partially offsetting measurement errors at various times. GDP(P) is less subject to timing problems than either GDP(E) or GDP(I).

    A comparison of growth-cycle turning points and phase-shift analysis reveals that there is little difference between the different measures of GDP, as growth cycles coincide in most cases and no one measure of GDP consistently leads the others.

    In examining mean revisions, GDP(I) is slightly more reliable than the other measures. However, GDP(A) is superior in terms of the mean absolute revision.

    The available data indicates that GDP(A) is the better indicator of short term movements, given its low variability and high reliability. This analysis supports the use of GDP(A) as the official measure of GDP.


    FOOTNOTES

    1. Aspden, C., "Which is the best short-term measure of Gross Domestic Product?" originally published in Australian National Accounts: National Income and Expenditure, June Quarter 1990, Australian Bureau of Statistics, cat. no. 5206.0, pp. 57-65, <https://www.ausstats.abs.gov.au/ausstats/free.nsf/0/A883157CBF2E2E21CA2574DC0016C234/$File/52060_1990_JUN.pdf>. (back)

    2. GDP(P) and GDP(A) could not be included in this analysis because of the short time that the GDP(P) series had been available. (back)


    REFERENCES

    Aspden, C. 1990, "Which is the best short-term measure of Gross Domestic Product?" originally published in Australian National Accounts: National Income and Expenditure, June Quarter 1990, Australian Bureau of Statistics, cat. no. 5206.0, pp. 57-65, <https://www.ausstats.abs.gov.au/ausstats/free.nsf/0/A883157CBF2E2E21CA2574DC0016C234/$File/52060_1990_JUN.pdf>.

    Australian Bureau of Statistics, Information Paper: Quality Dimensions of The Australian National Accounts, 2007, cat. no. 5216.0.55.002, Chapter 5 Accuracy of the national Accounts, pp 21-31, <https://www.abs.gov.au/AUSSTATS/abs@.nsf/DetailsPage/5216.0.55.0022007?OpenDocument>