Scanner Data in CPI Research and Compilation - PowerPoint PPT Presentation

1 / 21

About This Presentation

Title:

Scanner Data in CPI Research and Compilation

Description:

Improving current CPI methods and practices continuing to collect price data ... Data substitution ceasing direct price collection for some segments of the CPI, ... – PowerPoint PPT presentation

Number of Views:31

Avg rating:3.0/5.0

Slides: 22

Provided by: ct8

Category:

more less

Transcript and Presenter's Notes

Title: Scanner Data in CPI Research and Compilation

1
Scanner Data in CPI Research and Compilation

Guia Alcausin
Michael Anderson
Jonathan Khoo
Ken Tallis

2
2. Opportunities available through Scanner Data
Sets

Improving current CPI methods and practices
continuing to collect price data directly, but
using scanner-based research to tune the design
of the collection, index construction methods and
data treatments (such as editing and imputation)
Data substitution ceasing direct price
collection for some segments of the CPI, and
using scanner data instead
Data augmentation using both directly collected
and scanner data to compile segments of the CPI

3
3. It Is Not Cheap! Costs Include

Acquiring the data either drawing data directly
from individual stores or chains or purchasing
data from a commercial clearing house
Redesigning compilation practices
Retraining price statisticians in the new
compilation practices
Reworking the mathematics of CPI construction
such
as the microindex formulae and the aggregation
tree and
aggregation formulae
Redeveloping computer systems
Understanding the effects of all these changes on
the published CPI and explaining them to users

4
4. Nine Research Themes in ABS Work

Guiding current CPI design and practice
Theme 1 Aggregation tree and index formulae
Theme 2 Sample allocation
Theme 3 Treatment of discontinuities

Data substitution and data augmentation
Theme 4 Frequency and treatment of
discontinuities and quality changes
Theme 5 Volatility
Theme 6 An altered economics of CPI compilation?
Theme 7 Subsampling the scanner data?
Theme 8 What is the true cardinality of a scanner
dataset?
Theme 9 A different theoretical foundation needed?

6
5. Some Features of the Australian CPI

The Australian CPI is built up from
around 1,000 elementary aggregates for each of
the eight capital cities, which are combined to
form
around 90 expenditure classes, which are combined
to form
around a dozen expenditure groups, which are
combined to form
the all-groups CPI

7
6. Some Features of the Australian CPI

Data to construct weights collected at
five-yearly intervals
Quarterly series
Price data collected quarterly or more often
using handheld devices or directly from outlets
purposive not probability sample
10,000 price observations
Geometric means used at lowest level of
aggregation
Laspeyres index at higher levels

8
7. Some Key Properties of Scanner Data

Provide a (quasi) census of purchase transactions
Include data on both prices and quantities
Show observations in almost continuous time
Provide weighting data at the same frequency as
used for the price data

9
8. Properties of the Experimental Scanner Data Set

Obtained from AC Nielsen
65 week period (first 13 weeks used as base
period)
19 grocery commodities
Variables (quantities, prices, commodity-brand,
size and packaging)
Four supermarket chains (over 80 of grocery
sales)

10
9. Theme 1 Aggregation Tree and Index Formulae

Three key questions being asked
(i) Can a better understanding of substitution
between items guide our drawing of commodity
boundaries at the lower levels of the CPI
aggregation tree?
(ii) What index formula should be used at each
level of aggregation?
(iii) Under what conditions can a unit value
index be validly used in CPI compilation?

11
10. Theme 1 Aggregation Tree and Index Formulae

Three key questions being asked
(i) Can a better understanding of substitution
between items guide our drawing of commodity
boundaries at the lower levels of the CPI
aggregation tree?
(Work in progress)

12
Theme 1 contd

(ii) What index formula should be used at each
level of aggregation?
(Confirms geometric mean is best performing
microindex)
(iii) Under what conditions can a unit value
index be validly used in CPI compilation?
(Unit values calculated over items tend to cause
appreciable bias OK to compute over outlets for
same item)

13
11. Theme 2 Sample Allocation

Conclusion greater gains in index quality from
increasing the number of items in sample than
from increasing the number of stores

14
12. Theme 3 Treatment of Discontinuities in
Traditional Data Sources

Question what is the best way of dealing with
missing price observations (e.g. gaps, quality
changes)?
Possibilities using matching observations only,
impute missing data, move forward last
observation, hedonics
(Work in progress)

15
13. Theme 4 Frequency and Treatment of
Discontinuities and Quality Changes in Scanner
Data

(Work in progress)

16
14. Theme 5 Volatility

Finding Scanner data has increased volatility
of indexes (when introduced straight into our
existing compilation practices)
Does this reflect real-world volatility?
Or is it a flaw in conceptual framework for
dealing with high-frequency data?
Change to scanner based index compilation
procedures?

17
15. Theme 6 an Altered Economics of CPI
Compilation

Cost structure would change dramatically would
need to
rethink design
Much larger data set
Higher computer transaction costs
Purchase costs (some field costs)
Reworking of mathematics of CPI construction
Change in compilation practices
Retraining of price statisticians
Redevelopment of computer systems
Objective of theme to understand change in cost
components

18
16. Theme 7 Subsampling the Scanner Data

Objective to reduce costs of using the scanner
data
(Work in progress)

19
17. Theme 8 What is the True Cardinality of the
Data Set?

Some evidence of supermarket chains using common
price schedules for significant parts of the CPI
basket
Exploring the possibility of using price
schedules for parts of the CPI compilation

20
18. Theme 9 a Different Theoretical Foundation
Needed?

Our theoretical tools (mainly the existing
corpus of Cost of Living index theory) are
not fully adequate for the economic behaviours
search, shopping and inventory behaviour that
are incorporated in high-frequency data
(Triplett 2000)

21
19. Conclusions

Scanner data has considerable potential to assist
with CPI compilation
Consensus conclusions are emerging for some
research questions
Costs are not trivial
Research is expensive
Meta analysis would be useful to generalise
research findings. Need a home for relevant
research documents and links. ABS is prepared to
do this through its Ottawa Group secretariat
activities