Professional Documents
Culture Documents
Marketing Science
Publication details, including instructions for authors and subscription information:
http://pubsonline.informs.org
This article may be used only for the purposes of research, teaching, and/or private study. Commercial use
or systematic downloading (by robots or other automatic processes) is prohibited without explicit Publisher
approval, unless otherwise noted. For more information, contact permissions@informs.org.
The Publisher does not warrant or guarantee the article’s accuracy, completeness, merchantability, fitness
for a particular purpose, or non-infringement. Descriptions of, or references to, products or publications, or
inclusion of an advertisement in this article, neither constitutes nor implies a guarantee, endorsement, or
support of claims made of that product, publication, or service.
© 2004 INFORMS
INFORMS is the largest professional society in the world for professionals in the fields of operations research, management
science, and analytics.
For more information on INFORMS, its publications, membership, or meetings visit http://www.informs.org
informs ®
Vol. 23, No. 3, Summer 2004, pp. 317–334
issn 0732-2399 eissn 1526-548X 04 2303 0317 doi 10.1287/mksc.1040.0061
© 2004 INFORMS
Peter S. H. Leeflang
Faculty of Economics, Department of Marketing and Marketing Research, University of Groningen, P.O. Box 800,
9700 AV, Groningen, The Netherlands, p.s.h.leeflang@eco.rug.nl
Dick R. Wittink
Yale School of Management, Yale University, P.O. Box 208200, New Haven, Connecticut 06520-8200, and
Faculty of Economics, University of Groningen, Groningen, The Netherlands, dick.wittink@yale.edu
S ales promotions generate substantial short-term sales increases. To determine whether the sales promotion
bump is truly beneficial from a managerial perspective, we propose a system of store-level regression models
that decomposes the sales promotion bump into three parts: cross-brand effects (secondary demand), cross-
period effects (primary demand borrowed from other time periods), and category-expansion effects (remaining
primary demand). Across four store-level scanner datasets, we find that each of these three parts contribute
about one third on average. One extension we propose is the separation of the category-expansion effect into
cross-store and market-expansion effects. Another one is to split the cross-item effect (total across all other
items) into cannibalization and between-brand effects. We also allow for a flexible decomposition by allowing
all effects to depend on the feature/display support condition and on the magnitude of the price discount. The
latter dependence is achieved by local polynomial regression. We find that feature-supported price discounts
are strongly associated with cross-period effects while display-only supported price discounts have especially
strong category-expansion effects. While the role of the category-expansion effect tends to increase with higher
price discounts, the roles of cross-brand and cross-period effects both tend to decrease.
Key words: econometric models; market response models; sales promotion; regression and other statistical
techniques
History: This paper was received December 22, 2000, and was with the authors 21 months for 3 revisions;
processed by Scott Neslin.
run results (Pauwels et al. 2002). However, none of posed into 74% brand choice elasticity, 11% purchase
the current decomposition models answers the key incidence elasticity, and 15% quantity elasticity. The
question: If the promoted brand gains 100 units, how brand choice elasticity percentage is the part of the
many units do other brands loose, how many units bump representing consumers switching away from
come from other periods, and how many units rep- other brands keeping everything else constant. Thus,
resent category expansion? The answer is crucial for if a promoted brand gains 100 units, the other brands
a determination of the profitability of promotions as together would loose 74 units if the category does
Downloaded from informs.org by [132.239.1.231] on 28 August 2015, at 04:26 . For personal use only, all rights reserved.
et al. 1990, Gönül and Srinivasan 1996, van Heerde et al. 2001). To illustrate, it seems plausible for dis-
et al. 2000, Macé and Neslin 2004) and lagged effects counts that are only communicated within a store
(e.g., van Heerde et al. 2000).3 Category expansion (e.g., by a display) to have strong cross-brand sales
is the part of the primary demand effect that is not effects. By contrast, discounted items with out-of-store
due to temporal shifts, and it may include increased support (i.e., feature) have greater potential to create
consumption (Assunçao and Meyer 1993, Ailawadi cross-store sales effects. To obtain a separate decom-
and Neslin 1998, Chandon and Wansink 2002), deal- position of the price-discount effect for different sup-
Downloaded from informs.org by [132.239.1.231] on 28 August 2015, at 04:26 . For personal use only, all rights reserved.
to-deal purchasing (Krishna 1994), category switch- port conditions, we create four separate predictor
ing (Walters 1991), and store switching (Bucklin and variables for price discounts: (1) without support,
Lattin 1992). (2) with feature-only support, (3) with display-only
support, and (4) with feature-and-display support. We
2.3. Additional Decomposition Effects do not provide a decomposition of the effects of non-
One additional decomposition effect of interest is the price promotions (such as features without price cuts),
separation of the cross-store effect from the category- because these occur very rarely in our data.
expansion effect. Cross-store effects are related to The decomposition may also depend on the mag-
store switching, which has two variants (Bucklin and nitude of a price discount. For example, Gupta
Lattin 1992). The first is “direct” store switching that and Cooper (1992) show that incremental stockpil-
pertains if households use outside-store cues (e.g., fea- ing approaches zero when the price-discount level
tured price cuts) to decide where to purchase specific exceeds some value. Sethuraman (1996), Sethuraman
items. Direct store switching leads to net decreases in et al. (1999), and van Heerde et al. (2001) show that
item sales in stores competing with the store promot- cross-brand effects also strongly depend on the mag-
ing the item. The second is “indirect” store switching, nitude of the price discount. Consumers could have
which applies if a household visits multiple stores different reservation prices for the separate decisions
in a given week, and inside-store cues (e.g., displays underlying the decomposition. By using a nonpara-
with price promotions) influence which items are pur- metric method (local polynomial regression; see §3.4)
chased in the different stores (Bucklin and Lattin we obtain discount-dependent decomposition effects.
1992, Neslin 2002). Indirect store switching does not Importantly, the nature of nonlinearity in the discount
imply that store choice is driven by in-store activi- effect is allowed to differ between the four support
ties. Instead, it requires autonomous cross-shopping, conditions.
which is common for many households (Keng and
Ehrenberg 1984). Kumar and Leone (1988) find evi- 2.5. Store-Level Data
dence of indirect store-switching effects, because price To decompose the promotional sales spike, we use
cuts and displays in one store decrease sales in an store-level scanner data instead of household data.
adjacent store. We show how the category-expansion This choice is driven by the goal of this paper, which
effect within a store can be split into a cross-store is to propose a simple yet managerially relevant
effect and a market-expansion effect (the category- decomposition method that is applicable to widely
expansion effect net of the cross-store effect) with data available store data. In addition, store data are in
on multiple chains in a metropolitan area. general more representative than household data (cf.,
With SKU-level data, we can split the cross-item Gupta et al. 1996, Bucklin and Gupta 1999). Further,
effect (effect on other SKUs) into cannibalization and we want the method to be applicable for all cat-
cross-brand effects (net sales losses of nonpromoted egories, including low-incidence product categories,
items belonging to other brands). This distinction is where store data offer better coverage than household
especially relevant for manufacturers. data. Finally, firms typically have access to and anal-
ysis capabilities only for aggregate data.
2.4. Decomposition as a Function of Promotion One important benefit of household data is that
Characteristics one can obtain insight into the underlying consumer
To evaluate price promotions we create a model responses (incidence, choice, and quantity) due to pro-
that accomplishes the decomposition separately for motions (see, e.g., Zhang and Krishnamurthi 2004).
four alternative support conditions. This is important This decomposition is not available from store data,
because the price-discount effect is moderated by the because it is impossible to recover individual house-
type of support (Lemon and Nowlis 2002, van Heerde hold decisions. In addition, it may be useful to under-
stand to what extent households show responses
3
that cancel each other. For example, a nonpromot-
The cross-period effects are based on retail sales to consumers. In
sales from manufacturers to retailers there may also be promotion-
ing brand may gain 100 units due to an increase of
induced intertemporal shifts due to forward-buying by retailers customers in the category, induced by a promotion
(Drèze and Bell 2003). for another brand, but lose 100 units due to brand
van Heerde, Leeflang, and Wittink: Decomposing the Sales Promotion Bump with Store Data
320 Marketing Science 23(3), pp. 317–334, © 2004 INFORMS
Data
1 −100 400 — — 0
2 −100 400 1000 −1500 0
Downloaded from informs.org by [132.239.1.231] on 28 August 2015, at 04:26 . For personal use only, all rights reserved.
The promotion variable is a 0 −1 dummy in the variable. Excluding them would lead to misspecified
example, representing promotion at time t (Promt ). models, with biased effect estimates for the decompo-
Because pre- and post-promotion effects influence sition as we show in the bottom panel of Table 1. All
own-brand and cross-brand sales in other periods, we intercept and slope parameter estimates now differ
also need a lead promotion dummy (Promt+1 ) and a from the correct values, and the mathematical consis-
lagged promotion dummy (Promt−1 ) as covariates. In tency property is also violated.
addition, because PPCSt and TCSt span the period For real-world applications with noisy data, we
t − 1 t + 1, they may also be affected by a two- employ more complex models to obtain the desired
week lead (Promt+2 ) and by a two-week lagged pro- decomposition. However, the principle illustrated
motion dummy (Promt−2 ). Although these two extra in Table 1 remains the same. In general, there
variables are redundant in the equations for OBS and are J brands, the time window for which we
CBS (confirmed by the zero coefficients), we include define cross-period and category-expansion effects is
the same predictors in all equations to ensure a math- [t − T ∗ , t + T ], and we estimate the models pooled
ematically consistent decomposition: the sum of the across stores i = 1
I. Because stores differ in
estimated decomposition effects necessarily equals
size, and it is inappropriate to assume equal absolute
the own-brand effect.
unit sales effects across stores, we transform all cri-
We show the OLS results for the four multiple
terion variables so as to obtain equal effects across
regressions in the middle panel of Table 1. The coeffi-
stores proportional to category volume.5 Thus, we
cients for Promt in the four equations give the desired
divide all criterion variables by the average category
decomposition: the own-brand effect is 100 units,
the cross-brand effect is 35 units, the cross-period sales per store (CSi ).6 If Sijt is unit sales of brand j in
effect is 40 units, and the category-expansion effect
is 25 units. Thus, we obtain a consistent decomposi- 5
Such a proportionality assumption is implicit in the multiplicative
tion by estimating four regressions, and using only models commonly applied to pooled store data.
one coefficient from each regression. Importantly, the 6
The CSi variable includes both regular and promotional sales.
covariates (lead and lagged variables) are required Thus, this variable might depend on the promotional intensity in
to obtain unbiased effects of the current promotion store i. In §5.1, we show that this issue is of limited concern.
van Heerde, Leeflang, and Wittink: Decomposing the Sales Promotion Bump with Store Data
322 Marketing Science 23(3), pp. 317–334, © 2004 INFORMS
s=−T ∗ k=1
CSi s=−T ∗ k=1
CSi
=1 =1
s=0 (2)
These definitions allow us to write
4
4
3
OBSijt = CBSijt + PPCSit + TCSit
(1) CBSijt = j + cb lj PIijlt +
1 lj CPIijlt +
2 mj Dijmt
l=1 l=1 m=1
We use a price index variable to separate promo-
3
tional price from regular price effects. This variable is +
3
mj CDijmt + 4j RPijt + 5j CRPijt
defined as the actual price for an item divided by its m=1
regular price. It equals one in nonpromotional weeks Tmax −T −T ∗ +T ∗
T
and is less than one if the actual (shelf) price is below +
6 j Wt +
7 j PIijlt+
the regular price. If the regular price changes, then the =T +T ∗ +1 =1
price index is defined relative to the new regular price +T ∗
T +T ∗
T
(and it remains one if only the regular price changes). +
8 j PIijlt− +
9 j CPIijlt+
Thus, the price index captures temporary price dis- =1 =1
counts only. This variable is also used in ACNielsen’s +T ∗
T
Scan∗ Pro model (Wittink et al. 1988). + 10 j CPIijlt− + uijt (3)
Typical sales promotions consist of temporary price =1
4
4 +T ∗
T +T ∗
T
OBSijt = j + ob lj PIijlt + 1 lj CPIijlt + 8 j PIijlt− + 9 j CPIijlt+
l=1 l=1 =1 =1
3
3 +T ∗
T
+ 2 mj Dijmt + 3 mj CDijmt + 4j RPijt + 10 j CPIijlt− + uijt (5)
m=1 m=1 =1
van Heerde, Leeflang, and Wittink: Decomposing the Sales Promotion Bump with Store Data
Marketing Science 23(3), pp. 317–334, © 2004 INFORMS 323
for i = 1
I (stores), j = 1
J (brands), and Our approach expands on this framework by writing
t = T + T ∗ + 1
Tmax − T − T ∗ (weeks), where current category sales as −PPCSit − TCSit , which
PIijlt = price index for brand j in store i in week t; allows us to separate the primary demand effect into
l = 1 indicates “no support,” l = 2 feature-only
support, l = 3 display-only support, and l = 4 feature- ce lj
fraction category-expansion effect =
and-display support; PIijlt equals 1 − d/100 if there is ob lj
a d percent discount for brand j with support l in
and
Downloaded from informs.org by [132.239.1.231] on 28 August 2015, at 04:26 . For personal use only, all rights reserved.
+ cross-store effect
To estimate these effects, we regress respectively
CBwijmt and CBbijt on the same set of predictors.
To estimate the cross-store and market-expansion
effects, we regress respectively SCSijt and ETCSijt on 3.4. Estimation Methods for Constant and Flexible
exactly the same set of predictors as in previous equa- Decompositions
tions (see (2)–(5)). We first discuss the estimation method to obtain a
decomposition that is constant across price discount
3.3. Extended Decomposition of the Cross-Item depths. We then relax this constraint and allow all
Effect effects to depend on the magnitude of the price dis-
Extant decomposition research does not distinguish count. We use linear regression models for the con-
between cross-item effects within brands (cannibal- stant decomposition in (2)–(5) and for the extended
ization) and cross-item effects between brands. With decompositions in (7) and (8). We note that it is
SKU-level data it is possible to separate these effects. impossible to obtain a logically consistent unit-sales-
Define SKU m belonging to brand j m = 1
Mj , based decomposition for the semilog or log-log mod-
to distinguish cross-item effects that involve SKUs of els that are common for store data (e.g., Wittink et al.
the same brand from those of other brands. We define 1988, Blattberg and Wisniewski 1989, van Heerde
the sales of SKU m analogous to (1), i.e., also relative et al. 2000). However, a multiplicative model does
to average category store sales CSi : provide a mathematically consistent elasticity decom-
position.8 Note that we obtain one critical property of
J Mk
Sijmt Siknt
OBSijmt = − CBbijt = 8
In an earlier version of this paper we decomposed elasticities and
CSi k=1 n=1
CSi
k=j found the average cross-brand elasticity fraction to be almost 80%
van Heerde, Leeflang, and Wittink: Decomposing the Sales Promotion Bump with Store Data
Marketing Science 23(3), pp. 317–334, © 2004 INFORMS 325
across stores, by scaling all criterion variables by CSi . + 8 j PIijlt− + 9 j CPIijlt+
=1 =1
We relax the linearity assumption later by nonpara-
+T ∗
T
metric estimation of the price-discount effects in a
+ 10 j CPIijlt− + uijt
manner that still yields a mathematically consistent =1
decomposition.
For the constant decomposition of the own-brand where mob lj PIijlt is a nonparametric function. We
Downloaded from informs.org by [132.239.1.231] on 28 August 2015, at 04:26 . For personal use only, all rights reserved.
effect into cross-brand, cross-period, and category- replace (3)–(5) by analogous specifications (same
functional form, same set of predictors) to obtain
expansion effects, we estimate Equations (2)–(4)
mcb lj PIijlt , mcp lj PIijlt , and mce lj PIijlt . We use local
simultaneously across all items in a category. Equa-
linear regression to estimate the nonparametric func-
tion (5) is redundant, because the parameter estimates
tions (Fan 1992). The key benefit of local linear regres-
can be derived from (2)–(4). We use iterative SUR-GLS sion is that it allows for any degree of nonlinearity
to allow for contemporaneous correlation between while maintaining the mathematical consistency of
the errors for all decomposition components for all the decomposition for each level of the price index
brands (the SUR-part). In addition, we allow for first- variable (see the appendix):
order autocorrelation for all error terms (the GLS-
part). For the criterion variable in Equation (4), this mob lj PIijlt = mcb lj PIijlt + mcp lj PIijlt + mce lj PIijlt
brands. Macé and Neslin (2004) analyzed more than 28 stores of one supermarket chain in a metropolitan
30,000 SKUs in 83 stores and obtained “similar esti- area. Due to the inclusion of 12 lead and 12 lagged
mates of stockpiling or deceleration elasticity whether effects the effective sample size for each item is
they were calculated based on 4, 5, or 6 period lags.” 28 ∗ 104 − 2 ∗ 12 = 2240. In this sample, brands 1–4
Nijs et al. (2001) studied dynamic promotion effects used price promotions (see Table 3) while brand 5 did
at the category level for 560 product categories. They not. Therefore, we can only estimate price-promotion
had comparable elasticity estimates for four, six, and effects for brands 1–4. We do, however, include the
Downloaded from informs.org by [132.239.1.231] on 28 August 2015, at 04:26 . For personal use only, all rights reserved.
eight lags, and noted that the impulse response func- sales data for brand 5 in the criterion variables rele-
tions for these three alternative lag specifications are vant to the decomposition: CBSijt , PPCSit , and TCSit .
highly correlated. Finally, Pauwels et al. (2002) report Thus, brand 5 may experience a change in sales when
90% duration intervals for post-promotion effects other brands are promoted, and this change is part
on incidence, choice, and quantity that range from of the decomposition. We note that the tuna dataset
zero to eight weeks. The average duration interval is the only one for which we can use the extended
is approximately two weeks. Thus, it appears that decomposition of the category-expansion effect into
a time window choice of t − 6 t + 6 is justified. cross-store and market-expansion effects, because we
Nevertheless, we also estimate models with a larger have data for stores of other chains located in the
time window t − 8 t + 8 as well as smaller win- same geographic area. For the estimation of cross-
dows, and we report on the sensitivity of results store effects, we assume that within-chain cross-store
below (§5.2). effects are zero.
The second dataset (52 weeks) pertains to the six
4. Data largest national brands in the tissue product category
We apply the models to two American (tuna, tis- in the United States. The data (also from ACNielsen)
sue) and two Dutch (shampoo, peanut butter) weekly, are from 24 widely dispersed stores located in the
store-level, scanner datasets. The tuna, tissue, and eastern United States. For some brands the number
shampoo datasets are at the brand level, obtained of price promotions is very small (see Table 3). For
via aggregation across SKUs with homogeneous mar- example, brand 6 has only one price promotion sup-
keting activities. We provide descriptive statistics in ported with display-only. As a result, we anticipate
Tables 2 and 3. The first American dataset, from that the estimation of some decomposition effects for
ACNielsen, contains five brands in the 6.5 oz. canned tissue will be difficult, and we report results only for
tuna fish product category. For the standard decom- promotion variables with at least 24 price promotion
position, we have 104 weeks of data for each of the observations (there are 24 stores).
Price promotions Price promotions with Price promotions with Price promotions with
without support feature-only support display-only support feature-and-display support
Avg Min Max Avg Min Max Avg Min Max Avg Min Max
Category Item # % %a % # % % % # % % % # % % %
The first Dutch dataset (from ACNielsen) consists types. We use a pooling test to determine whether it is
of 11 shampoo brands, of which the five largest appropriate to postulate one common effect across the
engage in promotional activities. For this product cat- support types. The tests show that for all brands this
egory we have 109 weekly observations for almost homogeneity assumption is rejected p < 0
01. Hence,
all 48 stores in a national sample from one large we allow for separate effect sizes for the different sup-
supermarket chain. This provides a net sample size port conditions.
of 4,043 observations for each of the five brands. As We show the standard decomposition of the own-
Downloaded from informs.org by [132.239.1.231] on 28 August 2015, at 04:26 . For personal use only, all rights reserved.
before, the six other brands are included in relevant brand sales effect into cross-brand, cross-period, and
criterion variables for the decomposition. category-expansion effects in Table 4. The fit of all
The second Dutch dataset (from ACNielsen) con- standard decomposition models is excellent.9 The
tains five items of peanut butter. In contrast to the R-square for the whole SUR-system (Judge et al. 1985)
other three datasets, this dataset contains multiple ranges from 0.90 to 0.96. Across all brands in the
SKUs per brand, which enables us to separate four categories, 98% of all own-brand effects, 70% of
the secondary demand effect into within-brand and all cross-brand effects, 46% of the category-expansion
between-brand effects, as in Equation (8). For this effects, and 33% of the cross-period effects are statisti-
product category we have 144 weekly observations cally significant (two-tailed, p < 0
05), as indicated in
for almost all 49 stores in a national sample from the last row of the top panel in Table 4. The reduced
one large supermarket chain. The net sample size is frequency of significant primary demand effects is
5,519 observations. Three SKUs experienced price dis- consistent with Neslin et al. (1985, Figure 2), who find
counts. Data on the two remaining SKUs are included reduced power in models of purchase quantity and
in the appropriate decomposition criterion variables. interpurchase time. Similarly, Bell et al. (1999) report
We note that the number of SKUs is modest so a low signal-to-noise ratio for the quantity portion of
that we use a separate intercept for each SKU. With primary demand effects in the elasticity decomposi-
a larger number of SKUs we would use attribute- tion based on household data. We note that although
specific intercepts (see Fader and Hardie 1996). the promotional frequency for the tuna data is high
Table 3 shows that there is a lot of variation in (see Table 1), we still find that 63% of the cross-period
the price discounts offered by the individual items. effects are significant for this category.
For example, tuna item 1 was price promoted with- The results shown here are the simple aver-
out support in 520 store-weeks. The discount was 23% ages of brand-specific estimates (independent of the
on average with a range of 5% to 51%. With feature- p-values). In the first column of Table 4 we show the
only support the discount range is 6% to 47%, with average result for the own-brand sales Equation (2).
display-only 6% to 44%, and with feature-and-display The next columns show the percentage cross-brand
support 5% to 63%. Thus, the amount of variation effect from Equation (3), the percentage cross-period
in the price discount offered is substantial for each effect from Equation (4), and the percentage category-
support condition. This is critical for the reliable esti- expansion effect from Equation (5). The top panel
mation of nonparametric, price-discount-dependent shows average results across the four product cate-
demand curves and decomposition effects, separately gories. Because all effects are relative to store-specific
for the four price index variables. A detailed exami- average category sales, the parameter estimates are
nation of Table 3 suggests that both the frequency and also comparable across product categories. All aver-
range of discounts are poor for the tissue brands and age parameter estimates have the expected signs.
one peanut butter brand. The parameter estimates represent the effect of the
price index on the criterion variable (relative to aver-
age category sales in a store).10 To illustrate, a 20%
5. Results deal means that the price index variable goes from 1
We obtain results through a series of model estima-
tion and validation steps. We present results for the 9
The average OLS-based autocorrelation coefficient (across brands
“standard decomposition” from Equation (6), and, of all categories) is 0.23 for the OBS equation, 0.18 for CBS, and 0.90
where appropriate, the “extended decomposition” for PPCS. The large value for PPCS is due to the moving window
from Equations (7) and (8) in §5.1. We validate the summations. The SUR-GLS estimation procedure accounts for these
time window choice in §5.2. In §5.3, we present autocorrelations.
RESET test results and, given the rejection of constant 10
Differences in promotional intensity between stores affect the CSi
effects, we show flexible decomposition results based variable used for obtaining proportional effects across stores (see
Footnote 6). To contemplate the magnitude of this issue we assume
on local polynomial regression.
a 20% discount and the presence of display and feature. The aver-
age effect on own-brand sales is 1.82 times average category sales,
5.1. Standard Decomposition of which 35% represents category expansion (Table 4). Consider an
We obtain a separate decomposition of own-brand extreme case in which store 1 always promotes one of the brands in
sales effects for each of the four different support a category, and store 2 never promotes any brand. Even in that case,
van Heerde, Leeflang, and Wittink: Decomposing the Sales Promotion Bump with Store Data
328 Marketing Science 23(3), pp. 317–334, © 2004 INFORMS
Price index
Without support 044a 35% 44% 20% — —
With feature-only 124 32% 39% 29% — —
With display-only 116 38% 15% 44% — —
With feature and display 182 28% 36% 36% — —
Average 116 33% 32% 35% — —
Percentage significant 98% 70% 33% 46% — —
(two sided, p < 005)
Tuna; R2 = 090
Price index
Without support 046 003b 37% 9% 57% 25% 6% 21% 0% 55% 6% 71%
With feature-only 103 005 30% 7% 58% 14% 12% 11% 9% 46% 3% 51%
With display-only 098 004 36% 6% 10% 23% 54% 20% 43% 35% 11% 53%
With feature and display 174 005 29% 4% 36% 8% 35% 6% 31% 25% 4% 28%
Average 105 31% 38% 31% 25% 6%
Tissue; R2 = 096
Price index
Without support 036 006 17% 61% 73% 122% 9% 90% — —
With feature-only 124 009 18% 9% 37% 33% 45% 29% — —
With display-only 107 004 29% 14% 14% 37% 57% 31% — —
With feature and display 153 003 19% 5% 41% 15% 41% 12% — —
Average 105 21% 35% 43% — —
Shampoo; R2 = 090
Price index
Without support 046 003 24% 12% 26% 31% 50% 25% — —
With feature-only 114 005 30% 6% 58% 21% 13% 17% — —
With display-only 134 004 44% 6% 30% 20% 19% 16 — —
With feature and display 189 004 24% 4% 25% 12% 51% 11 — —
Average 121 31% 34% 33% — —
to 0.80. The parameter estimate of 0.46 for the own- own-brand effect. This differs strongly from the elas-
brand effect of an unsupported price cut for tuna ticity decomposition results in Gupta (1988), Chiang
(second panel of Table 4) implies that a 20% price cut (1991), Chintagunta (1993), Bucklin et al. (1998), Bell
leads to an estimated increase in unit sales of 0
20 ∗ et al. (1999), and Pauwels et al. (2002). However, our
0
46 ∗ 100% = 9
2% of average weekly category sales result is consistent with van Heerde et al. (2003), who
in a store. This own-brand effect is decomposed into show that an elasticity component of 3/4 translates to
37% cross-brand, 57% cross-period, and 6% category- a net cross-brand effect of about 1/3.
Downloaded from informs.org by [132.239.1.231] on 28 August 2015, at 04:26 . For personal use only, all rights reserved.
5.2. Validation of the Time Window allow for flexible estimation and decomposition of the
The cross-period and category-expansion effects are own-brand sales effect so that the decomposition per-
based on a time window of t − 6 t + 6 surround- centages may depend on the magnitude of a price
ing week t. To determine whether this time window discount.
is adequate, we also used t − 8 t + 8. This entails We estimate the effects, separately for each support
three changes in the model: (a) the criterion variables type, and flexibly by local polynomial regression.13
for the cross-period and category-expansion effects The amount of flexibility in local polynomial regres-
Downloaded from informs.org by [132.239.1.231] on 28 August 2015, at 04:26 . For personal use only, all rights reserved.
are obtained by summing across this larger time win- sion is determined by the bandwidth parameter,
dow; (b) the predictor variable set includes lead and which reflects a trade-off between bias and variance.
lagged variables from the window t − 16, t + 16; With a very large bandwidth parameter, local
and (c) the number of observations available for polynomial regression is equivalent to estimating a
estimation is reduced due to the additional lead and regular global polynomial regression model (high
lagged variables. We report the results in the last bias, low variance). If the bandwidth parameter is
two panels of Table 4. Except for tissue, the aver- small, there can be much variation in the response
age parameter estimates for the own-brand effects estimate and the decomposition (low bias, high vari-
and the three decomposition percentages are not ance). To obtain an acceptable trade-off, we apply
very sensitive to this alternative time window. For the following set of rules for the bandwidth choice.
three product categories the correlations across the First, we want a bandwidth that is small, to mini-
brand-specific parameter estimates are high, ranging mize the bias, but large enough to avoid inconceiv-
from 0.62 to 0.99. Thus, the time window choice of able patterns such as effects with implausible signs
t − 6, t + 6 seems appropriate for tuna, shampoo, or nonmonotonic patterns. Second, we want one com-
and peanut butter. mon bandwidth across all criterion variables, across
For tissue the comparison is much less satisfac- all instruments, across all brands, and across all cat-
tory. The average decomposition effects are dissimilar egories. This ensures that the sum of the component
across the two window specifications, and the correla- effects always equals the effect on own-brand sales,
tions between the parameter estimates, except for the as is the case for the constant decomposition. The use
own-brand effects, are low as well. The tissue data of a single bandwidth value also allows the results
suffer from a small number of observations relative across instruments and categories to be maximally
to the number of parameters, in part because there comparable. We choose h = 0
3 after considering other
are just 24 stores and 52 weeks. Most critically, for values such as 0.2 and 0.4. We note that h = 0
3 leads
the time window t − 6 t + 6, the effective number of to curves that are less flexible than we could have
observations per store is 28, and this reduces to 20 for justified, with a smaller bandwidth value, for the
the time window t − 8 t + 8.12 own- and cross-brand effects (see also van Heerde
et al. 2001). However, a lower value provides an unac-
5.3. Flexible Decomposition ceptable number of nonmonotonic relationships, espe-
To determine the suitability of nonparametric analy- cially for cross-period effects. Thus, we ignore some
ses, we use the RESET test for the assumption of con- potentially relevant nonlinearities to maintain consis-
stant decomposition effects (Stewart 1991, pp. 71–72). tency in the results relevant to the decomposition, and
It is a general procedure designed to detect an inap- to maintain comparability across product categories.
propriate functional form. To maximize the compa- We report the average own-brand effects and
rability across categories we focus on the standard decomposition percentages across categories for
decomposition effects from Equation (1). We compute selected price discount values in Table 5. We observe
RESET test-based p-values for each brand and sepa- similar phenomena across the four support types
rately for each of the Equations (2)–(4). The results (except for display-only): the percentage attributable
show that all but one p-value is below 0.01 (the to cross-brand effects tends to decrease for higher dis-
exception being the cross-period component for one counts, as does the percentage attributable to cross-
of the shampoo brands). Thus, it is meaningful to period effects (consistent with Gupta and Cooper
1992). By implication the percentage attributable to
12
We also analyzed what happens with shorter time windows. We category expansion tends to increase with higher
started with the window t − 0 t + 0 (i.e., no dynamic effects), next discounts. This dependence is especially strong
t − 1 t + 1, up to t − 6 t + 6, but in all cases using the same set of for feature-only supported price discounts: category
predictor variables and the same data. We observe that both the size expansion accounts for 21% at a 5% discount but
of the own-brand effect and the percentage cross-brand effects are
very stable. The percentage cross-period effect increases initially, as
13
one would expect, but starts to stabilize at t − 5 t + 5. The per- Van Heerde et al. (2001) find that nonparametric regression for
centage category-expansion effect decreases initially and also starts a sales response model predicts better in hold-out samples than
to stabilize at t − 5 t + 5. parametric regression.
van Heerde, Leeflang, and Wittink: Decomposing the Sales Promotion Bump with Store Data
Marketing Science 23(3), pp. 317–334, © 2004 INFORMS 331
Table 5 Decomposition of Flexible Price Effects: Averages across substantively relevant decomposition effects. On aver-
Brandsa age across four categories, we find that if a price pro-
Standard decomposition moted brand gains 100 units, the net loss for other
brands is 33 units, so the secondary demand effect is
Price Own-brand Percent Percent Percent about 1/3. This is very consistent with the net cross-
discount sales cross-brand cross-period category-expansion
(%) effect effect (%) effect (%) effect (%)
brand sales effect derived from the elasticity decom-
position for household data (van Heerde et al. 2003).
Downloaded from informs.org by [132.239.1.231] on 28 August 2015, at 04:26 . For personal use only, all rights reserved.
We find average decomposition percentages that are these could evoke severe competitive reactions after
similar to those obtained when household-level elas- some time that nullify the initial benefit (Leeflang and
ticity results are transformed into net sales effects. Wittink 1996). To further enhance this research, one
Second, we find statistical support for the dependence could extend the models to include competitive reac-
of effect sizes on support types, based on pooling test tions as a function of the decomposition.
results. Third, all average parameter estimates have Our flexible decomposition results might under-
the expected signs. Fourth, we validate our choice of state the true nonlinearity in effects. To maintain
Downloaded from informs.org by [132.239.1.231] on 28 August 2015, at 04:26 . For personal use only, all rights reserved.
the time window by estimating models for a wider consistency in the estimates across the criterion vari-
time window. We obtain similar results, except for tis- ables, we forced strong smoothness in the estimated
sue where the sample size is small. Fifth, the RESET curves based on requirements for the “weakest” cri-
test results indicate that the assumption of constant terion variable. With more data we can accommodate
decomposition effects is untenable. Thus, it is attrac- greater flexibility in all estimated effects. In general,
tive and appropriate to use the flexible approach the decomposition models require extensive datasets,
of local polynomial regression so as to obtain more both due to the large number of control variables and
refined insights. the loss of observations due to the lead and lag covari-
What are the managerial implications? Our results ates. The relatively poor results for tissue suggest that
can be used to infer net sales effects of price promo- one should have at least two years of weekly data for
tions. From a retailer perspective, a 100-unit increase at least 25 stores, leading to a minimal sample size of
for the promoted brand causes on average a net 2,600 observations =2 ∗ 52 ∗ 25. A high promotional
33-unit decrease for other brands in the category and frequency such as for tuna seems to lead to more reli-
a net 32-unit decrease in pre- and post-promotion able results. Multicollinearity is not a problem, due to
category sales (Table 4). This leaves a potentially ben- the mutually exclusive definitions of the price index
eficial 35-unit increase for the retailer due to category- variables.
expansion effects. For the manufacturer, cross-period A further limitation is that our approach is appli-
effects are also not beneficial unless those effects min- cable for the decomposition of sales bumps in stable
imize sales opportunities for other brands. There- environments. Our model is not intended to suggest
fore, a conservative estimate of the net effect for the what will happen if promotional intensities change.
manufacturer is the sum of cross-brand and category- The method needs to be extended for evolving
expansion effects: 68 units on average. Because cross- environments with persistent effects (Dekimpe and
brand effects may cause competitive reactions, the Hanssens 1999), which may result from free sample
net benefit for manufacturers may also represent cat- promotions for new products (Bawa and Shoemaker
2004) or from promotions directed to new customers
egory expansion, to the extent that this component
(Anderson and Simester 2004). For mature categories,
excludes cross-store effects. Overall, from a category
persistent effects of promotions seem to be rare (Nijs
management perspective jointly pursued by manu-
et al. 2001, Pauwels et al. 2002).
facturers and retailers, category-expansion effects are
In our approach, accelerated switches are counted
potentially the most desirable among the three stan-
as cross-period effects. Sun et al. (2003) show that
dard decomposition effects.
in household data it depends strongly on the model
Table 4 shows that on average the largest category-
specification how these switches are categorized. To
expansion effects occur for price cuts supported
shed further light on this issue, it would be of inter-
with feature and display (35% of 1
82 = 0
66), fol-
est to extend our model to separate cross-period
lowed by display-only supported discounts (0.52),
effects into within-brand and between-brand compo-
feature-only-supported discounts (0.36), and finally,
nents. This should be of interest to a brand man-
unsupported price discounts (0.09). Thus, although
ager whereas a retailer may be indifferent between
feature-only support has a higher own-brand effect these two types of stockpiling. In our model we could
than display-only support (1.24 versus 1.16), the split the criterion variable for cross-period effects
category-expansion effect favors display-only. This PPCSijt into (1) own-brand sales before and after
is because features generate much stronger cross- period t, and (2) cross-brand sales before and after
period effects. Such insights require a decomposition period t. Similarly, the category-expansion effect is
approach. We note that a limitation is that we cannot further decomposable with respect to cross-category
generate profit implications because we lack data on effects. For example, it would be useful to determine
the costs of features and displays, and we do not have to what extent sales increases for a tuna brand may be
access to profit margins of the products. attributable to sales decreases in other food categories.
The net benefit of promotions is also affected by
competitive reactions, which is currently not included Acknowledgments
in our model. For example, for manufacturers strong The authors thank ACNielsen (USA and The Netherlands)
cross-brand effects are beneficial in the short run, but for providing the data, and the editor, the area editor, the
van Heerde, Leeflang, and Wittink: Decomposing the Sales Promotion Bump with Store Data
Marketing Science 23(3), pp. 317–334, © 2004 INFORMS 333
reviewers, and seminar participants at Carnegie Mellon Now use this estimator to compute residuals, which are
University, University of Chicago, Dartmouth College, used in Step 1 again, etc., until convergence. This iterated
Duke University, Erasmus University (Rotterdam), Harvard SUR-GLS estimation procedure gives the maximum likeli-
Business School, INSEAD, University of Leuven, London hood estimates (Greene 2000).
Business School, NYU’s Stern School of Business, Univer- We next postulate that each criterion variable is a non-
sity of Texas, Tilburg University, UCLA, the Yale School of parametric function of the price discount level, while we
Management, and the Wharton School for comments. Pro- control for constant effects of other covariates. We use
fessor van Heerde thanks the Netherlands Organization for the transformed predictor and criterion variables from the
Downloaded from informs.org by [132.239.1.231] on 28 August 2015, at 04:26 . For personal use only, all rights reserved.
Scientific Research for research support. final iteration of the GLS-SUR estimation to account for
contemporaneous correlation and autocorrelation. Define
Appendix ˆ in (A.1) as X ∗ X ∗ −1 X ∗ y ∗ , where y ∗ and X ∗ are ỹ
The system of Equations (2)–(5) can be summarized as and X pre-multiplied with the Cholesky decomposition
yjkit = Xjkit jk + "jkit , where j is the index for brand of the inverse of the contemporaneous covariance matrix.
j = 1
J , k is the index for the decomposition compo- This choice ensures that if we smooth the nonparamet-
nent (k = 1 for yj1it = OBSijt , k = 2 for yj2it = CBSijt , k = 3 ric function to a large extent, we obtain exactly the
for yj3it = PPCSijt , and k = 4 for yj4it = TCSijt ), i is the GLS-SUR estimates. We can now write the semipara-
index for store i = 1
I, and t is the index for week metric version of the linear regression model as yjkit ∗
=
t = 1
Tmax . The error is distributed as ∗ ∗
mk lj PIijlt + Xjkitl jkl + "jkit , where Xjkitl is the vector of
all predictors excluding PIijlt . For price index level PIijlt = P0 ,
"jkit = #jk "jkit−1 + ujkit k = 1
K autocorrelation we postulate a locally weighted polynomial regression (Fan
Covujkit uj lit = %jkj l contemporaneous correlation
and Gijbels):
I T
2
We delete one equation (e.g., for yj4it = TCSijt ) because it max
∗
d
∗
yjkit − k ljq PIijlt − P0 q − Xjkitl jkl
is redundant. With iterative GLS, it does not matter which i=1 t=1 q=0
equation we drop (Greene 2000). We estimate the model in
three consecutive steps (Greene 2000): · Kh Pijlt − P0
Step 1. Use OLS to obtain ˆ ols −1
jk = Xjk Xjk Xjk yjk . Com-
pute ejk = yjk − Xjk ˆ ols
jk and where Kh · denotes a Kernel function and h is a band-
I Tmax width. We use ˆ jkl from the GLS-SUR regression and next
i=1 t=2 ejkit ejkit−1 minimize this equation by using weighted least squares,
#̂jk = I Tmax 2
i=1 t=1 ejkit and thus obtain the set )ˆ k ljq q = 0
d*. The esti-
mate for mk lj PIijlt at PIijlt = P0 equals ˆ k lj0 . Kernel K·
(Greene 2000). determines the weight of neighboring observations. We
Step 2. Apply the Praise-Winsten transformation to the 2 use
the quartic kernel: Kh u = 15/16 1 − u/h2 I u/h ≤ 1
ˆ =
Assunçao, Joao, Robert J. Meyer. 1993. The rational effect of price
promotions on sales and consumption. Management Sci. 39(5)
X
% J 311 X X
% J 3J 3 X 517–535.
J 3 11 J3 J3
J Bawa, Kapil, Robert Shoemaker. 2004. The effects of free sample
3
11j l X ỹj l
j =1 l=1 % 11 promotions on incremental brand sales. Marketing Sci. 23(3)
345–363.
·
(A.1) Bell, David R., Jeongweng Chiang, V. Padmanabhan. 1999. The
J 3
ỹj l decomposition of promotional response: An empirical general-
j =1 l=1 % J 3j l XJ3 ization. Marketing Sci. 18(4) 504–526.
van Heerde, Leeflang, and Wittink: Decomposing the Sales Promotion Bump with Store Data
334 Marketing Science 23(3), pp. 317–334, © 2004 INFORMS
Bell, David R., Ganesh Iyer, V. Padmanabhan. 2002. Price competi- Krishna, Aradhna. 1994. The impact of dealing patterns on pur-
tion under stockpiling and flexible consumption. J. Marketing chase behavior. Marketing Sci. 13(4) 351–373.
Res. 39(August) 292–303. Kumar, V., Robert P. Leone. 1988. Measuring the effect of retail store
Blattberg, Robert C., Kenneth J. Wisniewski. 1989. Price-induced promotion on brand and store substitution. J. Marketing Res.
patterns of competition. Marketing Sci. 8(4) 291–309. 25(May) 178–185.
Bucklin, Randolph E., Sunil Gupta. 1999. Commercial use of UPC Leeflang, Peter S. H., Dick R. Wittink. 1996. Competitive reaction
scanner data: Industry and academic perspectives. Marketing versus consumer response: Do managers overreact? Internat. J.
Sci. 18(3) 247–273. Res. Marketing 13 103–120.
Downloaded from informs.org by [132.239.1.231] on 28 August 2015, at 04:26 . For personal use only, all rights reserved.
Bucklin, Randolph E., James M. Lattin. 1992. A model of product Lemon, Katherine N., Stephen M. Nowlis. 2002. Developing syner-
category competition among grocery retailers. J. Retailing 68(3) gies between promotions and brands in different price-quality
271–293. tiers. J. Marketing Res. 39(May) 171–185.
Bucklin, Randolph E., Sunil Gupta, S. Siddarth. 1998. Determin- Macé, Sandrine, Scott A. Neslin. 2004. The determinants of pre- and
ing segmentation in sales response across consumer purchase post-promotion dips in sales of frequently purchased goods.
behaviors. J. Marketing Res. 35(May) 189–197. J. Marketing Res. 41(August). Forthcoming.
Chandon, Pierre, Brian Wansink. 2002. When are stockpiled prod- Neslin, Scott A. 2002. Sales Promotion. Relevant Knowledge Series,
ucts consumed faster? A convenience-salience framework of Marketing Science Institute, Cambridge, MA, XI, 28, 62–63.
postpurchase consumption incidence and quantity. J. Marketing Neslin, Scott A., Linda G. Schneider Stone. 1996. Consumer inven-
Res. 39(August) 321–335. tory sensitivity and the postpromotion dip. Marketing Lett. 7(1)
Chiang, Jeongwen. 1991. A simultaneous approach to the whether, 77–94.
what and how much to buy questions. Marketing Sci. 10(4) Neslin, Scott A., Caroline Henderson, John Quelch. 1985. Consumer
297–315. promotions and the acceleration of product purchases.
Chintagunta, Pradeep K. 1993. Investigating purchase incidence, Marketing Sci. 4(2) 147–165.
brand choice, and purchase quantity decisions of households. Nijs, Vincent, Marnik G. Dekimpe, Jan Benedict, E. M. Steenkamp,
Marketing Sci. 12(2) 184–208. Dominique M. Hanssens. 2001. The category demand effects of
Christen, Markus, Sachin Gupta, John C. Porter, Richard Staelin, price promotions. Marketing Sci. 20(1) 1–22.
Dick R. Wittink. 1997. Using market-level data to understand Pauwels, Koen, Dominique M. Hanssens, S. Siddarth. 2002.
the promotion effects in a nonlinear model. J. Marketing Res. The long-term effects of price promotions on category inci-
34(August) 322–334. dence, brand choice, and purchase quantity. J. Marketing Res.
Dekimpe, Marnik G., Dominique M. Hanssens. 1999. Sustained 39(November) 421–439.
spending and persistent response: A new look at long-term mar- Sethuraman, Raj. 1996. A model of how discounting high-priced
keting profitability. J. Marketing Res. 36(November) 397–412. brands affects the sales of low-price brands. J. Marketing Res.
Doyle, P., J. Saunders. 1985. The lead effect of marketing decisions. 33(November) 399–409.
J. Marketing Res. 22(February) 54–65. Sethuraman, Raj, V. Srinivasan, Kim Doyle. 1999. Asymmetric and
Drèze, Xavier, David R. Bell. 2003. Creating win-win trade promo- neighborhood cross-price effects: Some empirical generaliza-
tions: Theory and empirical analysis of scan-back trade deals. tions. Marketing Sci. 18(1) 23–41.
Marketing Sci. 22(1) 16–39.
Srinivasan, Shuba, Koen Pauwels, Dominique M. Hanssens, Marnik
Fader, Peter S., Bruce G. S. Hardie. 1996. Modeling consumer choice G. Dekimpe. 2004. Do promotions benefit manufacturers,
among SKUs. J. Marketing Res. 33(November) 442–452. retailers, or both? Management Sci. 50(5) 617–629.
Fan, Jianqing. 1992. Design-adaptive nonparametric regression. Stewart, Jon. 1991. Econometrics. Philip Allan, Hemel Hemstead,
J. Amer. Statist. Assoc. 87(420) 998–1004. 71–72.
Fan, Jianqing, Irène Gijbels. 1996. Local Polynomial Modeling and Its Rao, Ram C., Ramesh V. Arjunji, B. P. S. Murthi. 1995. Game theory
Applications. Chapman & Hall, Suffolk, U.K., 274. and empirical generalizations and managerial uses. Marketing
Gönül, Füsün, Kannan Srinivasan. 1996. Estimating the impact Sci. 14(3, part 2 of 2) G89–G108.
of consumer expectations of coupons on purchase behavior: Sun, Baohong, Scott Neslin, Kannan Srinivasan. 2003. Measur-
A dynamic structural model. Marketing Sci. 15(3) 262–279. ing the impact of promotions on brand switching when con-
Greene, William H. 2000. Econometric Analysis, 4th ed. Prentice Hall, sumers are forward-looking. J. Marketing Res. 40(November)
Upper Saddle River, NJ, 546, 604, 615, 617–618, 634–635, 638. 389–405.
Gupta, Sachin, Pradeep K. Chintagunta, Anil Kaul, Dick R. Wittink. van Heerde, Harald J., Sachin Gupta, Dick R. Wittink. 2003. Is 75%
1996. Do household scanner data provide representative infer- of the sales promotion bump due to brand switching? No, only
ences from brand choices: A comparison with store data. 33% is. J. Marketing Res. 40(November) 481–491.
J. Marketing Res. 33(November) 383–398. van Heerde, Harald J., Peter S. H. Leeflang, Dick R. Wittink. 2000.
Gupta, Sunil. 1988. Impact of sales promotion on when, what and The estimation of pre- and postpromotion dips with store-level
how much to buy. J. Marketing Res. 25(November) 342–355. scanner data. J. Marketing Res. 37(August) 383–395.
Gupta, Sunil, Lee G. Cooper. 1992. The discounting of discounts Van Heerde, Harald J., Peter S. H. Leeflang, Dick R. Wittink.
and promotion thresholds. J. Consumer Res. 19 401–411. 2001. Semiparametric analysis to estimate the deal effect curve.
Judge, George G., W. E. Griffiths, R. Carter Hill, Helmuth J. Marketing Res. 38(May) 197–215.
Lütkepohl, Tsoung Chao Lee. 1985. The Theory and Practice of Walters, Rockney G. 1991. Assessing the impact of retail price
Econometrics. John Wiley and Sons, Singapore, 32. promotions on product substitution, complementary pur-
Kalwani, Manohar U., Chi Kin Yim, Heikke J. Rinne, Yoshi Sugita. chase, and interstore sales displacement. J. Marketing 55(April)
1990. A price expectation model of customer brand choice. 17–28.
J. Marketing Res. 27(August) 251–262. Wittink, Dick R., Michael J. Addona, William J. Hawkes, John C.
Kalyanam, Kirthi, Daniel S. Putler. 1997. Incorporating demo- Porter. 1988. The estimation, validation, and use of promo-
graphic variables in brand choice models: An indivisible alter- tional effects based on scanner data. Working paper, Cornell
natives framework. Marketing Sci. 16(2) 166–181. University, Ithaca, NY.
Keng, Kau Ah, A. S. C. Ehrenberg. 1984. Patterns of store choice. J. Zhang, Jie, Lakshman Krishnamurthi. 2004. Customizing promo-
Marketing Res. 21(November) 399–409. tions in online stores. Marketing Sci. Forthcoming.