This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

An Impulse–Regime Switching Game Model
of Vertical Competition

René Aïd  Luciano Campi  Liangchen Li  Mike Ludkovski
Abstract

We study a new kind of non-zero-sum stochastic differential game with mixed impulse/switching controls, motivated by strategic competition in commodity markets. A representative upstream firm produces a commodity that is used by a representative downstream firm to produce a final consumption good. Both firms can influence the price of the commodity. By shutting down or increasing generation capacities, the upstream firm influences the price with impulses. By switching (or not) to a substitute, the downstream firm influences the drift of the commodity price process. We study the resulting impulse–regime switching game between the two firms, focusing on explicit threshold-type equilibria. Remarkably, this class of games naturally gives rise to multiple Nash equilibria, which we obtain via a verification based approach. We exhibit three types of equilibria depending on the ultimate number of switches by the downstream firm (zero, one or an infinite number of switches). We illustrate the diversification effect provided by vertical integration in the specific case of the crude oil market. Our analysis shows that the diversification gains strongly depend on the pass-through from the crude price to the gasoline price.

1 Introduction

Since Hotelling’s (1931) [18] seminal study of commodity prices, considerable efforts have been undertaken to understand the dynamics of the equilibrium price of commodities and in particular, its long–run properties. The cyclical nature of price dynamics is driven by the substitution effect, whereby consumers will switch to a different commodity if prices rise too high. In a deterministic setting the switching time to the substitute is simple to analyze, but with the stochastic economic cycle consumers face a huge challenge in determining when is the appropriate moment to switch. The succession of booms and busts of commodity prices complicates the switching timing. In the long–run, production capacities adapt to demand and make the price oscillate around a long–term equilibrium. Indeed, the long–run behaviour of commodity prices exhibits super–cycle patterns. The econometric studies in Leon and Soto (1997) [23], Erten and Ocampo (2012) [12], Jacks (2013) [19] and more recently Stuemer (2018) [30], all find the presence of super–cycles of several decades in the price of commodities. This phenomenon makes one wonder whether it is even necessary for the consumers to ever switch and whether it is not preferable to just wait for the prices to crash again.

In this paper we design a dynamic model of competition between production and consumption of a commodity used as an intermediate good, allowing to draw conclusions on the long–run dynamics of the commodity price. In our model, two factors drive the price of the commodity: on the one hand, short–term but persistent shocks of demand and/or production, and on the other hand, strategic decisions of the (representative) upstream production firm and of the (representative) downstream consumer firm. The upstream producer extracts the commodity at cost cpc_{p} and sells it for a price XX. The downstream industry buys the commodity and converts it into a final good that has a price PP, non–decreasing in XX. This framework covers a wide range of industries. One might think for example, of the agricultural sector where soy enters as an input for the food industry to produce a large range of consumer goods. In the aluminum industry, upstream smelters produce aluminum to be used by the automotive and transportation industries. In the oil industry, the crude is extracted by production firms, then transformed into gasoline and kerosene by downstream refineries, and then consumed in the retail market. For the sake of simplicity, we identify the downstream firm that transforms the commodity with the final consumer and this downstream firm’s profit with the consumer’s surplus.

We focus on the role of the commodity price XX that intrinsically creates competition between the representative agents of producers and consumers. In a nutshell, producers prefer high price XX, while consumers prefer low price XX. This competition is dynamic and manifests itself through strategic price effects actuated by the two industries. Therefore, XX is (partially) jointly controlled by the producers/consumers, leading to game–theoretic impacts.

On the upstream production side, the producer needs the commodity price XX to be high enough to make a profit margin. We suppose that the dynamics of investment and disinvestment in upstream production is driven by production capacity shocks that cause jumps in the price XX. This assumption is consistent with the theory of real options that predicts the existence of threshold prices triggering the decision of entry and the exit from the market (see MacDonald and Siegel (1986) [25] and Dixit and Pindyck (1994) [11]). It is also consistent with the observations of quick swings in investment and disinvestement in production, see e.g. the boom and bust of commodity prices in 2008–10.

On the downstream consumer side, consumers induce a long–term effect on the commodity price only if they switch to a substitute, and they switch to a substitute only if they anticipate that XX will remain high enough for a long time. The downstream side faces slower dynamics because it involves the transformation of many local installations using the commodity. To have an example in mind, one may think of the thousands of adjustments required to change heating systems in buildings, or of the slow effect of the energy saving programs launched by OECD after the 1970s oil shock. Thus, in our model, the downstream market for the final good can be in contraction or expansion regime. The contraction regime corresponds to a decreasing demand for the primary commodity, i.e. the market is abandoning the use of the commodity for a substitute, while the expansion mode corresponds to an increasing demand for the commodity. Depending on the state of the downstream retail market, the drift of the commodity price takes either a constant positive value in the expansion mode or a negative value in the contraction mode. Because such consumer shifts are slow and expensive, the state is persistent (i.e. piecewise constant in time) and changing the state of the final good market incurs heavy switching costs. This toggling of the price trend can be interpreted as endogenous regime-switching, a common way of modeling commodity prices through the business cycle. Beyond the impact of producers and consumers decisions, the commodity price is subject to exogenous short–term stochastic shocks, captured through a Brownian motion driving risk factor.

Our aim is to construct and characterize the dynamic equilibrium in the commodity market due to this vertical competition. Our major contribution is to provide an endogenous, game-theoretic basis for two key stylized features of commodity markets: (i) super–cycles that manifest as long–term mean-reversion; (ii) fundamental impact of supply and demand that maintains the price in a range of values rather than a single equilibrium value. Furthermore, our model allows for three types of equilibria depending on the number of demand switches undertaken by the consumer at equilibrium: zero, one, or an infinite number of switches. All equilibria exhibit the latter qualitative properties. Besides, the higher the consumer’s switching cost, the more she is compelled to endure an unfavorable range of prices.

Along the way, we also make mathematical contributions to the literature on non-zero-sum stochastic games (see Martyr and Moriarty (2017) [24], Atard (2018) [4], De Angelis et al. (2018) [13], Aïd et al. (2020) [1]). To our knowledge ours is the first paper that: (i) considers a mixed impulse-control/switching-control stochastic game; (ii) explicitly constructs impulse-switching threshold-type equilibria in non-zero-sum games; provides new verification theorems regarding best-response strategies for (iii) an impulsing agent in a regime-switching setting and (iv) switching agent with an impulsed state process. While our solution is non exhaustive in the sense that we a priori focus on a special class of equilibria (leaving open the question of existence of other equilibrium families), it is highly tractable. Namely, we are able to provide closed-form description of the dynamic equilibrium, offering precise quantitative insights regarding the producer and consumer roles and their equilibrium behavior.

To emphasize the latter point, beyond several synthetic examples that illustrate and visualize our model features, we also present a detailed case-study of the diversification effect provided by vertical integration in the crude oil market circa 2019, viewed as a competition between crude oil producers and oil refiners that convert crude into gasoline and other consumer goods. Indeed, the industrial organization of upstream and downstream segments is an important concern both for the anti-trust regulators and for the firms themselves. It is reflected in the extensive economic literature on the subject and in its persistent presence within the political debate (see Lafontaine and Slade (2007) [20] for a review of the topic). From a firm’s perspective, vertical integration brings multiple virtues, including the potential to reduce the long–term exposure to commodity price fluctuations. See for example Helfat and Teece (1987) [16] for an empirical estimation of the hedge procured by vertical integration in the oil business. In our case study, we consider the generic type of equilibrium and a small downstream firm asking herself whether she has an interest in getting more vertically integrated. We show that the gains from integration are directly linked to the pass-through parameter that links the crude oil price to the retail gasoline price. The higher this pass-through, the higher production activity dominates the retail activity both in terms of expected rate of profit and the standard deviation of rate of profit.

The rest of the paper is organized as follows. Section 2 sets up the competitive producer-consumer commodity market. Section 3 then constructs the respective threshold-type impulse-switching equilibria by considering the producer and consumer best-response strategies. Section 4 illustrates and discusses the different types of emergent equilibria using toy examples. Section 5 presents the above vertical integration case study and Section 6 concludes. All the proofs, as well as additional comparative statics, are delegated to Section 7.

2 The model

2.1 Description

We use (Xt)(X_{t}) to denote the (pre-equilibrium) commodity price, modeled as a continuous-time stochastic process. The two players are denoted as pproducer and cconsumer. In what follows sub-index pp (resp. cc) in the notation will always refer to the producer (resp. consumer). The market involves the original raw commodity that is being produced and the goods market (e.g. gasoline). The producer extracts the commodity at cost cpc_{p} and sells it for price xx. The consumer buys it for price xx, converts it into a final good, and sells it for price PP.

Profit rates:

The price xx of the commodity influences the volume of trade, captured by the demand function Dp(x)D_{p}(x). A similar phenomenon plays out in the final-good market: the goods price PP leads to sales volume Dc(P)D_{c}(P). Since the consumer is in effect the intermediary between the commodity and the goods market, she will pass some of her input price shocks to the output price PP(x)P\equiv P(x).

We ignore the players’ fixed costs because they can be considered to be integrated in the investment costs, and concentrate on the variable costs and revenues that are driven by the respective input/output prices. Based on the above discussion, the instantaneous profit rate of the producer is

πp(x):=(xcp)Dp(x).\displaystyle\pi_{p}(x):=(x-c_{p})D_{p}(x). (1)

Let ccc_{c} be the processing/conversion cost from input commodity to final good and α\alpha be the respective conversion factor, so that one unit of commodity becomes α\alpha units of the final good (e.g. barrels of crude oil, converted into barrels of gasoline). Then the instantaneous profit rate of the consumer is

πc(x):=Dc(P)PDc(P)α(x+cc).\displaystyle\pi_{c}(x):=D_{c}(P)P-\frac{D_{c}(P)}{\alpha}(x+c_{c}). (2)

We note that while the consumer has market power, he is not the only user of the commodity (e.g. crude oil is also used by the petrochemical industry), so there is no direct link between production volume and consumption volume. Thus, while there is a physical link between the consumer input volume Dc(P)/αD_{c}(P)/\alpha and her output volume Dc(P)D_{c}(P), there is no direct link between Dc(P)/αD_{c}(P)/\alpha and aggregate commodity demand Dp(x)D_{p}(x).

We shall consider linear inverse demand

Dp(x)=d0d1x.D_{p}(x)=d_{0}-d_{1}x.

If we further assume that P(x)=p0+p1xP(x)=p_{0}+p_{1}x (the price of the final good is linearly proportional to the commodity price), and Dc(P)=d0d1PD_{c}(P)=d^{\prime}_{0}-d^{\prime}_{1}P (final good demand is linearly decreasing in its price PP), the profit rate of the consumer becomes:

πc(x)\displaystyle\pi_{c}(x) =Dc(P(x))(P(x)(x+cc)α)\displaystyle=D_{c}(P(x))\cdot\left(P(x)-\frac{(x+c_{c})}{\alpha}\right)
=(d0d1p0)(p0ccα)+((d0d1p0)(p11α)d1p1(p0ccα))x+d1p1(1αp1)x2\displaystyle=\left(d^{\prime}_{0}-d^{\prime}_{1}p_{0}\right)\left(p_{0}-\frac{c_{c}}{\alpha}\right)+\left(\left(d^{\prime}_{0}-d^{\prime}_{1}p_{0}\right)\left(p_{1}-\frac{1}{\alpha}\right)-d^{\prime}_{1}p_{1}\left(p_{0}-\frac{c_{c}}{\alpha}\right)\right)x+d^{\prime}_{1}p_{1}\left(\frac{1}{\alpha}-p_{1}\right)x^{2}
=:γ0+γ1x+γ2x2.\displaystyle=:\gamma_{0}+\gamma_{1}x+\gamma_{2}x^{2}. (3)

The consumer profit is concave in the commodity price xx, γ2<0\gamma_{2}<0, if and only if the pass–through coefficient p1p_{1} is higher than the conversion factor 1/α1/\alpha. It means that the final good price increases faster than the need of the downstream industry to produce one more good, which is a sound economic condition for having a sustainable downstream industry. To sum up, the profit rates of both producer and consumer are concave and quadratic in xx.

Market conditions:

We model the commodity price process (Xt)(X_{t}) as a controlled Itô diffusion of the form

dXt=μtdt+σdWtdNt.\displaystyle dX_{t}=\mu_{t}dt+\sigma dW_{t}-dN_{t}. (4)

The Brownian motion (Wt)(W_{t}) captures exogenous price shocks due to random demand or production fluctuations, or pertinent economic shocks for the industry. In this sense, the model is agnostic in the reasons why the commodity price fluctuates around its mean trend. The point process Nt:=i1ξi𝟏{τit}N_{t}:=\sum_{i\geq 1}\xi_{i}\mathbf{1}_{\{\tau_{i}\leq t\}} captures the producer interventions at times (τi)i1(\tau_{i})_{i\geq 1} and impulses (ξi)i1(\xi_{i})_{i\geq 1}. A positive impulse is triggered by an investment phase, and has a negative impact on the price. A negative impulse is induced by a disinvestment phase and has a positive impact on the price.

The drift process (μt)(\mu_{t}) represents the state of the retail market for the final good. It is either in expansion or in contraction state. When in expansion, demand is growing faster than the available production capacity, hence prices tend to rise: μt=μ+>0\mu_{t}=\mu_{+}>0. When in contraction, the demand is shrinking faster than the production capacity, thus the price tends to decrease, and thus μt=μ<0\mu_{t}=\mu_{-}<0. This modeling corresponds to an imperfect adjustment of the market as in a sticky price model in macroeconomics. The drift is fully controlled by the consumer,

μt=μ+i=0𝟏{σ2it<σ2i+1}+μi=1𝟏{σ2i1t<σ2i},t0,\mu_{t}=\mu_{+}\sum_{i=0}^{\infty}\mathbf{1}_{\{\sigma_{2i}\leq t<\sigma_{2i+1}\}}+\mu_{-}\sum_{i=1}^{\infty}\mathbf{1}_{\{\sigma_{2i-1}\leq t<\sigma_{2i}\}},\qquad t\geq 0,

where σi\sigma_{i} is the ii-th switching instance taken by the consumer in the case μ0=μ+\mu_{0-}=\mu_{+} (with the convention σ0=0\sigma_{0}=0, so that σ1\sigma_{1} is the first switching time) and analogously when μ0=μ\mu_{0-}=\mu_{-} by interchanging odd and even switching times. Thus, both players influence (Xt)(X_{t}), although their actions are of distinct types, namely impulse control (Nt)(N_{t}) by the producer and switching-drift control (μt)(\mu_{t}) by the consumer. The resulting controlled price dynamics are denoted as X(μ,N)X^{(\mu,N)}.

The quadratic nature of the upstream and downstream profit rate functions πp()\pi_{p}(\cdot) and πc()\pi_{c}(\cdot) implies that each player has their own natural habitat given by the intervals (xc1,xc2)(x_{c}^{1},x_{c}^{2}) and (xp1,xp2)(x_{p}^{1},x_{p}^{2}) for commodity price levels with:

xp1\displaystyle x^{1}_{p} :=min{cp,d0d1},\displaystyle:=\min\Big{\{}c_{p},\frac{d_{0}}{d_{1}}\Big{\}},\qquad xp2:=max{cp,d0d1},\displaystyle x^{2}_{p}:=\max\Big{\{}c_{p},\frac{d_{0}}{d_{1}}\Big{\}}, (5)
xc1\displaystyle x^{1}_{c} :=min{p0cc/α1/αp1,d0d1p0p1d1},\displaystyle:=\min\Big{\{}\frac{p_{0}-c_{c}/\alpha}{1/\alpha-p_{1}},\frac{d^{\prime}_{0}-d^{\prime}_{1}p_{0}}{p_{1}d^{\prime}_{1}}\Big{\}},\qquad xc2:=max{p0cc/α1/αp1,d0d1p0p1d1}.\displaystyle x^{2}_{c}:=\max\Big{\{}\frac{p_{0}-c_{c}/\alpha}{1/\alpha-p_{1}},\frac{d^{\prime}_{0}-d^{\prime}_{1}p_{0}}{p_{1}d^{\prime}_{1}}\Big{\}}. (6)

Players make a positive profit only if the price stays in the interval (xi1,xi2)(x^{1}_{i},x^{2}_{i}), i{c,p}i\in\{c,p\}. The concavity of the profit functions implies that players have preferred commodity levels X¯p,X¯c\bar{X}_{p},\bar{X}_{c} that maximize their profit rates, namely:

X¯p:=d0+cpd12d1,X¯c:=γ12γ2.\displaystyle\bar{X}_{p}:=\frac{d_{0}+c_{p}d_{1}}{2d_{1}},\qquad\bar{X}_{c}:=-\frac{\gamma_{1}}{2\gamma_{2}}. (7)

Typically, we expect that X¯c<X¯p\bar{X}_{c}<\bar{X}_{p}, so that the preferred commodity price of the consumer is lower than that of the producer. The stochastic fluctuations coming from (Wt)(W_{t}) can generate three different market conditions:

Xt<X¯c\displaystyle X_{t}<\bar{X}_{c} I: abnormally low prices;\displaystyle\quad\text{I: abnormally low prices};
X¯cXtX¯p\displaystyle\bar{X}_{c}\leq X_{t}\leq\bar{X}_{p} II: vertical competition;\displaystyle\quad\text{II: vertical competition};
X¯p<Xt\displaystyle\bar{X}_{p}<X_{t} III: abnormally high prices.\displaystyle\quad\text{III: abnormally high prices}.

In the first and last cases, both players have the same preferences to raise or decrease XtX_{t}; in the intermediate case, they compete against each other. Because both players can in principle push (Xt)(X_{t}) in either direction, the market organization is influenced by their relative gain of doing so, as well as their action costs. In cases I and III, the players are in waiting mode because of the second–mover advantage, hoping that the other will act first which allows the second to benefit from the price effect without paying the cost of (dis-)investment or of switching. In case II they are in preemption mode, with the player who moves first being able to increase her profits at the expense of the other. These dynamic shifts between waiting and preemption is an important feature of vertical competition.

Objective functions and admissible strategies:

The objective functionals of the players consist of integrated profit rates π(x)\pi_{\cdot}(x), discounted at constant rate β>0\beta>0 and subtracting the control costs that are paid at respective intervention epochs. We take the investment cost function of the producer to be some convex function Kp:K_{p}:\mathbb{R}\to\mathbb{R}, and of the consumer as H:{μ,μ+}+H:\{\mu_{-},\mu_{+}\}\to\mathbb{R}_{+}. We denote the latter as H(μ)=h,H(μ+)=h+H(\mu_{-})=h_{-},H(\mu_{+})=h_{+}. Depending on the initial drift μ0\mu_{0} being positive/negative the producer’s objective function is given by:

Jp±(x;N,μ):=𝔼[0eβt(Xtcp)Dp(Xt)𝑑tieβτiKp(ξi)|μ0=μ±,X0=x],\displaystyle J_{p}^{\pm}(x;N,\mu):=\mathbb{E}\Big{[}\int_{0}^{\infty}e^{-\beta\,t}\big{(}X_{t}-c_{p})D_{p}(X_{t})dt-\sum_{i}e^{-\beta\,\tau_{i}}K_{p}(\xi_{i})\Big{|}\,\mu_{0}=\mu^{\pm},X_{0}=x\Big{]}, (8)

and, similarly, the representative consumer’s objective function is:

Jc±(x;N,μ):=𝔼[0eβt(γ0+γ1Xt+γ2Xt2)𝑑tjeβσjH(μσj)|μ0=μ±,X0=x].\displaystyle J_{c}^{\pm}(x;N,\mu):=\mathbb{E}\Big{[}\int_{0}^{\infty}e^{-\beta\,t}\big{(}\gamma_{0}+\gamma_{1}X_{t}+\gamma_{2}X_{t}^{2})dt-\sum_{j}e^{-\beta\,\sigma_{j}}H(\mu_{\sigma_{j}})\Big{|}\,\mu_{0}=\mu^{\pm},X_{0}=x\Big{]}. (9)

In order for the state variable dynamics and players’ expected payoffs to be well-defined we give the following definition of admissible strategies. To this end, let (Ω,,(t)t0,)(\Omega,\mathcal{F},(\mathcal{F}_{t})_{t\geq 0},\mathbb{P}) be a probability space with a filtration satisfying the usual conditions and supporting an (t)t0(\mathcal{F}_{t})_{t\geq 0}-Brownian motion (Wt)(W_{t}).

Definition 1 (Admissible strategies).

We say that (τi,ξi)i1(\tau_{i},\xi_{i})_{i\geq 1} is an admissible strategy for the producer if the following properties hold:

  1. 1.

    (τi)i1(\tau_{i})_{i\geq 1} is a sequence of [0,][0,\infty]-valued stopping times such that 0τ1<τ2<0\leq\tau_{1}<\tau_{2}<\cdots and limiτi=\lim_{i\to\infty}\tau_{i}=\infty a.s., with the convention that τi=\tau_{i}=\infty for some i1i\geq 1 implies τk=\tau_{k}=\infty for all kik\geq i;

  2. 2.

    (ξi)i1(\xi_{i})_{i\geq 1} is a sequence of real-valued τi\mathcal{F}_{\tau_{i}}-measurable random variables;

  3. 3.

    the sequence (τi,ξi)i1(\tau_{i},\xi_{i})_{i\geq 1} satisfies i1eβτiξiL2()\sum_{i\geq 1}e^{-\beta\tau_{i}}\xi_{i}\in L^{2}(\mathbb{P}).

Similarly, we say that the sequence (σj)j1(\sigma_{j})_{j\geq 1} is an admissible strategy for the consumer if

  1. 4.

    each σj\sigma_{j} is a [0,][0,\infty]-valued stopping time, 0σ1<σ2<0\leq\sigma_{1}<\sigma_{2}<\cdots, with the convention that σj=\sigma_{j}=\infty for some j1j\geq 1 implies σk=\sigma_{k}=\infty for all kjk\geq j;

  2. 5.

    j1eβσjL2()\sum_{j\geq 1}e^{-\beta\sigma_{j}}\in L^{2}(\mathbb{P}).

The set of all producer’s (resp. consumer’s) admissible strategies is denoted by 𝒜p\mathcal{A}_{p} (resp. 𝒜c\mathcal{A}_{c}).

Remark 1.

Observe that the property 1 above implies that the producer intervention times do not accumulate in finite time, so that for all t>0t>0 the process Nt=i1ξi𝟏{τit}N_{t}=\sum_{i\geq 1}\xi_{i}\mathbf{1}_{\{\tau_{i}\leq t\}}, t0t\geq 0, is well-defined, adapted and finite-valued. Moreover, the integrability condition in 5 gives that σj\sigma_{j}\to\infty (as jj\to\infty), i.e. the switching times of the consumer do not accumulate in finite time either, so that the dynamics of the controlled state variable (4) is well-defined too. Regarding the expected profits of the players, they are both finite due to integrability properties in 3 and 5 above.

Remark 2.

According to the definition of admissibility above, neither player can intervene more than once at a time. However, simultaneous interventions coming from both of them are not excluded. As discussed, the dynamics of the intervention in upstream production is much faster than the switching of the consumption regime for final good. Thus, in case both players try to act simultaneously, we assume that the producer has priority. This avoids unnecessary technicalities and allows for a consistent modeling of the vertical competition.

2.2 Equilibrium

Using this notion of admissible strategies, we give the definition of Nash equilibrium.

Definition 2 (Nash equilibrium).

A Nash equilibrium is any pair ((ξi,τi)i1,(σj)j1)𝒜p×𝒜c((\xi_{i},\tau_{i})_{i\geq 1},(\sigma_{j})_{j\geq 1})\in\mathcal{A}_{p}\times\mathcal{A}_{c} satisfying the following property:

Jp±(x;N,μ)Jp±(x;N,μ),Jc±(x;N,μ)Jc±(x;N,μ),x,J_{p}^{\pm}(x;N^{\prime},\mu)\leq J_{p}^{\pm}(x;N,\mu),\qquad J_{c}^{\pm}(x;N,\mu^{\prime})\leq J_{c}^{\pm}(x;N,\mu),\qquad\forall x\in\mathbb{R},

for any other pair of strategies ((ξi,τi)i1,(σj)j1)𝒜p×𝒜c((\xi_{i}^{\prime},\tau^{\prime}_{i})_{i\geq 1},(\sigma^{\prime}_{j})_{j\geq 1})\in\mathcal{A}_{p}\times\mathcal{A}_{c}, where in the payoffs Jr(x;)J^{-}_{r}(x;\cdot), r{c,p}r\in\{c,p\}, above we have Nt=i1ξi𝟏{τit}N^{\prime}_{t}=\sum_{i\geq 1}\xi^{\prime}_{i}\mathbf{1}_{\{\tau^{\prime}_{i}\leq t\}} and μt=μ+i=01{σ2it<σ2i+1}+μi=1𝟏{σ2i1t<σ2i}\mu^{\prime}_{t}=\mu_{+}\sum_{i=0}^{\infty}1_{\{\sigma^{\prime}_{2i}\leq t<\sigma^{\prime}_{2i+1}\}}+\mu_{-}\sum_{i=1}^{\infty}\mathbf{1}_{\{\sigma^{\prime}_{2i-1}\leq t<\sigma^{\prime}_{2i}\}} for t0t\geq 0, μ0=μ+\mu^{\prime}_{0-}=\mu_{+} and the convention σ0=0\sigma^{\prime}_{0}=0 (analogously in the other case μ0=μ\mu^{\prime}_{0-}=\mu_{-} by interchanging odd and even switching times).

In line with the envisioned Markovian structure and in order to maximize tractability, we concentrate on a specific class of dynamic equilibria. Namely, we aim to construct threshold-type Feedback Nash Equilibria which are of the form

τ0=0,τi=inf{t>τi1:XtΓp(t)},i1,ξi=δ(Xτi,μτi),\displaystyle\tau_{0}=0,\quad\tau_{i}=\inf\{t>\tau_{i-1}:X_{t}\in\Gamma_{p}(t-)\},\quad i\geq 1,\qquad\xi_{i}=\delta(X_{\tau_{i}},\mu_{\tau_{i}-}), (10)

and

σ0=0,σj=inf{t>σj1:XtΓc(t)},j1,\displaystyle\sigma_{0}=0,\quad\sigma_{j}=\inf\{t>\sigma_{j-1}:X_{t}\in\Gamma_{c}(t)\},\quad j\geq 1, (11)

where

Γr(t)=Γr+𝟏{μt=μ+}+Γr𝟏{μt=μ},r{c,p},\Gamma_{r}(t)=\Gamma^{+}_{r}\mathbf{1}_{\{\mu_{t}=\mu_{+}\}}+\Gamma^{-}_{r}\mathbf{1}_{\{\mu_{t}=\mu_{-}\}},\quad r\in\{c,p\},

for some measurable function δ:\delta:\mathbb{R}\to\mathbb{R} and some suitable Borel sets Γp±,Γc±\Gamma_{p}^{\pm},\Gamma_{c}^{\pm}\subset\mathbb{R}. Thus, (10)-(11) imply that players act based solely on the current price (Xt)(X_{t}) and demand regime (μt)(\mu_{t}), ruling out history-dependent strategies, and moreover the strategies are characterized through fixed action regions Γp±,Γc±\Gamma_{p}^{\pm},\Gamma_{c}^{\pm} and impulse maps δ()\delta(\cdot). We will denote by τk\tau^{\prime}_{k} the aggregated intervention times coming jointly from the two players. The fact that in (10) producer’s intervention times τi\tau_{i} are defined via Γp(t)\Gamma_{p}(t-) translates the assumption that in case of simultaneous interventions, the producer plays first and so her thresholds naturally depend on the drift μt\mu_{t-} just before her and consumer’s actions (compare to Remark 2).

The action regions Γp±,Γc±\Gamma_{p}^{\pm},\Gamma_{c}^{\pm} are expected to be as follows. The impulse intervention region of the upstream production Γp±=(x±,xh±)\Gamma^{\pm}_{p}=(x^{\pm}_{\ell},x^{\pm}_{h}) is two-sided: the producer will act whenever XtX_{t} reaches xh±x^{\pm}_{h} from below or drops to x±x^{\pm}_{\ell} from above. Note that these thresholds x±,xh±x_{\ell}^{\pm},x_{h}^{\pm} are μ\mu-dependent. On the consumption side, when μt=μ+\mu_{t}=\mu_{+} (expansion regime), the consumer will switch to μ\mu_{-} if XtX_{t} gets too high: Γc+=(yh,)\Gamma^{+}_{c}=(y_{h},\infty). Similarly when μt=μ\mu_{t}=\mu_{-} (contraction regime), she will switch to μ+\mu_{+} if XtX_{t} gets too low Γc=(,y)\Gamma^{-}_{c}=(-\infty,y_{\ell}). Finally, when the producer intervenes, he will bring XtX_{t} to her impulse level xr±x^{\pm\ast}_{r} so that the impulse amount is ξr±=xr±xr±\xi^{\pm}_{r}=x^{\pm}_{r}-x^{\pm\ast}_{r}. The natural ordering we expect is the producer impulses towards X¯p\bar{X}_{p}

x±<x±andxh±<xh±,\displaystyle x^{\pm}_{\ell}<x^{\pm\ast}_{\ell}\quad\textrm{and}\quad x^{\pm\ast}_{h}<x^{\pm}_{h}, (12)

and the consumer switches towards X¯c\bar{X}_{c},

y<X¯c<yh,\displaystyle y_{\ell}<\bar{X}_{c}<y_{h}, (13)

so that when acting both players try to move XX towards their preferred levels. However, the precise ordering between the impulse thresholds xr±x^{\pm}_{r} and the switching thresholds yy’s is not clear a priori and will emerge as part of the overall equilibrium construction.

2.3 Illustration of Competitive Dynamics

To further understand the market evolution under competition of the producer and consumer, we focus on the case where both players are active. The producer’s strategy is summarized via a 2×42\times 4 matrix 𝒞p\mathcal{C}_{p} which lists the thresholds x±,xh±x^{\pm}_{\ell},x^{\pm}_{h} and the target levels x±,xh±x^{\pm\ast}_{\ell},x^{\pm\ast}_{h}. Thus, the no-intervention regions are [x±,xh±][x^{\pm}_{\ell},x^{\pm}_{h}] and impulse amounts are xh±xh±,x±x±x^{\pm}_{h}-x^{\pm\ast}_{h},x^{\pm\ast}_{\ell}-x^{\pm}_{\ell}:

𝒞p=[x+,x+,xh+,xh+x,x,xh,xh].\displaystyle\mathcal{C}_{p}=\begin{bmatrix}x^{+}_{\ell},&x^{+\ast}_{\ell},&x^{+\ast}_{h},&x^{+}_{h}\\ x^{-}_{\ell},&x^{-\ast}_{\ell},&x^{-\ast}_{h},&x^{-}_{h}\\ \end{bmatrix}. (14)

The consumer has two switching thresholds y,yhy_{\ell},y_{h}; in a typical setup we expect them to satisfy the following ordering

x<y<yh<xh+.\displaystyle x^{-}_{\ell}<y_{\ell}<y_{h}<x_{h}^{+}. (15)

Note that in the expansion regime (drift μ+\mu_{+}), we assume that yh<xh+y_{h}<x^{+}_{h}. Therefore, coming from below, XtX^{\ast}_{t} hits yhy_{h} first, causing the consumer to switch into the contraction regime with drift μ\mu_{-}. As a result, the impulse threshold xh+x^{+}_{h} is not effective, i.e. it will never get triggered along an equilibrium path of (Xt)(X_{t}). Similar argument implies that xx^{-}_{\ell} is not effective either if x<yx^{-}_{\ell}<y_{\ell}. In the left panel of Fig. 1 we illustrate such threshold-based vertical competition among the two players.

Refer to caption
Refer to caption
Figure 1: Left panel: Dynamic competition between producer and consumer. The blue arrows represent drift-switching controls exercised by the consumer at levels yy_{\ell} and yhy_{h}, while the red curved arrows represent impulse controls exercised by the producer at levels x+,xhx_{\ell}^{+},x^{-}_{h} that instantaneously push XtX_{t} to x+x_{\ell}^{+\ast} and xhx^{-\ast}_{h} respectively. Right: A sample path of the controlled commodity price (Xt)(X^{\ast}_{t}) under competitive equilibrium. Observe that Xt[1,2]X^{\ast}_{t}\in[1,2] for all tt.

To illustrate competitive dynamics, the right panel of Fig. 1 shows a sample trajectory of (Xt)(X^{\ast}_{t}) (the superscript emphasizing the fact that we are now looking at equilibrium) with producer and consumer strategies

𝒞p=[1.0,1.3,1.7,2.01.0,1.3,1.7,2.0],(y,yh)=(1.2,1.8).\mathcal{C}_{p}=\begin{bmatrix}1.0,&1.3,&1.7,&2.0\\ 1.0,&1.3,&1.7,&2.0\end{bmatrix},\quad\quad(y_{\ell},y_{h})=(1.2,1.8).

According to the above discussion, the effective thresholds are (x+,yh)(x^{+}_{\ell},y_{h}) when μt=μ+\mu_{t}=\mu_{+}, or (y,xh)(y_{\ell},x^{-}_{h}) when μt=μ\mu_{t}=\mu_{-}. In other words, in the expansion regime, (Xt)(X_{t}) will be between [1.0,1.8][1.0,1.8] and in the contraction regime it will be between [1.2,2.0][1.2,2.0]. In Fig. 1 (Right), we start in the contraction regime with X0=1.5X_{0}=1.5 and μ0=μ\mu_{0}=\mu_{-}. On this trajectory, (Xt)(X^{\ast}_{t}) moves down until it touches the consumer’s threshold yy_{\ell}, where the consumer switches to a positive drift to draw the price up. Nevertheless, the price keeps decreasing and hits x+=1.0x^{+}_{\ell}=1.0, whereby the producer intervenes and pushes it to x+=1.3x^{+\ast}_{\ell}=1.3. Prices then continue to rise up to yh=1.8y_{h}=1.8 at which point the consumer switches again and starts pushing them back down (supposedly she wishes to keep them somewhere around 1.5). This cyclic behavior continues ad infinitum, yielding a stationary distribution for the pair (Xt,μt)(X^{\ast}_{t},\mu^{\ast}_{t}). Note that the consumer uses her switching control to keep XtX^{\ast}_{t} from going too high or too low, essentially cycling between yy_{\ell} and yhy_{h}. Indeed, starting at Xt=yX^{\ast}_{t}=y_{\ell}, the consumer switches to expansion which causes prices to trend up; once they hit yhy_{h} the consumer switches to contraction, causing prices to trend down. As a result, μt\mu_{t} alternates between μ+,μ\mu_{+},\mu_{-} generating a mean-reverting behavior. Throughout, the producer acts as a “back-up”, explicitly forcing prices from becoming extreme (namely from falling in the expansion regime, or rising in the contraction regime). These additional interventions by the producer make the domain of (Xt)(X^{\ast}_{t}) bounded.

It is also possible that, say, xh+<yhx^{+}_{h}<y_{h} so that in the expansion regime the producer will act first both when (Xt)(X^{\ast}_{t}) falls (impulse threshold x+x^{+}_{\ell}) and when (Xt)(X^{\ast}_{t}) rises (xh+x^{+}_{h}), making the consumer inactive. In that case it is plain to see that the drift μtμ+\mu_{t}\equiv\mu_{+} will stay positive forever; (Xt)(X_{t}) will be forced to a bounded domain but will not have mean-reverting dynamics since the drift is constant. Instead, it will experience repeated impulses downward to counteract the upward trend due to ongoing consumption growth.

3 Best–response functions

To obtain a threshold-type Feedback Nash Equilibrium we view it as a fixed point of the producer and consumer best–response maps. Therefore, our overall strategy is to (i) characterize threshold–type switching strategies for the consumer given a pre-specified, threshold–type behavior by the producer; (ii) characterize threshold–type impulse strategies for the producer who faces a pre-specified regime–switching behavior of (Xt)(X_{t}); (iii) employ tâtonnement, i.e. iteratively apply the best–response controls alternating between the two players to construct an interior, non-preemptive equilibrium satisfying the ordering (15).

To analyze best–response strategies, we utilize stochastic control theory, rephrasing the related dynamic optimization objectives through variational inequalities (VI) for the jump–diffusion dynamics (4). The competitor thresholds then act as boundary conditions in the VIs. To establish the desired equilibrium we need to verify that the best response is also of threshold-type and solves the expected systems of equations. We note that all three pieces above are new and we have not been able to find precise analogues of the needed verification theorems in the extant literature. Nevertheless, they do build upon similar single–agent control formulations, so the overall technique is conceptually clear.

3.1 Consumer Best–Response

Fixing impulse thresholds xr±x^{\pm}_{r} (r=h,lr=h,l), the consumer faces a two–state switching control problem on the bounded domain (x±,xh±)(x^{\pm}_{\ell},x^{\pm}_{h}). Namely, given a producer’s impulse strategy (τi,ξi)i1(\tau_{i},\xi_{i})_{i\geq 1} with τ=inf{t:Xt[x±,xh±]}\tau=\inf\{t:X_{t}\notin[x^{\pm}_{\ell},x^{\pm}_{h}]\}, we expect the following stochastic representation for her value functions w±(x)w^{\pm}(x) with x[x±,xh±]x\in[x^{\pm}_{\ell},x^{\pm}_{h}]

w±(x)\displaystyle w^{\pm}(x) =supσ𝒯𝔼x,±[0τσeβtπc(Xt)𝑑t+eβτ¯𝟙{τ<σ}(w±(Xτξ))+eβτ¯𝟙{τ>σ}(w(Xσ)h±)],\displaystyle=\sup_{\sigma\in\mathcal{T}}\mathbb{E}_{x,\pm}\Bigg{[}\int_{0}^{{\tau}\wedge\sigma}e^{-\beta t}\pi_{c}(X_{t})dt+e^{-\beta\underline{\tau}}\mathds{1}_{\{\tau<\sigma\}}\Big{(}w^{\pm}(X_{\tau}-\xi)\Big{)}+e^{-\beta\underline{\tau}}\mathds{1}_{\{\tau>\sigma\}}\Big{(}w^{\mp}(X_{\sigma})-h_{\pm}\Big{)}\Bigg{]}, (16)

where 𝔼x,±\mathbb{E}_{x,\pm} denotes expectation with respect to μt{μ,μ+}\mu_{t}\in\{\mu_{-},\mu_{+}\} and h±h_{\pm} are the fixed intervention costs of the consumer. The above is a system of two coupled equations, which locally resembles an optimal stopping problem with running payoff πc()\pi_{c}(\cdot), reward w()w^{\mp}(\cdot) (last term), and stop–loss payoff (middle term) w()w^{\mp}(\cdot) due to the producer impulse at τ\tau. This is almost the formulation as considered in [3] except with two modifications:

  • The domain is bounded on both sides (previously there was a one–sided stop–loss region).

  • The boundary condition w+(x)=w+(x+)w^{+}(x_{\ell})=w^{+}(x^{+\ast}_{\ell}) is autonomous but nonlocal. Therefore, the two stopping–type VIs for the consumer are coupled only through the free boundaries, not through the stop–loss thresholds as in [3].

Now, given a producer strategy 𝒞p\mathcal{C}_{p}, if the consumer’s response is such that y<xy_{\ell}<x^{-}_{\ell} and xh+<yhx^{+}_{h}<y_{h}, the consumer will be stuck forever in the initial regime because the price touches xx^{-}_{\ell} before yy_{\ell} in the contraction regime and xh+x^{+}_{h} before yhy_{h} in the expansion regime. In this case, the price will oscillate between xx^{-}_{\ell} and xhx^{-}_{h} if the initial market is in the contraction regime, and between x+x^{+}_{\ell} and xh+x^{+}_{h} in the expansion regime.

In the case where the consumer’s response satisfies y<xy_{\ell}<x^{-}_{\ell} and yh<xh+y_{h}<x^{+}_{h}, depending on the initial state, the consumer will switch once to the expansion regime or will be stuck in the initial expansion regime. If the initial regime is μ+\mu_{+}, the price will touch yhy_{h}, the regime will switch to contraction, the price will never touch yy_{\ell} and will oscillate between xx^{-}_{\ell} and xhx^{-}_{h}. If the initial state is already μ+\mu_{+}, no switch of regime will ever occur. The same reasoning applies for the symmetric case where x<yx^{-}_{\ell}<y_{\ell} and xh+<yhx^{+}_{h}<y_{h}.

Finally, if the consumer’s response satisfies x<yx^{-}_{\ell}<y_{\ell} and yh<xh+y_{h}<x^{+}_{h}, then whatever the initial regime, the state (μt)(\mu_{t}) will switch many times between the two regimes.

The best–response of the consumer consists in picking the best response amongst the three possible ones above. Thus, we distinguish three cases:

(a) No-Switch:

The consumer is completely inactive and simply collects her payoff based on the strategy (x,h±)(x^{\pm}_{\ell,h}).

(b) Single-Switch:

The consumer always prefers one regime to the other. Then she is inactive (like in case (a) above) in the preferred regime and faces an optimal stopping (since there is only a single switch to consider) problem in the other regime.

(c) Multiple-Switch:

The consumer switches back and forth between both regimes: the continuation region is (y,yh)(y_{\ell},y_{h}).

Proposition 1 provides the value function of the consumer in case (a). The system (24) characterizes the game payoff in case (b), and Proposition 2 provides the value function of the consumer in case (c).

3.1.1 No–switch

Regardless of the consumer strategy, in the continuation region, a direct application of the Feynman–Kac formula on (16) shows that her value function solves the following ordinary differential equation (ODE)

βw+μ±wx+12σ2wxx+πc(x)=0.-\beta w+\mu_{\pm}w_{x}+\frac{1}{2}\sigma^{2}w_{xx}+\pi_{c}(x)=0. (17)

Solving this inhomogeneous second-order ODE, we obtain w±(x)=ω^±(x)+u±(x)w^{\pm}(x)=\widehat{\omega}^{\pm}(x)+u^{\pm}(x), where letting θ2±<0<θ1±\theta_{2}^{\pm}<0<\theta_{1}^{\pm} be the two real roots of the quadratic equation β+μ±z+12σ2z2=0-\beta+\mu_{\pm}z+\frac{1}{2}\sigma^{2}z^{2}=0,

  • u±(x)=λ1±eθ1±x+λ2±eθ2±xu^{\pm}(x)=\lambda^{\pm}_{1}e^{\theta^{\pm}_{1}x}+\lambda^{\pm}_{2}e^{\theta^{\pm}_{2}x} solves the homogeneous ODE βu+μ±ux+12σ2uxx=0-\beta u+\mu_{\pm}u_{x}+\frac{1}{2}\sigma^{2}u_{xx}=0 and λi,0±\lambda^{\pm}_{i,0}, i=1,2i=1,2 are to be determined from appropriate boundary conditions;

  • ω^±(x)\widehat{\omega}^{\pm}(x) is a particular solution to (17), given by

    ω^±(x)=Ex2+F±x+G± where \widehat{\omega}^{\pm}(x)=Ex^{2}+F_{\pm}x+G_{\pm}\qquad\text{ where }
    E\displaystyle E =γ2β,F±=1β(γ1+2μ±γ2β),G±=1β(γ0+σ2γ2β+μ±F±).\displaystyle=\frac{\gamma_{2}}{\beta},\quad F_{\pm}=\frac{1}{\beta}\Big{(}\gamma_{1}+2\mu_{\pm}\frac{\gamma_{2}}{\beta}\Big{)},\quad G_{\pm}=\frac{1}{\beta}\big{(}\gamma_{0}+\sigma^{2}\frac{\gamma_{2}}{\beta}+\mu_{\pm}F_{\pm}\big{)}. (18)

When the consumer is inactive (denoted by w0±w^{\pm}_{0}), the continuation region is [x±,xh±][x_{\ell}^{\pm},x_{h}^{\pm}] with the boundary conditions at the impulse levels

w0±(xr±)=w0±(xr±),r{,h}.\displaystyle w^{\pm}_{0}(x^{\pm}_{r})=w^{\pm}_{0}(x^{\pm\ast}_{r}),\qquad r\in\{\ell,h\}. (19)

From (19) the respective coefficients λ1,0±,λ2,0±\lambda^{\pm}_{1,0},\lambda^{\pm}_{2,0} are solved from the following uncoupled linear system:

λ1,0±[eθ1±x±eθ1±x±]+λ2,0±[eθ2±x±eθ2±x±]=ω^±(x±)ω^±(x±),\displaystyle\lambda^{\pm}_{1,0}\cdot\big{[}e^{\theta^{\pm}_{1}x^{\pm}_{\ell}}-e^{\theta^{\pm}_{1}x^{\pm\ast}_{\ell}}\big{]}+\lambda^{\pm}_{2,0}\cdot\big{[}e^{\theta^{\pm}_{2}x^{\pm}_{\ell}}-e^{\theta^{\pm}_{2}x^{\pm\ast}_{\ell}}\big{]}=\widehat{\omega}^{\pm}(x^{\pm\ast}_{\ell})-\widehat{\omega}^{\pm}(x^{\pm}_{\ell}), (20)
λ1,0±[eθ1±xh±eθ1±xh±]+λ2,0±[eθ2±xh±eθ2±xh±]=ω^±(xh±)ω^±(xh±).\displaystyle\lambda^{\pm}_{1,0}\cdot\big{[}e^{\theta^{\pm}_{1}x^{\pm}_{h}}-e^{\theta^{\pm}_{1}x^{\pm\ast}_{h}}\big{]}+\lambda^{\pm}_{2,0}\cdot\big{[}e^{\theta^{\pm}_{2}x^{\pm}_{h}}-e^{\theta^{\pm}_{2}x^{\pm\ast}_{h}}\big{]}=\widehat{\omega}^{\pm}(x^{\pm\ast}_{h})-\widehat{\omega}^{\pm}(x^{\pm}_{h}). (21)

For x>xh±x>x^{\pm}_{h} we take w0±(x)=w0±(xh±)w^{\pm}_{0}(x)=w^{\pm}_{0}(x^{\pm*}_{h}) and similarly in the contraction regime, we take w0±(x)=w0±(x±)w^{\pm}_{0}(x)=w^{\pm}_{0}(x^{\pm*}_{\ell}) for x<x±x<x^{\pm}_{\ell}.

Proposition 1.

Let (λ1,0±,λ2,0±)4(\lambda^{\pm}_{1,0},\lambda^{\pm}_{2,0})\in\mathbb{R}^{4} be the solution to the system (20)-(21). Then the functions w0±(x)w^{\pm}_{0}(x), x[x±,xh±]x\in[x_{\ell}^{\pm},x_{h}^{\pm}], are the value functions for an inactive consumer, i.e. w0±(x)=Jc±(x;N,μ±)w_{0}^{\pm}(x)=J_{c}^{\pm}(x;N,\mu^{\pm}), where NN is the producer impulse strategy associated with the thresholds (x±,x±;xh±,xh±)(x_{\ell}^{\pm},x_{\ell}^{\pm\ast};x_{h}^{\pm},x_{h}^{\pm\ast}) with x±<xh±x_{\ell}^{\pm}<x_{h}^{\pm}.

The role of w0±()w^{\pm}_{0}(\cdot) is important for judging the other two cases, and moreover for deciding whether the best–response ought to be of threshold–type.

3.1.2 Single–switch

We next consider the situation where the payoff in the expansion regime is higher than the contraction one for any price xx, so that the consumer is never incentivized to switch to the contraction regime. We then expect the consumer’s corresponding best–response to be either a single–switch strategy (to the preferred regime) or no–switch (if already there). Economically, this corresponds to yh>xh+y_{h}>x_{h}^{+} so that as the price rises, the producer impulses (Xt)(X_{t}) down, and the consumer is not intervening to decrease her demand. As a result, the consumer never switches (except perhaps the first time from negative to positive drift) and limtμt=μ+\lim_{t\to\infty}\mu_{t}=\mu_{+}. This can be observed when demand switching is very expensive, so that the producer has full market power and is able to keep prices consistently low. The consumer is forced to be in the expansion regime forever and she is not able to influence (Xt)(X_{t}).

Suppose that the consumer prefers expansion regime (μt=μ+\mu_{t}=\mu_{+}) and adopts threshold-type strategies. Given 𝒞p\mathcal{C}_{p}, her strategy is summarized by

y>x,yh=+,\displaystyle y_{\ell}>x^{-}_{\ell},\qquad y_{h}=+\infty,

and the resulting contraction–regime value function ww^{-} should be a solution to the variational inequality

sup{βw+μwx+12σ2wxx+πc;w0+hw}=0,\displaystyle\sup\big{\{}-\beta w^{-}+\mu_{-}w^{-}_{x}+\frac{1}{2}\sigma^{2}w^{-}_{xx}+\pi_{c};\,w^{+}_{0}-h_{-}-w^{-}\big{\}}=0, (22)

where w0+w^{+}_{0} is from Proposition 1 and the continuation region is [y,xh][y_{\ell},x_{h}^{-}]. This is a standard optimal stopping problem. Note that while the above equation for ww^{-} depends on w0+w^{+}_{0}, the equation for w0+w^{+}_{0} is autonomous—the system of equations becomes decoupled because the two regimes of (μt)(\mu_{t}) no longer communicate.

To solve (22) we posit that her best–response is of the form

w(x)\displaystyle w^{-}(x) ={w0+(x)h,xy,ω^(x)+λ1eθ1x+λ2eθ2x,y<x<xh,w(xh),xhx,\displaystyle=\begin{cases}w^{+}_{0}(x)-h_{-},&x\leq y_{\ell},\\ \widehat{\omega}^{-}(x)+\lambda^{-}_{1}e^{\theta^{-}_{1}x}+\lambda^{-}_{2}e^{\theta^{-}_{2}x},&y_{\ell}<x<x^{-}_{h},\\ w^{-}(x^{-\ast}_{h}),&x^{-}_{h}\leq x,\end{cases} (23)

with the smooth pasting and boundary conditions:

{ω^(y)+λ1eθ1y+λ2eθ2y=ω^+(y)+λ1,0+eθ1+y+λ2,0+eθ2+yh,(𝒞0 at y)ω^(xh)+λ1eθ1xh+λ2eθ2xh=ω^(xh)+λ1eθ1xh+λ2eθ2xh,(𝒞0 at xh)ω^x(y)+λ1θ1eθ1y+λ2θ2eθ2y=ω^x+(y)+λ1,0+θ1+eθ1+y+λ2,0+θ2+eθ2+y.(𝒞1 at y)\displaystyle\begin{cases}\widehat{\omega}^{-}(y_{\ell})+\lambda^{-}_{1}e^{\theta^{-}_{1}y_{\ell}}+\lambda^{-}_{2}e^{\theta^{-}_{2}y_{\ell}}=\widehat{\omega}^{+}(y_{\ell})+\lambda^{+}_{1,0}e^{\theta^{+}_{1}y_{\ell}}+\lambda^{+}_{2,0}e^{\theta^{+}_{2}y_{\ell}}-h_{-},&(\mathcal{C}^{0}\text{ at }y_{\ell})\\ \widehat{\omega}^{-}(x^{-}_{h})+\lambda^{-}_{1}e^{\theta^{-}_{1}x^{-}_{h}}+\lambda^{-}_{2}e^{\theta^{-}_{2}x^{-}_{h}}=\widehat{\omega}^{-}(x^{-\ast}_{h})+\lambda^{-}_{1}e^{\theta^{-}_{1}x^{-\ast}_{h}}+\lambda^{-}_{2}e^{\theta^{-}_{2}x^{-\ast}_{h}},&(\mathcal{C}^{0}\text{ at }x^{-}_{h})\\ \widehat{\omega}^{-}_{x}(y_{\ell})+\lambda^{-}_{1}\theta^{-}_{1}e^{\theta^{-}_{1}y_{\ell}}+\lambda^{-}_{2}\theta^{-}_{2}e^{\theta^{-}_{2}y_{\ell}}=\widehat{\omega}^{+}_{x}(y_{\ell})+\lambda^{+}_{1,0}\theta^{+}_{1}e^{\theta^{+}_{1}y_{\ell}}+\lambda^{+}_{2,0}\theta^{+}_{2}e^{\theta^{+}_{2}y_{\ell}}.&(\mathcal{C}^{1}\text{ at }y_{\ell})\end{cases} (24)

The system (24) is to be solved for the three unknowns y,λ1,λ2y_{\ell},\lambda^{-}_{1},\lambda^{-}_{2}, while λ1,0+,λ2,0+\lambda^{+}_{1,0},\lambda^{+}_{2,0} are the coefficients of the consumer’s payoff associated to the no-switch strategy in the μ+\mu_{+} regime, see previous subsection. We can re-write it as first solving for λ1,2\lambda^{-}_{1,2} from the linear system

[eθ1yeθ2yeθ1xheθ1xheθ2xheθ2xh][λ1λ2]=[w0+(y)ω^(y)hω^(xh)ω^(xh)]\displaystyle\begin{bmatrix}e^{\theta^{-}_{1}y_{\ell}}&e^{\theta^{-}_{2}y_{\ell}}\\ e^{\theta^{-}_{1}x^{-}_{h}}-e^{\theta^{-}_{1}x^{-\ast}_{h}}&e^{\theta^{-}_{2}x^{-}_{h}}-e^{\theta^{-}_{2}x^{-\ast}_{h}}\end{bmatrix}\cdot\begin{bmatrix}\lambda^{-}_{1}\\ \lambda^{-}_{2}\end{bmatrix}=\begin{bmatrix}w^{+}_{0}(y_{\ell})-\widehat{\omega}^{-}(y_{\ell})-h_{-}\\ \widehat{\omega}^{-}(x^{-\ast}_{h})-\widehat{\omega}^{-}(x^{-}_{h})\end{bmatrix} (25)

and then determining yy_{\ell} from the smooth pasting 𝒞1{\mathcal{C}}^{1}-regularity

wx(y)=w0,x+(y).\displaystyle w^{-}_{x}(y_{\ell})=w^{+}_{0,x}(y_{\ell}). (26)

The case of a single–switch from expansion to contraction regime can be treated analogously in a symmetric way.

3.1.3 Double–switch

Finally, we consider the main case where the consumer adopts threshold–type switches, i.e. the ordering in (15) holds. Given 𝒞p\mathcal{C}_{p}, the w±w^{\pm} are then supposed to be a solution to the coupled variational inequalities

sup{βw++μ+wx++12σ2wxx++πc;max{wh+,w+}w+}\displaystyle\sup\big{\{}-\beta w^{+}+\mu_{+}w^{+}_{x}+\frac{1}{2}\sigma^{2}w^{+}_{xx}+\pi_{c};\,\max\{w^{-}-h_{+},w^{+}\}-w^{+}\big{\}} =0,\displaystyle=0, (27)
sup{βw+μwx+12σ2wxx+πc;max{w+h,w}w}\displaystyle\sup\big{\{}-\beta w^{-}+\mu_{-}w^{-}_{x}+\frac{1}{2}\sigma^{2}w^{-}_{xx}+\pi_{c};\,\max\{w^{+}-h_{-},w^{-}\}-w^{-}\big{\}} =0,\displaystyle=0, (28)

where we expect continuation regions of the form (x+,yh)(x_{\ell}^{+},y_{h}) and (y,xh)(y_{\ell},x_{h}^{-}). To set up a verification argument for the consumer’s best–response we make the ansatz

w+(x)\displaystyle w^{+}(x) ={w+(x+),xx+,ω^+(x)+λ1+eθ1+x+λ2+eθ2+x,x+<x<yh,w(x)h+,xyh,\displaystyle=\begin{cases}w^{+}(x^{+\ast}_{\ell}),&x\leq x^{+}_{\ell},\\ \widehat{\omega}^{+}(x)+\lambda^{+}_{1}e^{\theta^{+}_{1}x}+\lambda^{+}_{2}e^{\theta^{+}_{2}x},&x^{+}_{\ell}<x<y_{h},\\ w^{-}(x)-h_{+},&x\geq y_{h},\end{cases} (29a)
w(x)\displaystyle w^{-}(x) ={w+(x)h,xy,ω^(x)+λ1eθ1x+λ2eθ2x,y<x<xh,w(xh),xxh.\displaystyle=\begin{cases}w^{+}(x)-h_{-},&x\leq y_{\ell},\\ \widehat{\omega}^{-}(x)+\lambda^{-}_{1}e^{\theta^{-}_{1}x}+\lambda^{-}_{2}e^{\theta^{-}_{2}x},&y_{\ell}<x<x^{-}_{h},\\ w^{-}(x^{-\ast}_{h}),&x\geq x^{-}_{h}.\end{cases} (29b)

This yields 6 equations:

{ω^+(y)+λ1+eθ1+y+λ2+eθ2+yh=ω^(y)+λ1eθ1y+λ2eθ2y,(𝒞0 at y)ω^+(x+)+λ1+eθ1+x++λ2+eθ2+x+=ω^+(x+)+λ1+eθ1+x++λ2+eθ2+x+,(𝒞0 at x+)ω^(yh)+λ1eθ1yh+λ2eθ2yhh+=ω^+(yh)+λ1+eθ1+yh+λ2+eθ2+yh,(𝒞0 at yh)ω^(xh)+λ1eθ1xh+λ2eθ2xh=ω^(xh)+λ1eθ1xh+λ2eθ2xh,(𝒞0 at xh)ω^x+(y)+λ1+θ1+eθ1+y+λ2+θ2+eθ2+y=ω^x(y)+λ1θ1eθ1y+λ2θ2eθ2y,(𝒞1 at y)ω^x(yh)+λ1θ1eθ1yh+λ2θ2eθ2yh=ω^x+(yh)+λ1+θ1+eθ1+yh+λ2+θ2+eθ2+yh.(𝒞1 at yh)\displaystyle\begin{cases}\widehat{\omega}^{+}(y_{\ell})+\lambda^{+}_{1}e^{\theta^{+}_{1}y_{\ell}}+\lambda^{+}_{2}e^{\theta^{+}_{2}y_{\ell}}-h_{-}=\widehat{\omega}^{-}(y_{\ell})+\lambda^{-}_{1}e^{\theta^{-}_{1}y_{\ell}}+\lambda^{-}_{2}e^{\theta^{-}_{2}y_{\ell}},&(\mathcal{C}^{0}\text{ at }y_{\ell})\\ \widehat{\omega}^{+}(x^{+}_{\ell})+\lambda^{+}_{1}e^{\theta^{+}_{1}x^{+}_{\ell}}+\lambda^{+}_{2}e^{\theta^{+}_{2}x^{+}_{\ell}}=\widehat{\omega}^{+}(x^{+\ast}_{\ell})+\lambda^{+}_{1}e^{\theta^{+}_{1}x^{+\ast}_{\ell}}+\lambda^{+}_{2}e^{\theta^{+}_{2}x^{+\ast}_{\ell}},&(\mathcal{C}^{0}\text{ at }x^{+}_{\ell})\\ \widehat{\omega}^{-}(y_{h})+\lambda^{-}_{1}e^{\theta^{-}_{1}y_{h}}+\lambda^{-}_{2}e^{\theta^{-}_{2}y_{h}}-h_{+}=\widehat{\omega}^{+}(y_{h})+\lambda^{+}_{1}e^{\theta^{+}_{1}y_{h}}+\lambda^{+}_{2}e^{\theta^{+}_{2}y_{h}},&(\mathcal{C}^{0}\text{ at }y_{h})\\ \widehat{\omega}^{-}(x^{-}_{h})+\lambda^{-}_{1}e^{\theta^{-}_{1}x^{-}_{h}}+\lambda^{-}_{2}e^{\theta^{-}_{2}x^{-}_{h}}=\widehat{\omega}^{-}(x^{-\ast}_{h})+\lambda^{-}_{1}e^{\theta^{-}_{1}x^{-\ast}_{h}}+\lambda^{-}_{2}e^{\theta^{-}_{2}x^{-\ast}_{h}},&(\mathcal{C}^{0}\text{ at }x^{-}_{h})\\ \widehat{\omega}^{+}_{x}(y_{\ell})+\lambda^{+}_{1}\theta^{+}_{1}e^{\theta^{+}_{1}y_{\ell}}+\lambda^{+}_{2}\theta^{+}_{2}e^{\theta^{+}_{2}y_{\ell}}=\widehat{\omega}^{-}_{x}(y_{\ell})+\lambda^{-}_{1}\theta^{-}_{1}e^{\theta^{-}_{1}y_{\ell}}+\lambda^{-}_{2}\theta^{-}_{2}e^{\theta^{-}_{2}y_{\ell}},&(\mathcal{C}^{1}\text{ at }y_{\ell})\\ \widehat{\omega}^{-}_{x}(y_{h})+\lambda^{-}_{1}\theta^{-}_{1}e^{\theta^{-}_{1}y_{h}}+\lambda^{-}_{2}\theta^{-}_{2}e^{\theta^{-}_{2}y_{h}}=\widehat{\omega}^{+}_{x}(y_{h})+\lambda^{+}_{1}\theta^{+}_{1}e^{\theta^{+}_{1}y_{h}}+\lambda^{+}_{2}\theta^{+}_{2}e^{\theta^{+}_{2}y_{h}}.&(\mathcal{C}^{1}\text{ at }y_{h})\\ \end{cases} (30)

The six equations can be split into a linear system for the four coefficients λ1,2±\lambda^{\pm}_{1,2}’s

[eθ1+yeθ2+yeθ1yeθ2yeθ1+x+eθ1+x+eθ2+x+eθ2+x+00eθ1+yheθ2+yheθ1yheθ2yh00eθ1xheθ1xheθ2xheθ2xh][λ1+λ2+λ1λ2]=[ω^(y)ω^+(y)h+ω^+(x+)ω^+(x+)ω^+(yh)ω^(yh)hω^(xh)ω^(xh)]\displaystyle\begin{bmatrix}e^{\theta^{+}_{1}y_{\ell}}&e^{\theta^{+}_{2}y_{\ell}}&-e^{\theta^{-}_{1}y_{\ell}}&-e^{\theta^{-}_{2}y_{\ell}}\\ e^{\theta^{+}_{1}x^{+}_{\ell}}-e^{\theta^{+}_{1}x^{+\ast}_{\ell}}&e^{\theta^{+}_{2}x^{+}_{\ell}}-e^{\theta^{+}_{2}x^{+\ast}_{\ell}}&0&0\\ -e^{\theta^{+}_{1}y_{h}}&-e^{\theta^{+}_{2}y_{h}}&e^{\theta^{-}_{1}y_{h}}&e^{\theta^{-}_{2}y_{h}}\\ 0&0&e^{\theta^{-}_{1}x^{-}_{h}}-e^{\theta^{-}_{1}x^{-\ast}_{h}}&e^{\theta^{-}_{2}x^{-}_{h}}-e^{\theta^{-}_{2}x^{-\ast}_{h}}\end{bmatrix}\cdot\begin{bmatrix}\lambda^{+}_{1}\\ \lambda^{+}_{2}\\ \lambda^{-}_{1}\\ \lambda^{-}_{2}\end{bmatrix}=\begin{bmatrix}\widehat{\omega}^{-}(y_{\ell})-\widehat{\omega}^{+}(y_{\ell})-h_{+}\\ \widehat{\omega}^{+}(x^{+\ast}_{\ell})-\widehat{\omega}^{+}(x^{+}_{\ell})\\ \widehat{\omega}^{+}(y_{h})-\widehat{\omega}^{-}(y_{h})-h_{-}\\ \widehat{\omega}^{-}(x^{-\ast}_{h})-\widehat{\omega}^{-}(x^{-}_{h})\end{bmatrix} (31)

and the smooth-pasting conditions determining the two switching thresholds y,hy_{\ell,h} (viewed as free boundaries)

wx+(yr)=wx(yr),r{,h}.\displaystyle w^{+}_{x}(y_{r})=w^{-}_{x}(y_{r}),\qquad r\in\{\ell,h\}. (32)
Proposition 2.

Let the 66-tuple (λ1±,λ2±,yh,y)(\lambda^{\pm}_{1},\lambda^{\pm}_{2},y_{h},y_{\ell}) be a solution to the system (31)-(32) such that the order in (15) is fulfilled. Then, the functions defined in (29) give the best–response payoffs of consumer, and a best–response strategy is given by (σ^i)i1(\hat{\sigma}_{i})_{i\geq 1}, where

σ^0=0,σ^i=inf{t>σ^i1:XtΓc(t)},i1,\hat{\sigma}_{0}=0,\quad\hat{\sigma}_{i}=\inf\left\{t>\hat{\sigma}_{i-1}:X_{t}\in\Gamma_{c}(t)\right\},\quad i\geq 1,

with Γc+=[y,+)\Gamma_{c}^{+}=[y_{\ell},+\infty) and Γc=(,yh]\Gamma_{c}^{-}=(-\infty,y_{h}].

Figure 2 illustrates the shapes of the consumer’s value function in the different case of best–response. For the strategy given, we have a dominant function in the contraction regime (w0w^{-}_{0}) and a dominant function in the expansion regime (w0+w_{0}^{+}).

Refer to caption
Refer to caption
(a) 𝒞p=[1.2,2.5,2.6,4.21.5,3.0,2.8,4.5]\mathcal{C}_{p}=\begin{bmatrix}1.2,&2.5,&2.6,&4.2\\ 1.5,&3.0,&2.8,&4.5\end{bmatrix}
Figure 2: Value functions w0±w^{\pm}_{0}, w1±w^{\pm}_{1} and w2±w^{\pm}_{2} of the consumer given the producer’s strategy (a).

Remark: For comparison purposes, it is also useful to know the continuation region of the consumer when she alone controls the market price (Xt)(X_{t}). As usual, this region is (,yh)(-\infty,y_{h}) in the expansion regime and (y,+)(y_{\ell},+\infty) in the contraction regime, with the natural ordering y<yhy_{\ell}<y_{h}. The value functions w±w^{\pm} satisfy:

sup{βw++μ+wx++12σ2wxx++πc;wh+}\displaystyle\sup\big{\{}-\beta w^{+}+\mu_{+}w^{+}_{x}+\frac{1}{2}\sigma^{2}w^{+}_{xx}+\pi_{c};\,w^{-}-h_{+}\big{\}} =0,\displaystyle=0, (33)
sup{βw+μwx+12σ2wxx+πc;w+h}\displaystyle\sup\big{\{}-\beta w^{-}+\mu_{-}w^{-}_{x}+\frac{1}{2}\sigma^{2}w^{-}_{xx}+\pi_{c};\,w^{+}-h_{-}\big{\}} =0.\displaystyle=0. (34)

To set up a verification argument for the consumer’s best–response we make the ansatz

w+(x)\displaystyle w^{+}(x) ={w(yh)h+,xyh,ω^+(x)+λ1,0+eθ1+x+λ2,0+eθ2+x,x<yh,\displaystyle=\begin{cases}w^{-}(y_{h})-h_{+},&x\geq y_{h},\\ \widehat{\omega}^{+}(x)+\lambda_{1,0}^{+}e^{\theta_{1}^{+}x}+\lambda_{2,0}^{+}e^{\theta_{2}^{+}x},&x<y_{h},\end{cases} (35a)
w(x)\displaystyle w^{-}(x) ={ω^(x)+λ1,0eθ1x+λ2,0eθ2x,x>y,w+(y)h,xy.\displaystyle=\begin{cases}\widehat{\omega}^{-}(x)+\lambda_{1,0}^{-}e^{\theta_{1}^{-}x}+\lambda_{2,0}^{-}e^{\theta_{2}^{-}x},&x>y_{\ell},\\ w^{+}(y_{\ell})-h_{-},&x\leq y_{\ell}.\end{cases} (35b)

Furthermore, in the expansion regime, to keep w+(x)w^{+}(x) bounded as xx\to-\infty we must have λ2,0+=0\lambda_{2,0}^{+}=0 because θ2+<0\theta_{2}^{+}<0. In the contraction regime, a similar argument gives λ1,0=0\lambda_{1,0}^{-}=0. We are left with the four unknowns y,yhy_{\ell},y_{h} and λ1,0p\lambda_{1,0}^{p} and λ2,0\lambda_{2,0}^{-} determined from the following smooth pasting conditions:

{ω^(y)+λ2,0eθ2y=ω^+(y)+λ1,0+eθ1+yh,(𝒞0 at y)ω^+(yh)+λ1,0+eθ1+yh=ω^(yh)+λ2,0eθ2yhh+,(𝒞0 at yh)ω^x(y)+λ2,0θ2eθ2y=ω^x+(y)+λ1,0+θ1+eθ1+y,(𝒞1 at y)ω^x+(yh)+λ1,0+θ1+eθ1+yh=ω^x(yh)+λ2,0θ2eθ2yh.(𝒞1 at yh)\displaystyle\begin{cases}\widehat{\omega}^{-}(y_{\ell})+\lambda^{-}_{2,0}e^{\theta^{-}_{2}y_{\ell}}=\widehat{\omega}^{+}(y_{\ell})+\lambda_{1,0}^{+}e^{\theta_{1}^{+}y_{\ell}}-h_{-},&(\mathcal{C}^{0}\text{ at }y_{\ell})\\ \widehat{\omega}^{+}(y_{h})+\lambda^{+}_{1,0}e^{\theta^{+}_{1}y_{h}}=\widehat{\omega}^{-}(y_{h})+\lambda_{2,0}^{-}e^{\theta_{2}^{-}y_{h}}-h_{+},&(\mathcal{C}^{0}\text{ at }y_{h})\\ \widehat{\omega}^{-}_{x}(y_{\ell})+\lambda^{-}_{2,0}\theta^{-}_{2}e^{\theta^{-}_{2}y_{\ell}}=\widehat{\omega}^{+}_{x}(y_{\ell})+\lambda^{+}_{1,0}\theta^{+}_{1}e^{\theta^{+}_{1}y_{\ell}},&(\mathcal{C}^{1}\text{ at }y_{\ell})\\ \widehat{\omega}^{+}_{x}(y_{h})+\lambda^{+}_{1,0}\theta^{+}_{1}e^{\theta^{+}_{1}y_{h}}=\widehat{\omega}^{-}_{x}(y_{h})+\lambda^{-}_{2,0}\theta^{-}_{2}e^{\theta^{-}_{2}y_{h}}.&(\mathcal{C}^{1}\text{ at }y_{h})\\ \end{cases} (36)

\hfill\Box

3.2 Producer Best Response

We now consider the best–response of the producer, given the consumer’s switching strategy denoted by 𝒞c:=[y,yh]\mathcal{C}_{c}:=[y_{\ell},y_{h}]. Once again, we face three Cases:

  1. 1.

    The producer is a monopolist, i.e. the consumer is completely inactive;

  2. 2.

    The consumer adopts a single–switch strategy;

  3. 3.

    The consumer adopts a double–switch strategy.

3.2.1 Producer as Sole Optimizer

To begin with, we determine the monopoly-like strategy of the producer assuming the consumer adopts a no–switch strategy. In that case μt\mu_{t} is constant throughout and the functions v±v^{\pm} of the producer satisfy the variational inequality (VI):

sup{βv±+μ±vx±+12σ2vxx±+πp,supξ{v±(+ξ)v±()Kp(ξ)}}\displaystyle\sup\Big{\{}-\beta v^{\pm}+\mu_{\pm}v^{\pm}_{x}+\frac{1}{2}\sigma^{2}v^{\pm}_{xx}+\pi_{p}\;,\sup_{\xi}\big{\{}v^{\pm}(\cdot+\xi)-v^{\pm}(\cdot)-K_{p}(\xi)\big{\}}\Big{\}} =0.\displaystyle=0. (37)

Note that the two VIs for v+v^{+} and vv^{-} are autonomous, hence uncoupled from each other. In the continuation region, the general solution of the ODE

βv+μ±vx+12σ2vxx+πp(x)=0-\beta v+\mu_{\pm}v_{x}+\frac{1}{2}\sigma^{2}v_{xx}+\pi_{p}(x)=0

is of the form v±(x)v^{\pm}(x) == v^±(x)+u±(x)\widehat{v}^{\pm}(x)+u^{\pm}(x), where u±=ν1±eθ1±x+ν2±eθ2±xu^{\pm}=\nu^{\pm}_{1}e^{\theta^{\pm}_{1}x}+\nu^{\pm}_{2}e^{\theta^{\pm}_{2}x}, with θ1±,θ2±\theta^{\pm}_{1},\theta_{2}^{\pm} as before, satisfies the homogenous ODE βu+μ±ux+12σ2uxx=0-\beta u+\mu_{\pm}u_{x}+\frac{1}{2}\sigma^{2}u_{xx}=0, and v^±(x)\widehat{v}^{\pm}(x) is a particular solution given by

v^±(x)=Ax2+B±x+C±,\widehat{v}^{\pm}(x)=Ax^{2}+B_{\pm}x+C_{\pm}, (38)

where the coefficients A,B±,C±A,B_{\pm},C_{\pm} are identified as:

A\displaystyle A =d1β,B±=1β(d02μ±d1β+cpd1),C±=1β(μ±B±+Aσ2cpd0).\displaystyle=-\frac{d_{1}}{\beta},\quad B_{\pm}=\frac{1}{\beta}\Big{(}d_{0}-\frac{2\,\mu_{\pm}\,d_{1}}{\beta}+c_{p}\,d_{1}\Big{)},\quad C_{\pm}=\frac{1}{\beta}\big{(}\mu_{\pm}B_{\pm}+A\sigma^{2}-c_{p}d_{0}\big{)}.

Assuming the producer adopts threshold–type impulse strategies defined by ξ(x)\xi^{\ast}(x) in the intervention region, her expected payoff is of the form:

v±(x)\displaystyle v^{\pm}(x) ={v±(xh±)Kp(ξ(x))xxh±,v^±(x)+ν1±eθ1±x+ν2±eθ2±x,x±<x<xh±,v±(x±)Kp(ξ(x))xx±.\displaystyle=\begin{cases}v^{\pm}(x^{\pm\ast}_{h})-K_{p}(\xi^{\ast}(x))&x\geq x^{\pm}_{h},\\ \widehat{v}^{\pm}(x)+\nu^{\pm}_{1}e^{\theta^{\pm}_{1}x}+\nu^{\pm}_{2}e^{\theta^{\pm}_{2}x},&x^{\pm}_{\ell}<x<x^{\pm}_{h},\\ v^{\pm}(x^{\pm\ast}_{\ell})-K_{p}(\xi^{\ast}(x))&x\leq x^{\pm}_{\ell}.\end{cases} (39)

When applying the optimal impulse ξ±(x)\xi^{\pm\ast}(x) at the threshold xr±,r=,hx_{r}^{\pm},r=\ell,h, the producer brings XtX_{t} back to the price level xr±x^{\pm\ast}_{r} :=:= xr±ξ±(xr±)x^{\pm}_{r}-\xi^{\pm\ast}(x^{\pm}_{r}).For optimality, the respective impulse amounts satisfy the first order conditions

vx±(xh±)\displaystyle v_{x}^{\pm}(x^{\pm\ast}_{h}) =ξKp(ξ(xh±)),vx±(x±)=ξKp(ξ(x±)).\displaystyle=-\partial_{\xi}K_{p}(\xi^{\ast}(x^{\pm}_{h})),\qquad v_{x}^{\pm}(x^{\pm\ast}_{\ell})=-\partial_{\xi}K_{p}(\xi^{\ast}(x^{\pm}_{\ell})). (40)

We reinterpret the above as the equation to be satisfied by ξ(xr±)\xi^{\ast}(x_{r}^{\pm}) which are treated temporarily as unknowns and plugged into further equations. To ensure that the value function is continuous at xr±x^{\pm}_{r} we further need

v±(xr±)\displaystyle v^{\pm}(x^{\pm}_{r}) =v±(xr±)Kp(ξr±).\displaystyle=v^{\pm}(x^{\pm\ast}_{r})-K_{p}(\xi^{\pm\ast}_{r}). (41)

Finally, making the hypothesis that the value function is differentiable at the borders of the intervention region, we have:

vx±(x±)\displaystyle v^{\pm}_{x}(x^{\pm}_{\ell}) =vx±(x±)ξKp(ξ(x±)),\displaystyle=v^{\pm}_{x}(x^{\pm\ast}_{\ell})-\partial_{\xi}K_{p}(\xi^{\ast}(x^{\pm}_{\ell})), (42a)
vx±(xh±)\displaystyle v^{\pm}_{x}(x^{\pm}_{h}) =vx±(xh±)ξKp(ξ(xh±)).\displaystyle=v^{\pm}_{x}(x^{\pm\ast}_{h})-\partial_{\xi}K_{p}(\xi^{\ast}(x^{\pm}_{h})). (42b)

We consider two cases of impulse costs: (i) constant Kp(ξ)=κ0K_{p}(\xi)=\kappa_{0} and (ii) linear Kp(ξ)=κ0+κ1|ξ|K_{p}(\xi)=\kappa_{0}+\kappa_{1}|\xi|. In case (i), because the impulse cost is independent of the intervention amount there will be an optimal impulse level xr±x^{\pm\ast}_{r} so that for any xx in the intervention region the strategy is to impulse back to xr±x^{\pm\ast}_{r} which is the same at the two thresholds. In case (ii), ξKp=±κ1\partial_{\xi}K_{p}=\pm\kappa_{1} and all the smooth pasting and boundary conditions can be gathered in the following system:

{v^±(xh±)+ν1±eθ1±xh±+ν2±eθ2±xh±=v^±(xh±)+ν1±eθ1±xh±+ν2±eθ2±xh±κ0κ1(xh±xh±),(𝒞0 at xh±)v^±(x±)+ν1±eθ1±x±+ν2±eθ2±x±=v^±(x±)+ν1±eθ1±x±+ν2±eθ2±x±κ0κ1(x±x±),(𝒞0 at x±)v^x±(xh±)+ν1±θ1±eθ1±xh±+ν2±θ2±eθ2±xh±=κ1(𝒞1 at xh±)v^x±(x±)+ν1±θ1±eθ1±x±+ν2±θ2±eθ2±x±=κ1,(𝒞1 at x±)v^x±(xh±)+ν1±θ1±eθ1±xh±+ν2±θ2±eθ2±xh±=v^x±(xh±)+ν1±θ1±eθ1±xh±+ν2±θ2±eθ2±xh±κ1,(𝒞1 at xh±)v^x±(x±)+ν1±θ1±eθ1±x±+ν2±θ2±eθ2±x±=v^x±(x±)+ν1±θ1±eθ1±x±+ν2±θ2±eθ2±x±+κ1.(𝒞1 at x±)\displaystyle\begin{cases}\widehat{v}^{\pm}(x^{\pm}_{h})+\nu^{\pm}_{1}e^{\theta^{\pm}_{1}x^{\pm}_{h}}+\nu^{\pm}_{2}e^{\theta^{\pm}_{2}x^{\pm}_{h}}=\widehat{v}^{\pm}(x^{\pm\ast}_{h})+\nu^{\pm}_{1}e^{\theta^{\pm}_{1}x^{\pm\ast}_{h}}+\nu^{\pm}_{2}e^{\theta^{\pm}_{2}x^{\pm\ast}_{h}}-\kappa_{0}-\kappa_{1}(x^{\pm}_{h}-x^{\pm\ast}_{h}),&(\mathcal{C}^{0}\text{ at }x^{\pm}_{h})\\ \widehat{v}^{\pm}(x^{\pm}_{\ell})+\nu^{\pm}_{1}e^{\theta^{\pm}_{1}x^{\pm}_{\ell}}+\nu^{\pm}_{2}e^{\theta^{\pm}_{2}x^{\pm}_{\ell}}=\widehat{v}^{\pm}(x^{\pm\ast}_{\ell})+\nu^{\pm}_{1}e^{\theta^{\pm}_{1}x^{\pm\ast}_{\ell}}+\nu^{\pm}_{2}e^{\theta^{\pm}_{2}x^{\pm\ast}_{\ell}}-\kappa_{0}-\kappa_{1}(x^{\pm\ast}_{\ell}-x^{\pm}_{\ell}),&(\mathcal{C}^{0}\text{ at }x^{\pm}_{\ell})\\ \widehat{v}^{\pm}_{x}(x^{\pm\ast}_{h})+\nu^{\pm}_{1}\theta^{\pm}_{1}e^{\theta^{\pm}_{1}x^{\pm\ast}_{h}}+\nu^{\pm}_{2}\theta^{\pm}_{2}e^{\theta^{\pm}_{2}x^{\pm\ast}_{h}}=-\kappa_{1}&(\mathcal{C}^{1}\text{ at }x^{\pm\ast}_{h})\\ \widehat{v}^{\pm}_{x}(x^{\pm\ast}_{\ell})+\nu^{\pm}_{1}\theta^{\pm}_{1}e^{\theta^{\pm}_{1}x^{\pm\ast}_{\ell}}+\nu^{\pm}_{2}\theta^{\pm}_{2}e^{\theta^{\pm}_{2}x^{\pm\ast}_{\ell}}=\kappa_{1},&(\mathcal{C}^{1}\text{ at }x^{\pm\ast}_{\ell})\\ \widehat{v}^{\pm}_{x}(x^{\pm}_{h})+\nu^{\pm}_{1}\theta^{\pm}_{1}e^{\theta^{\pm}_{1}x^{\pm}_{h}}+\nu^{\pm}_{2}\theta^{\pm}_{2}e^{\theta^{\pm}_{2}x^{\pm}_{h}}=\widehat{v}^{\pm}_{x}(x^{\pm\ast}_{h})+\nu^{\pm}_{1}\theta^{\pm}_{1}e^{\theta^{\pm}_{1}x^{\pm\ast}_{h}}+\nu^{\pm}_{2}\theta^{\pm}_{2}e^{\theta^{\pm}_{2}x^{\pm\ast}_{h}}-\kappa_{1},&(\mathcal{C}^{1}\text{ at }x^{\pm}_{h})\\ \widehat{v}^{\pm}_{x}(x^{\pm}_{\ell})+\nu^{\pm}_{1}\theta^{\pm}_{1}e^{\theta^{\pm}_{1}x^{\pm}_{\ell}}+\nu^{\pm}_{2}\theta^{\pm}_{2}e^{\theta^{\pm}_{2}x^{\pm}_{\ell}}=\widehat{v}^{\pm}_{x}(x^{\pm\ast}_{\ell})+\nu^{\pm}_{1}\theta^{\pm}_{1}e^{\theta^{\pm}_{1}x^{\pm\ast}_{\ell}}+\nu^{\pm}_{2}\theta^{\pm}_{2}e^{\theta^{\pm}_{2}x^{\pm\ast}_{\ell}}+\kappa_{1}.&(\mathcal{C}^{1}\text{ at }x^{\pm}_{\ell})\end{cases} (43)

Note that there are two uncoupled linear systems for v+v^{+} and vv^{-}. The 𝒞0{\mathcal{C}}^{0} conditions are from (41), the first two 𝒞1{\mathcal{C}}^{1} conditions are from (40) which determines the optimal impulse destination, and the last two 𝒞1{\mathcal{C}}^{1} conditions are from (42).

By a standard verification argument, one can show that if both systems above admit solutions ν1,2±\nu^{\pm}_{1,2} and x,h±x^{\pm}_{\ell,h}, where the latter satisfy the order condition x±<xh±x_{\ell}^{\pm}<x_{h}^{\pm}, then the functions v±(x)v^{\pm}(x) as in (39) are the value functions of the producer and his optimal strategies are given by the thresholds x,h±x_{\ell,h}^{\pm} and impulse amounts ξ(x,h±)\xi^{\ast}(x_{\ell,h}^{\pm\ast}). This can be done by following exactly the arguments in, e.g., [7] (see also their Remark 2.1), which are very standard in the literature of impulse control problems. Therefore, details are omitted.

3.2.2 Non-preemptive Response

Suppose the following ordering, which is similar to (15), holds:

x±<yl<yh<xh±.x_{\ell}^{\pm}<y_{l}<y_{h}<x_{h}^{\pm}. (44)

We then expect v±v^{\pm} to solve the VIs

{sup{βv++μ+vx++12σ2vxx++πp;supξ(v+(ξ)v+Kp(ξ))}=0,sup{βv+μvx+12σ2vxx+πp;supξ(v(ξ)vKp(ξ))}=0.\displaystyle\begin{cases}\sup\big{\{}-\beta v^{+}+\mu_{+}v^{+}_{x}+\frac{1}{2}\sigma^{2}v^{+}_{xx}+\pi_{p}\;;\;\sup_{\xi}(v^{+}(\cdot-\xi)-v^{+}-K_{p}(\xi))\big{\}}=0,\\ \sup\big{\{}-\beta v^{-}+\mu_{-}v^{-}_{x}+\frac{1}{2}\sigma^{2}v^{-}_{xx}+\pi_{p}\;;\;\sup_{\xi}(v^{-}(\cdot-\xi)-v^{-}-K_{p}(\xi))\big{\}}=0.\\ \end{cases} (45)

To obtain the producer best–response it suffices to identify the two active impulse thresholds x+,xhx^{+}_{\ell},x^{-}_{h} and the respective target levels x+,xhx^{+\ast}_{\ell},x^{-\ast}_{h}. The other two boundary conditions take place at the consumer thresholds y,yhy_{\ell},y_{h}, so that the strategy (see (47) below) is 𝒞p=[x+,x+,,+,,xh,xh].\mathcal{C}_{p}=\begin{bmatrix}x^{+}_{\ell},&x^{+\ast}_{\ell},&-,&+\infty\\ -\infty,&-,&x^{-\ast}_{h},&x^{-}_{h}\end{bmatrix}. The game coupling shows up in the additional boundary condition that when the consumer switches, the producer’s value is unaffected:

v+(y)=v(y),y(,y][yh,+).v^{+}(y)=v^{-}(y),\qquad y\in(-\infty,y_{\ell}]\cup[y_{h},+\infty). (46)

Accordingly, our ansatz is

v(x)\displaystyle v^{-}(x) ={v(xh)Kp(ξ(x)),xxh,v^(x)+ν1eθ1x+ν2eθ2x,y<x<xh,v+(x),xy,\displaystyle=\begin{cases}v^{-}(x^{-\ast}_{h})-K_{p}(\xi^{\ast}(x)),&x\geq x^{-}_{h},\\ \widehat{v}^{-}(x)+\nu^{-}_{1}e^{\theta^{-}_{1}x}+\nu^{-}_{2}e^{\theta^{-}_{2}x},&y_{\ell}<x<x^{-}_{h},\\ v^{+}(x),&x\leq y_{\ell},\end{cases} (47a)
v+(x)\displaystyle v^{+}(x) ={v(x),xyh,v^+(x)+ν1+eθ1+x+ν2+eθ2+x,x+<x<yh,v+(x+)Kp(ξ(x)),xx+.\displaystyle=\begin{cases}v^{-}(x),&x\geq y_{h},\\ \widehat{v}^{+}(x)+\nu^{+}_{1}e^{\theta^{+}_{1}x}+\nu^{+}_{2}e^{\theta^{+}_{2}x},&x^{+}_{\ell}<x<y_{h},\\ v^{+}(x^{+\ast}_{\ell})-K_{p}(\xi^{\ast}(x)),&x\leq x^{+}_{\ell}.\end{cases} (47b)

To simplify the presentation, let us concentrate on the proportional impulse costs Kp(ξ)=κ0+κ1|ξ|K_{p}(\xi)=\kappa_{0}+\kappa_{1}|\xi|. We have the smooth pasting 𝒞1\mathcal{C}^{1} and boundary conditions:

{v^+(y)+ν1+eθ1+y+ν2+eθ2+y=v^(y)+ν1eθ1y+ν2eθ2y,(𝒞0 at y)v^(yh)+ν1eθ1yh+ν2eθ2yh=v^+(yh)+ν1+eθ1+yh+ν2+eθ2+yh,(𝒞0 at yh)v^+(x+)+ν1+eθ1+x++ν2+eθ2+x+=v^+(x+)+ν1+eθ1+x++ν2+eθ2+x+Kp(ξ(x+)),(𝒞0 at x+)v^(xh)+ν1eθ1xh+ν2eθ2xh=v^(xh)+ν1eθ1xh+ν2eθ2xhKp(ξ(xh)),(𝒞0 at xh)v^x+(x+)+ν1+θ1+eθ1+x++ν2+θ2+eθ2+x+=v^x+(x+)+ν1+θ1+eθ1+x++ν2+θ2+eθ2+x+κ1,(𝒞1 at x+)v^x(xh)+ν1θ1eθ1xh+ν2θ2eθ2xh=v^x(xh)+ν1θ1eθ1xh+ν2θ2eθ2xh+κ1.(𝒞1 at xh)v^x+(x+)+ν1+θ1+eθ1+x++ν2+θ2+eθ2+x+=κ1(𝒞1 at x+)v^x(xh)+ν1θ1eθ1xh+ν2θ2eθ2xh=κ1,(𝒞1 at xh)\displaystyle\begin{cases}\widehat{v}^{+}(y_{\ell})+\nu^{+}_{1}e^{\theta^{+}_{1}y_{\ell}}+\nu^{+}_{2}e^{\theta^{+}_{2}y_{\ell}}=\widehat{v}^{-}(y_{\ell})+\nu^{-}_{1}e^{\theta^{-}_{1}y_{\ell}}+\nu^{-}_{2}e^{\theta^{-}_{2}y_{\ell}},&(\mathcal{C}^{0}\text{ at }y_{\ell})\\ \widehat{v}^{-}(y_{h})+\nu^{-}_{1}e^{\theta^{-}_{1}y_{h}}+\nu^{-}_{2}e^{\theta^{-}_{2}y_{h}}=\widehat{v}^{+}(y_{h})+\nu^{+}_{1}e^{\theta^{+}_{1}y_{h}}+\nu^{+}_{2}e^{\theta^{+}_{2}y_{h}},&(\mathcal{C}^{0}\text{ at }y_{h})\\ \widehat{v}^{+}(x^{+}_{\ell})+\nu^{+}_{1}e^{\theta^{+}_{1}x^{+}_{\ell}}+\nu^{+}_{2}e^{\theta^{+}_{2}x^{+}_{\ell}}=\widehat{v}^{+}(x^{+\ast}_{\ell})+\nu^{+}_{1}e^{\theta^{+}_{1}x^{+\ast}_{\ell}}+\nu^{+}_{2}e^{\theta^{+}_{2}x^{+\ast}_{\ell}}-K_{p}(\xi^{\ast}(x^{+}_{\ell})),&(\mathcal{C}^{0}\text{ at }x^{+}_{\ell})\\ \widehat{v}^{-}(x^{-}_{h})+\nu^{-}_{1}e^{\theta^{-}_{1}x^{-}_{h}}+\nu^{-}_{2}e^{\theta^{-}_{2}x^{-}_{h}}=\widehat{v}^{-}(x^{-\ast}_{h})+\nu^{-}_{1}e^{\theta^{-}_{1}x^{-\ast}_{h}}+\nu^{-}_{2}e^{\theta^{-}_{2}x^{-\ast}_{h}}-K_{p}(\xi^{\ast}(x^{-}_{h})),&(\mathcal{C}^{0}\text{ at }x^{-}_{h})\\ \widehat{v}^{+}_{x}(x^{+}_{\ell})+\nu^{+}_{1}\theta^{+}_{1}e^{\theta^{+}_{1}x^{+}_{\ell}}+\nu^{+}_{2}\theta^{+}_{2}e^{\theta^{+}_{2}x^{+}_{\ell}}=\widehat{v}^{+}_{x}(x^{+\ast}_{\ell})+\nu^{+}_{1}\theta^{+}_{1}e^{\theta^{+}_{1}x^{+\ast}_{\ell}}+\nu^{+}_{2}\theta^{+}_{2}e^{\theta^{+}_{2}x^{+\ast}_{\ell}}{-\kappa_{1}},&(\mathcal{C}^{1}\text{ at }x^{+}_{\ell})\\ \widehat{v}^{-}_{x}(x^{-}_{h})+\nu^{-}_{1}\theta^{-}_{1}e^{\theta^{-}_{1}x^{-}_{h}}+\nu^{-}_{2}\theta^{-}_{2}e^{\theta^{-}_{2}x^{-}_{h}}=\widehat{v}^{-}_{x}(x^{-\ast}_{h})+\nu^{-}_{1}\theta^{-}_{1}e^{\theta^{-}_{1}x^{-\ast}_{h}}+\nu^{-}_{2}\theta^{-}_{2}e^{\theta^{-}_{2}x^{-\ast}_{h}}{+\kappa_{1}}.&(\mathcal{C}^{1}\text{ at }x^{-}_{h})\\ \widehat{v}^{+}_{x}(x_{\ell}^{+\ast})+\nu^{+}_{1}\theta^{+}_{1}e^{\theta^{+}_{1}x_{\ell}^{+\ast}}+\nu^{+}_{2}\theta^{+}_{2}e^{\theta^{+}_{2}x_{\ell}^{+\ast}}=-\kappa_{1}&{(\mathcal{C}^{1}\text{ at }x_{\ell}^{+\ast})}\\ \widehat{v}^{-}_{x}(x_{h}^{-\ast})+\nu^{-}_{1}\theta^{-}_{1}e^{\theta^{-}_{1}x_{h}^{-\ast}}+\nu^{-}_{2}\theta^{-}_{2}e^{\theta^{-}_{2}x_{h}^{-\ast}}=\kappa_{1},&{(\mathcal{C}^{1}\text{ at }x_{h}^{-\ast})}\end{cases} (48)

Unlike the single–agent setting (43), the equations (48) are coupled. The coefficients ν1,2±\nu^{\pm}_{1,2} are the solution to the linear system

[eθ1+yeθ2+yeθ1yeθ2yeθ1+x+eθ1+x+eθ2+x+eθ2+x+00eθ1+yheθ2+yheθ1yheθ2yh00eθ1xheθ1xheθ2xheθ2xh][ν1+ν2+ν1ν2]=[v^(y)v^+(y)v^+(x+)v^+(x+)Kpv^+(yh)v^(yh)v^(xh)v^(xh)Kp]\displaystyle\begin{bmatrix}e^{\theta^{+}_{1}y_{\ell}}&e^{\theta^{+}_{2}y_{\ell}}&-e^{\theta^{-}_{1}y_{\ell}}&-e^{\theta^{-}_{2}y_{\ell}}\\ e^{\theta^{+}_{1}x^{+}_{\ell}}-e^{\theta^{+}_{1}x^{+\ast}_{\ell}}&e^{\theta^{+}_{2}x^{+}_{\ell}}-e^{\theta^{+}_{2}x^{+\ast}_{\ell}}&0&0\\ -e^{\theta^{+}_{1}y_{h}}&-e^{\theta^{+}_{2}y_{h}}&e^{\theta^{-}_{1}y_{h}}&e^{\theta^{-}_{2}y_{h}}\\ 0&0&e^{\theta^{-}_{1}x^{-}_{h}}-e^{\theta^{-}_{1}x^{-\ast}_{h}}&e^{\theta^{-}_{2}x^{-}_{h}}-e^{\theta^{-}_{2}x^{-\ast}_{h}}\end{bmatrix}\cdot\begin{bmatrix}\nu^{+}_{1}\\ \nu^{+}_{2}\\ \nu^{-}_{1}\\ \nu^{-}_{2}\end{bmatrix}=\begin{bmatrix}\widehat{v}^{-}(y_{\ell})-\widehat{v}^{+}(y_{\ell})\\ \widehat{v}^{+}(x^{+\ast}_{\ell})-\widehat{v}^{+}(x^{+}_{\ell})-K_{p}\\ \widehat{v}^{+}(y_{h})-\widehat{v}^{-}(y_{h})\\ \widehat{v}^{-}(x^{-\ast}_{h})-\widehat{v}^{-}(x^{-}_{h})-K_{p}\end{bmatrix} (49)

and the thresholds xh+,xx_{h}^{+},x_{\ell}^{-} are determined by the 𝒞1{\mathcal{C}}^{1} smooth–pasting (recall that xh=xhξ(xh)x^{-\ast}_{h}=x^{-}_{h}-\xi^{\ast}(x^{-}_{h}), x+=x+ξ(x+)x^{+\ast}_{\ell}=x^{+}_{\ell}-\xi^{\ast}(x^{+}_{\ell})):

{vx(xh)=vx(xh),vx+(x+)=vx+(x+),\displaystyle\begin{cases}v^{-}_{x}(x^{-}_{h})=v^{-}_{x}(x^{-\ast}_{h}),\\ v^{+}_{x}(x^{+}_{\ell})=v^{+}_{x}(x^{+\ast}_{\ell}),\end{cases} (50)

and the first order conditions (FOCs) giving the optimal impulses:

vx(xh)=ξKp(ξ(xh))vx+(x+)\displaystyle v_{x}^{-}(x_{h}^{-\ast})=-\partial_{\xi}K_{p}(\xi^{\ast}(x^{-}_{h}))\qquad v_{x}^{+}(x_{\ell}^{+\ast}) =ξKp(ξ(x+)).\displaystyle=-\partial_{\xi}K_{p}(\xi^{\ast}(x^{+}_{\ell})). (51)
Proposition 3.

Let the 88-tuple (ν1±,ν2±,xh+,x,xh+,x)(\nu^{\pm}_{1},\nu^{\pm}_{2},x^{+}_{h},x^{-}_{\ell},x^{+*}_{h},x^{-*}_{\ell}) be a solution to the system (48), such that the order in (44) is fulfilled and x+<x+,xh<xhx^{+}_{\ell}<x_{\ell}^{+*},x_{h}^{-*}<x_{h}^{-}. Let v±v^{\pm} be defined in (47) and assume

vxx+(x+)<0,vxx(xh)<0.v_{xx}^{+}(x_{\ell}^{+*})<0,\qquad v_{xx}^{-}(x_{h}^{-*})<0. (52)

Then the functions v±v^{\pm} are the best–response payoffs of the producer, and a best–response strategy is given by

τ0=0,\displaystyle\tau^{*}_{0}=0, τi=inf{t>τi1:XtΓp(t)},\displaystyle\quad\tau^{*}_{i}=\inf\left\{t>\tau^{*}_{i-1}:X^{\ast}_{t}\in\Gamma_{p}(t-)\right\}, (53)
ξi(x+)\displaystyle\xi^{*}_{i}(x_{\ell}^{+}) =x+x+,ξi(xh)=xhxh,i1,\displaystyle=x_{\ell}^{+*}-x_{\ell}^{+},\qquad\xi^{*}_{i}(x_{h}^{-})=x_{h}^{-}-x_{h}^{-*},\qquad i\geq 1, (54)

with Γp(t)=Γp+𝟏{μt=μ+}+Γp𝟏{μt=μ}\Gamma_{p}(t)=\Gamma^{+}_{p}\mathbf{1}_{\{\mu_{t}=\mu_{+}\}}+\Gamma^{-}_{p}\mathbf{1}_{\{\mu_{t}=\mu_{-}\}}, where Γp+=(,x+]\Gamma_{p}^{+}=(-\infty,x^{+}_{\ell}] and Γp=[xh,+)\Gamma_{p}^{-}=[x^{-}_{h},+\infty), while (Xt)(X^{*}_{t}) follows the dynamics corresponding to the consumer’s strategy (σi)i1(\sigma_{i})_{i\geq 1} and the producer’s impulse strategy (τi,ξi)i1(\tau^{*}_{i},\xi_{i}^{*})_{i\geq 1}.

3.2.3 Preemptive Response

It is possible that the static discounted future profit of the producer satisfies, say, v+(x)v(x)v^{+}(x)\geq v^{-}(x) for any xx, so that he always prefers expansion regime to contraction regime.

In that case, the consumer switching at yhy_{h} from expansion to contraction hurts the producer and one possible strategy for him is to preempt in order to prevent the consumer from switching the drift to μ\mu_{-}. This situation could be viewed as looking for best xh+<yhx^{+}_{h}<y_{h}, given yhy_{h}. In the latter case the constrained solution could be xh+=yhx^{+}_{h}=y_{h}-, whereby the system (43) does not hold and the best–response is to impulse (Xt)(X_{t}) right before it hits yhy_{h}, xh+=yhx^{+}_{h}=y_{h}-. This strategy is not well-defined (i.e. the supremum is not achieved on the open interval (x+,yh)(x_{\ell}^{+},y_{h})), but the resulting preemptive best–response value in the μ+\mu_{+} regime can be obtained by using the ansatz (where we slightly abuse the notation to write xh+=yhξ(yh)x_{h}^{+\ast}=y_{h}-\xi^{\ast}(y_{h}) for the target impulse level at yhy_{h})

v+(x)\displaystyle v^{+}(x) ={v+(xh+)Kp(ξ(x)),xyh,v^+(x)+ν1+eθ1+x+ν2+eθ2+x,x+<x<yh,v+(x+)Kp(ξ(x)),xx+,\displaystyle=\begin{cases}v^{+}(x^{+\ast}_{h})-K_{p}(\xi^{\ast}(x)),&x\geq y_{h},\\ \widehat{v}^{+}(x)+\nu^{+}_{1}e^{\theta^{+}_{1}x}+\nu^{+}_{2}e^{\theta^{+}_{2}x},&x^{+}_{\ell}<x<y_{h},\\ v^{+}(x^{+\ast}_{\ell})-K_{p}(\xi^{\ast}(x)),&x\leq x^{+}_{\ell},\end{cases} (55)

and the boundary conditions for determining the target impulse levels

vx+(xh+)\displaystyle v_{x}^{+}(x^{+\ast}_{h}) =κ1,vx+(x+)=+κ1.\displaystyle=-\kappa_{1},\quad v_{x}^{+}(x_{\ell}^{+\ast})=+\kappa_{1}. (56)

Note that we now have 5 unknowns, ν1,2+,x+,x+,xh+\nu^{+}_{1,2},x^{+}_{\ell},x^{+\ast}_{\ell},x^{+\ast}_{h} rather than six as we “fixed” xh+=yhx^{+}_{h}=y_{h}. This yields the following system

{v^+(yh)+ν1+eθ1+yh+ν2+eθ2+yh=v^+(xh+)+ν1+eθ1+(xh+)+ν2+eθ2+(xh+)Kp(ξ(yh))(𝒞0at yh)v^+(x+)+ν1+eθ1+x++ν2+eθ2+x+=v^+(x+)+ν1+eθ1+x++ν2+eθ2+x+Kp(ξ(x+))(𝒞0 at x+)v^x+(x+)+ν1+θ1+eθ1+x++ν2+θ2+eθ2+x+=v^x+(x+)+ν1+θ1+eθ1+x++ν2+θ2+eθ2+x++κ1(𝒞1 at x+)v^x+(xh+)+ν1+θ1+eθ1+xh++ν2+θ2+eθ2+xh+=κ1(𝒞1 at xh+)v^x+(x+)+ν1+θ1+eθ1+x++ν2+θ2+eθ2+x+=κ1.(𝒞1 at x+)\displaystyle\begin{cases}\widehat{v}^{+}(y_{h})+\nu^{+}_{1}e^{\theta^{+}_{1}y_{h}}+\nu^{+}_{2}e^{\theta^{+}_{2}y_{h}}=\widehat{v}^{+}(x^{+\ast}_{h})+\nu^{+}_{1}e^{\theta^{+}_{1}(x^{+\ast}_{h})}+\nu^{+}_{2}e^{\theta^{+}_{2}(x^{+\ast}_{h})}-K_{p}(\xi^{*}(y_{h}))&(\mathcal{C}^{0}\text{at }y_{h})\\ \widehat{v}^{+}(x^{+}_{\ell})+\nu^{+}_{1}e^{\theta^{+}_{1}x^{+}_{\ell}}+\nu^{+}_{2}e^{\theta^{+}_{2}x^{+}_{\ell}}=\widehat{v}^{+}(x^{+\ast}_{\ell})+\nu^{+}_{1}e^{\theta^{+}_{1}x^{+\ast}_{\ell}}+\nu^{+}_{2}e^{\theta^{+}_{2}x^{+\ast}_{\ell}}-K_{p}(\xi^{*}(x_{\ell}^{+}))&(\mathcal{C}^{0}\text{ at }x^{+}_{\ell})\\ \widehat{v}^{+}_{x}(x^{+}_{\ell})+\nu^{+}_{1}\theta^{+}_{1}e^{\theta^{+}_{1}x^{+}_{\ell}}+\nu^{+}_{2}\theta^{+}_{2}e^{\theta^{+}_{2}x^{+}_{\ell}}=\widehat{v}^{+}_{x}(x^{+\ast}_{\ell})+\nu^{+}_{1}\theta^{+}_{1}e^{\theta^{+}_{1}x^{+\ast}_{\ell}}+\nu^{+}_{2}\theta^{+}_{2}e^{\theta^{+}_{2}x^{+\ast}_{\ell}}+\kappa_{1}&(\mathcal{C}^{1}\text{ at }x^{+}_{\ell})\\ \widehat{v}_{x}^{+}(x^{+\ast}_{h})+\nu^{+}_{1}\theta^{+}_{1}e^{\theta^{+}_{1}x_{h}^{+\ast}}+\nu^{+}_{2}\theta^{+}_{2}e^{\theta^{+}_{2}x_{h}^{+\ast}}=-\kappa_{1}&(\mathcal{C}^{1}\text{ at }x^{+\ast}_{h})\\ \widehat{v}^{+}_{x}(x^{+\ast}_{\ell})+\nu^{+}_{1}\theta^{+}_{1}e^{\theta^{+}_{1}x^{+\ast}_{\ell}}+\nu^{+}_{2}\theta^{+}_{2}e^{\theta^{+}_{2}x^{+\ast}_{\ell}}=\kappa_{1}.&(\mathcal{C}^{1}\text{ at }x^{+\ast}_{\ell})\\ \end{cases} (57)

Preemption in the contraction regime writes in a symmetric way.

In general, we need to manually verify whether xh+>yhx_{h}^{+}>y_{h} (the “normal” case) or xh+=yhx^{+}_{h}=y_{h} (the preemptive case) whenever we consider the producer best–response. The two situations lead to different boundary conditions at the upper threshold, and hence cannot be directly compared. Considering the optimization problem for xh+x_{h}^{+}, we expect his value function to increase in xh+x_{h}^{+} on (x+,yh)(x_{\ell}^{+},y_{h}) and experience a positive jump at yhy_{h}, i.e. conditional on someone acting, the producer prefers the consumer’s switch to applying his impulse. However, if this is not the case, the consumer action hurts the producer and assuming the impulse costs are low, the best-response is xh+=yhx_{h}^{+}=y_{h}. This corner solution arises due to the underlying discontinuity: on (x+,yh)(x_{\ell}^{+},y_{h}) the producer compares the value of waiting to the value of doing an optimal impulse, but at yhy_{h} he compares the value of switching to that of doing an optimal impulse. So it could be that “waiting” >> impulsing >> switching at yhy_{h}, leading to pre-emptive impulse to prevent the worst (for the producer) outcome.

Proposition 4.

Assume μ0=μ+\mu_{0-}=\mu_{+}. Let the 55-tuple (ν1+,ν2+,x+,x+,xh+)(\nu^{+}_{1},\nu^{+}_{2},x^{+}_{\ell},x^{+*}_{\ell},x^{+*}_{h}) be a solution to the system (57), such that the order in (44) is fulfilled and x+<x+,xh+<yhx^{+}_{\ell}<x_{\ell}^{+*},x_{h}^{+*}<y_{h}. Let v+v^{+} be defined as in (55) and assume

vxx+(x+)<0,vxx+(xh+)<0.v_{xx}^{+}(x_{\ell}^{+*})<0,\qquad v_{xx}^{+}(x_{h}^{+*})<0. (58)

Then, the function v+v^{+} is the best–response payoff of the producer in the expansion regime, and a best–response strategy is given by

τ0=0,\displaystyle\tau^{*}_{0}=0, τi=inf{t>τi1:XtΓp(t)},\displaystyle\quad\tau^{*}_{i}=\inf\{t>\tau^{*}_{i-1}:X_{t}\in\Gamma_{p}(t-)\}, (59)
ξi(x+)\displaystyle\xi^{*}_{i}(x_{\ell}^{+}) =x+x+,ξi(yh)=yhxh+,i1,\displaystyle=x_{\ell}^{+*}-x_{\ell}^{+},\quad\xi^{*}_{i}(y_{h})=y_{h}-x_{h}^{+*},\quad i\geq 1, (60)

with Γp(t)=Γp+(t)=(,x+][yh,+)\Gamma_{p}(t)=\Gamma^{+}_{p}(t)=(-\infty,x^{+}_{\ell}]\cup[y_{h},+\infty), while XX^{\ast} follows the dynamics corresponding to the producer’s impulse strategy (τi,ξi)i1(\tau^{*}_{i},\xi_{i}^{*})_{i\geq 1}.

Refer to caption
Refer to caption
Figure 3: Value functions v0±v^{\pm}_{0}, v1±v^{\pm}_{1}, and vpre±v^{\pm}_{\rm pre} of the producer given the consumer’s strategy 𝒞c=(3,4).\mathcal{C}_{c}=(3,4).

Figure 3 illustrates the shapes of the producer’s value function in the different cases of best–response. For the given consumer strategy, we have a dominant function in the contraction regime (v1v^{-}_{1}) and a dominant function in the expansion regime (v1+v_{1}^{+}).

4 Equilibria

The best–response functions defined in Section 3 lead to three types of potential market equilibria, depending on the equilibrium behaviour of the consumer and characterized by the relative positions of the consumerand producer thresholds:

  • Type I – generic: yxhy_{\ell}\leq x^{-}_{h} and x+yhx^{+}_{\ell}\leq y_{h}.

  • Type II – transitory: =yx-\infty=y_{\ell}\leq x^{-}_{\ell} and x+yhx^{+}_{\ell}\leq y_{h}; or yxhy_{\ell}\leq x^{-}_{h} and yh=+y_{h}=+\infty.

  • Type III – preemptive: yxhy_{\ell}\leq x^{-}_{h} and x+=yhx^{+}_{\ell}=y_{h}; or y=xhy_{\ell}=x^{-}_{h} and x+yhx^{+}_{\ell}\leq y_{h}.

In equilibrium Type I, the consumer switches back and forth forever between the two expansion and contraction regimes. The optimal policy of the consumer is given by the threshold yy_{\ell} in the contraction regime and yhy_{h} in the expansion regime, while the optimal policy of the producer is formed by the pair (x+,x+,)(x_{\ell}^{+},x_{\ell}^{+,\ast}) in the expansion regime and symmetrically by the pair (xh+,,xh)(x_{h}^{+,\ast,}x_{h}^{-}) in the contraction regime. We anticipate this to be the most common equilibrium type; it was precisely described and illustrated in Section 4.1.

In equilibrium Type II, the consumer and the producer both prefer a given regime and thus, the consumer switches at most once when the market is initialized in the opposite regime. Afterwards, only the producer acts to maintain the price between (x,xh)(x_{\ell},x_{h}). Consider the case of a single switch from expansion to contraction; the consumer’s optimal policy consists then in only one threshold, yhy_{h}. The optimal policy of the producer is more complicated: in the expansion regime, it consists of the pair (x+,x+,)(x_{\ell}^{+},x_{\ell}^{+,\ast}) and in the contraction regime, it consists of a quadruplet (x,x,,xh,,xh)(x^{-}_{\ell},x^{-,\ast}_{\ell},x^{-,\ast}_{h},x^{-}_{h}). The same reasoning applies in the other single switch case. This equilibrium is described in Section 4.2.

The last type of equilibrium, named Type III, resembles the preceding one in the sense that at most one switch can be observed. But it differs because here the consumer is stuck forever in a state she wishes to leave. In that case described in Section 4.3, only the producer acts. Starting in the expansion regime, for instance, the consumer would like to switch to the contraction regime when the price reaches a threshold yhy_{h}. But the producer, who prefers perpetual expansion, preempts the switch by acting at the threshold yhy_{h}^{-}, just before the action of the consumer.

Threshold–type equilibria offer analytical tractability to describe the long–run market behavior. The latter can be summarized by the stationary distribution of the commodity price (Xt)(X^{\ast}_{t}) and the consumer regimes (μt)(\mu^{\ast}_{t}) as induced by the equilibrium strategies (Nt,μt)(N^{\ast}_{t},\mu^{\ast}_{t}). To quantify these effects, we define an auxiliary discrete–time jump chain (Mn)n=0(M^{\ast}_{n})_{n=0}^{\infty} which takes values in the state space

E:={S+,S,I,Ih,I+,Ih+}.\displaystyle E:=\left\{S_{+},S_{-},I^{-}_{\ell},I^{-}_{h},I^{+}_{\ell},I^{+}_{h}\right\}. (61)

The chain MM^{\ast} keeps track of the sequential actions of the players, where S±S_{\pm} represents the switches of the consumer (“S+S_{+}” stands for the switch μμ+\mu_{-}\rightarrow\mu_{+} and “SS_{-}” for μ+μ\mu_{+}\rightarrow\mu_{-}) and Ir±I^{\pm}_{r} the impulses (up/down at the two impulse boundaries) of the producer. Thus, MM^{\ast} summarizes the sequence of market interventions stored within τi,σi\tau_{i},\sigma_{i} stopping times. Note that states Mn{S+,Ih+}M^{\ast}_{n}\in\{S_{+},I^{+}_{\ell h}\} imply a positive drift μ+\mu_{+} of XX^{\ast}, while the rest imply a negative drift μ\mu_{-}. Moreover, if the consumer adopts a double–switch strategy and the producer adopts a non-preemptive strategy as discussed in Section 4.1, then the thresholds xh+x^{+}_{h} and xx^{-}_{\ell} will be hit at most once by XX^{\ast} and therefore the corresponding states Ih+I^{+}_{h} and II^{-}_{\ell} of MnM^{\ast}_{n} are transient.

Because the dynamics of XX^{\ast} between interventions are always Brownian motion (BM) with drift, the transition probabilities of MM^{\ast} can be described in terms of hitting probabilities of a BM. This offers closed-form expressions for the the transition probability matrix 𝐏\mathbf{P} of MM^{\ast}, and its invariant distribution denoted by Π\vec{\Pi}. Moreover, the sojourn times of MM^{\ast} correspond to (Xt)(X^{\ast}_{t}) hitting the various thresholds (in terms of the original continuous-time “tt”) and are similarly linked to BM first passage times. Combining the above ideas, we can then derive a complete description of (μt)(\mu^{\ast}_{t}), namely the long-run proportion of time that the commodity demand is in expansion/contraction regimes and the respective expected switching time, see (71).

In the following section otherwise stated, we use the parameter values in Table 5, such that πi(x)=ai(xxi1)(xi2x)\pi_{i}(x)=a_{i}(x-x^{1}_{i})(x^{2}_{i}-x), i{c,p}i\in\{c,p\}. This yields consumer and producer preferred price levels of X¯c=3,X¯p=4\bar{X}_{c}=3,\bar{X}_{p}=4. The same set of parameters yields an equilibrium of each type, showing the non-uniqueness of equilibria in this model.

Refer to caption
Figure 4: Producer’s and consumer’s profit rates as a function of xx.
Market Consumer Producer
β\beta 0.10.1 xc1x^{1}_{c} 1 xp1x^{1}_{p} 2
σ\sigma 0.250.25 xc2x^{2}_{c} 5 xp2x^{2}_{p} 6
μ+\mu_{+} 0.10.1 aca_{c} 0.75 apa_{p} 0.25
μ\mu_{-} 0.1-0.1 h±h_{\pm} 1010 κ0\kappa_{0} 3
κ1\kappa_{1} 0
Figure 5: Model parameters for Section 4.1.

4.1 Type I – Generic

We look for an interior, non-preemptive equilibrium satisfying the ordering (15), i.e. a pair of consumer and producer strategies of the form (y,yh)(y_{\ell}^{\ast},y_{h}^{\ast}) and (x+,x+,xh,xh)(x^{+}_{\ell},x^{+\ast}_{\ell},x^{-\ast}_{h},x^{-}_{h}). To construct this equilibrium, we employ tâtonnement, i.e. iteratively apply the best–response controls alternating between the two players. This corresponds to the interpretation of Nash equilibrium as a fixed point of best–response maps BRBR. The equilibrium is obtained using two different fixed–point algorithms. Given strategies 𝒞p0\mathcal{C}_{p}^{0} and 𝒞c0\mathcal{C}_{c}^{0}, we have either an asynchronous or synchronous algorithm, namely

𝒞pk+1=BR(𝒞ck),\displaystyle\mathcal{C}_{p}^{k+1}=BR(\mathcal{C}_{c}^{k}),\quad 𝒞pk+1=BR(𝒞ck),\displaystyle\mathcal{C}_{p}^{k+1}=BR(\mathcal{C}_{c}^{k}),
𝒞ck+1=BR(𝒞pk+1),\displaystyle\mathcal{C}_{c}^{k+1}=BR(\mathcal{C}_{p}^{k+1}),\quad 𝒞ck+1=BR(𝒞pk),\displaystyle\mathcal{C}_{c}^{k+1}=BR(\mathcal{C}_{p}^{k}),
asynchronous synchronous.\displaystyle\text{synchronous}.

The resulting equilibrium found using both algorithms is the same and is

𝒞pI,=[2.0,3.6,,+,,4.5,6.1],𝒞cI,=[2.2,4.4].\displaystyle\mathcal{C}^{\rm{I},\ast}_{p}=\begin{bmatrix}2.0,&3.6,&-,&+\infty\\ -\infty,&-,&4.5,&6.1\\ \end{bmatrix},\qquad\mathcal{C}^{\rm{I},\ast}_{c}=[2.2,4.4]. (62)
Refer to caption
Refer to caption
Refer to caption
Figure 6: A sample path of the controlled market price (Xt)(X^{\ast}_{t}) under a Type I equilibrium (Left), Type II (Center) and Type III (Right) together with 𝔼[Xt]\mathbb{E}\big{[}X^{\ast}_{t}\big{]} (black solid curve). The colors along the xx-axis indicate the corresponding μt\mu^{\ast}_{t}. We start with μ0=μ+\mu^{\ast}_{0}=\mu_{+}.

The dynamic equilibrium of the commodity price XX^{\ast} is illustrated in Figure 6 (Left). The market starts in the expansion regime, μ0=μ+\mu^{\ast}_{0}=\mu_{+}. We observe that xhx^{-\ast}_{h} is close to yhy_{h}^{\ast}, implying that once the price has reached the switching level yhy_{h}^{\ast}, it is likely to touch soon thereafter the threshold xhx^{-\ast}_{h}, making the price drop to xhx^{-\ast}_{h}. The producer “backs up” this mean-reversion by impulsing down if prices rise too much and impulsing down if they drop too much. Otherwise, she lets the consumer be in charge via switching control that benefits him as well.

At equilibrium, the price XX^{\ast} fluctuates in a range of values where neither the producer nor the consumer have negative profit rate. If alone in the market, the optimal monopolistic strategies of the producer and the consumer are

𝒞pm:=[1.9,3.5,3.5,5.62.4,4.5,4.5,6.1],𝒞cm:=[1.7,4.3].\displaystyle\mathcal{C}^{m}_{p}:=\begin{bmatrix}1.9,&3.5,&3.5,&5.6\\ 2.4,&4.5,&4.5,&6.1\\ \end{bmatrix},\qquad\mathcal{C}^{m}_{c}:=[1.7,4.3]. (63)

We see that the equilibrium strategy of the producer 𝒞pI,\mathcal{C}^{\rm{I},\ast}_{p} is quite close to what he would have done if alone in the market. On his side, the consumer-induced equilibrium price range is wider than he would prefer (2.62.6 against 2.22.2 if alone). In equilibrium, it is as if the producer lets the consumer do the job of bringing back the price to his preferred level X¯p\bar{X}_{p}. The producer intervenes only if XtX^{\ast}_{t} drops too low or gets too high, after the regime switching has occurred. But, in the long–run, the average price limt𝔼[Xt]\lim_{t\to\infty}\mathbb{E}[X^{\ast}_{t}] is close to 3.53.5, which is the mid–value between X¯p\bar{X}_{p} and X¯c\bar{X}_{c}.

The players’ equilibrium strategy profile yields a stationary distribution for the pair (Xt,μt)(X^{\ast}_{t},\mu^{\ast}_{t}). The macro market μ\mu^{\ast} switches between the expansion and the contraction regimes back and forth, while the jointly controlled price (Xt)(X^{\ast}_{t}) is bounded in the range [x+,xh][x^{+}_{\ell},x^{-}_{h}] and fluctuates in a mean-reverting pattern due to alternating signs of its drift. These stylized features can be broadly traced in the world commodity markets which undergo cyclical Expansion/Contraction patterns.

Dynamics of (Xt)(X^{\ast}_{t}) in the equilibrium: The dynamics of the commodity price (Xt)(X^{\ast}_{t}) are less tractable due to the impulses applied by the producer. Let ϕ()\phi^{\ast}(\cdot) denote the long-run (i.e. stationary) distribution of (Xt)(X^{\ast}_{t}). In Figure 7, we show ϕ\phi^{\ast} obtained from an empirical density based on a long trajectory of (Xt)(X^{\ast}_{t}), relying on Monte Carlo simulations and the ergodicity of the recurrent, bounded process (Xt)(X^{\ast}_{t}). For additional interpretability, we also plot the invariant distributions ϕ±\phi^{\ast}_{\pm} conditional on μt=μ±\mu_{t}=\mu_{\pm}.

Refer to caption
Refer to caption
Figure 7: Estimated long-run densities ϕ\phi^{\ast} of XX^{\ast} in a double–switch and one–sided impulse equilibrium. Panel (a): overall kernel smoothed ϕ()\phi^{\ast}(\cdot). (b): long–run distributions ϕ±\phi^{\ast}_{\pm} of XX^{\ast} conditional on μ(t)=μ+\mu(t)=\mu_{+} (resp. μ(t)=μ\mu(t)=\mu_{-}). Recall that the domain of XtX^{\ast}_{t} depends on the current regime, so the support of the two densities differs.

4.2 Type II – Transitory

In another type of equilibrium the consumer switches only in one regime, with the other being absorbing. For this reason, we name it transitory. To fix ideas, suppose that the consumer only switches from expansion regime to contraction regime. In that case, the producer effectively acts like a profit maximizing monopoly in the contraction regime with two–sided impulses; in the expansion regime she will apply a one–sided impulse as in the equilibrium type I.

To solve for such equilibrium, we first compute the producer strategy in the contraction regime which is a decoupled VI as in (39) leading to the 6 equations in (43) but only for vv^{-}, xr,xrx_{r}^{-},x_{r}^{-*}, r{,h}r\in\{\ell,h\}. This solution induces the corresponding no–switch solution ω0\omega_{0}^{-} of the consumer as in (20)-(21). Both v,ω0v^{-},\omega_{0}^{-} are then fixed and act as source terms to solve for the equilibrium in the expansion regime. For the latter, we need to compute v+()v^{+}(\cdot) and the associated thresholds x+,x+x_{\ell}^{+},x_{\ell}^{+*} (only one threshold), as well as ω+(x)\omega^{+}(x) and the switching threshold yhy_{h} (note that there is no yy_{\ell}). The boundary conditions are v+(yh)=v(yh),ω+(yh)=ω(yh)h0v^{+}(y_{h})=v^{-}(y_{h}),\omega^{+}(y_{h})=\omega^{-}(y_{h})-h_{0}.

This reasoning leads to the following algorithm. If x+x^{+}_{\ell} and x+,x^{+,\ast}_{\ell} are fixed, we compute the best–response of the consumer by solving the variational problem for the consumer value function ω+\omega^{+} such that:

w+(x)\displaystyle w^{+}(x) ={w0(x)h0,yhx,ω^+(x)+λ1+eθ1+x+λ2+eθ2+x,x+<x<yh,w+(x+,),xx+.\displaystyle=\begin{cases}w_{0}^{-}(x)-h_{0},&y_{h}\leq x,\\ \widehat{\omega}^{+}(x)+\lambda^{+}_{1}e^{\theta^{+}_{1}x}+\lambda^{+}_{2}e^{\theta^{+}_{2}x},&x^{+}_{\ell}<x<y_{h},\\ w^{+}(x^{+,\ast}_{\ell}),&x\leq x^{+}_{\ell}.\end{cases}

This is exactly the best–response in the single–switch case with the solution given by the system (24) and which provides the consumer’s threshold yhy_{h}. Now, if we consider that yhy_{h} is fixed, we can compute the best–response of the producer by solving a VI for the value function v+v^{+} that satisfies

v+(x)\displaystyle v^{+}(x) ={v(x),yhx,v^+(x)+ν1+eθ1+x+ν2+eθ2+x,x+<x<yh,v+(x+)κ0κ1(x+,x),xx+.\displaystyle=\begin{cases}v^{-}(x),&y_{h}\leq x,\\ \widehat{v}^{+}(x)+\nu^{+}_{1}e^{\theta^{+}_{1}x}+\nu^{+}_{2}e^{\theta^{+}_{2}x},&x^{+}_{\ell}<x<y_{h},\\ v^{+}(x^{+\ast}_{\ell})-\kappa_{0}-\kappa_{1}(x^{+,\ast}_{\ell}-x),&x\leq x^{+}_{\ell}.\end{cases}

The boundary conditions giving the four unknowns (x+,x+,,ν1+,ν2+)(x^{+}_{\ell},x^{+,\ast}_{\ell},\nu^{+}_{1},\nu^{+}_{2}) are:

{v(yh)=v^+(yh)+ν1+eθ1+yh+ν2+eθ2+yh,(𝒞0 at yh)v^+(x+)+ν1+eθ1+x++ν2+eθ2+x+=v^+(x+)+ν1+eθ1+x++ν2+eθ2+x+κ0κ1(x+,x+),(𝒞0 at x+)v^x+(x+)+ν1+θ1+eθ1+x++ν2+θ2+eθ2+x+=v^x+(x+)+ν1+θ1+eθ1+x++ν2+θ2+eθ2+x++κ1,(𝒞1 at x+)v^x+(x+)+ν1+θ1+eθ1+x++ν2+θ2+eθ2+x+=κ1.(𝒞1 at x+)\displaystyle\begin{cases}v^{-}(y_{h})=\widehat{v}^{+}(y_{h})+\nu^{+}_{1}e^{\theta^{+}_{1}y_{h}}+\nu^{+}_{2}e^{\theta^{+}_{2}y_{h}},&(\mathcal{C}^{0}\text{ at }y_{h})\\ \widehat{v}^{+}(x^{+}_{\ell})+\nu^{+}_{1}e^{\theta^{+}_{1}x^{+}_{\ell}}+\nu^{+}_{2}e^{\theta^{+}_{2}x^{+}_{\ell}}=\widehat{v}^{+}(x^{+\ast}_{\ell})+\nu^{+}_{1}e^{\theta^{+}_{1}x^{+\ast}_{\ell}}+\nu^{+}_{2}e^{\theta^{+}_{2}x^{+\ast}_{\ell}}\!\!-\kappa_{0}-\kappa_{1}(x^{+,\ast}_{\ell}-x^{+}_{\ell}),&(\mathcal{C}^{0}\text{ at }x^{+}_{\ell})\\ \widehat{v}^{+}_{x}(x^{+}_{\ell})+\nu^{+}_{1}\theta^{+}_{1}e^{\theta^{+}_{1}x^{+}_{\ell}}+\nu^{+}_{2}\theta^{+}_{2}e^{\theta^{+}_{2}x^{+}_{\ell}}=\widehat{v}^{+}_{x}(x^{+\ast}_{\ell})+\nu^{+}_{1}\theta^{+}_{1}e^{\theta^{+}_{1}x^{+\ast}_{\ell}}+\nu^{+}_{2}\theta^{+}_{2}e^{\theta^{+}_{2}x^{+\ast}_{\ell}}+\kappa_{1},&(\mathcal{C}^{1}\text{ at }x^{+}_{\ell})\\ \widehat{v}^{+}_{x}(x_{\ell}^{+\ast})+\nu^{+}_{1}\theta^{+}_{1}e^{\theta^{+}_{1}x_{\ell}^{+\ast}}+\nu^{+}_{2}\theta^{+}_{2}e^{\theta^{+}_{2}x_{\ell}^{+\ast}}=\kappa_{1}.&(\mathcal{C}^{1}\text{ at }x_{\ell}^{+\ast})\end{cases} (64)

Now, we can perform the iterations yh0(x+,(0),x+,(0))yh1(x+,(1),x+,(1))y_{h}^{0}\to(x^{+,(0)}_{\ell},x^{+,\ast(0)}_{\ell})\to y_{h}^{1}\to(x^{+,(1)}_{\ell},x^{+,\ast(1)}_{\ell})\ldots.

We find the following fixed–point of the best–response functions of the producer and the consumer:

𝒞pII,=[1.9,3.6,,+2.4,4.5,4.5,6.1],𝒞cII,=[,4.3],\displaystyle\mathcal{C}^{\rm{II},\ast}_{p}=\begin{bmatrix}1.9,&3.6,&-,&+\infty\\ 2.4,&4.5,&4.5,&6.1\\ \end{bmatrix},\qquad\mathcal{C}^{\rm{II},\ast}_{c}=[-\infty,4.3], (65)

The system starts in the expansion regime, and once the price reaches level Xt=4.3X^{\ast}_{t}=4.3, the consumer switches to contraction and the systems remains in that state for ever. After that, she relies on the producer to impulse (Xt)(X^{\ast}_{t}) up/down when prices get too low/too high but never reverts to the Expansion regime. Thus, in the long-run (Xt)(X^{\ast}_{t}) is simply a Brownian motion with negative drift μ\mu_{-} that has two impulse boundaries x=2.4,xh=6.1x^{-}_{\ell}=2.4,x^{-}_{h}=6.1.

Compared to the double–switch equilibrium of the previous section, the above market equilibrium in (65) has two important differences. First, as tt\to\infty we have that μtμ\mu^{\ast}_{t}\to\mu_{-} so that in the long–run the market will be in the contraction regime and the consumer becomes inactive. Second, because the producer eventually “takes over”, she will intervene much more frequently (see center panel of Figure 6), benefiting himself and reducing consumer value.

4.3 Type III – Preemptive

The producer may have an interest to preempt the switch, say, from the expansion regime to the contraction regime to avoid decline in the consumption of the commodity he produces. In this case, the equilibrium is a fixed point of the best–response function of the producer described in Section 3.2.3 and the best–response function of the consumer described in Section 3.1.2. We look for an equilibrium where the consumer would like to switch at yhy_{h} in the expansion regime, but where the producer makes yhy_{h} his own intervention threshold to impulse the price down.

Using the same protocol as in Type I and Type II equilibrium research, we find the following threshold strategy for the producer and the consumer:

𝒞pIII,:=[1.7,3.1,3.1,4.3,,,],𝒞cIII,:=[,4.3].\displaystyle\mathcal{C}^{\rm{III},\ast}_{p}:=\begin{bmatrix}1.7,&3.1,&3.1,&4.3\\ -,&-,&-,&-\\ \end{bmatrix},\qquad\mathcal{C}^{\rm{III},\ast}_{c}:=[-,4.3]. (66)

In the preemptive equilibrium, the price fluctuates in a narrower range than the other two equilibria. Here, (Xt)(X^{\ast}_{t}) oscillates between x+=1.7x^{+}_{\ell}=1.7 and xh+=yh=4.3x^{+}_{h}=y_{h}=4.3.

4.4 Equilibrium Non-Uniqueness

There are at least three potential equilibria. A natural question is thus whether one of them is preferable to the others. Figure 8 shows the value functions of the producer and of the consumer in the two market regimes (expansion and contraction) and for the different equilibria from type I to type III. We observe that the producer would prefer in both regimes to live in a type I equilibrium. The function v1±v_{1}^{\pm} dominates all the other ones (note that there is no v3v_{3}^{-} because in equilibrium type III contraction never happens). However, the consumer would rather be in the preemptive equilibrium type III: her value function w3+w^{+}_{3} dominates the other two. Intuitively, we may think that the switching costs she saves by letting the producer do all the work of maintaining the price around its long term average value compensate for the inconvenience of having prices that are higher than preferred.

Refer to caption
Figure 8: Producer and consumer value functions in the different equilibria.

4.5 Impact of Market Volatility

As one example of comparative statistics that are possible in our model, we investigate the impact of volatility parameter σ\sigma of XX on the equilibrium profits and behavior of XX^{\ast}. In Table 1 we list statistics of ϕ\phi^{\ast} for a range of market volatilities σ\sigma. We also quantify the profitability of the two players through their average percentage of optimality (APOO)

APOO:=𝒟πr(x)ϕ(dx)πr(X¯r),\displaystyle\mathrm{APOO}:=\frac{\int_{\mathcal{D}}\pi_{r}(x)\phi^{\ast}(dx)}{\pi_{r}(\bar{X}_{r})},

which is the ratio between average profit rate πr(Xt)\pi_{r}(X^{\ast}_{t}) in equilibrium and the maximum profit that could be hypothetically obtained at the first–best level πr(X¯r)\pi_{r}(\bar{X}_{r}), r{c,p}r\in\{c,p\}.

In all types of equilibria, both players are worse off in terms of expected profit rate as σ\sigma increase. This occurs even though in type I and in type II equilibria the average price 𝔼[X]\mathbb{E}[X^{*}] increases. However, that gain is dominated by the losses due to higher Var(X)\mathrm{Var}(X^{\ast}) which implies that prices tend to be further from their preferred levels X¯r\bar{X}_{r} decreasing 𝔼ϕ[πr]\mathbb{E}_{\phi^{\ast}}\left[\pi_{r}\right].

σ\sigma 𝔼ϕ[X]\mathbb{E}_{\phi^{\ast}}[X^{\ast}] Varϕ(X)\mathrm{Var}_{\phi^{\ast}}(X^{\ast}) 𝔼ϕ[πp]\mathbb{E}_{\phi^{\ast}}\big{[}\pi_{p}\big{]} 𝔼ϕ[πc]\mathbb{E}_{\phi^{\ast}}\big{[}\pi_{c}\big{]} Switch (per yr)
Type I 0.25 3.52 0.73 0.81 (80%) 2.4 (80%) 0.021
0.3 3.62 0.80 0.80 (79%) 2.2 (73%) 0.021
0.4 3.77 0.94 0.76 (75%) 1.90 (62%) 0.020
Type II 0.25 3.73 0.68 0.87 (85%) 2.3 (74%) 0.020
0.3 3.76 0.74 0.85 (84%) 2.2 (71%) 0.020
0.4 3.81 0.85 0.81 (80%) 1.95 (64%) 0.020
Type III 0.25 3.41 0.45 0.86 (85%) 2.7 (90%) 0.0
0.3 3.35 0.51 0.83 (82%) 2.7 (90%) 0.0
0.4 3.28 0.61 0.78 (77%) 2.7 (88%) 0.0
Table 1: Long–run mean and variance of XX^{\ast}, long–run profit rates, and frequency of regime switches as market volatility σ\sigma changes (APOO in parentheses).

4.6 Effect of consumer’s switching cost

A key parameter that controls which equilibrium type we face is the consumer’s switching cost h0h_{0}. Starting from the double–switch situation, as h0h_{0} increases (>0.6>0.6), the consumer is less incentivised to switch from μ+\mu^{+} to μ\mu^{-} and we enter the single–switch scenario of Section 3.1.2. Consequently, she receives the No–Switch payoff ω0+(x)\omega_{0}^{+}(x) when μt=μ+\mu_{t}=\mu^{+} and solving for her best–response boils down to solve for yhy_{h} only. Once h0h_{0} gets very large, her best–response is simply the No–Switch response ω0±\omega_{0}^{\pm}. Conversely, as h00h_{0}\downarrow 0 her actions become free. In that situation, we can reduce the producer problem to a single, piecewise VI with a free boundary X~c\tilde{X}_{c}:

sup{βv(x)+μvx+12σ2vxx+πp(x);supξ{v(x+ξ)v(x)Kp(ξ)}}\displaystyle\sup\Big{\{}-\beta v(x)+\mu_{-}v_{x}+\frac{1}{2}\sigma^{2}v_{xx}+\pi_{p}(x)\;;\;\sup_{\xi}\big{\{}v(x+\xi)-v(x)-K_{p}(\xi)\big{\}}\Big{\}} =0x>X~c,\displaystyle=0\qquad x>\tilde{X}_{c}, (67)
sup{βv(x)+μ+vx+12σ2vxx+πp(x);supξ{v(x+ξ)v(x)Kp(ξ)}}\displaystyle\sup\Big{\{}-\beta v(x)+\mu_{+}v_{x}+\frac{1}{2}\sigma^{2}v_{xx}+\pi_{p}(x)\;;\;\sup_{\xi}\big{\{}v(x+\xi)-v(x)-K_{p}(\xi)\big{\}}\Big{\}} =0x<X~c,\displaystyle=0\qquad x<\tilde{X}_{c}, (68)

with the 𝒞0{\cal C}^{0} regularity at X~c\tilde{X}_{c}: limxX~cv(x)=limxX~cv(x)\lim_{x\uparrow\tilde{X}_{c}}v(x)=\lim_{x\downarrow\tilde{X}_{c}}v(x).

Fig. 9 shows that for low h0h_{0}, xh+<yx^{+\ast}_{h}<y_{\ell} and xh+x^{+}_{h} is greater but close to yhy_{h}. Thus, when consumption switches from expansion to contraction, it is very likely that the price touches xh+x_{h}^{+} soon thereafter and is impulsed back to xh+x^{+\ast}_{h} and thus, the regime rapidly switches back to expansion again. When the switching cost increases, this solution disappears.

Refer to caption
Figure 9: Equilibrium thresholds as a function of consumer switching cost h0h_{0} given Table 5 parameter values. We show the consumer thresholds y,yhy_{\ell},y_{h}, the producer thresholds x+,xhx_{\ell}^{+},x_{h}^{-} and respective impulse target levels x,+,xh,x_{\ell}^{\ast,+},x_{h}^{\ast,-}.
Remark 3.

It is possible for the impulse amounts to be so large as to lead to a double simultaneous control: producer’s impulse instantaneously followed by the consumer switching. In this setting, the producer effectively forces the consumer to switch the regime by impulsing XX^{\ast} hard enough. This situation corresponds to xh<yx^{-\ast}_{h}<y_{\ell}, so that the impulse in the contraction regime moves XX^{\ast} into the respective switching region (,y)(-\infty,y_{\ell}), and as a result the consumer immediately switches to the expansion regime. This situation occurs if, for instance, the drifts are μ=0.01,μ+=0.1\mu^{-}=0.01,\mu^{+}=0.1, so that the consumer is not able to ever efficiently lower prices. Consequently, the producer is forced to fully control price reduction. We observe in the above situation the equilibrium thresholds of xh=3.04<y=3.69x^{-\ast}_{h}=3.04<y_{\ell}=3.69. \Box

5 Case study: diversification effect of vertical integration

The industrial organization of upstream and downstream segments is linked to anti-trust regulations. From a regulatory perspective, vertical integration could be used to increase market power and foreclose competitors. For example, see De Fontenay and Gans (2005) [10] who develop a game theoretic model for the foreclosure effect of vertical concentration and Hasting and Gilbert (2005) [15] for the related empirical facts in the context of the US retail gasoline market. At the same time, consumers can benefit from vertical integration of commodity producers; we refer to related analysis of electricity markets from a market equilibrium point of view (Aïd et al. (2011) [2]) and an empirical point of view (Mansur (2007) [26]). Another virtue of vertical integration is its potential to reduce the long-term exposure of the firm to commodity price fluctuations. See Helfat and Teece (1987) [16] for an empirical estimation of the hedge procured by vertical integration in the oil business.

In this section, we accordingly study whether or not downstream or upstream firms have an interest in being vertically integrated. To this end we consider a small firm that has no market power regarding the commodity price XX and focus on the case of the market equilibrium type I (generic case). We then investigate whether the firm can benefit from a diversification effect by having activity both in the downstream consumer side and the upstream conversion side.

To make the case study concrete, we consider a simplified version of the crude oil and gasoline markets, the latter a shorthand for refined products, calibrated to the ballpark of the 2019 state of the world. Currently, world oil consumption is about 100 Mb/d (millions of barrels per day), normalized to 1 "barrel" per day. We take as a nominal initial price X0X_{0} == 5050 USD/b and a nominal volatility of crude σ\sigma == 10 USD/b. To calibrate our model, we consider that crude oil producers have a preferred range of prices that goes from xp1=30x^{1}_{p}=30 USD/b to xp2=100x^{2}_{p}=100 USD/b and that the average cost of oil extraction is cp=30c_{p}=30 USD/b. This leads to a demand function Dp(x)=10.01xD_{p}(x)=1-0.01x, which captures the low sensitivity of the demand of crude to prices. The crude is transformed into gasoline with a small amount of losses 5%, so that the conversion factor is α=0.95\alpha=0.95.

We set the transfer function of crude oil price to average price of gasoline to P(x)=10+1.1xP(x)=10+1.1x, where P(x)P(x) is also expressed in USD/b. There is evidence that the (pre-tax) price of gasoline is a linear function of the crude. For instance, using monthly data of the Energy Information Agency of the US Department of Energy on refined products prices from January 1983 to November 2019 111Data available at http://www.eia.gov/dnav/pet/pet_pri_refoth_dcu_nus_m.htm., we regressed the US Total gasoline Retail sales by refineries P^\hat{P} to the monthly crude oil price X^\hat{X} and found a linear relation

P^m=1.2X^m+14+ϵm\hat{P}_{m}=1.2\hat{X}_{m}+14+\epsilon_{m}

with a regression R2=95%R^{2}=95\%. Considering that the basket of refined products includes not just gasoline (even if it accounts for the largest share), we simplified the relation. Note that the condition p1αp_{1}\geq\alpha for having a downstream convex profit function holds. Furthermore, refinery costs ccc_{c} are highly variable between 4 to 10 USD/b. We take the higher value of cc=10c_{c}=10. Finally, we consider that the demand function for refined products, Dc(P)=d0d1PD_{c}(P)=d^{\prime}_{0}-d^{\prime}_{1}P is such that d0=5d^{\prime}_{0}=5 b/d of crude equivalent refined products and d1=0.05d^{\prime}_{1}=0.05. With these parameters, the preferred range of crude prices for the consumer is between xc1=11x^{1}_{c}=11 and xc2=82x^{2}_{c}=82 USD/b. We consider fixed action costs both for the production firm and the downstream firm. We consider that the producer and the downstream firm lose two years of profit at optimal price to change state making κ0=2πp(X¯p)\kappa_{0}=2\pi_{p}(\bar{X}_{p}) and h0=2πc(X¯c)h_{0}=2\pi_{c}(\bar{X}_{c}). Finally, we take μ±=±0.15\mu_{\pm}=\pm 0.15 per year, which implies that it takes 10 years for the crude price to increase by 1.5 USD.

Value Interpretation Units
β\beta 0.10.1 Discount rate %/year
X0X_{0} 5050 Initial oil price USD/b
d0d_{0} 11 Demand function for oil: intercept Mb/d
d1d_{1} 0.010.01 Demand function for oil: slope Mb/d/(USD/b)
d0d^{\prime}_{0} 55 Demand function for gasoline: intercept Mb/d
d1d^{\prime}_{1} 0.050.05 Demand function for gasoline: slope Mb/d/(USD/b)
α\alpha 0.950.95 Transformation rate dimensionless
p0p_{0} 1010 Crude – gasoline price function: intercept USD/b
p1p_{1} 1.11.1 Crude – gasoline transfer price function: slope USD/b/(USD/b)
cpc_{p} 3030 Oil production cost USD/b
ccc_{c} 55 Refining cost USD/b
μ±\mu_{\pm} ± 0.15\pm\,0.15 Annualized crude drift parameters USD/b
σ\sigma 1010 Annualized crude volatility USD/b
h0h_{0} 2πc(X¯c)=292\pi_{c}(\bar{X}_{c})=29 Consumption switching cost USD
κ0\kappa_{0} 2πp(X¯p)=24.52\pi_{p}(\bar{X}_{p})=24.5 Production switching cost: fixed USD
κ1\kappa_{1} 0 Production switching cost: proportional USD/b
Table 2: Nominal values for model parameters for the crude oil case study.

The resulting equilibrium type I producer impulse strategy 𝒞pI,\mathcal{C}^{\rm I,\ast}_{p} and consumer’s switching strategy 𝒞cI,\mathcal{C}^{\rm I,\ast}_{c} associated to the calibration summarized in Table 2 are given by

𝒞pI,=[26,62,,+,,69,104],𝒞cI,=[22.5,87].\displaystyle\mathcal{C}^{\rm I,\ast}_{p}=\begin{bmatrix}26,&62,&-,&+\infty\\ -\infty,&-,&69,&104\\ \end{bmatrix},\qquad\mathcal{C}^{\rm I,\ast}_{c}=\begin{bmatrix}22.5,&87\\ \end{bmatrix}. (69)

Thus, in equilibrium, the crude price XX^{\ast} fluctuates between 22.522.5 and 8787 USD/b, with potential excursions up to 104104 or down to 2626 USD/b at which point producers intervene.

Refer to caption
Refer to caption
Refer to caption
Figure 10: Left: Profit rate function of the consumer πc(x)\pi_{c}(x) (red) as the pass-through parameter p1p_{1} is varied, as well as the fixed producer profit rate πp(x)\pi_{p}(x) (blue). Middle: Respective equilibrium strategy thresholds y,yhy_{\ell},y_{h} (red) and x+,xhx_{\ell}^{+},x_{h}^{-} (blue) as a function of p1p_{1}. We also plot X¯c\bar{X}_{c} and 𝔼ϕ[X]\mathbb{E}_{\phi^{\ast}}[X^{\ast}], shading the typical commodity price range [𝔼ϕ[X]±σϕ(X)][\mathbb{E}_{\phi^{\ast}}[X^{\ast}]\pm\sigma_{\phi^{\ast}}(X^{\ast})]. Right: Risk-minimizing integration level λ\lambda^{*} as a function of p1p_{1}.
Refer to caption
Figure 11: The curve λ(σ(πλ),𝔼[πλ])\lambda\mapsto(\sigma(\pi_{\lambda}),\mathbb{E}\big{[}\pi_{\lambda}\big{]}) as a function of the pass-through parameter p1p_{1}.

Now let us consider a small firm engaging in a fraction λ(0,1)\lambda\in(0,1) of activity in the downstream sector and 1λ1-\lambda in the upstream sector. Her profit rate is thus πλ:=λπc+(1λ)πp\pi_{\lambda}:=\lambda\pi_{c}+\big{(}1-\lambda\big{)}\pi_{p}. The firm is vertically integrated when 0<λ<10<\lambda<1. Denote by σ(πλ)\sigma(\pi_{\lambda}) the standard deviation of her profit rate πλ()\pi_{\lambda}(\cdot) integrated against the stationary distribution ϕ\phi^{\ast} of XX^{\ast}, and by 𝔼[πλ]=πλ(x)ϕ(dx)\mathbb{E}\big{[}\pi_{\lambda}\big{]}=\int\pi_{\lambda}(x)\phi^{\ast}(dx) the respective expected profit rate. To fix ideas and because the analysis is symmetric, we are interested in situations where a pure downstream firm (λ=0\lambda=0) would be better off having part of her activity in the upstream sector. This will take place when the upstream activity provides a higher expected profit rate and/or a lower risk as measured by σ(πλ)\sigma(\pi_{\lambda}). Figure 11 presents the risk–return curves λ(σ(πλ),𝔼[πλ])\lambda\mapsto(\sigma(\pi_{\lambda}),\mathbb{E}\big{[}\pi_{\lambda}\big{]}) as the pass-through parameter p1p_{1} increases from the nominal value of 1.11.1 to 1.181.18. We observe that for low values of p1p_{1} diversification gains are limited: expected profit rate goes up but risk also increases. For moderate p1p_{1} a pure downstream firm unambiguously benefits from some upstream activity: she can achieve the same level of risk with a higher expected profit. For high p1p_{1} the upstream sector dominates completely with lower risk and higher average profit. Figure 10 (Right) shows the critical integration level λ\lambda^{*} that minimizes the risk σ(πλ)\sigma(\pi_{\lambda}) and captures the “variance–minimal” business model.

We observe that for high enough pass-through values, being a producer (λ=1)(\lambda=1) dominates any other combination of activity. This phenomenon happens even though the maximum profit rate of the downstream firm πc(X¯c)\pi_{c}(\bar{X}_{c}) increases and gets higher than the producer’s maximum profit rate function πp(X¯p)\pi_{p}(\bar{X}_{p}) as shown in the left panel of Figure 10. As shown by the evolution of equilibrium price range in Figure 10 (Middle), as p1p_{1} increases, the equilibrium is getting more and more detrimental to the downstream firm. The shaded salmon area represents the interval [𝔼ϕ[X]σϕ(X),𝔼ϕ[X]+σϕ(X)][\mathbb{E}_{\phi^{\ast}}[X^{\ast}]-\sigma_{\phi^{\ast}}(X^{\ast}),\mathbb{E}_{\phi^{\ast}}[X^{\ast}]+\sigma_{\phi^{\ast}}(X^{\ast})] where commodity prices tend to reside. The average commodity price remains stable around 6565 USD/b, and its the standard deviation is not affected much by p1p_{1} either, while X¯c\bar{X}_{c} is steadily decreasing. Thus, since the expected profit rate of the integrated firm is a function of the expected price and its standard deviation, it does not change much. But its variance grows as a function of p1p_{1} and thus increases significantly. To conclude, in our model we do observe a diversification effect obtained by mixing upstream and downstream activities, however the integration gains depend closely on the pass-through parameter p1p_{1} which serves as a transmission channel of the volatility of the commodity price to the retail price.

6 Conclusion

We showed how a simple model of competition between upstream and downstream representative firms having different pace of intervention can lead to a rich variety of equilibria, potentially non–unique. The fact that the upstream firm can impact the price more rapidly than the downstream firm gives the producer a significant advantage, enabling him to lock the consumer in the producer’s preferred range of prices. Further, in the case of the crude oil market and its refinery products, we stressed how the pass-through parameter p1p_{1} plays a key role for the diversification effect induced by vertical integration. Vertical integration is beneficial for low values of p1p_{1} while for higher values, production dominates downstream activity both in terms of expected profit rate and profit standard deviation.

7 Proofs

7.1 Proof of Proposition 1

Proof.

The proof is standard, nonetheless we give some detail for the reader’s convenience. To ease the notation, let us consider only the case μ=μ+\mu=\mu_{+}, the other case being identical. Let w0+(x)=ω^+(x)+u+(x)w^{+}_{0}(x)=\widehat{\omega}^{+}(x)+u^{+}(x), where the parameters (λ1,0+,λ2,0+)(\lambda^{+}_{1,0},\lambda^{+}_{2,0})\in\mathbb{R} solve the system (20)-(21). By construction the function w0+w_{0}^{+} is of class 𝒞2\mathcal{C}^{2} everywhere. Hence we can apply Itô’s formula to eβsw0+(Xs)e^{-\beta s}w_{0}^{+}(X_{s}) on the time interval [0,tζn)[0,t\wedge\zeta_{n}), yielding

eβ(tζn)w0+(Xtζn)\displaystyle e^{-\beta(t\wedge\zeta_{n})}w_{0}^{+}(X_{t\wedge\zeta_{n}}) =w0+(x)+0+tζneβs[w0+(Xs)(μ+ds+σdWsdNs)βw0+(Xs)ds]\displaystyle=w_{0}^{+}(x)+\int_{0+}^{t\wedge\zeta_{n}}e^{-\beta s}\left[w_{0}^{+\prime}(X_{s-})(\mu_{+}ds+\sigma dW_{s}-dN_{s})-\beta w_{0}^{+}(X_{s-})ds\right]
+σ220+tζneβsw0+′′(Xs)𝑑s+0<stζneβs[Δw0+(Xs)+w0+(Xs)ΔNs],\displaystyle\quad+\frac{\sigma^{2}}{2}\int_{0+}^{t\wedge\zeta_{n}}e^{-\beta s}w_{0}^{+\prime\prime}(X_{s-})ds+\sum_{0<s\leq t\wedge\zeta_{n}}e^{-\beta s}\left[\Delta w_{0}^{+}(X_{s})+w_{0}^{+\prime}(X_{s-})\Delta N_{s}\right],

where (ζn)n1(\zeta_{n})_{n\geq 1} is a localizing sequence of stopping times along which the local martingale part above is in fact a true martingale. We use the notation and ′′ for, respectively, the first and second derivative in xx. Taking expectations on both sides, using the fact that w0+w_{0}^{+} solves the ordinary differential equation (17) and letting nn\to\infty, we obtain

𝔼[eβtw0+(Xt)]=w0+(x)𝔼[0+teβsπc(Xs)𝑑s+0<stΔw0+(Xs)].\mathbb{E}\left[e^{-\beta t}w_{0}^{+}(X_{t})\right]=w_{0}^{+}(x)-\mathbb{E}\left[\int_{0+}^{t}e^{-\beta s}\pi_{c}(X_{s})ds+\sum_{0<s\leq t}\Delta w_{0}^{+}(X_{s})\right].

Now, notice that on the jumps of XX we have Δw0+(Xs)=w0+(xr+)w0+(xr+)\Delta w_{0}^{+}(X_{s})=w_{0}^{+}(x^{+\ast}_{r})-w_{0}^{+}(x^{+}_{r}), which is zero by the boundary conditions (19), hence the jump part in the equation above vanishes. Moreover, being Xt[x+,xh+]X_{t}\in[x_{\ell}^{+},x_{h}^{+}] for all t0t\geq 0, we have by dominated convergence that 𝔼[eβtw0+(Xt)]0\mathbb{E}\left[e^{-\beta t}w_{0}^{+}(X_{t})\right]\to 0 as tt\to\infty. Therefore, letting tt\to\infty we can conclude that w0+(x)=Jc+(x;N,μ+)w_{0}^{+}(x)=J_{c}^{+}(x;N,\mu^{+}) for all x[x+,xh+]x\in[x_{\ell}^{+},x_{h}^{+}]. ∎

Fig. 12(a) illustrates the fact that a threshold switching strategy might not be optimal in all potential situations by considering the shape of w0±(x)w^{\pm}_{0}(x). In the right panel, we have comonotonicity between w+w^{+} and ww^{-}: the consumer is incentivised to switch to μ+\mu^{+} when XtX_{t} is low and to μ\mu^{-} when XtX_{t} is high. In that situation, we expect that a threshold–type strategy is a best response. In contrast, on the left panel two other cases are illustrated. First, we see that it is possible that w+()w()w^{+}(\cdot)\ll w^{-}(\cdot), in other words the consumer has a strong preference to one regime over the other. In that case, the expansion regime could be absorbing, i.e. it is optimal to never switch to μ\mu_{-}. In the plot this would happen if h0h_{0} is low (dashed line), whereby w(x)>w+(x)h0w^{-}(x)>w^{+}(x)-h_{0} and it is optimal to switch to μ\mu_{-} at any xx (therefore μ+\mu_{+} would never be observed in the resulting game evolution). At the same time, we see that if h0h_{0} is moderate (the solid line), then the region where w0(x)>w0+(x)h0w^{-}_{0}(x)>w^{+}_{0}(x)-h_{0} is disconnected, so it is likely that a two–threshold switching strategy is an optimal response.

Refer to caption
(a) 𝒞p=[1.53.02.84.01.22.52.64.2]\mathcal{C}_{p}=\begin{bmatrix}1.5&3.0&2.8&4.0\\ 1.2&2.5&2.6&4.2\end{bmatrix}
Refer to caption
(b) 𝒞p=[1.23.02.83.51.02.82.43.2]\mathcal{C}_{p}=\begin{bmatrix}1.2&3.0&2.8&3.5\\ 1.0&2.8&2.4&3.2\end{bmatrix}
Figure 12: No–Switch payoffs ω0±(x)\omega^{\pm}_{0}(x) of the consumer given the producer’s strategy 𝒞p\mathcal{C}_{p}.

7.2 Proof of Proposition 2

Proof of Proposition 2.

By construction, the functions w±(x)w^{\pm}(x) in (29) solve the system of VIs in (27)-(28) and satisfy w+𝒞2((x+,xh+){y})𝒞1((x+,xh+))𝒞0()w^{+}\in\mathcal{C}^{2}((x_{\ell}^{+},x_{h}^{+})\setminus\{y_{\ell}\})\cap\mathcal{C}^{1}((x_{\ell}^{+},x_{h}^{+}))\cap\mathcal{C}^{0}(\mathbb{R}) and w𝒞2((x,xh){yh})𝒞1((x,xh))𝒞0()w^{-}\in\mathcal{C}^{2}((x_{\ell}^{-},x_{h}^{-})\setminus\{y_{h}\})\cap\mathcal{C}^{1}((x_{\ell}^{-},x_{h}^{-}))\cap\mathcal{C}^{0}(\mathbb{R}). Let NN denote the pure jump component in XX’s dynamics associated to the producer’s strategy with thresholds (x±,x±;xh±,xh±)(x_{\ell}^{\pm},x_{\ell}^{\pm\ast};x_{h}^{\pm},x_{h}^{\pm\ast}). The proof is structured in two steps.

– Step 1: optimality. The following verification argument proves that such functions coincide with the best–response payoffs of the consumer and that the switching times σ^i\hat{\sigma}_{i} as in the statement are optimal provided they are admissible. First, by an approximation procedure as in the first part of the proof in [1, Theorem 3.3], we can assume without loss of generality that w+𝒞2((x+,xh+))𝒞0()w^{+}\in\mathcal{C}^{2}((x_{\ell}^{+},x_{h}^{+}))\cap\mathcal{C}^{0}(\mathbb{R}). Let μ0=μ+\mu_{0-}=\mu_{+}. Consider two consecutive switching times of any consumer admissible strategy, say σ2i\sigma_{2i} and σ2i+1\sigma_{2i+1}, for i0i\geq 0 with the convention σ0=0\sigma_{0}=0, and recall that over [σ2i,σ2i+1)[\sigma_{2i},\sigma_{2i+1}) the state process XX has drift μ+\mu_{+}. Applying Itô’s formula to eβtw+(Xt)e^{-\beta t}w^{+}(X_{t}) over the interval [σ2iT,σ2i+1T)[\sigma_{2i}\wedge T,\sigma_{2i+1}\wedge T), for some finite T>0T>0, we obtain

eβ(σ2i+1T)w+(Xσ2i+1T)\displaystyle e^{-\beta(\sigma_{2i+1}\wedge T)}w^{+}(X_{\sigma_{2i+1}\wedge T}) =eβ(σ2iT)w+(Xσ2iT)\displaystyle=e^{-\beta(\sigma_{2i}\wedge T)}w^{+}(X_{\sigma_{2i}\wedge T})
+σ2iTσ2i+1Teβs{wx+(Xs)dXs+σ22ωxx+(Xs)dsβXsds}\displaystyle\quad+\int_{\sigma_{2i}\wedge T}^{\sigma_{2i+1}\wedge T}e^{-\beta s}\left\{w^{+}_{x}(X_{s})dX_{s}+\frac{\sigma^{2}}{2}\omega_{xx}^{+}(X_{s})ds-\beta X_{s}ds\right\}
+σ2iT<uσ2i+1Teβs{Δw+(Xs)+wx+(Xs)ΔNs}.\displaystyle\quad+\sum_{\sigma_{2i}\wedge T<u\leq\sigma_{2i+1}\wedge T}e^{-\beta s}\left\{\Delta w^{+}(X_{s})+w_{x}^{+}(X_{s})\Delta N_{s}\right\}.

Using the dynamics dXs=μ+ds+σdWsdNsdX_{s}=\mu_{+}ds+\sigma dW_{s}-dN_{s} between the two switching times above, localizing the martingale part through a suitable sequence of stopping times ζn\zeta_{n} and taking expectation on both sides, we obtain

𝔼\displaystyle\mathbb{E} [eβζn,T2i+1w+(Xζn,T2i+1)]=𝔼[eβζn,T2iw+(Xζn,T2i)]\displaystyle\left[e^{-\beta\zeta^{2i+1}_{n,T}}w^{+}(X_{\zeta^{2i+1}_{n,T}})\right]=\mathbb{E}\left[e^{-\beta\zeta^{2i}_{n,T}}w^{+}(X_{\zeta^{2i}_{n,T}})\right]
+𝔼[ζn,T2iζn,T2i+1eβs{wx+(Xs)μ++σ22wxx+(Xs)βXs}𝑑s]+𝔼[ζn,T2is<ζn,T2i+1eβsΔw+(Xs)],\displaystyle+\mathbb{E}\left[\int_{\zeta^{2i}_{n,T}}^{\zeta^{2i+1}_{n,T}}e^{-\beta s}\left\{w_{x}^{+}(X_{s})\mu_{+}+\frac{\sigma^{2}}{2}w_{xx}^{+}(X_{s})-\beta X_{s}\right\}ds\right]+\mathbb{E}\left[\sum_{\zeta^{2i}_{n,T}\leq s<\zeta^{2i+1}_{n,T}}e^{-\beta s}\Delta w^{+}(X_{s})\right],

where we set ζn,Tk:=σkζnT\zeta_{n,T}^{k}:=\sigma_{k}\wedge\zeta_{n}\wedge T. Notice first that the third summand above vanishes since between σ2i\sigma_{2i} and σ2i+1\sigma_{2i+1}, the state process XX can jump only due to the impulses of the producer, hence at any of such jumps the 𝒞0\mathcal{C}^{0}-pasting condition at xh+x^{+}_{h} yields

Δw+(Xs)=(w+(Xs)w+(Xs))𝟏(ΔXs0)=(w+(xh)w+(xh+))𝟏(ΔXs0)=0.\Delta w^{+}(X_{s})=(w^{+}(X_{s})-w^{+}(X_{s-}))\mathbf{1}_{(\Delta X_{s}\neq 0)}=(w^{+}(x_{h}^{\ast})-w^{+}(x_{h}^{+}))\mathbf{1}_{(\Delta X_{s}\neq 0)}=0.

Regarding the second summand, we use the variational inequality (27) so that we can write

𝔼[eβζn,T2i+1w+(Xζn,T2i+1)]𝔼[eβζn,T2iw+(Xζn,T2i)]𝔼[ζn,T2iζn,T2i+1eβsπc(Xs)𝑑s].\displaystyle\mathbb{E}\left[e^{-\beta\zeta^{2i+1}_{n,T}}w^{+}(X_{\zeta^{2i+1}_{n,T}})\right]\leq\mathbb{E}\left[e^{-\beta\zeta^{2i}_{n,T}}w^{+}(X_{\zeta^{2i}_{n,T}})\right]-\mathbb{E}\left[\int_{\zeta^{2i}_{n,T}}^{\zeta^{2i+1}_{n,T}}e^{-\beta s}\pi_{c}(X_{s})ds\right].

Now, letting nn\to\infty we obtain by dominated convergence that

𝔼[eβσ2i+1Tw+(Xσ2i+1T)]𝔼[eβσ2iTw+(Xσ2iT)]𝔼[σ2iσ2i+1eβsπc(XsT)𝑑s],i0.\mathbb{E}\left[e^{-\beta\sigma_{2i+1}\wedge T}w^{+}(X_{\sigma_{2i+1}\wedge T})\right]\leq\mathbb{E}\left[e^{-\beta\sigma_{2i}\wedge T}w^{+}(X_{\sigma_{2i}\wedge T})\right]-\mathbb{E}\left[\int_{\sigma_{2i}}^{\sigma_{2i+1}}e^{-\beta s}\pi_{c}(X_{s\wedge T})ds\right],\quad i\geq 0.

Analogously, we can get the same inequality between the switching times σ2i1\sigma_{2i-1} and σ2i\sigma_{2i} for i1i\geq 1 with ww^{-} replacing w+w^{+}, so summing them all up we have

𝔼[0(supiσi)Tπc(Xs)𝑑s]\displaystyle-\mathbb{E}\left[\int_{0}^{(\sup_{i}\sigma_{i})\wedge T}\pi_{c}(X_{s})ds\right] i0𝔼[eβσ2i+1Tw+(Xσ2i+1T)eβσ2iTw+(Xσ2iT)]\displaystyle\geq\sum_{i\geq 0}\mathbb{E}\left[e^{-\beta\sigma_{2i+1}\wedge T}w^{+}(X_{\sigma_{2i+1}\wedge T})-e^{-\beta\sigma_{2i}\wedge T}w^{+}(X_{\sigma_{2i}\wedge T})\right]
+i1𝔼[eβσ2iTw(Xσ2iT)eβσ2i1Tw(Xσ2i1T)].\displaystyle\quad+\sum_{i\geq 1}\mathbb{E}\left[e^{-\beta\sigma_{2i}\wedge T}w^{-}(X_{\sigma_{2i}\wedge T})-e^{-\beta\sigma_{2i-1}\wedge T}w^{-}(X_{\sigma_{2i-1}\wedge T})\right].

Note that by admissibility i1eβσiL2()\sum_{i\geq 1}e^{-\beta\sigma_{i}}\in L^{2}(\mathbb{P}), which implies supi1σi=+\sup_{i\geq 1}\sigma_{i}=+\infty almost surely. Then, using the 𝒞0\mathcal{C}^{0}-pasting conditions in (30) and letting TT\to\infty, we finally obtain

𝔼[0+πc(Xs)𝑑s]+i1𝔼[eβσih0]w+(x),\mathbb{E}\left[\int_{0}^{+\infty}\pi_{c}(X_{s})ds\right]+\sum_{i\geq 1}\mathbb{E}\left[e^{-\beta\sigma_{i}}h_{0}\right]\leq w^{+}(x),

for any admissible consumer strategy (σi)(\sigma_{i}). Applying the same arguments to the sequence (σ^i)(\hat{\sigma}_{i}) we would get equalities instead of inequalities everywhere. The proof for the case μ0=μ\mu_{0-}=\mu_{-} is analogous and therefore is omitted.

– Step 2: admissibility. To conclude we show that the switching times σ^i\hat{\sigma}_{i} are admissible, i.e. they belong to the set 𝒜c\mathcal{A}_{c}. To do so, notice first that (σ^i)(\hat{\sigma}_{i}) is a sequence of [0,)[0,\infty)-valued stopping times. Hence, it remains to show that a.s. σ^i<σ^i+1\hat{\sigma}_{i}<\hat{\sigma}_{i+1} for all i0i\geq 0, and i1eβσ^iL2()\sum_{i\geq 1}e^{-\beta\hat{\sigma}_{i}}\in L^{2}(\mathbb{P}). The former follows from y<yhy_{\ell}<y_{h}. For the latter, we can proceed as in the proof of [1, Prop. 4.7], whose main idea is to write each σi\sigma_{i} as a sum of independent exit times for some (scaled) Brownian motion with possibly different drifts and initial conditions. First, let us denote (τk)k1(\tau^{\prime}_{k})_{k\geq 1} the increasing sequence of stopping times exhausting the intervention times of both players. Therefore we have

𝔼[(i1eβσi)2]\displaystyle\mathbb{E}\left[\left(\sum_{i\geq 1}e^{-\beta\sigma_{i}}\right)^{2}\right] 𝔼[(k1eβτk)2]limm2𝔼[1krmeβ(τr+τk)]\displaystyle\leq\mathbb{E}\left[\left(\sum_{k\geq 1}e^{-\beta\tau^{\prime}_{k}}\right)^{2}\right]\leq\lim_{m\to\infty}2\mathbb{E}\left[\sum_{1\leq k\leq r\leq m}e^{-\beta(\tau^{\prime}_{r}+\tau^{\prime}_{k})}\right]
limm2𝔼[1kme2βτk]=2𝔼[k1e2βτk],\displaystyle\leq\lim_{m\to\infty}2\mathbb{E}\left[\sum_{1\leq k\leq m}e^{-2\beta\tau^{\prime}_{k}}\right]=2\mathbb{E}\left[\sum_{k\geq 1}e^{-2\beta\tau^{\prime}_{k}}\right],

hence it suffices to prove that k1e2βτkL1()\sum_{k\geq 1}e^{-2\beta\tau^{\prime}_{k}}\in L^{1}(\mathbb{P}). Now, notice that τk\tau^{\prime}_{k}, k1k\geq 1, can be represented as r=1kζr\sum_{r=1}^{k}\zeta_{r}, where ζr\zeta_{r} is a sequence of independent random variables distributed as the exit time, say ζz,μ\zeta^{z,\mu}, of one of the processes z+μt+σWtz+\mu t+\sigma W_{t} with

(z,μ)𝒵±:={(yh,μ+),(y,μ),(xh+,μ+),(x,μ)},(z,\mu)\in\mathcal{Z}^{\pm}:=\left\{(y_{h},\mu_{+}),(y_{\ell},\mu_{-}),(x_{h}^{+\ast},\mu_{+}),(x_{\ell}^{-\ast},\mu_{-})\right\},

from the respective intervals

(,yh),(y,+),(,xh+),(x,+).(-\infty,y_{h}),\quad(y_{\ell},+\infty),\quad(-\infty,x_{h}^{+}),\quad(x_{\ell}^{-},+\infty).

Due to the independence of the sequence ζr\zeta_{r} we have

𝔼[k1e2βτk]=k1r=1k𝔼[e2βζr]k1(𝔼[e2βmin(z,μ)𝒵±ζz,μ])k,\displaystyle\mathbb{E}\left[\sum_{k\geq 1}e^{-2\beta\tau^{\prime}_{k}}\right]=\sum_{k\geq 1}\prod_{r=1}^{k}\mathbb{E}\left[e^{-2\beta\zeta_{r}}\right]\leq\sum_{k\geq 1}\left(\mathbb{E}\left[e^{-2\beta\min_{(z,\mu)\in\mathcal{Z}^{\pm}}\zeta^{z,\mu}}\right]\right)^{k},

which is a convergent geometric series, due to β>0\beta>0 and the fact that ζz,μ>0\zeta^{z,\mu}>0 almost surely for all (z,μ)𝒵±(z,\mu)\in\mathcal{Z}^{\pm}. This shows that sequence of switching times σ^i\hat{\sigma}_{i} is an admissible consumer’s strategy and concludes the proof. ∎

7.3 Proofs of Propositions 3 and 4

Proof of Proposition 3.

Let v:{μ,μ+}×v:\{\mu_{-},\mu_{+}\}\times\mathbb{R}\to\mathbb{R} be the function defined as v(μ±,x)=v±(x)v(\mu_{\pm},x)=v^{\pm}(x), with v±v^{\pm} as in (47). By construction, the functions (v+,v)(v^{+},v^{-}) solve the system of VIs in (45) and moreover v±𝒞2((x+,xh){y,yh})𝒞0()v^{\pm}\in\mathcal{C}^{2}((x_{\ell}^{+},x_{h}^{-})\setminus\{y_{\ell},y_{h}\})\cap\mathcal{C}^{0}(\mathbb{R}), hence not necessarily 𝒞1\mathcal{C}^{1} at the points y,yhy_{\ell},y_{h}. Recall that μt=μ+i=0𝟏{σ2it<σ2i+1}+μi=1𝟏{σ2i1t<σ2i}\mu_{t}=\mu_{+}\sum_{i=0}^{\infty}\mathbf{1}_{\{\sigma_{2i}\leq t<\sigma_{2i+1}\}}+\mu_{-}\sum_{i=1}^{\infty}\mathbf{1}_{\{\sigma_{2i-1}\leq t<\sigma_{2i}\}}, t0t\geq 0, where without loss of generality we can assume σi\sigma_{i} is the ii-th switching instance taken by the consumer in the case μ0=μ+\mu_{0-}=\mu_{+} (remember the convention σ0=0\sigma_{0}=0). The other case μ0=μ\mu_{0-}=\mu_{-} can be treated in a similar way, it is therefore omitted. We split the rest of the proof in two steps.

– Step 1: optimality. The following verification argument proves that such functions coincide with the best–response payoffs of the producer and that the impulse strategy as in the statement is optimal provided it is admissible. First, by an approximation procedure as in the first part of the proof in [1, Theorem 3.3], we can assume without loss of generality that v±𝒞2((x+,xh))𝒞0()v^{\pm}\in\mathcal{C}^{2}((x_{\ell}^{+},x_{h}^{-}))\cap\mathcal{C}^{0}(\mathbb{R}). Consider any producer admissible strategy (τi,ξi)i1(\tau_{i},\xi_{i})_{i\geq 1} as in the first part of Definition 1. Applying Itô’s formula to eβtv(μt,Xt)e^{-\beta t}v(\mu_{t},X_{t}) over the interval [σ2iT,σ2i+1T)[\sigma_{2i}\wedge T,\sigma_{2i+1}\wedge T), for some finite T>0T>0, we obtain

eβ(σ2i+1T)v(μσ2i+1T,Xσ2i+1T)\displaystyle e^{-\beta(\sigma_{2i+1}\wedge T)}v(\mu_{\sigma_{2i+1}\wedge T},X_{\sigma_{2i+1}\wedge T}) =eβ(σ2i+1T)v+(Xσ2i+1T)\displaystyle=e^{-\beta(\sigma_{2i+1}\wedge T)}v^{+}(X_{\sigma_{2i+1}\wedge T})
=eβ(σ2iT)v+(Xσ2iT)\displaystyle=e^{-\beta(\sigma_{2i}\wedge T)}v^{+}(X_{\sigma_{2i}\wedge T})
+σ2iTσ2i+1Teβs{vx+(Xs)dXs+σ22vxx+(Xs)dsβv+(Xs)ds}\displaystyle\quad+\int_{\sigma_{2i}\wedge T}^{\sigma_{2i+1}\wedge T}e^{-\beta s}\left\{v^{+}_{x}(X_{s})dX_{s}+\frac{\sigma^{2}}{2}v^{+}_{xx}(X_{s})ds-\beta v^{+}(X_{s})ds\right\}
+σ2iT<sσ2i+1Teβs{Δv+(Xs)+vx+(Xs)ΔNs},\displaystyle\quad+\sum_{\sigma_{2i}\wedge T<s\leq\sigma_{2i+1}\wedge T}e^{-\beta s}\left\{\Delta v^{+}(X_{s})+v_{x}^{+}(X_{s})\Delta N_{s}\right\},

where the first equality comes from the fact that over [σ2iT,σ2i+1T)[\sigma_{2i}\wedge T,\sigma_{2i+1}\wedge T), the drift equals μ+\mu_{+} (remember that μ0=μ+\mu_{0-}=\mu_{+}). Using the dynamics dXs=μ+ds+σdWsdNsdX_{s}=\mu_{+}ds+\sigma dW_{s}-dN_{s}, with Nt:=i1ξi𝟏{τit}N_{t}:=\sum_{i\geq 1}\xi_{i}\mathbf{1}_{\{\tau_{i}\leq t\}}, between the two switching times above, localizing the martingale part through a suitable sequence of stopping times ζn\zeta_{n} and taking expectation on both sides, we obtain

𝔼\displaystyle\mathbb{E} [eβζn,T2i+1v+(Xζn,T2i+1)]=𝔼[eβζn,T2iv+(Xζn,T2i)]\displaystyle\left[e^{-\beta\zeta^{2i+1}_{n,T}}v^{+}(X_{\zeta^{2i+1}_{n,T}})\right]=\mathbb{E}\left[e^{-\beta\zeta^{2i}_{n,T}}v^{+}(X_{\zeta^{2i}_{n,T}})\right]
+𝔼[ζn,T2iζn,T2i+1eβs{vx+(Xs)μ++σ22vxx+(Xs)βv+(Xs)}𝑑s]+𝔼[ζn,T2is<ζn,T2i+1eβsΔv+(Xs)],\displaystyle+\mathbb{E}\left[\int_{\zeta^{2i}_{n,T}}^{\zeta^{2i+1}_{n,T}}e^{-\beta s}\left\{v_{x}^{+}(X_{s})\mu_{+}+\frac{\sigma^{2}}{2}v_{xx}^{+}(X_{s})-\beta v^{+}(X_{s})\right\}ds\right]+\mathbb{E}\left[\sum_{\zeta^{2i}_{n,T}\leq s<\zeta^{2i+1}_{n,T}}e^{-\beta s}\Delta v^{+}(X_{s})\right],

where we set ζn,Tk:=σkζnT\zeta_{n,T}^{k}:=\sigma_{k}\wedge\zeta_{n}\wedge T. For the third summand above, notice that between σ2i\sigma_{2i} and σ2i+1\sigma_{2i+1}, due to the non-local term in the variational inequality (45), the state process XX can jump only due to the impulses of the producer and at any of such jumps we have

Δv+(Xτi)\displaystyle\Delta v^{+}(X_{\tau_{i}}) Kp(ξi),i0,\displaystyle\leq-K_{p}(\xi_{i}),\quad i\geq 0,

implying

𝔼[ζn,T2is<ζn,T2i+1eβsΔv+(Xs)]𝔼[j:ζn,T2iτj<ζn,T2i+1eβsKp(ξj)].\mathbb{E}\left[\sum_{\zeta^{2i}_{n,T}\leq s<\zeta^{2i+1}_{n,T}}e^{-\beta s}\Delta v^{+}(X_{s})\right]\leq\mathbb{E}\left[\sum_{j:\zeta^{2i}_{n,T}\leq\tau_{j}<\zeta^{2i+1}_{n,T}}e^{-\beta s}K_{p}(\xi_{j})\right].

Regarding the second summand, we use the variational inequality (45) so that we can write

𝔼[eβζn,T2i+1v+(Xζn,T2i+1)]\displaystyle\mathbb{E}\left[e^{-\beta\zeta^{2i+1}_{n,T}}v^{+}(X_{\zeta^{2i+1}_{n,T}})\right]\leq 𝔼[eβζn,T2iv+(Xζn,T2i)]𝔼[ζn,T2iζn,T2i+1eβsπp(Xs)𝑑s]\displaystyle\,\mathbb{E}\left[e^{-\beta\zeta^{2i}_{n,T}}v^{+}(X_{\zeta^{2i}_{n,T}})\right]-\mathbb{E}\left[\int_{\zeta^{2i}_{n,T}}^{\zeta^{2i+1}_{n,T}}e^{-\beta s}\pi_{p}(X_{s})ds\right]
+𝔼[j:ζn,T2iτj<ζn,T2i+1eβsKp(ξj)].\displaystyle+\mathbb{E}\left[\sum_{j:\zeta^{2i}_{n,T}\leq\tau_{j}<\zeta^{2i+1}_{n,T}}e^{-\beta s}K_{p}(\xi_{j})\right].

Now, due to Xt[x+,xh]X_{t}\in[x_{\ell}^{+},x_{h}^{-}] for all t0t\geq 0, letting nn\to\infty we obtain by dominated convergence that

𝔼[eβσ2i+1Tv+(Xσ2i+1T)]\displaystyle\mathbb{E}\left[e^{-\beta\sigma_{2i+1}\wedge T}v^{+}(X_{\sigma_{2i+1}\wedge T})\right]\leq\, 𝔼[eβσ2iTv+(Xσ2iT)]𝔼[σ2iσ2i+1eβsπp(XsT)𝑑s]\displaystyle\mathbb{E}\left[e^{-\beta\sigma_{2i}\wedge T}v^{+}(X_{\sigma_{2i}\wedge T})\right]-\mathbb{E}\left[\int_{\sigma_{2i}}^{\sigma_{2i+1}}e^{-\beta s}\pi_{p}(X_{s\wedge T})ds\right]
+𝔼[j:σ2iτj<σ2i+1eβsKp(ξj)],i0.\displaystyle+\mathbb{E}\left[\sum_{j:\sigma_{2i}\leq\tau_{j}<\sigma_{2i+1}}e^{-\beta s}K_{p}(\xi_{j})\right],\quad i\geq 0.

Analogously, we can get the same inequality between the switching times σ2i1\sigma_{2i-1} and σ2i\sigma_{2i} for i1i\geq 1 with vv^{-} replacing v+v^{+}, so summing them all up we have

𝔼[0(supiσi)Tπp(Xs)𝑑s]\displaystyle-\mathbb{E}\left[\int_{0}^{(\sup_{i}\sigma_{i})\wedge T}\pi_{p}(X_{s})ds\right] i0𝔼[eβσ2i+1Tv+(Xσ2i+1T)eβσ2iTv+(Xσ2iT)]\displaystyle\geq\sum_{i\geq 0}\mathbb{E}\left[e^{-\beta\sigma_{2i+1}\wedge T}v^{+}(X_{\sigma_{2i+1}\wedge T})-e^{-\beta\sigma_{2i}\wedge T}v^{+}(X_{\sigma_{2i}\wedge T})\right]
+i1𝔼[eβσ2iTv(Xσ2iT)eβσ2i1Tv(Xσ2i1T)].\displaystyle\quad+\sum_{i\geq 1}\mathbb{E}\left[e^{-\beta\sigma_{2i}\wedge T}v^{-}(X_{\sigma_{2i}\wedge T})-e^{-\beta\sigma_{2i-1}\wedge T}v^{-}(X_{\sigma_{2i-1}\wedge T})\right]. (70)

Note that by admissibility i1eβσiL2()\sum_{i\geq 1}e^{-\beta\sigma_{i}}\in L^{2}(\mathbb{P}), which implies supi0σi=+\sup_{i\geq 0}\sigma_{i}=+\infty almost surely. Then, using the 𝒞0\mathcal{C}^{0}-pasting conditions in (48) and letting TT\to\infty, we finally obtain

𝔼[0+πp(Xs)𝑑s]+i1𝔼[eβτiKp(ξi)]v+(x),\mathbb{E}\left[\int_{0}^{+\infty}\pi_{p}(X_{s})ds\right]+\sum_{i\geq 1}\mathbb{E}\left[e^{-\beta\tau_{i}}K_{p}(\xi_{i})\right]\leq v^{+}(x),

for any admissible producer’s strategy (τi,ξi)i1(\tau_{i},\xi_{i})_{i\geq 1}. Applying the same arguments to the impulse strategy (τi,ξi)i1(\tau^{*}_{i},\xi^{*}_{i})_{i\geq 1} as in the statement we would get equalities instead of inequalities everywhere. Notice that the second order conditions (52) guarantee the optimality of the impulses ξi\xi_{i}^{*}.

– Step 2: admissibility. To conclude the proof, we need to show that the impulse strategy (τi,ξi)i1(\tau_{i}^{*},\xi_{i}^{*})_{i\geq 1} is admissible as in the first part of Definition 1. Property 1 is granted by the dynamics of the state variable XX and the fact that producer’s thresholds satisfy x+<x+,xh<xhx_{\ell}^{+}<x_{\ell}^{+*},x_{h}^{-*}<x_{h}^{-}.

Property 2 is obviously satisfied by definition of the optimal impulses ξi\xi_{i}^{*} as in the statement. Hence, we are left with showing property 3, i.e. i1eβτiξiL2()\sum_{i\geq 1}e^{-\beta\tau^{*}_{i}}\xi^{*}_{i}\in L^{2}(\mathbb{P}). We can proceed once more as in the proof of [1, Prop. 4.7] and in the second part of Proposition 2’s proof. We provide all details for reader’s convenience. First, let us denote (τk)k1(\tau^{\prime}_{k})_{k\geq 1} the increasing sequence of stopping times exhausting the intervention times of both players. Since the optimal impulses (ξi)i1(\xi_{i}^{*})_{i\geq 1} are uniformly bounded by some positive constant, say κ\kappa, we have

𝔼[(i1ξieβτi)2]\displaystyle\mathbb{E}\left[\left(\sum_{i\geq 1}\xi^{*}_{i}e^{-\beta\tau_{i}}\right)^{2}\right] κ2𝔼[(k1eβτk)2]limm2𝔼[1krmeβ(τr+τk)]\displaystyle\leq\kappa^{2}\mathbb{E}\left[\left(\sum_{k\geq 1}e^{-\beta\tau^{\prime}_{k}}\right)^{2}\right]\leq\lim_{m\to\infty}2\mathbb{E}\left[\sum_{1\leq k\leq r\leq m}e^{-\beta(\tau^{\prime}_{r}+\tau^{\prime}_{k})}\right]
limm2κ2𝔼[1kme2βτk]=2κ2𝔼[k1e2βτk],\displaystyle\leq\lim_{m\to\infty}2\kappa^{2}\mathbb{E}\left[\sum_{1\leq k\leq m}e^{-2\beta\tau^{\prime}_{k}}\right]=2\kappa^{2}\mathbb{E}\left[\sum_{k\geq 1}e^{-2\beta\tau^{\prime}_{k}}\right],

hence it suffices to prove that k1e2βτkL1()\sum_{k\geq 1}e^{-2\beta\tau^{\prime}_{k}}\in L^{1}(\mathbb{P}). Now, notice that τk\tau^{\prime}_{k}, k1k\geq 1, can be represented as r=1kζr\sum_{r=1}^{k}\zeta_{r}, where ζr\zeta_{r} is a sequence of independent random variables distributed as the exit time, say ζz,μ\zeta^{z,\mu}, of one of the processes z+μt+σWtz+\mu t+\sigma W_{t} with

(z,μ)𝒵±:={(yh,μ+),(y,μ),(xh,μ),(x+,μ+)},(z,\mu)\in\mathcal{Z}^{\pm}:=\{(y_{h},\mu_{+}),(y_{\ell},\mu_{-}),(x_{h}^{-\ast},\mu_{-}),(x_{\ell}^{+\ast},\mu_{+})\},

from the respective intervals

(,yh),(y,+),(,xh),(x+,+).(-\infty,y_{h}),\quad(y_{\ell},+\infty),\quad(-\infty,x_{h}^{-}),\quad(x_{\ell}^{+},+\infty).

Due to the independence of the sequence ζr\zeta_{r} we have

𝔼[k1e2βτk]=k1r=1k𝔼[e2βζr]k1(𝔼[e2βmin(z,μ)𝒵±ζz,μ])k,\displaystyle\mathbb{E}\left[\sum_{k\geq 1}e^{-2\beta\tau^{\prime}_{k}}\right]=\sum_{k\geq 1}\prod_{r=1}^{k}\mathbb{E}\left[e^{-2\beta\zeta_{r}}\right]\leq\sum_{k\geq 1}\left(\mathbb{E}\left[e^{-2\beta\min_{(z,\mu)\in\mathcal{Z}^{\pm}}\zeta^{z,\mu}}\right]\right)^{k},

which is a convergent geometric series, due to β>0\beta>0 and the fact that ζz,μ>0\zeta^{z,\mu}>0 almost surely for all (z,μ)𝒵±(z,\mu)\in\mathcal{Z}^{\pm}. This shows that (τi,ξi)i1(\tau^{*}_{i},\xi_{i}^{*})_{i\geq 1} is an admissible producer’s impulse strategy and concludes the proof. ∎

Proof of Proposition 4.

Here, notice that xh+=yhx_{h}^{+}=y_{h} so that, given producer’s priority in case of simultaneous interventions (cf. Remark 2), the drift is always equal to μ+\mu_{+} (recall that we are in the case μ0=μ+\mu_{0-}=\mu_{+}). Hence, this proof can be performed as the one of Proposition 3, by ignoring the intervals where the drift is μ\mu_{-} so that the second half in the RHS of inequality (70) is zero. The admissibility is proved in the same way. The details are therefore omitted. ∎

7.4 Equilibrium Dynamics Computation

Let (a,b)(a,b) be an arbitrary interval and x(a,b)x\in(a,b) be an interior starting location. We define δ+(x;a,b)\delta_{+}(x;a,b) to be the first passage time associated to the interval (a,b)(a,b) of a Brownian Motion with drift μ+\mu_{+} starting from xx and P+(x;a,b)P_{+}(x;a,b) to be the probability that this BM hits aa before bb (similarly for δ(x;a,b)\delta_{-}(x;a,b) and P(x;a,b)P_{-}(x;a,b) associated to drift μ\mu_{-}). These quantities admit explicit expressions, see [6].

The expected time τ:=inf{t:μt=μ}\tau_{-}:=\inf\{t:\mu_{t}=\mu_{-}\} for μt\mu_{t} to switch from μ+\mu_{+} to μ\mu_{-} within a double–switch and one–sided impulse equilibrium is then

𝔼[τ]=𝔼[δ+(x0;x+,yh)]+P+(x0;x+,yh)𝐏Ih+,S𝔼+[δ(x+;x+,yh)],\displaystyle\mathbb{E}\big{[}\tau_{-}\big{]}=\mathbb{E}\big{[}\delta_{+}(x_{0};x^{+}_{\ell},y_{h})\big{]}+\frac{P_{+}(x_{0};x^{+}_{\ell},y_{h})}{\mathbf{P}_{I^{+}_{h},S_{-}}}\mathbb{E}_{+}\big{[}\delta(x^{+\ast}_{\ell};x^{+}_{\ell},y_{h})\big{]}, (71)

where 𝐏\mathbf{P} is the transition matrix of MnM^{\ast}_{n}. Above, the first term denotes the time to either reach x+x_{\ell}^{+} (producer impulses up) or yhy_{h} (switch to contraction); the second term counts the additional time if x+x_{\ell}^{+} is reached first multiplied by the respective probability P+(x0;x+,yh)P_{+}(x_{0};x^{+}_{\ell},y_{h}). Let ζ\vec{\zeta} be the resulting vector of expected sojourn times. Then the long-run proportion of time that XX^{\ast} carries a positive drift (μ+\mu_{+}) is

ρ+=ΠS+ζS++ΠI+ζI++ΠIh+ζIh+Πζ,\displaystyle\rho_{+}=\frac{\Pi_{S_{+}}\zeta_{S_{+}}+\Pi_{I^{+}_{\ell}}\zeta_{I^{+}_{\ell}}+\Pi_{I^{+}_{h}}\zeta_{I^{+}_{h}}}{\vec{\Pi}\cdot\vec{\zeta}^{\dagger}}, (72)

and similarly the long-run proportion associated to a negative drift (μ\mu_{-}) is ρ=1ρ+\rho_{-}=1-\rho_{+}.

References

  • [1] Aïd, R., Basei, M., Callegaro, G., Campi, L., and Vargiolu, T. (2020). Nonzero-sum stochastic differential games with impulse controls: a verification theorem with applications. Mathematics of Operations Research, 45(1), 205–232.
  • [2] Aïd, R., Chemla, G., Porchet, A., Touzi, N. (2011). Hedging and vertical integration in electricity markets. Management Science, 57(8), 1438–1452.
  • [3] Aïd, R., Li, L., and Ludkovski, M. (2017). Capacity expansion games with application to competition in power generation investments. Journal of Economic Dynamics and Control, 84, 1–31.
  • [4] Attard, N. (2018). Nonzero-sum games of optimal stopping for Markov processes. Applied Mathematics & Optimization, 77, 567–597.
  • [5] Barsky, R. B., Kilian, L. (2004). Oil and the macroeconomy since the 1970s. Journal of Economic Perspectives,18(4), 115–134.
  • [6] Borodin, A. N., Salminen, P. (2012). Handbook of Brownian motion-facts and formulae. Birkhäuser.
  • [7] Cadenillas, A., Lakner, P., and Pinedo, M. (2010). Optimal control of a mean-reverting inventory. Operations Research, 58, 1697–1710.
  • [8] Casassus, J., Collin-Dufresne, P., Routledge, B. R., (2018). Equilibrium commodity prices with irreversible investment and non-linear technologies. Journal of Banking and Finance, 95, 128–147.
  • [9] Deaton, A. , Laroque, G., (1992). On the behaviour of commodity prices. Rev. Econ. Stud. 59 (1), 1–23.
  • [10] De Fontenay, C., Gans, J. (2005). Vertical integration in the presence of upstream competition. The RAND Journal of Economics, 36(3), 544–572.
  • [11] Dixit, A., Pindyck, R. (1994). Investment Under Uncertainty. Princeton University Press.
  • [12] Erten, B., Ocampo, J. A. (2012). Super cycles of commodity prices since the mid-nineteenth century. World Development, 44, 14-3-0.
  • [13] De Angelis, T., Ferrari, G., Moriarty, J. (2018). Nash equilibria of threshold type for two-player nonzero-sum games of stopping, The Annals of Applied Probability, 28, 12–147.
  • [14] Giraud, P.-N.(1995). The equilibrium price range of oil: economics, politics and uncertainty in the formation of oil prices. Energy Policy, 23(1), 35–49.
  • [15] Hastings, J., Gilbert, R. (2005). Market power, vertical integration and the wholesale price of gasoline. The Journal of Industrial Economics, 53(4), 469-492.
  • [16] Helfat, C., Teece, D. (1987). Vertical integration and risk reduction. Journal of Law, Economics, & Organization, 3(1), 47–67.
  • [17] Hössinger, R., Link, C., Sonntag, A., & Stark, J. (2017). Estimating the price elasticity of fuel demand with stated preferences derived from a situational approach. Transportation Research Part A: Policy and Practice, 103, 154–171.
  • [18] Hotelling, H. (1931). The economics of exhaustible resources. Journal of Political Economy 39 (2), 137–175.
  • [19] Jacks, D. S. (2006). From boom to bust: a typology of real commodity prices in the long run. NBER Working Paper, n. 18714.
  • [20] Lafontaine, F., and Slade, M. (2007). Vertical Integration, Journal of Economic Literature, 45, 629–685.
  • [21] Levin, R. C (1981). Vertical integration and profitability in the oil industry. Journal of Economic Behavior & Organization, 2(3), 215–235.
  • [22] Levin, L., Lewis, M. S., & Wolak, F. A. (2017). High frequency evidence on the demand for gasoline. American Economic Journal: Economic Policy, 9(3), 314–47.
  • [23] Leon, J., Soto, R. (1997). Structural breaks and long–run trends in commodity prices. Journal of International Development, 9(3), 347–366.
  • [24] Martyr, R., Moriarty, J. (2017). Nonzero-sum games of optimal stopping and generalised Nash equilibrium. arXiv:1709.01905.
  • [25] McDonald, R., Siegel, D.(1986). The value of waiting to invest. The Quarterly Journal of Economics, 101(4), 707–727.
  • [26] Mansur, E.T. (2007). Upstream competition and vertical integration in electricity markets. The Journal of Law & Economics, 50(1), 125–156.
  • [27] Mitchell, E. J. (1976). Vertical integration in the oil industry. National Energy Project.
  • [28] Routledge, B.R. , Seppi, D.J., Spatt, C.S. (2000). Equilibrium forward curves for commodities. J. Finance 55 (3), 1297–1338.
  • [29] Scheinkman, J.A. , Schechtman, J. (1983). A simple competitive model with production and storage. Review of Economic Studies 50, 427–441.
  • [30] Stuemer, M. (2018). 150 years of boom and bust —What drives mineral commodity prices?. Macroeconomic Dynamics, 22(3), 702–717.