This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

thanks: Corresponding author: Weidong Tian, Belk College of Business, University of North Carolina at Charlotte. Email: [email protected]. Zimu Zhu, Department of Mathematics, University of Southern California. Email:[email protected]. We thank Jianfeng Zhang and Tao Pang for stimulating discussions on this paper. We also thank Dr.Xiaojing Xing for her assistance in numerical implementation. The authors would like to thank the editor and anonymous referee for their constructive comments and suggestions.

A Portfolio Choice Problem Under Risk Capacity Constraint

Weidong Tian
University of North Carolina at Charlotte
   Zimu Zhu
University of Southern California
A Portfolio Choice Problem Under Risk Capacity Constraint \\ Abstract

This paper studies the asset allocation problem for a retiree facing longevity risk and living standard risk. We introduce a risk capacity constraint to reduce the living standard risk in the retirement period. Whether the retiree focuses on intertemporal consumption or inheritance wealth, we demonstrate a unique number to measure the expected lump sum of the spending post-retirement. The optimal portfolio is nearly neutral to the stock market movement if the portfolio’s value is higher than this finite critical value; otherwise, the retiree actively invests in the stock market. As a comparison, we consider a dynamic leverage constraint and show that the corresponding optimal portfolio would lose significantly in stressed markets.

Keywords: Risk Capacity, Retirement Portfolio, Longevity Risk, Leverage Constraint

JEL Classification Codes: G11, G12, G13, D52, and D90


1 Introduction

Investment after retirement is significantly different from investment for (before) retirement in several respects. Retirees invest in an unknown while finite length of time because of longevity (mortality) risk. They also worry about the balance between spending and leaving wealth as an inheritance. More importantly, because of no labor income, these individuals will face living standard risk if a market downturn occurs, leading to a substantial decline in their living standards.111As Kenneth French presented at the Annual Conference for Dimensional Funds Advisors, 2016, “It is living standard risk you should know about the risk. It is what your exposure is to a major change in your standard of living during the entirely uncertain numbers of years you remain alive.”

Motivating by the asset allocation problem after retirement, this paper studies a portfolio choice problem with two distinguishing features. One is an uncertain investment time horizon; another is that the investment dollar amount in the risky asset is always bound from above by a fixed constant (capacity). We name this constraint a “risk capacity constraint”. We present the asset allocation problem after retirement as an optimal portfolio choice problem. Precisely, the retiree’s mortality risk is formulated by an uncertain investment time horizon. The length of each individual’s retirement may differ from the statistical life expectancy, and the mortality risk is virtually independent of the market risk in the financial market. Since the absence of labor income results in severe concerns about living standards, we use the risk capacity constraint to address the retiree’s concern about the living standard risk after retirement.222Since we focus on the risk capacity, we ignore other factors such as health care risk and real estate assets in retirement portfolios. See, for instance, Yogo (2016) about the discussion of other factors.

We first characterize the value function of the optimal portfolio choice problem. Then we characterize the region of the wealth in which the risk capacity constraint is binding. Specifically, assuming the value function is C2C^{2} smooth, and under some technical conditions about the value function, then there exists a threshold (a positive number WW^{*}) such that the retiree invests the capacity amount in the risky asset if and only if the portfolio wealth is greater than the threshold. Otherwise, the investment amount is strictly smaller than the capacity.

To derive the consumption-investment policy explicitly, we next investigate two particular yet critical situations in which these technical conditions of the value function can be verified. In the first situation, Problem (A), the investor (retiree) focuses on intertemporal consumption, whereas the investor concentrates on the inheritance wealth in the second situation, Problem (B). For a tractable purpose, we study the CRRA utility functions. In Problem (A) and Problem (B), if the value function is C2C^{2} smooth, we demonstrate the existence of one number WW^{*} such that the risk capacity constraint is binding or not depends on whether the portfolio wealth is greater or smaller than this number. Moreover, we derive the explicit consumption-investment policy in terms of this threshold.

For Problem (A), the C2C^{2} smooth property of the value function follows from the classical study in Zariphopoulou (1994). In general, when the utility function satisfies a global Lipschitz condition, and the invested dollar amount is bounded from below and above by two positive numbers, the value function is C2C^{2} smooth by a recent remarkable theorem of Strulovic and Szydlowski (2015). However, the CRRA utility does not satisfy the global Lipschitz condition, and the risk capacity constraint implies that the dollar amount can be any sufficiently small positive number. A technical component in this paper is to investigate the smooth property of the value function and thus the explicit expression of the value function in Problem (B). We reduce the C2C^{2} smooth property of the value function to solve a nonlinear equation of one variable. Briefly speaking, there exists a positive real number solution (and unique upon its existence) of the nonlinear equation if and only if the value function is C2C^{2} smooth. Moreover, this real number solution separates the unconstrained region and constrained region of the risk capacity constraint. We also show that this method to determine the threshold WW^{*} and the value function explicitly can be used in Problem (A).

Our solution to the optimal portfolio choice problem provides new theoretical insights for retirement investment. First, the optimal investment strategy displays a wealth-cycle property, in contrast to the life-cycle feature which is suggested in Modigliani (1986). That is, individual’s investment and consumption decision has a life-cycle feature. See Bodie, Merton, and Samuelson (1992), Cocco, Gomes, and Maenhout (2005), Gomes and Michaelides (2005), Benzoni, Dufresne, and Goldstein (2007), and Bodie, Detemple, and Rindisbacher (2009) for life-cycle theoretical and empirical studies.333The life-cycle hypothesis is also used for preparing the retirement portfolio and in the retirement portfolio. For instance, a conventional rule for an agent of age tt is to invest (100 - t)/100 percent of the wealth in the stock market. See Malkiel (1999). Even after retirement, a time-dependent (life-cycle) investment strategy is also popular among financial advisors. Specifically, when the portfolio value is unsustainable for the entire retirement period, the retiree should invest in the market because the dollar investment in the stock will increase the expected portfolio value. Nevertheless, the percentage of wealth in the market declines in the wealth. The declining percentage of the wealth invested in risky assets is due to the retiree’ living standard concern to protect the portfolio value. This decreasing feature of the percentage becomes significant when the portfolio’s worth is sufficiently high.

Second, since the dollar invested in the stock is always a constant LL when the portfolio wealth is higher than a threshold WW^{*}, this threshold WW^{*} measures the expected lump sum of the spending in the retirement period. Intuitively, when the portfolio is worth more than this threshold, the retiree aims to protect the portfolio by investing only a fixed amount of LL in the stock market without losing the living standard. This contingent constant-dollar strategy is thus a buffer-invest strategy: when the wealth is below the target, the retired invests; and if the wealth is above the target, the retire will dis-invest more on the stock market. In a classical constant-dollar strategy, the dollar invested in the risky asset is always fixed. In contrast, by a contingent constant-dollar strategy in this paper we mean a fixed dollar is invested in the risky asset only when the portfolio value is higher than a threshold. It is also different from the constant proportion portfolio insurance strategy in Black and Perold (1992) and El Karoui and Jeanblanc-Picque (1998), which is interpreting by a put option on the Merton-type portfolio and the underlying Merton-type portfolio. See Carroll (1997) for a buffer-stock saving theory under the permanent income hypothesis. Our result is consistent with the retirement portfolio’s decumulation process. In contrast, investing for (before) retirement is an accumulating asset process.

Third, the portfolio is nearly independent of the stock market when the retiree’s portfolio is worth sufficiently embracing the living standard. Moreover, the portfolio risk is also nearly zero if the retiree’s portfolio wealth is high enough. Therefore, the retiree’s living standard risk is reduced substantially under the risk capacity constraint.

Fourth, we demonstrate that the sub-optimality of the annuity, a popular insurance product for a retiree. Annuities are the perfect financial vehicle to counter only the mortality risk. However, due to unexpected costs or shock, non-annuitized wealth could be needed to cover the bill. From a portfolio choice perspective, we demonstrate that the annuity is not optimal since the optimal consumption should depend on wealth by incorporating a risk capacity constraint. The relationship between the consumption rate and wealth is highly non-linear but implementable.

Last but not least, the risk capacity constraint is significantly different from the leverage constraint. At first glance, it seems to be a particular case of leverage constraint or collateral constraint, Xtf(Wt)X_{t}\leq f(W_{t}), where XtX_{t} represents the dollar invested in the risky asset and WtW_{t} the wealth at time tt. Earlier studies on the leverage constraint include Grossman and Vila (1992), Zariphopoulou (1994), Vila and Zariphopoulou (1997). Studies on portfolio choice and asset pricing under other dynamic constraints on the control variable ctc_{t} or the state variable, WtW_{t}, include Black and Perold (1992), Dybvig (1995), El Karoui and Jeanblanc-Picque (1998), Elie and Touzi (2008), Dybvig and Liu (2010), Chen and Tian (2016), Ahn, Choi and Lim (2019), and reference therein. However, most studies on the leverage constraint do not study the situation that f(Wt)f(W_{t}) is independent on WtW_{t}, and there are subtle differences as shown in this paper. For a comparative purpose, we solve the relevant optimal portfolio choice problem in which a dynamic leverage constraint replaces the risk capacity constraint. We demonstrate that the optimal portfolio under a leverage constraint moves precisely in the stock movement direction, which is a severe concern of the living standard risk in a stressed market period. Therefore, the risk capacity constraint is substantially different from classical leverage constraints, and it can be used to reduce the retiree’s living standard risk.

The structure of the paper is organized as follows. Section 2 introduces the model and presents a general optimal investment problem with risk capacity constraint in an infinite time horizon. In Section 3, we show that the constrained region is (W,)(W^{*},\infty) under certain conditions. The explicit solution of the value function and the consumption-saving strategy are presented in Section 4 (Problem (A)) and Section 5 (Problem (B)), respectively. We present the applications to the retiree’s asset allocation problem in Section 6. The conclusion is given in Section 7, and technical proofs are given in Appendix A - Appendix B.

2 An Optimal Portfolio Choice Problem

This section introduces a risk capacity constraint and then presents an optimal portfolio choice problem under the risk capacity constraint. Finally, we characterize the value function of this optimal portfolio choice problem in terms of the HJB equation.

2.1 Investment Opportunities

There are two assets in a continuous-time economy. Let (Ω,(t),P)(\Omega,({\cal F}_{t}),P) be a filtered probability space in which the information flow is generated by a standard one-dimension Brownian motion (Zt)(Z_{t}). The risk-free asset (“the bond”) grows at a continuously compounded, constant rr. We treat the risk-free asset as a numeaire, so we assume that r=0r=0. {\cal F}_{\infty} is the σ\sigma-algebra generated by all t,t[0,){\cal F}_{t},\forall t\in[0,\infty).

The other asset (”the stock index”) is a risky asset, and its price process SS follows

dSt=μStdt+σStdZtdS_{t}=\mu S_{t}dt+\sigma S_{t}dZ_{t} (1)

where μ\mu and σ\sigma are the expected return and the volatility of the stock index.

2.2 Investor

The investor’s initial wealth is W0W_{0} at time 0. We simply name “he” for the investor. The investor is risk-averse and his utility function is denoted by a strictly increasing and concave function u():(0,)Ru(\cdot):(0,\infty)\rightarrow R and u()u(\cdot) satisfies the Inada’s condition: limWu(W)=0\lim_{W\uparrow\infty}u^{\prime}(W)=0, and limW0u(W)=0\lim_{W\downarrow 0}u^{\prime}(W)=0.

2.3 Risk capacity constraint

Let XtX_{t} be the dollar amount invested in the risky asset at time tt. Consider a pension portfolio and time t=0t=0 represents the beginning of the retirement, and W0W_{0} is the wealth at time t=0t=0. For a highly risk averse investor (retiree), we introduce the following constraint

0XtL,t0.0\leq X_{t}\leq L,\ \ \ \ \ t\geq 0. (2)

Here L=lW0L=lW_{0} for a positive number ll. It means that the dollar amount in the market is non-negative (no short-selling) and bounded from above by a percentage of the initial wealth. For example, let l=30%,W0=1,000,000l=30\%,W_{0}=1,000,000, then at most $300,000 is invested in the stock market during the entire time period. Since this constraint highlights the dollar amount, we call it a risk capacity constraint and LL a capacity. Ottaviani and Sorensen (2015) use this exogenous constraint to study price reaction to information with heterogeneous beliefs.

Among portfolio constraints in numerous optimal portfolio choice literature, the leverage constraint is mostly related to the risk capacity constraint. That is, Xtk(Wt+L)X_{t}\leq k(W_{t}+L), where a positive number kk denotes the leverage upper bound of the investment. See classical studies in Grossman and Vila (1992), Vila and Zariphopoulou (1997), Zariphopoulou (1994), and a more recent study in Ahn et al. (2019). Our main insight in this paper (shown below) is that the optimal portfolio under risk capacity constraint behaves significantly differently from the leverage constraint, yielding different implications to retirement portfolio management.

2.4 An optimal portfolio choice problem

In this subsection, we present a portfolio choice problem in which an investor’s preference is on the consumption path and wealth process. Specifically, the investor’s expected utility is given by

E[0eδt{αu(ct)+βu(Wt)}𝑑t].\ E\left[\int_{0}^{\infty}e^{-\delta t}\left\{\alpha u(c_{t})+\beta u(W_{t})\right\}dt\right]. (3)

with two nonnegative constants α\alpha and β\beta, and α+β>0\alpha+\beta>0.

The optimal portfolio choice problem is to find the optimal trading strategy (Xt)(X_{t}) and the consumption rule (ct)(c_{t}) in

V¯(W0,L)sup(ct,Xt)𝒜(W0,L)𝔼[α0eδtu(ct)𝑑t+β0eδtu(Wt)𝑑t],\bar{V}(W_{0},L)\equiv\sup_{(c_{t},X_{t})\in{\cal A}(W_{0},L)}\mathbb{E}\left[\alpha\int_{0}^{\infty}e^{-\delta t}u(c_{t})dt+\beta\int_{0}^{\infty}e^{-\delta t}u(W_{t})dt\right], (4)

where 𝒜(W0,L){\cal A}(W_{0},L) be the set of admissible (c,X)(c,X) such that (1) ctc_{t} is t{\cal F}_{t}-progressively measurable process, ct0a.s.,t0c_{t}\geq 0\ a.s.,\forall t\geq 0 and 0tcsds<a.s.,t0\int_{0}^{t}c_{s}ds<\infty\ a.s.,\forall t\geq 0; (2) XtX_{t} is t{\cal F}_{t}-progressively measurable process, and square-integral, 0tXs2𝑑s<a.s.t0;\int_{0}^{t}X_{s}^{2}ds<\infty\ a.s.\forall t\geq 0; (3) 0XtL,t00\leq X_{t}\leq L,\forall t\geq 0; and (4) the wealth budget constraint,

dWt=Xt(μdt+σdZt)ctdt,\displaystyle dW_{t}=X_{t}(\mu dt+\sigma dZ_{t})-c_{t}dt,

and Wt0a.s.,t0W_{t}\geq 0\ a.s.,\forall t\geq 0.

In stochastic control literature, there are extent studies on the following general stochastic control problem,

maxc,X𝔼[0Tf(t,ct,Wt,Xt)𝑑t],\displaystyle\max_{c,X}\mathbb{E}\left[\int_{0}^{T}f(t,c_{t},W_{t},X_{t})dt\right],

without constraint.444For instance, Bismut (1973) studies the above general stochastic control problem in a general diffusion process framework and shows duality theorems for a general concave function f()f(\cdot). However, the characterization of the value function often relies on technical assumptions on the model parameters and the control variables. See Fleming and Soner (2006). While we consider the additive specification of the expected utility in (3), a multiplicative specification such as f(t,ct,Wt,Xt)=ctaWtb1Rf(t,c_{t},W_{t},X_{t})=\frac{c_{t}^{a}W_{t}^{b}}{1-R} is considered in Bakshi and Chen (1996) and Smith (2001).

2.5 The characterization of the value function

We characterize the value function in (4) by the following proposition.

Proposition 1

The value function V¯(W)\bar{V}(W) is the unique viscosity solution in the class of concave functions of the following HJB equation:

δV¯(W)=max0XL[μXV¯(W)+12σ2X2V¯′′(W)]+maxc0{αu(c)cV¯(W)}+βu(W),(W>0)\delta\bar{V}(W)=\max_{0\leq X\leq L}\left[\mu X\bar{V}^{\prime}(W)+\frac{1}{2}\sigma^{2}X^{2}\bar{V}^{\prime\prime}(W)\right]\\ +\max_{c\geq 0}\{\alpha u(c)-c\bar{V}^{\prime}(W)\}+\beta u(W),(W>0) (5)

with V¯(0)=α+βδu(0)\bar{V}(0)=\frac{\alpha+\beta}{\delta}u(0).

Proof:  See Appendix A. \Box

The central point in Proposition 1 is that the value function is uniquely characterized in the framework of viscosity solution of the HJB equation, regardless of the smooth property (“ex-ante”) of the value function of a portfolio choice problem or not. If the utility function u()u(\cdot) satisfies a global Lipschitz condition, and the control variable XtX_{t} takes values in [l,L][l,L] for 0<l<L0<l<L, Assumptions 1 - 3 in Strulovici and Szydlowski (2015, Theorem 1) are satisfied; hence, the value function is C2C^{2} smooth. It remains open whether the value function is C2C^{2} smooth under the risk capacity constraint, or the utility function u()u(\cdot) does not satisfy a global Lipschitz condition.

From now on, we consider a widely class of utility function in economic and finance but does not satisfy the global Lipschitz condition. That is,

u(W)=W1R1R,R>0,R1,(W>0).\displaystyle u(W)=\frac{W^{1-R}}{1-R},R>0,R\neq 1,(\forall W>0).

Assumption A.

δ>ρ(1R)κR,\displaystyle\delta>\rho\equiv\frac{(1-R)\kappa}{R}, (6)

where κ=μ22σ2\kappa=\frac{\mu^{2}}{2\sigma^{2}}. Let

λ=1R[δ(1R)(μ22Rσ2)].\displaystyle\lambda^{\infty}=\frac{1}{R}\left[\delta-(1-R)\left(\frac{\mu^{2}}{2R\sigma^{2}}\right)\right].

To guarantee the existence of the optimal solution in standard Merton’s model, we impose Assumption A from the next section, λ>0\lambda^{\infty}>0.

3 Characterization of the constrained region

In this section, we assume the C2C^{2} smooth of the value function to characterize the constrained region under certain conditions.

Specifically, if the value function is C2C^{2} smooth, by Proposition 1, the optimal investment strategy is

X=min{μσ2V¯(W)V¯′′(W),L}.X^{*}=\min\left\{-\frac{\mu}{\sigma^{2}}\frac{\bar{V}^{\prime}(W)}{\bar{V}^{\prime\prime}(W)},L\right\}. (7)

Following standard convention in Zariphopoulou (1994), we divide the state space Ω=[0,)\Omega=[0,\infty) into two regions. On one hand, in the region

𝒰={W>0:μσ2V¯V¯′′L},\displaystyle{\cal U}=\left\{W>0:-\frac{\mu}{\sigma^{2}}\frac{\bar{V}^{\prime}}{\bar{V}^{\prime\prime}}\leq L\right\},

X<LX^{*}<L, we call 𝒰{\cal U} the unconstrained region following Vila and Zariphopoulou (1997).555In the region 𝒰{\cal U}, the constrain XLX^{*}\leq L in the HJB equation (10) becomes irrelevant. Hence, it is often called the region unconstrained. On the other hand, in the region

={W>0:μσ2V¯V¯′′>L},\displaystyle{\cal B}=\left\{W>0:-\frac{\mu}{\sigma^{2}}\frac{\bar{V}^{\prime}}{\bar{V}^{\prime\prime}}>L\right\},

the risk capacity constraint is binding and then X=LX^{*}=L. We call {\cal B} a constrained region.

3.1 Portfolio choice without risk constraint

As a benchmark, we first solve the optimal portfolio choice problem (4) without the risk capacity constraint; alternatively, L=L=\infty.

Proposition 2

In the absence of the risk capacity constraint, the value function in (4) is

V¯(W)=A1RW1R\displaystyle\bar{V}(W)={A\over 1-R}W^{1-R}

where AA is a positive constant which will be specified later. The risky asset investment amount is

Xt=μRσ2Wt.\displaystyle X_{t}=\frac{\mu}{R\sigma^{2}}W_{t}.
  1. (a)

    If α>0,β>0\alpha>0,\beta>0, then AA is the unique positive root of the following equation:

    (δ1RκR)A=α1RR1RA11R+β1R,\displaystyle({\delta\over 1-R}-{\kappa\over R})A={\alpha}^{1\over R}{R\over 1-R}A^{1-{1\over R}}+{\beta\over 1-R},

    Moreover, the optimal consumption rate is

    ct=(αA)1RWt.\displaystyle c_{t}^{*}=({\alpha\over A})^{{1\over R}}W_{t}.
  2. (b)

    If α>0,β=0\alpha>0,\beta=0, then A=α(λ)RA=\alpha(\lambda^{\infty})^{-R}, the optimal consumption rate is ct=λWt.c_{t}^{*}=\lambda^{\infty}W_{t}.

  3. (c)

    If α=0\alpha=0, β>0\beta>0, then A=βδρA={\beta\over\delta-\rho}, the optimal consumption rate is ct=0.c_{t}^{*}=0.

Proof:  See Appendix A. \Box

According to this Proposition 2, without the risk constraint constraint, the optimal strategy is a constant proportion of wealth invested in the risky asset and a constant consumption-wealth ratio. As a result, the wealth process is a geometric Brownian motion. Since WtW_{t} has a lognormal distribution, the risk capacity constraint fails with a positive positive probability for any time t>0t>0.

3.2 The unconstrained and constrained region

Assuming the value function V¯(W)\bar{V}(W) is C2C^{2} smooth, we characterize the unconstrained region explicitly under certain conditions in the following result.

Proposition 3

Assume the value function V¯(W)\bar{V}(W) is C2C^{2} smooth, and the following three conditions hold.

  1. (1)

    (“Nontrival unconstrained region”) There exists a positive number W0>0W_{0}>0 such that (0,W0)𝒰(0,W_{0})\subseteq{\cal U}.

  2. (2)

    (“Order of value function”) There exists two positive numbers C0,C1C_{0},C_{1} such that C0WRV(W)C1WRC_{0}W^{-R}\leq V^{\prime}(W)\leq C_{1}W^{-R} for all W(0,)W\in(0,\infty).

  3. (3)

    (“Single crossing”) The function g(W)βWR1(μWσ2LR)α1/Rμ2σ2LR(V¯(W))11Rg(W)\equiv-\beta W^{-R-1}(\mu W-\sigma^{2}LR)-\alpha^{1/R}{\mu^{2}\over\sigma^{2}LR}(\bar{V}(W)^{\prime})^{1-{1\over R}} changes the sign at most one time in the region (0,)(0,\infty),

Then there exists a positive number WW^{*} such that 𝒰=(0,W]{\cal U}=(0,W^{*}], and =(W,){\cal B}=(W^{*},\infty).

Proof:  See Appendix A. \Box

Proposition 3 is crucial to derive an explicit solution of the general portfolio choice problem (4). It states that both the unconstrained and constrained region are simply determined by a finite positive number WW^{*} under certain conditions about the value function. Assuming the value function is solved, the optimal investment strategy is X=μσ2V¯(W)V¯′′(W)X^{*}=-\frac{\mu}{\sigma^{2}}\frac{\bar{V}^{\prime}(W)}{\bar{V}^{\prime\prime}(W)} if WWW\leq W^{*}, and otherwise, X=LX^{*}=L. Furthermore, if α0\alpha\neq 0, then the optimal consumption rate c=α1/R(V¯(W))1/Rc^{*}=\alpha^{1/R}(\bar{V}^{\prime}(W))^{-1/R}. As will be shown below, the number WW^{*} is also essential to derive the value function explicitly.

In Proposition 3, the “nontrivial unconstrained region” condition (1) states that the risk constraint condition is satisfied when the wealth is sufficiently small. Its intuition is simple. If the wealth is reasonably small, the investor’s investment dollar amount in the risky asset is small as well, then Xt<LX_{t}<L. The “order of value function” condition (2) follows from the assumption of the CRRA utility function with order 1R1-R. Both condition (1) and (2) are straightforward (See their proofs in some important cases in Appendix A). Nevertheless, the “single crossing” condition (3) is more dedicated and essential in characterizing the unconstrained region precisely.

For general values of α\alpha and β\beta, it is hard to check the single crossing condition due to two components in g(W)g(W). To see it, we notice that its second component is strictly monotonic, as the value function increases and concave. Precisely, (V¯(W))11R(\bar{V}(W)^{\prime})^{1-{1\over R}} is increasing if R<1R<1 and decreasing otherwise. In contrast, its first component, WR1(μWσ2LR)W^{-R-1}(\mu W-\sigma^{2}LR), increases over the region Wσ2Lμ(R+1)W\leq\frac{\sigma^{2}L}{\mu}(R+1) and decreases in other region. In total, the function g(W)g(W) displays a complicated shape, and the single crossing condition itself depends on some properties of the value function, which is to be determined.

There are two special cases though, α=0\alpha=0 or β=0\beta=0, in which the single crossing condition is satisfied naturally. First, α=0\alpha=0, then g(W)=βWR1(μWσ2LR)g(W)=-\beta W^{-R-1}(\mu W-\sigma^{2}LR) only changes the sign at W=σ2LRμW=\frac{\sigma^{2}LR}{\mu}. Second, β=0\beta=0, then g(W)=α1/Rμ2σ2LR(V¯)11R<0g(W)=-\alpha^{1/R}{\mu^{2}\over\sigma^{2}LR}(\bar{V}^{\prime})^{1-{1\over R}}<0.

Our objective is to explicitly investigate the risk constraint’s implications by giving an analytical expression of the optimal consumption-investment policy. Therefore, we next focus on these two situations, α=0\alpha=0 or β=0\beta=0. Specifically, we explicitly solve two portfolio choice problems. The first one is given by

Problem (A): U(W0,L)sup(ct,Xt)𝒜(W0,L)𝔼[0eδtu(ct)𝑑t],\text{Problem (A): }U(W_{0},L)\equiv\sup_{(c_{t},X_{t})\in{\cal A}(W_{0},L)}\mathbb{E}\left[\int_{0}^{\infty}e^{-\delta t}u(c_{t})dt\right], (8)

in which the value function is written by U(W0,L)U(W_{0},L) to highlight the risk constraint level LL. For any 0<L1<L20<L_{1}<L_{2}, it is evident that U(W,0)U(W,L1)U(W,L2)U(W,)U(W,0)\leq U(W,L_{1})\leq U(W,L_{2})\leq U(W,\infty), where U(W,)U(W,\infty) is the value function in Merton’s model without the constraint on (Xt)(X_{t}). If there is no confusion, we will write U(W)U(W) to represent U(W,L)U(W,L) in this paper. When Xtf(Wt)X_{t}\leq f(W_{t}), in particular, Xtk(Wt+L),X_{t}\leq k(W_{t}+L), Problem (A) is studied in Vila and Zariphopoulou (1997), Zariphopoulou (1994).

The second one is given by

Problem (B): V(W0,L)=max(X)𝔼[0eδtu(Wt)𝑑t].\text{Problem (B): }V(W_{0},L)=\max_{(X)}\mathbb{E}\left[\int_{0}^{\infty}e^{-\delta t}u(W_{t})dt\right]. (9)

Here, we use V(W,L)V(W,L) in Problem (B) to denote the value function to distinguish from U()U(\cdot). Similarly, we do not distinguish V(W,L)V(W,L) with V(W)V(W) if it is evident.

In the following sections, we derive the explicit solution of Problem (A) and Problem (B), respectively.

4 Solution to Problem (A)

For Problem (A), the smooth property of the value function follows from a difficult theorem of Zariphopoulou (1994).666Precisely, Zariphopoulou (1994) investigates the constraint that Xtf(Wt)X_{t}\leq f(W_{t}) almost surely, where f(x):[0,)[0,)f(x):[0,\infty)\rightarrow[0,\infty) is increasing, concave and satisfies the global Lipschitz condition. Her dedicate argument goes through if f()f(\cdot) is a positive constant. Zariphopoulou (1994, Theorem 1.1) shows that U(W)U(W) is the unique C2C^{2} smooth function of the following HJB equation:

δU(W)=max0XL[μXU(W)+12σ2X2U′′(W)]+maxc0{u(c)cU(W)},(W>0)\delta U(W)=\max_{0\leq X\leq L}\left[\mu XU^{\prime}(W)+\frac{1}{2}\sigma^{2}X^{2}U^{\prime\prime}(W)\right]\\ +\max_{c\geq 0}\{u(c)-cU^{\prime}(W)\},(W>0) (10)

with U(0)=u(0)δU(0)=\frac{u(0)}{\delta}.

Since the value function U(W)U(W) is C2C^{2} smooth, both the unconstrained and constrained region are well-defined. On one hand, in the unconstrained region, the HJB equation (10) reduces to the following ordinary differential equation

δU=R1R(U)(1R)/Rκ(U)2U′′.\delta U=\frac{R}{1-R}(U^{\prime})^{-(1-R)/R}-\kappa\frac{(U^{\prime})^{2}}{U^{\prime\prime}}. (11)

On the other hand, in the constrained region {\cal B}, the HJB equation reduces to another ordinary differential equation

δU=R1R(U)(1R)/R+μLU+12σ2L2U′′.\delta U=\frac{R}{1-R}(U^{\prime})^{-(1-R)/R}+\mu LU^{\prime}+\frac{1}{2}\sigma^{2}L^{2}U^{\prime\prime}. (12)
Proposition 4

There is a positive number WW^{*} such that 𝒰=(0,W]{\cal U}=(0,W^{*}] and =(W,){\cal B}=(W^{*},\infty).

Proof:  See Appendix A. \Box

Proposition 4 is crucial to derive an explicit solution to Problem (A). It not only characterizes the constrained region but also reduces Problem (A) to determine the threshold number WW^{*} next.

Define

ω+=Rκ(κ+δ),\displaystyle\omega^{+}={R\over\kappa}(\kappa+\delta),

then, for any R>0,R1R>0,R\neq 1, Assumption A implies that ω+>max(R,1).\omega^{+}>\max(R,1).

Given a real number mm, define two auxiliary function H(C)H(C) and J(C)J(C) as follows.

H(C)=1λ(C+mCω+),H(C)=\frac{1}{\lambda^{\infty}}\left(C+mC^{\omega^{+}}\right), (13)
J(C)=1λ{C1R1R+mω+ω+RCω+R},J(C)=\frac{1}{\lambda^{\infty}}\left\{\frac{C^{1-R}}{1-R}+m\frac{\omega^{+}}{\omega^{+}-R}C^{\omega^{+}-R}\right\}, (14)
Proposition 5

There exists unique real numbers {C,m}\{C^{*},m\} which satisfies the following two equations

CH(C)=σ2RLμ,C^{*}H^{\prime}(C^{*})=\frac{\sigma^{2}RL}{\mu}, (15)

and

J(C)=K(W),J(C^{*})=K(W^{*}), (16)

where K(x)K(x) satisfies the following second-order ordinary differential equation

12σ2L2K′′(x)=δK(x)R1R(K(x))(1R)/RμLK(x),\displaystyle\frac{1}{2}\sigma^{2}L^{2}K^{\prime\prime}(x)=\delta K(x)-\frac{R}{1-R}(K^{\prime}(x))^{-(1-R)/R}-\mu LK^{\prime}(x), (17)

with K(W)=(C)RK^{\prime}(W^{*})=(C^{*})^{-R}, and K(W)K(W) has the same order as W1RW^{1-R} when WW\rightarrow\infty. Moreover, W=H(C)W^{*}=H(C^{*}).

Proof:  See Appendix A. \Box

Proposition 5 is understood as follows. In the unconstrained region (0,W](0,W^{*}], the relationship between wealth and optimal consumption rate is characterized by W=H(C)W=H(C), a strictly increasing function.777It is a well known fact the function H()H(\cdot) is strictly increasing, since U()U(\cdot) is concave and UWW=RCR1CW=RCR1/H(C)U_{WW}=-RC^{-R-1}\frac{\partial C}{\partial W}=-RC^{-R-1}/H^{\prime}(C). Furthermore, if the optimal consumption rate is a concave function of the wealth (Carroll and Kimball (1996)), then 2CW2=H′′(C)H(C)2CW<0\frac{\partial^{2}C}{\partial W^{2}}=-\frac{H^{\prime\prime}(C)}{H^{\prime}(C)^{2}}\frac{\partial C}{\partial W}<0 implies that H()H(\cdot) is a convex function. The value function U(W)U(W) is determined by J(C)J(C). Therefore, the value function is characterized with the auxiliary parameter CC. Moreover, the optimal investment policy is

X(W)=μRσ2CH(C).X^{*}(W)=\frac{\mu}{R\sigma^{2}}CH^{\prime}(C). (18)

Since CH(C)CH^{\prime}(C) is increasing with respect to CC, X(W)X^{*}(W) is a well-defined increasing function of the wealth in the unconstrained region. Therefore, X(W)<LX^{*}(W)<L holds for any wealth W<WW<W^{*}.

In the constrained region (W,)(W^{*},\infty), the optimal investment policy is straightforward, X=LX^{*}=L, the investor places LL dollars in the risky asset as long as the portfolio value is greater than WW^{*}. The value function U(W)=K(W)U(W)=K(W) satisfies the following ordinary differential equation

12σ2L2K′′(x)=δK(x)R1R(K(x))(1R)/RμLK(x).\displaystyle\frac{1}{2}\sigma^{2}L^{2}K^{\prime\prime}(x)=\delta K(x)-\frac{R}{1-R}(K^{\prime}(x))^{-(1-R)/R}-\mu LK^{\prime}(x). (19)

with the boundary conditions K(W)=J(C)K(W^{*})=J(C^{*}) and K(W)=(C)RK^{\prime}(W^{*})=(C^{*})^{-R}. These two boundary conditions of the function K(x)K(x) are exactly the value-matching and smooth-fit condition at WW^{*}. However, these two conditions alone are not sufficient yet to characterize uniquely the function K(x)K(x) as WW^{*} is to be determined. Therefore, we need a boundary condition at WW\rightarrow\infty, that is, the order of K(x)K(x), according to standard theory of ordinary differential equation. Finally, the optimal consumption rate is c=K(W)1Rc^{*}=K^{\prime}(W)^{-\frac{1}{R}}.

5 Solution to Problem (B)

In contrast to Problem (A), the value function of Problem (B) is not known to be C2C^{2} smooth under the risk constraint. Therefore, in solving Problem (B), we need to simultaneously investigate the smooth property of the value function. We follow three steps. First, we explicitly characterize the constrained and unconstrained region of the risk capacity constraint, assuming the value function is C2C^{2} smooth. This characterization of the constrained region follows from Proposition 6. Second, we provide an explicit expression of the value function, still assuming the C2C^{2} smooth property of the value function. Third, we present the sufficient and necessary condition of the C2C^{2} smoothness of the value function, given the expression of the value function in the second step.

Assuming the value function V(W)V(W) is C2C^{2} smooth, the value function on the unconstrained region satisfies

δV(W)=u(W)+κ(V(W))2V′′(W).\delta V(W)=u(W)+\kappa\frac{(V^{\prime}(W))^{2}}{-V^{\prime\prime}(W)}. (20)

Similarly, on the constrained region, V()V(\cdot) satisfies a second-order linear ODE

δV(W)=u(W)+μLV(W)+12σ2L2V′′(W).\delta V(W)=u(W)+\mu LV^{\prime}(W)+\frac{1}{2}\sigma^{2}L^{2}V^{\prime\prime}(W). (21)

In the above definition of the region 𝒰{\cal U} and {\cal B}, we assume that C2C^{2} smoothness of the value function ex ante. Without knowing the C2C^{2} smooth property of the value function, we can still directly investigate the ordinary differential equation (20) - (21). Later, we study these two ordinary differential equations and verify the C2C^{2} smooth property of the value function under certain conditions.

Proposition 6

Assume V(W)V(W) is C2C^{2} smooth, then there exists a positive number WW^{*} such that 𝒰=(0,W]{\cal U}=(0,W^{*}] and =(W,){\cal B}=(W^{*},\infty).

Proof:  See Appendix A. \Box

By Proposition 2, the number WW^{*} is a finite number. Similar to Proposition 4 for Problem (A), the characterization of 𝒰{\cal U} and {\cal B} in Proposition 6 reduces the solution to a number WW^{*}.

We next derive the sufficient and necessary condition of the C2C^{2} smoothness of the value function and simultaneously characterize the threshold WW^{*}. Define two real numbers

β1=μ+μ2+2δσ2σ2L,β2=μμ2+2δσ2σ2L,\beta_{1}=\frac{-\mu+\sqrt{\mu^{2}+2\delta\sigma^{2}}}{\sigma^{2}L},\beta_{2}=\frac{-\mu-\sqrt{\mu^{2}+2\delta\sigma^{2}}}{\sigma^{2}L}, (22)

where β1\beta_{1} and β2\beta_{2} are two roots of the following quadratic equation

12σ2L2β2+Lμβδ=0,\displaystyle\frac{1}{2}\sigma^{2}L^{2}\beta^{2}+L\mu\beta-\delta=0,

and β1>0>β2\beta_{1}>0>\beta_{2}.

Let

V0(W)\displaystyle V_{0}(W) =\displaystyle= 2(β1β2)(1R)σ2L2\displaystyle\frac{2}{(\beta_{1}-\beta_{2})(1-R)\sigma^{2}L^{2}}
×{eβ2W0Wx1Reβ2x𝑑xeβ1W0Wx1Reβ1x𝑑x}.\displaystyle\times\left\{e^{\beta_{2}W}\int_{0}^{W}x^{1-R}e^{-\beta_{2}x}dx-e^{\beta_{1}W}\int_{0}^{W}x^{1-R}e^{-\beta_{1}x}dx\right\}.

The function V0(W)V_{0}(W) is a well-defined smooth function for W>0W>0. We recall the expression of lower incomplete Gamma function,

γ(s,x)=0xts1et𝑑t,Re(s)>0,\displaystyle\gamma(s,x)=\int_{0}^{x}t^{s-1}e^{-t}dt,Re(s)>0,

which is well-defined for all real number x>0x>0. Therefore, V0(W)V_{0}(W) is well-defined for any 0<R<20<R<2.888We refer to Appendix B for basic properties of the incomplete Gamma function.

Given V0()V_{0}(\cdot), the following two functions C(W)C(W) and D(W)D(W) are well-defined.

C(W)=2(β1β2)(1R)σ2L2β1R1Γ(2R)eβ1W[μ+Lσ2β1]μV0(W)σ2LV0′′(W)σ2L(β2)2eβ2W+μβ2eβ2W,C(W)={-{2\over(\beta_{1}-\beta_{2})(1-R)\sigma^{2}L^{2}}\beta_{1}^{R-1}\Gamma(2-R)e^{\beta_{1}W}[\mu+L\sigma^{2}\beta_{1}]-\mu V_{0}^{\prime}(W)-\sigma^{2}LV_{0}^{\prime\prime}(W)\over\sigma^{2}L(\beta_{2})^{2}e^{\beta_{2}W}+\mu\beta_{2}e^{\beta_{2}W}}, (23)

and

D(W)={2(β1β2)(1R)σ2L2(β1)R1Γ(2R)eβ1W+C(W)β2eβ2W+V0(W)}1R.D(W)=\left\{\frac{2}{(\beta_{1}-\beta_{2})(1-R)\sigma^{2}L^{2}}(\beta_{1})^{R-1}\Gamma(2-R)e^{\beta_{1}W}+C(W)\beta_{2}e^{\beta_{2}W}+V_{0}^{\prime}(W)\right\}^{-\frac{1}{R}}. (24)

Given any W>0W>0, we define a function G(g),0gD(W),G(g),0\leq g\leq D(W), by the following second-order ordinary differential equation,

G′′(g)=Rκg1G(g)[δρgR(G(g))R]G^{\prime\prime}(g)={R\over\kappa}g^{-1}G^{\prime}(g)\left[\delta-\rho-g^{R}(G(g))^{-R}\right] (25)

with boundary condition G(0)=0,G(D(W))=WG(0)=0,G(D(W))=W and G(D(W))=LRσ2μ(D(W))1G^{\prime}(D(W))={LR\sigma^{2}\over\mu}(D(W))^{-1} (See King, Billingham, and Otto (2003) for the properties of second-order ordinary differential equation).

Finally, we define one equation of the variable WW as follows.

u(0)δ+0D(W)gRG(g)𝑑g=2(β1β2)(1R)σ2L2(β1)R2Γ(2R)eβ1W+C(W)eβ2W+V0(W).\frac{u(0)}{\delta}+\int_{0}^{D(W)}g^{-R}G^{\prime}(g)dg={2\over(\beta_{1}-\beta_{2})(1-R)\sigma^{2}L^{2}}(\beta_{1})^{R-2}\Gamma(2-R)e^{\beta_{1}W}+C(W)e^{\beta_{2}W}+V_{0}(W). (26)

In what follows, we are interested in one particular function G()G(\cdot) that is defined by one specific number WW^{*}.

Proposition 7

Assume 0<R<20<R<2.

(“Necessary”) If V(W,L)V(W,L) is C2C^{2}, then there exists a unique positive solution WW^{*} of Equation (26), and the value function is given by

V~(W,L)={u(0)δ+0G1(W)gRG(g)𝑑g,WW2(β1β2)(1R)σ2L2(β1)R2Γ(2R)eβ1W+C(W)eβ2W+V0(W),W>W.\displaystyle\tilde{V}(W,L)=\begin{cases}\frac{u(0)}{\delta}+\int_{0}^{G^{-1}(W)}g^{-R}G^{\prime}(g)dg,&W\leq W^{*}\cr\frac{2}{(\beta_{1}-\beta_{2})(1-R)\sigma^{2}L^{2}}(\beta_{1})^{R-2}\Gamma(2-R)e^{\beta_{1}W}+C(W^{*})e^{\beta_{2}W}+V_{0}(W),&W>W^{*}.\end{cases} (27)

Here, the function G(g)G(g) is defined by the real number WW^{*}.

(“Verification”) Assume the existence of a positive solution WW^{*} of Equation (26). Moreover, in the region 0gg=D(W)0\leq g\leq g^{*}=D(W^{*}), G(g)G(g) is increasing,

G(g)(δρ+κR)1Rg,\displaystyle G(g)\geq\left(\delta-\rho+\frac{\kappa}{R}\right)^{-\frac{1}{R}}g,

and μV~W(W,L)+L2V~WW(W,L)0\mu\tilde{V}_{W}(W,L)+L^{2}\tilde{V}_{WW}(W,L)\geq 0 for all WWW\geq W^{*}. Then the number WW^{*}, as a positive solution of Equation (26) is unique, the value function V(W,L)V(W,L) is C2C^{2} smooth and V(W,L)=V~(W,L)V(W,L)=\tilde{V}(W,L) in (27).

Proof:  See Appendix A. \Box

Proposition 7 presents a sufficient and necessary condition of the C2C^{2} smoothness of the value function and solves Problem (B) explicitly. It also presents a closed-form expression of the value function and the optimal strategy in terms of WW^{*} and the auxiliary function G()G(\cdot). By its construction, if there is one positive solution of Equation (26), then the solution WW^{*} is unique, and the unconstrained region and the constrained region are separated by WW^{*}. Moreover, the value function V(W,L)V(W,L) is a C2C^{2} smooth function of the HJB equation and given by the expression (27) in Proposition 7. Conversely, if V(W,L)V(W,L) is C2C^{2} smooth, then there exists a unique solution of Equation (26). The C2C^{2} smoothness of the value function is essentially the existence of a positive solution of the nonlinear equation (26) of one variable.

If L=L=\infty, it reduces 𝒰=(0,){\cal U}=(0,\infty) and an empty region {\cal B}. By Proposition 2, the function G()G(\cdot) is a linear function. For any L<L<\infty, the function G()G(\cdot) is highly non-linear, and its non-linearity is equivalent to the non-myopic property of the optimal strategy, as will be explained in the next section.

6 Applications to optimal retirement portfolio

In this section, we present applications to the optimal portfolio for retirees. Our objective is to explain the implications of the risk capacity constraint and demonstrate its substantial difference from the leverage constraint.

There are several distinct features in the retiree’s portfolio choice problem compared with a standard investor before retirement. (1) The retiree has a fixed cash flow from his social security account post-retirement.999 See www.ssa.gov for the social security system in the U.S.A. There are similar social security systems in Europe and Canada. (2) He has no labor income. (3) He has a mortality risk, and (4) he becomes more risk-averse than before retirement because he has concerns about the market downturn and has no sufficient time to wait for the market return, leading to a substantial decline in his living standards. We formally incorporate these features into the optimal portfolio choice problems as follows.

The retirement date is zero. The retiree’s initial wealth is W0W_{0} at the retirement date. Since the retiree faces his mortality risk, the investment time-horizon is uncertain, neither a fixed finite time nor infinity. We assume that the investor’s death time τ\tau has an exponential distribution with mean λ\lambda, that is, the probability of the retiree surviving in the next tt years is eλte^{-\lambda t}. The investor’s average lifetime is 1λ\frac{1}{\lambda}, and the variance of his lifetime is 1λ2\frac{1}{\lambda^{2}}. For example, if λ=0.05\lambda=0.05, an ordinary retiree who retires at 65 is likely to die at 85 years old. τ\tau is independent of the information set {\cal F}_{\infty}.

We first consider an optimal portfolio choice problem with random maturity (see Blanchet-Scaillet, et al. 2008; Chen, Fu, and Zhou, 2020) as follows,

maxX,c𝔼[0τeδtu(ct)𝑑t].\max_{X,c}\mathbb{E}\left[\int_{0}^{\tau}e^{-\delta t}u(c_{t})dt\right]. (28)

Here the wealth process WtW_{t} satisfies dWt=Xt(μdt+σdZt)ctdt,W0=WdW_{t}=X_{t}(\mu dt+\sigma dZ_{t})-c_{t}dt,W_{0}=W, Wt0a.s.,t0W_{t}\geq 0\ a.s.,\forall t\geq 0, and the risk capacity constraint 0XtL,tτ0\leq X_{t}\leq L,\forall t\leq\tau.101010When the retiree receives a constant social security stream, the wealth equation becomes dWt=Xt(μdt+σdZt)(cty0)dtdW_{t}=X_{t}(\mu dt+\sigma dZ_{t})-(c_{t}-y_{0})dt, y0y_{0} represents the social security with continuously compounding, the discussion in Problem (A) can be applied if we consider u(cty0)u(c_{t}-y_{0}) in Problem (A). The reason is as follows. If the social security is sufficient, there is no need to withdraw from the pension portfolio. Therefore, the retiree focuses on the difference cty0c_{t}-y_{0} in the withdraw decision. To sharpen our intuition of the risk capacity constraint, we omit the social security or other fixed cash-flow income. We also ignore medical costs or other costs in the portfolio choice decision. In this problem, the investor finds the best withdraw (consumption) rate whereas the inheritance wealth is not a concern. Because the stopping time τ\tau is independent from the equity market information,

𝔼[0τeδtu(ct)𝑑t]\displaystyle\mathbb{E}\left[\int_{0}^{\tau}e^{-\delta t}u(c_{t})dt\right] =\displaystyle= 𝔼[0eδt1τtu(ct)𝑑t]\displaystyle\mathbb{E}\left[\int_{0}^{\infty}e^{-\delta t}1_{\tau\geq t}u(c_{t})dt\right]
=\displaystyle= 𝔼[0𝔼[eδt1τtu(ct)|]𝑑t]\displaystyle\mathbb{E}\left[\int_{0}^{\infty}\mathbb{E}[e^{-\delta t}1_{\tau\geq t}u(c_{t})|{\cal F}_{\infty}]dt\right]
=\displaystyle= 𝔼[0eδtu(ct)eλt𝑑t]\displaystyle\mathbb{E}\left[\int_{0}^{\infty}e^{-\delta t}u(c_{t})e^{-\lambda t}dt\right]

where we make use of the fact that P(τt)=eλtP(\tau\geq t)=e^{-\lambda t}. Therefore, this optimal retirement portfolio reduces to Problem (A) in which the the subjective discount factor is replaced by δ+λ\delta+\lambda. It is thus natural to assume that δ+λ<1\delta+\lambda<1 and then δ+λ>ρ\delta+\lambda>\rho.

Alternatively, the second optimal retirement portfolio problem for the retiree at time zero is as follows (see Liu and Loewenstein, 2002),

J(W,L)=max(X)𝔼[eδτu((1α)Wτ)]J(W,L)=\max_{(X)}\mathbb{E}\left[e^{-\delta\tau}u((1-\alpha)W_{\tau})\right] (29)

where δ\delta is the retiree’s subjective discount factor, α\alpha is the inheritance tax rate of the wealth. XtX_{t} satisfies the risk capacity constraint. Given the distribution of τ\tau, and the independent assumption between τ\tau and {\cal F}_{\infty}, using the same derivation as in the first problem, the retiree’s optimal retirement portfolio problem (29) is reduced to

J(W,L)=max(X)𝔼[0e(λ+δ)tu((1α)Wt)𝑑t].J(W,L)=\max_{(X)}\mathbb{E}\left[\int_{0}^{\infty}e^{-(\lambda+\delta)t}u((1-\alpha)W_{t})dt\right]. (30)

Assuming u(W)=W1R1R,R>0,R1u(W)=\frac{W^{1-R}}{1-R},R>0,R\neq 1, and using its scaling property,

u((1α)Wt)=(1α)1Ru(Wt),u((1-\alpha)W_{t})=(1-\alpha)^{1-R}u(W_{t}),

we have

J(W,L)=(1α)1Rmax(X)𝔼[0e(λ+δ)tu(Wt)𝑑t].J(W,L)=(1-\alpha)^{1-R}\max_{(X)}\mathbb{E}\left[\int_{0}^{\infty}e^{-(\lambda+\delta)t}u(W_{t})dt\right]. (31)

A general expression of the retirement portfolio problem elaborates these two cases. If both the consumption/withdraw and the inheritance wealth are considered together, 111111In practice, a constant consumption rate is often fixed. For instance, a standard withdrawal rate is between 4% to 5%. See Bengen (1994). Then the problem reduces to Problem (B).

𝔼[0τeδtu(ct)𝑑t+eδτu((1α)Wτ)]=𝔼[0e(λ+δ)t{u(ct)+(1α)1Ru(Wt)}𝑑t].\displaystyle\mathbb{E}\left[\int_{0}^{\tau}e^{-\delta t}u(c_{t})dt+e^{-\delta\tau}u((1-\alpha)W_{\tau})\right]=\mathbb{E}\left[\int_{0}^{\infty}e^{-(\lambda+\delta)t}\left\{u(c_{t})+(1-\alpha)^{1-R}u(W_{t})\right\}dt\right].

It reduces to the optimization problem (4) studied in Section 2 and Section 3.

6.1 Alternative portfolio choice under a leverage constraint

We first present the solution to a relevant portfolio choice problem by replacing the risk capacity constraint with a leverage constraint for a comparison purpose. Specifically, we assume XtbWt,tX_{t}\leq bW_{t},\forall t. We use a predetermined number of bb to represent the highest possible percentage of wealth invested in the risky asset. For instance, b=0.7b=0.7 means at most 70 percent of the portfolio is invested in the risky asset. Define

Vb(W)=sup(X)𝔼[α0eδtu(ct)𝑑t+β0eδtu(Wt)𝑑t],V^{b}(W)=\sup_{(X)}\mathbb{E}\left[\alpha\int_{0}^{\infty}e^{-\delta t}u(c_{t})dt+\beta\int_{0}^{\infty}e^{-\delta t}u(W_{t})dt\right], (32)

where the risk capacity constraint is replaced by XtbWt,tX_{t}\leq bW_{t},\forall t. By Proposition 2, we assume that b<μRσ2b<\frac{\mu}{R\sigma^{2}}. Otherwise, Vb(W)V^{b}(W) is solved by Proposition 2 for all bμRσ2b\geq\frac{\mu}{R\sigma^{2}}.

Proposition 8

Under the constraint that XtbWtX_{t}\leq bW_{t} and b<μRσ2b<\frac{\mu}{R\sigma^{2}}, then

Vb(W)=Bu(W),V^{b}(W)=Bu(W), (33)

the optimal strategy is Xt=bWtX_{t}=bW_{t}, and the optimal consumption rate is ct=(αB)1/RWtc_{t}=\left(\frac{\alpha}{B}\right)^{1/R}W_{t}. Here, BB is a unique positive number satisfying

[δ+(1R)(12σ2b2Rμb)]B=β+α1/RRB11R.\displaystyle\left[\delta+(1-R)(\frac{1}{2}\sigma^{2}b^{2}R-\mu b)\right]B=\beta+\alpha^{1/R}RB^{1-\frac{1}{R}}.

Proof:  See Appendix A. \Box

Proposition 8 states that a constant percentage strategy Xt=bWtX_{t}=bW_{t} is an optimal policy under a leverage constraint. For α=1\alpha=1 and β=0\beta=0, Proposition 8 reduces to Vila and Zariphopoulou (1997, Proposition 4.2). For α=0,β=1\alpha=0,\beta=1, B=1δ+(1R)(12σ2b2Rμb)B={1\over\delta+(1-R)({1\over 2}\sigma^{2}b^{2}R-\mu b)}. Under the leverage constraint XtbWtX_{t}\leq bW_{t}, the general portfolio choice problem in (4) has a similar optimal strategy and a constant consumption-wealth ratio (for α0)\alpha\neq 0). The wealth process is a geometric Brownian motion. As a consequence, the portfolio wealth at any time has a lognormal distribution.

6.2 Optimal strategy

We start with the optimal investing strategy in Problem (A) - (B). Its explicit expression is given by the next result.

Corollary 1

The optimal investment strategy in Problem (B) is

X(W)={μRσ2gG(g),WW,L,W>W.\displaystyle X(W)=\begin{cases}\frac{\mu}{R\sigma^{2}}gG^{\prime}(g),&W\leq W^{*},\cr L,&W>W^{*}.\end{cases} (34)

Th expression of the optimal investment strategy for Problem (A) is the same if gG(g)gG^{\prime}(g) is replaced by CH(C)CH^{\prime}(C). The optimal portfolio strategies in both Problem (A) and Problem (B) are not the myopic.

Since the auxiliary parameter, CC in Proposition 5 and gg in Proposition 7, represents the optimal consumption rate cc^{*} in the unconstrained region {WW}\{W\leq W^{*}\}, the optimal investment strategy shares the same expression in Corollary 1. If the optimal strategy were the myopic strategy in the sense that Xt=min{μRσ2Wt,L},X_{t}=\min\left\{\frac{\mu}{R\sigma^{2}}W_{t},L\right\}, then the function G()G(\cdot) or H()H(\cdot) would be a linear function, and W=LRσ2μW^{*}=\frac{LR\sigma^{2}}{\mu}. For Problem (B), it is impossible by the definition of the function G()G(\cdot), and Equation (26) fails for W=LRσ2μW^{*}=\frac{LR\sigma^{2}}{\mu} because the left side is a polynomial function while the right hand is virtually an incomplete Gamma function. Intuitively, the risk capacity constraint in the future affects the investment decision even though the constraint is not binding instantly. By the same reason, in Problem (A), the parameter m0m\neq 0 in function H()H(\cdot) since the future risk capacity constraint affects the decision even for W<WW<W^{*}. In other words, while the risk capacity constraint is not binding, the investment strategy is affected by the fact that the constraint may be binding in the future. The major difference between Problem (A) and Problem (B) is the characterization of the threshold WW^{*} in terms of different non-linear equation.

As a numerical illustration, we plot the auxiliary function G()G(\cdot) and the investing strategy X(W)X(W) in Problem (B). We choose the risk premium μ=0.10\mu=0.10 to consistent with the market data of S & P 500 between 1948 and 2018. We choose λ=0.07\lambda=0.07 to consistent to approximately 15 years of life after retirement. Assuming the initial retirement portfolio worth 1 million, we choose 700,000700,000 as the maximum dollar amount in the stock market. Let σ=30%\sigma=30\%. The number σ\sigma is slightly higher than the calibration of the market index since our purpose is to highlight the high likelihood of the market downturn, which is a big concern for the retiree. We choose R=0.5R=0.5. By calculation, the expected value for retirement level is W=490,235W^{*}=490,235 in Equation (26).

As shown in Figure 1 and Figure 2, since G()G(\cdot) is not a linear function, the strategy is not a myopic one. By the same reason, X(W)X(W) as a function of the wealth is not C1C^{1} since

X(W)W|W=W=μRσ2(gG(g))G(g)=μRσ2(1+gG′′(g)G(g))0.\displaystyle\frac{\partial X(W)}{\partial W}|_{W=W^{*}-}=\frac{\mu}{R\sigma^{2}}\frac{(gG^{\prime}(g))^{\prime}}{G^{\prime}(g)}=\frac{\mu}{R\sigma^{2}}\left(1+\frac{gG^{\prime\prime}(g)}{G^{\prime}(g)}\right)\neq 0.

The percentage of wealth in the risky asset, X(W)W\frac{X(W)}{W}, can be analyzed similarly. In the constrained region, WWW\geq W^{*}, the percentage of wealth is LW\frac{L}{W}. The larger the wealth, the smaller percentage of wealth is invested in the stock market. On the other hand, in the unconstrained region, X(W)W=μRσ2gG(g)G(g)\frac{X(W)}{W}=\frac{\mu}{R\sigma^{2}}\frac{gG^{\prime}(g)}{G(g)}. This function also decreases with respect to the wealth as shown in Figure 3.

A decreasing percentage of wealth invested in stocks is different from the surveys of the household before retirement. Wachter and Yogo (2010) explain the increasing portfolio shares in the wealth. In contrast, we show the decreasing effect of the risk capacity constraint on the portfolio share X(W)W\frac{X(W)}{W}. In the retirement period, the more wealth, the less portfolio share if the retirees are concerned about the living standard risk.121212However, we admit that our model mainly applies to a median household, not for the wealthy. Even though the wealthiest household still saves more in the portfolio, they are heavily skewed toward risky assets such as their own privately-held business and different preference with the median household. See Carroll (2002). Both Figure 2 and Figure 3 show that the optimal portfolio strategy displays a strong risk-averse feature by comparing with the benchmark model without the risk capacity constraint. Figure 4 displays a similar comparison when the risk aversion parameter R=1.5R=1.5.

Similarly, we illustrate the the effect of the risk capacity constraint and its comparison with the leverage constrain for Problem (A) in Figure 5. The Benchmark represents Merton’s classical model in which a constant proportion invested in the equity market. “VZ” denotes the optimal portfolio strategy solved in Vila and Zariphopoulou (1997) under a leverage constraint Xt12μRσ2WtX_{t}\leq\frac{1}{2}\frac{\mu}{R\sigma^{2}}W_{t}. At first glance, It seems that the optimal investment stratery XX is C1C^{1} at WW^{*}, which is mainly because the function HH (see (13)) in Problem (A) has better smoothness than GG (see (25)) in Problem (B). However, we point out that this is not true. Indeed, by Lemma C.4 in Vila and Zariphopoulou (1997), in the unconstrained region 𝒰{\cal U}, we have

dXdW=μRσ2C+m(ω+)2Cω+C+mω+Cω+\displaystyle{dX\over dW}={\mu\over R\sigma^{2}}{C+m(\omega^{+})^{2}C^{\omega^{+}}\over C+m\omega^{+}C^{{\omega}^{+}}}

If XX is C1C^{1} at WW^{*}, then we must have

C+m(ω+)2(C)ω+=0\displaystyle C^{*}+m(\omega^{+})^{2}(C^{*})^{\omega^{+}}=0

which contradicts what we derived in Proposition 5.

6.3 Wealth process

Given the optimal strategy characterized in Corollary 1, the optimal wealth process in Problem (B) is uniquely determined by (for W=G(g)W=G(g))

dWt=min{μRσ2gG(g),L}(μdt+σdZt),W0=W>0.dW_{t}=\min\left\{\frac{\mu}{R\sigma^{2}}gG^{\prime}(g),L\right\}(\mu dt+\sigma dZ_{t}),W_{0}=W>0. (35)

It can be shown that the stochastic differential equation (35) has a strong solution. Therefore, we can directly analyze the portfolio by the stochastic differential equation (SDE) (35). We obtain a similar SDE of the wealth process in Problem (A).

The portfolio dynamic is as follows. Assuming wealth Wt=WW_{t}=W^{*} at a time tt from below, then in the instant time period, [t,t+δt][t,t+\delta t], Wt,t+δt=Wt+L(μδt+σδtζ),W_{t,t+\delta t}=W_{t}+L(\mu\delta t+\sigma\sqrt{\delta t}\zeta), and St+δt=St+St(μδt+δtζ),S_{t+\delta t}=S_{t}+S_{t}\left(\mu\delta t+\sqrt{\delta t}\zeta\right), where ζ\zeta is a standard normal variable. In a good scenario of the stock market, St+δtStS_{t+\delta t}\geq S_{t}, that is, μδt+σδtζ>0\mu\delta t+\sigma\sqrt{\delta t}\zeta>0, then Wt+δtWtW_{t+\delta t}\geq W_{t}, so the same dollar amount LL is still invested in the stock market. If the market drops in the period [t,t+δt][t,t+\delta t], St+δt<StS_{t+\delta t}<S_{t}, then Wt+δt<WW_{t+\delta t}<W^{*}, the portfolio value reduces and is smaller than the threshold WW^{*}, then a new dollar amount, μRσ2V(Wt+δt)V′′(Wt+δt)\frac{\mu}{-R\sigma^{2}}\frac{V^{\prime}(W_{t+\delta t})}{V^{\prime\prime}(W_{t+\delta t})}, is invested in the stock market. This process continuous between the unconstrained region and the constrained region.

The retirement portfolio’s return process is

dWtWt=min{μRσ2gG(g),L}Wt(μdt+σdZt).\displaystyle\frac{dW_{t}}{W_{t}}=\frac{\min\left\{\frac{\mu}{R\sigma^{2}}gG^{\prime}(g),L\right\}}{W_{t}}\left(\mu dt+\sigma dZ_{t}\right).

Therefore, the instantaneous variance, Var[dWtWt]Var\left[\frac{dW_{t}}{W_{t}}\right], converges to zero as WW\rightarrow\infty. When the wealth is sufficiently high, the risk of the portfolio is very small so the retiree is able to resolve the living standard risk, regardless of possible market downturn. More importantly, the instantaneous covariance between dWtWt\frac{dW_{t}}{W_{t}} and dStSt\frac{dS_{t}}{S_{t}} is

Cov(dWtWt,dStSt)=X(Wt)Wtσ20, as Wt.Cov\left(\frac{dW_{t}}{W_{t}},\frac{dS_{t}}{S_{t}}\right)=\frac{X(W_{t})}{W_{t}}\sigma^{2}\rightarrow 0,\text{ as }W_{t}\rightarrow\infty. (36)

Hence, the portfolio is virtually independent from the stock market if the portfolio value is large enough. The same result holds for Problem (A) by replacing gG(g)gG^{\prime}(g) by CH(C)CH^{\prime}(C) and W=H(C)W=H(C).

The next result summarizes our discussion.

Corollary 2

Under the risk capacity constraint in Problem (A) and Problem (B), the optimal portfolio is virtually independent of the stock market if the retirement portfolio value is large enough.

6.4 Implications

In this section, we explain several implications to the retirement portfolio from our results.

First, the retiree needs to invest in the stock market since the all-safe strategy is too conservative to sustain the spending given longevity risk. Indeed, Vanguard (2018), among others, suggests that investing after retirement is both necessary and vital. Second, we demonstrate that the risk capacity constraint captures the retiree’s living standard risk, and the optimal portfolio under the risk capacity constraint is a reasonable retirement strategy. Specifically, if the retirement portfolio value is not high enough, the retiree should invest some money in the stock market to increase the growth rate. However, when the portfolio value is high enough, the retiree implements a “contingent constant-dollar amount strategy” by only placing LL dollar of the portfolio in the stock market as long as the portfolio value is higher than WW^{*}. Third, under the risk capacity constraint, the higher the portfolio value, the smaller the wealth in the stock market. As a result, the portfolio can reduce the living standard risk because its return is asymptotically independent of the stock market for a high level of portfolio value. Fourth, the risk capacity constraint and the leverage constraint yield different investment strategies. The generating retirement portfolio is perfectly correlated to the stock market by implementing a leverage constraint, so the retiree faces a substantial market risk. Finally, these features are robust regardless of the retiree focuses on consumption (Problem (A)) or the inheritance wealth to her heirs (Problem (B)). In interpreting this optimal strategy and saving policy, the number WW^{*} is crucial. Given its unique feature in the investment strategy, this number measures the expected lump sum of the spending in the retirement period.

It is interesting to see the effect of the capacity LL to the threshold WW^{*}, and we write it as W(L)W^{*}(L). By its definition, we write

X(W(L),L)=L,X(W^{*}(L),L)=L, (37)

where X(,L)X(\cdot,L) denotes the dollar amount function of wealth in the unconstrained region. By the chain rule in Calculus, we obtain

XWW(L)L+XL=1.\frac{\partial X}{\partial W}\frac{\partial W^{*}(L)}{\partial L}+\frac{\partial X}{\partial L}=1. (38)

Therefore, W(L)L>0\frac{\partial W^{*}(L)}{\partial L}>0 if (a) XW>0\frac{\partial X}{\partial W}>0, and (b) XL<1\frac{\partial X}{\partial L}<1. Here, the condition (a) means the monotonic property of the investment in the risky asset in the unconstrained region.131313In Problem (A), if the function CH(C)CH^{\prime}(C) is increasing, then X(W)X^{*}(W) is increasing to the wealth WW. Similarly, if the function gG(g)gG^{\prime}(g) increases, we obtain the monotonic property (a) in Problem B. Both follow from the concavity of the consumption rate in Carroll and Kimball (1996). The result still holds if the decreasing rate of H()H^{\prime}(\cdot) or G()G^{\prime}(\cdot) is bounded from above by the constant 1, even though the concavity of the consumption rate fails. The condition (b) states that the marginal effect of the capacity to the dollar investment is less than one. While it seems difficult to prove condition (a) and condition (b) rigorously, their intuitions are appealing. For instance, the condition (b) roughly means that

X(W,L+ϵ)<X(W,L)+ϵ.\displaystyle X(W,L+\epsilon)<X(W,L)+\epsilon.

To see it, let X(W,L)X(W,L) be the optimal investment for the capacity LL, and we now increase the capacity by ϵ\epsilon. Since the capacity is the maximum possible dollar amount invested in the market, the dollar invested in the equity with the new capacity level L+ϵL+\epsilon should be bounded from above by the sum of X(W,L)X(W,L) and ϵ\epsilon. Figure 6 also numerically demonstrates this property of W(L)W^{*}(L) to the capacity LL.

Choosing the parameter LL or l=L/W0l=L/W_{0} is practically interesting to implement the risk capacity constraint. If L1<L2L_{1}<L_{2}, the invested dollar amount in the stock market under the constraint XtL1X_{t}\leq L_{1} is bounded by the corresponding money invested in the stock market for the level L2L_{2}. While an increasing level of LL invests in the portfolio’s expected return, the portfolio becomes riskier. Therefore, a suitable level of LL depends on its counter-effect to the expected return and risk.

Given the relationship between WW^{*} and LL in Proposition 5 and Proposition 7, a plausible method to set the capacity LL is to first estimate the number WW^{*} and solve the capacity LL conversely. For example, by estimating all expected costs in the retirement period, the retiree might be able to estimate WW^{*}, say, 1 million. The Equation (26) reduces one equation of the variable - the capacity LL, which can be solved numerically. In this way, Proposition 5 and Proposition 7 provide the optimal strategy when the wealth does not meet the threshold WW^{*} yet.

Finally, we demonstrate that (in Proposition 5) the optimal consumption rate is not a simple linear function of the wealth. Whether the wealth is greater than the threshold WW^{*} or not, the optimal consumption rate is a highly nonlinear function of the wealth. Therefore, standard annuities are not optimal from an optimal portfolio choice perspective, if the risk of living standard is a concern.

7 Conclusion

This paper solves an optimal portfolio choice problem under risk capacity constraint in an infinite horizon framework. We present an explicit consumption-saving policy for two critical situations. Then we apply our results to the asset allocation problem for a retiree with longevity risk and living standard risk when the retiree has a preference on a stream of consumption rates or inheritance wealth, respectively. We demonstrate that the risk capacity constraint implies a buffer-stock saving strategy and reduces the living standard risk. By contrast, the leverage constraint generates substantial living standard risk. Our discussions rely on the explicit characterization of the regions on which the risk capacity constraint is binding and a detailed analysis of the smooth property of the value function.

Appendix A. Proofs

Proof of Proposition 1.

Zariphopoulou (1994) demonstrates the result for β=0\beta=0. We assume that β>0\beta>0. It is standard to show that V¯(0)=α+βδu(0)\bar{V}(0)=\frac{\alpha+\beta}{\delta}u(0) and V¯(W)\bar{V}(W) is (strictly) continuous, increasing and concave. We show that V¯(x)\bar{V}(x) is the viscosity solution of (5) and such a solution is unique.

The existence part is standard in the theory of viscosity solution. See Fleming and Soner (2006, Chapter 3). To prove the uniqueness part it suffices to prove the following comparison principle: if V1(W)V_{1}(W) is the viscosity supersolution and V2(W)V_{2}(W) is the viscosity subsolution and satisfies V1(0)V2(0)V_{1}(0)\geq V_{2}(0), then V1(W)V2(W)V_{1}(W)\geq V_{2}(W) for all W(0,)W\in(0,\infty).

Since the function u(W)u(W) is not Lipschitz, we cannot apply the standard comparison principle directly in our situation. For this purpose, we separate (0,)(0,\infty) into two parts: (0,δ)(0,\delta) and (δ,)(\delta,\infty) for a proper positive number δ\delta, then show that ϵ>0\forall\epsilon>0,V1(W)+ϵV2(W),W>0.V_{1}(W)+\epsilon\geq V_{2}(W),\ \ \forall W>0. Since V1(0)V2(0)V_{1}(0)\geq V_{2}(0), there exists δ>0\delta>0, such that

V1(W)+ϵV2(W),W(0,δ].V_{1}(W)+\epsilon\geq V_{2}(W),\ \ \forall W\in(0,\delta]. (A-1)

On the region W(δ,)W\in(\delta,\infty), u(W)u(W) is Lipchitz. Since ψ(W)+ϵ\psi(W)+\epsilon is the test function for V1(W)+ϵV_{1}(W)+\epsilon, V1(W)V_{1}(W) is also a supersolution of (5), then we utilize the standard comparison principle in Fleming and Soner (2006, Chapter 5) to obtain

V1(W)+ϵV2(W),W(δ,)V_{1}(W)+\epsilon\geq V_{2}(W),\ \ \forall W\in(\delta,\infty) (A-2)

Now, combine (A-1) and (A-2), we have

V1(W)+ϵV2(W),W>0.\displaystyle V_{1}(W)+\epsilon\geq V_{2}(W),\ \ \forall W>0.

Since ϵ\epsilon is arbitrary, the comparison principle holds and the proof is now complete. \Box

Proof of Proposition 2.

We prove Case (a), namely, α>0\alpha>0 and β>0\beta>0. Case (b) and Case (c) can be proved similarly.

We assume the solution is in the form of

V¯(W)=A1RW1R\displaystyle\bar{V}(W)={A\over 1-R}W^{1-R}

and plug it into the HJB equation:

δV¯(W)=maxX0[μXV¯(W)+12σ2X2V¯′′(W)]+maxc0{αu(c)cV¯(W)}+βu(W),(W>0).\delta\bar{V}(W)=\max_{X\geq 0}\left[\mu X\bar{V}^{\prime}(W)+\frac{1}{2}\sigma^{2}X^{2}\bar{V}^{\prime\prime}(W)\right]\\ +\max_{c\geq 0}\{\alpha u(c)-c\bar{V}^{\prime}(W)\}+\beta u(W),(W>0). (A-3)

We obtain c=(αA)1RWc^{*}=({\alpha\over A})^{1\over R}W, and Equation (A-3) becomes:

δA1RW1R=κARW1R+α1R(αA)1R1W1R(αA)1RAW1R+β1RW1R\displaystyle{\delta A\over 1-R}W^{1-R}={\kappa A\over R}W^{1-R}+{\alpha\over 1-R}({\alpha\over A})^{{1\over R}-1}W^{1-R}-({\alpha\over A})^{1\over R}AW^{1-R}+{\beta\over 1-R}W^{1-R}

By simplifying the above equation, we obtain

(δ1RκR)A=α1RR1RA11R+β1R({\delta\over 1-R}-{\kappa\over R})A={\alpha}^{1\over R}{R\over 1-R}A^{1-{1\over R}}+{\beta\over 1-R} (A-4)

It suffices to show that there is only one positive number AA which satisfy equation (A-4). Let

h(t)=(δ1RκR)tα1RR1Rt11Rβ1R.\displaystyle h(t)=({\delta\over 1-R}-{\kappa\over R})t-{\alpha}^{1\over R}{R\over 1-R}t^{1-{1\over R}}-{\beta\over 1-R}.

Then

h(t)=(δ1RκR)+α1Rt1R,\displaystyle h^{\prime}(t)=({\delta\over 1-R}-{\kappa\over R})+{\alpha}^{1\over R}t^{-{1\over R}},

and

h′′(t)=1Rα1Rt11R\displaystyle h^{\prime\prime}(t)=-{1\over R}\alpha^{1\over R}t^{-1-{1\over R}}

Case 1: If R<1R<1, by Assumption A, h(t)>0,t0h^{\prime}(t)>0,\forall t\geq 0. Moreover, h(0)=h(0)=-\infty and h(+)=+h(+\infty)=+\infty. Since h()h(\cdot) is a increasing function, there exists a unique positive number AA such that h(A)=0h(A)=0, which satisfies (A-4).

Case 2: If R>1R>1, then h(0)=β1R>0h(0)=-{\beta\over 1-R}>0, h(+)=h(+\infty)=-\infty. Moreover, h′′(t)<0,t>0h^{\prime\prime}(t)<0,\forall t>0 and h(0)=+h^{\prime}(0)=+\infty, h(+)=δ1RκR<0h^{\prime}(+\infty)={\delta\over 1-R}-{\kappa\over R}<0. Then h()h(\cdot) is concave, increase first, and then decrease on (0,)(0,\infty). Using the sign of h(0)h(0) and h()h(\infty), there is a unique positive number AA such that h(A)=0h(A)=0. The proof is complete. \Box

Proof of Proposition 3.

The ordinary differential equation for V¯(W)\bar{V}(W) in the unconstrained and constrained region becomes,

δV¯(W)=α1/RR1R(V¯)(1R)/R+κ(V¯(W)2V¯′′(W)+βu(W),\delta\bar{V}(W)=\alpha^{1/R}\frac{R}{1-R}(\bar{V}^{\prime})^{-(1-R)/R}+\kappa\frac{(\bar{V}^{\prime}(W)^{2}}{-\bar{V}^{\prime\prime}(W)}+\beta u(W), (A-5)

and

δV¯(W)=α1/RR1R(V¯)(1R)/R+μLV¯(W)+12σ2L2V¯′′(W)+βu(W),\delta\bar{V}(W)=\alpha^{1/R}\frac{R}{1-R}(\bar{V}^{\prime})^{-(1-R)/R}+\mu L\bar{V}^{\prime}(W)+\frac{1}{2}\sigma^{2}L^{2}\bar{V}^{\prime\prime}(W)+\beta u(W), (A-6)

respectively. Define a function

Y(W)=μV¯(W)+σ2LV¯′′(W),W>0.Y(W)=\mu\bar{V}^{\prime}(W)+\sigma^{2}L\bar{V}^{\prime\prime}(W),W>0. (A-7)

Then, Y(W)<0,W𝒰Y(W)<0,\forall W\in{\cal U}, and Y(W)>0Y(W)>0 for any WW\in{\cal B}.

Step 1. In the unconstrained region, the value function V¯()\bar{V}(\cdot) satisfies the ODE (A-5). By differentiating the ODE equation once and twice, we obtain

δV¯=α1/R(V¯)1RV¯′′2κV¯+κ(V¯)2V¯′′′(V¯′′)2+βu(W)\displaystyle\delta\bar{V}^{\prime}=-\alpha^{1/R}(\bar{V}^{\prime})^{-{1\over R}}\bar{V}^{\prime\prime}-2\kappa\bar{V}^{\prime}+\frac{\kappa(\bar{V}^{\prime})^{2}\bar{V}^{\prime\prime\prime}}{(\bar{V}^{\prime\prime})^{2}}+\beta u^{\prime}(W)

and

δV¯′′\displaystyle\delta\bar{V}^{\prime\prime} =\displaystyle= α1/R1R(V¯)1R1(V¯′′)2α1/R(V¯)1RV¯′′′\displaystyle\alpha^{1/R}{1\over R}(\bar{V}^{\prime})^{-{1\over R}-1}(\bar{V}^{\prime\prime})^{2}-\alpha^{1/R}(\bar{V}^{\prime})^{-{1\over R}}\bar{V}^{\prime\prime\prime}
2κV¯′′+κ(V¯)2V¯′′′′(V¯′′)2+2κV¯V¯′′′(V¯′′)3{(V¯′′)2V¯V¯′′′}+βu′′(W)\displaystyle-2\kappa\bar{V}^{\prime\prime}+\frac{\kappa(\bar{V}^{\prime})^{2}\bar{V}^{\prime\prime\prime\prime}}{(\bar{V}^{\prime\prime})^{2}}+\frac{2\kappa\bar{V}^{\prime}\bar{V}^{\prime\prime\prime}}{(\bar{V}^{\prime\prime})^{3}}\left\{(\bar{V}^{\prime\prime})^{2}-\bar{V}^{\prime}\bar{V}^{\prime\prime\prime}\right\}+\beta u^{\prime\prime}(W)

By the definition of Y(W)Y(W), the last two equations imply

δY\displaystyle\delta Y =\displaystyle= α1/R(V¯)1R[Y2μσ2LRY+1σ2LRY2V¯+μ2σ2LRV¯]2κY\displaystyle\alpha^{1/R}(\bar{V}^{\prime})^{-{1\over R}}\left[-Y^{\prime}-{2\mu\over\sigma^{2}LR}Y+{1\over\sigma^{2}LR}{Y^{2}\over\bar{V}^{\prime}}+{\mu^{2}\over\sigma^{2}LR}\bar{V}^{\prime}\right]-2\kappa Y
+κ(V¯)2(V¯′′)2Y′′+2κV¯V¯′′′(V¯′′)3{V¯′′σ2LYV¯σ2LY}\displaystyle+\frac{\kappa(\bar{V}^{\prime})^{2}}{(\bar{V}^{\prime\prime})^{2}}Y^{\prime\prime}+\frac{2\kappa\bar{V}^{\prime}\bar{V}^{\prime\prime\prime}}{(\bar{V}^{\prime\prime})^{3}}\left\{\frac{\bar{V}^{\prime\prime}}{\sigma^{2}L}Y-\frac{\bar{V}^{\prime}}{\sigma^{2}L}Y^{\prime}\right\}
+β[μu(W)+Lσ2u′′(W)]\displaystyle+\beta\left[\mu u^{\prime}(W)+L\sigma^{2}u^{\prime\prime}(W)\right]

We then define an elliptic operator on the unconstrained region by

𝒰[y]\displaystyle{\cal L}^{{\cal U}}[y] \displaystyle\equiv κ(V¯)2(V¯′′)2y′′2κV¯V¯′′′(V¯′′)3{V¯′′σ2LyV¯σ2Ly}+(δ+2κ)y\displaystyle-\frac{\kappa(\bar{V}^{\prime})^{2}}{(\bar{V}^{\prime\prime})^{2}}y^{\prime\prime}-\frac{2\kappa\bar{V}^{\prime}\bar{V}^{\prime\prime\prime}}{(\bar{V}^{\prime\prime})^{3}}\left\{\frac{\bar{V}^{\prime\prime}}{\sigma^{2}L}y-\frac{\bar{V}^{\prime}}{\sigma^{2}L}y^{\prime}\right\}+(\delta+2\kappa)y
α1/R(V¯)1R[y2μσ2LRy+1σ2LRy2V¯+μ2σ2LRV¯]\displaystyle-\alpha^{1/R}(\bar{V}^{\prime})^{-{1\over R}}\left[-y^{\prime}-{2\mu\over\sigma^{2}LR}y+{1\over\sigma^{2}LR}{y^{2}\over\bar{V}^{\prime}}+{\mu^{2}\over\sigma^{2}LR}\bar{V}^{\prime}\right]
β[μu(W)+Lσ2u′′(W)].\displaystyle-\beta\left[\mu u^{\prime}(W)+L\sigma^{2}u^{\prime\prime}(W)\right].

Therefore, 𝒰[Y]=0{\cal L}^{{\cal U}}[Y]=0 in 𝒰{\cal U}.

Step 2. In the constrained region {\cal B}, by differentiating the ODE (A-6) of V(W)V(W) once and twice, we have

δV¯=α1/R(V¯)1RV¯′′+μLV¯′′+12σ2L2V¯′′′+βu(W)\displaystyle\delta\bar{V}^{\prime}=-\alpha^{1/R}(\bar{V}^{\prime})^{-{1\over R}}\bar{V}^{\prime\prime}+\mu L\bar{V}^{\prime\prime}+\frac{1}{2}\sigma^{2}L^{2}\bar{V}^{\prime\prime\prime}+\beta u^{\prime}(W)

and

δV¯′′=α1/R1R(V¯)1R1(V¯′′)2α1/R(V¯)1RV¯′′′+μLV¯′′′+12σ2L2V¯′′′′+βu′′(W).\displaystyle\delta\bar{V}^{\prime\prime}=\alpha^{1/R}{1\over R}(\bar{V}^{\prime})^{-{1\over R}-1}(\bar{V}^{\prime\prime})^{2}-\alpha^{1/R}(\bar{V}^{\prime})^{-{1\over R}}\bar{V}^{\prime\prime\prime}+\mu L\bar{V}^{\prime\prime\prime}+\frac{1}{2}\sigma^{2}L^{2}\bar{V}^{\prime\prime\prime\prime}+\beta u^{\prime\prime}(W).

Then,

δY\displaystyle\delta Y =\displaystyle= α1/R(V¯)1R[Y2μσ2LRY+1σ2LRY2V¯+μ2σ2LRV¯]+μLY+12σ2L2Y′′\displaystyle\alpha^{1/R}(\bar{V}^{\prime})^{-{1\over R}}\left[-Y^{\prime}-{2\mu\over\sigma^{2}LR}Y+{1\over\sigma^{2}LR}{Y^{2}\over\bar{V}^{\prime}}+{\mu^{2}\over\sigma^{2}LR}\bar{V}^{\prime}\right]+\mu LY^{\prime}+\frac{1}{2}\sigma^{2}L^{2}Y^{\prime\prime}
+β[μu(W)+σ2Lu′′(W)].\displaystyle+\beta\left[\mu u^{\prime}(W)+\sigma^{2}Lu^{\prime\prime}(W)\right].

Similarly, we define an elliptic operator

[y]\displaystyle{\cal L}^{{\cal B}}[y] =\displaystyle= 12σ2L2y′′μLy+δyβμu(W)βσ2Lu′′(W)\displaystyle-\frac{1}{2}\sigma^{2}L^{2}y^{\prime\prime}-\mu Ly^{\prime}+\delta y-\beta\mu u^{\prime}(W)-\beta\sigma^{2}Lu^{\prime\prime}(W)
\displaystyle- α1/r(V¯)1R[y2μσ2LRy+1σ2LRy2V¯+μ2σ2LRV¯]\displaystyle\alpha^{1/r}(\bar{V}^{\prime})^{-{1\over R}}\left[-y^{\prime}-{2\mu\over\sigma^{2}LR}y+{1\over\sigma^{2}LR}{y^{2}\over\bar{V}^{\prime}}+{\mu^{2}\over\sigma^{2}LR}\bar{V}^{\prime}\right]

Then [Y]=0{\cal L}^{{\cal B}}[Y]=0 in .{\cal B}.

Step 3. By straightforward calculation, we obtain

[0]=𝒰[0]=βWR1(μWσ2LR)α1/Rμ2σ2LR(V¯)11R.{\cal L}^{{\cal B}}[0]={\cal L}^{{\cal U}}[0]=-\beta W^{-R-1}(\mu W-\sigma^{2}LR)-\alpha^{1/R}{\mu^{2}\over\sigma^{2}LR}(\bar{V}^{\prime})^{1-{1\over R}}. (A-8)

For simplicity, let

g(W)βWR1(μWσ2LR)α1/Rμ2σ2LR(V¯)11R.\displaystyle g(W)\equiv-\beta W^{-R-1}(\mu W-\sigma^{2}LR)-\alpha^{1/R}{\mu^{2}\over\sigma^{2}LR}(\bar{V}^{\prime})^{1-{1\over R}}.

Step 4. By the nontrivial unconstrained region condition (1), there exists a real number W1>0W_{1}>0 such that (0,W1)𝒰(0,W_{1})\subseteq{\cal U} and Y(W1)=0Y(W_{1})=0. The existence of W1W_{1} follows from Proposition 2 that 𝒰(0,){\cal U}\neq(0,\infty). We show that (W1,)(W_{1},\infty)\subseteq{\cal B} by a contradiction argument.

Assume that, there exists W2>W1W_{2}>W_{1} such that (W1,W2)(W_{1},W_{2})\subseteq{\cal B} and Y(W2)=0Y(W_{2})=0. Moreover, there exists W3>W2W_{3}>W_{2} such that (W2,W3)𝒰(W_{2},W_{3})\subseteq{\cal U}. We show this is impossible and thus finish the proof.

We first show that the constant function y=0y=0 is not the supersolution for [y]=0{\cal L}^{{\cal B}}[y]=0 in the region (W1,W2)(W_{1},W_{2}). The reason is as follows. Otherwise, since [Y]=0{\cal L}^{{\cal B}}[Y]=0 in the region (W1,W2)(W_{1},W_{2})\subseteq{\cal B} and Y(W1)=Y(W2)=0Y(W_{1})=Y(W_{2})=0, then by the comparison principle, Y(W)y=0Y(W)\leq y=0 for W(W1,W2)W\in(W_{1},W_{2}). However, by its definition of ,Y(W)>0{\cal B},Y(W)>0 for all W(W1,W2)W\in(W_{1},W_{2}). This contradiction show that the constant funciton y=0y=0 is not a supersolution of [y]=0{\cal L}^{{\cal B}}[y]=0 in the region (W1,W2)(W_{1},W_{2}). Therefore, there exists some W0(W1,W2)W_{0}\in(W_{1},W_{2}) such that, at W=W0W=W_{0},

[0]=g(W0)<0.\displaystyle{\cal L}^{{\cal B}}[0]=g(W_{0})<0.

We divide the proof into two situations because of the single crossing condition (3).

Case 1. The function g(W)g(W) does not change sign at all in (0,)(0,\infty).

In this case, g(W)<0g(W)<0 for all (0,)(0,\infty). We now consider the region (W2,W3)𝒰(W_{2},W_{3})\in{\cal U} and the operator 𝒰{\cal L}^{{\cal U}}. Since 𝒰[0]0{\cal L}^{{\cal U}}[0]\leq 0 in this small region, the constant function y=0y=0 is the subsolution for 𝒰[0]=0{\cal L}^{{\cal U}}[0]=0. Since Y(W2)=Y(W3)=0Y(W_{2})=Y(W_{3})=0, by the comparison principle, we obtain Y(W)0,W(W2,W3)Y(W)\geq 0,\forall W\in(W_{2},W_{3}), which is impossible since Y(W)Y(W) is strictly negative over the region (W2,W3)𝒰(W_{2},W_{3})\subseteq{\cal U}, the unconstrained region.

Case 2. The function g(W)g(W) change the sign in exactly one time.

In this case, β\beta must be positive. Otherwise, g(W)=α1/Rμ2σ2LR(V¯)11R<0g(W)=-\alpha^{1/R}{\mu^{2}\over\sigma^{2}LR}(\bar{V}^{\prime})^{1-{1\over R}}<0 never change the sign. By using the order of value function condition (2), V¯(W)\bar{V}^{\prime}(W) has the same order of WRW^{-R}, then (V¯)11R(\bar{V}^{\prime})^{1-{1\over R}} has the same order of W1RW^{1-R}. By comparing the order of WW of each term in in the function g(W)g(W), the term σ2LRβW1R\sigma^{2}LR\beta W^{-1-R} dominates other terms for small value of WW. Then limW0g(W)>0\lim_{W\downarrow 0}g(W)>0 (including positive infinite). Therefore, the function g(W)g(W) must be negative for all W>W0W>W_{0}. In particular, g(W)<0,W(W2,W3)g(W)<0,\forall W\in(W_{2},W_{3}). Following the same proof as in Case 1, the constant y=0y=0 is the subsolution for 𝒰[0]=0{\cal L}^{{\cal U}}[0]=0. It implies that Y(W)0,W(W2,W3)Y(W)\geq 0,\forall W\in(W_{2},W_{3}). This leads a contradiction again by the definition of 𝒰{\cal U}.

By the above proof, we have shown that (W1,)=(W_{1},\infty)={\cal B} by a contradiction argument. \Box

Lemma 7.1

In Problem (A), there exists W>0W^{*}>0 such that the open interval (0,W)(0,W^{*}) is included in the unconstrained 𝒰~\tilde{\cal U} and X(W)=LX^{*}(W^{*})=L

Proof:  Assume not, then there exists a sequence of Wn0W_{n}\rightarrow 0, such that X(Wn)=LX^{*}(W_{n})=L and (from the definition of the constrained domain and its corresponding HJB equation):

δU(Wn)R1R(U(Wn))11R+12μLU(Wn).\displaystyle\delta U(W_{n})\geq{R\over 1-R}(U^{\prime}(W_{n}))^{1-{1\over R}}+{1\over 2}\mu LU^{\prime}(W_{n}).

Set f(t)=R1Rt11R+12μLtf(t)={R\over 1-R}t^{1-{1\over R}}+{1\over 2}\mu Lt. Then the function f(t)f(t) attains its minimum at t=(12Lμ)Rt^{*}=({1\over 2}L\mu)^{-R} and f(t)=11R[12Lμ]1Rf(t^{*})={1\over 1-R}[{1\over 2}L\mu]^{1-R}. We divide the proof in two cases.

Case 1. If R<1R<1, let nn\rightarrow\infty, then δU(Wn)\delta U(W_{n}) converges to u(0)=0u(0)=0. On the other hand, the right side of the last inequality is bounded from below by a positive number 11R[12Lμ]1R{1\over 1-R}[{1\over 2}L\mu]^{1-R}, which is contradiction.

Case 2. If R>1R>1, let nn\rightarrow\infty, the δU(Wn)\delta U(W_{n}) tends to -\infty but right side is always bounded below by a finite number, 11R[12Lμ]1R{1\over 1-R}[{1\over 2}L\mu]^{1-R}, another contradiction. We have thus finished the proof. \Box

Proof of Proposition 4.

We prove Proposition 4 by verifying condition (1) and (2) in Proposition 6, since the condition (3) holds naturally. Lemma 7.1 implies condition (1) for the unconstrained region. As for condition (2), we first notice that U(W)U(W) is bounded by the value function (Merton, 1971) without the risk constrain region. Therefore, there exists a positive number C1C_{1} such that U(W)C1W1RU(W)\leq C_{1}W^{1-R}. Moreover, by following a similar argument in Vila and Zariphoulou (1997, Lemma C.1, (ii)), it can be shown that U(W)C0W1R,W(0,)U(W)\geq C_{0}W^{1-R},\forall W\in(0,\infty) for a positive number C0C_{0}. Then, U(W)U^{\prime}(W) has the same order of WRW^{-R}. Alternatively, a similar argument in the following Lemma 7.2 also demonstrates the order of value function condition (2) in this case. The proof is finished. \Box

Remark 7.1

Vila and Zariphopoulou (1997, Proposition 4.4) shows a similar result under the constraint that Xtk(Wt+L)X_{t}\leq k(W_{t}+L). We can modify some arguments in Vila and Zariphopoulou (1997) to prove Proposition 4 under the risk capacity constraint (the details are available upon request). However, this method cannot be used in Problem (B) and the general problem (4)).

Proof of Proposition 5.

By Karatzas and Shreve (1998), the value function in the unconstrained region can be written as J(m,n)(W)J^{(m,n)}(W) for parameters m,nm,n as follows. For any real number m,nm,n, and W0W^{*}\geq 0, we define the a class of strictly increasing and concave function, W(0,W]J(m,n)(W)+W\in(0,W^{*}]\rightarrow J^{(m,n)}(W)\in\mathbb{R}^{+} such that

W=W(C)1λ(C+mCω++nCω),\displaystyle W=W(C)\equiv\frac{1}{\lambda^{\infty}}\left(C+mC^{\omega^{+}}+nC^{\omega^{-}}\right),

which is strictly increasing, and

J(C)1λ{C1R1R+mω+ω+RCω+R+nωωRCωR}.\displaystyle J(C)\equiv\frac{1}{\lambda^{\infty}}\left\{\frac{C^{1-R}}{1-R}+m\frac{\omega^{+}}{\omega^{+}-R}C^{\omega^{+}-R}+n\frac{\omega^{-}}{\omega^{-}-R}C^{\omega^{-}-R}\right\}.

These two functions W(C),J(C)W(C),J(C) with a variable CC clearly introduce a well-defined increasing and C2C^{2} smooth function, and U(W)=J(C)U(W)=J(C). It can be shown that when W=0W=0, the optimal consumption rate is zero. It implies that n=0n=0. Then W=W(C)=H(C)W=W(C)=H(C), and W=H(C)W^{*}=H(C^{*}) for a unique finite number CC^{*}. It remains to solve {C,m}\{C^{*},m\}, by using both the value-matching and smooth-fit condition below.

We start with the smooth fit condition at WW^{*}. The value function U(W)U(W) satisfies

L=μσ2UWUWW|W=WL=-\frac{\mu}{\sigma^{2}}\frac{U_{W}}{U_{WW}}|_{W=W^{*}} (A-9)

In the unconstrained region, noting that UW=CRU_{W}=C^{-R}, then UW(W)=(C)RU_{W}(W^{*})=(C^{*})^{-R}, and

UWW=RCR11dW/dC=R1CR+1H(C),\displaystyle U_{WW}=-RC^{-R-1}\frac{1}{dW/dC}=-R\frac{1}{C^{R+1}H^{\prime}(C)},

then, we obtain

L=μσ2RCH(C).L=\frac{\mu}{\sigma^{2}R}C^{*}H^{\prime}(C^{*}). (A-10)

We next consider the value function in the constrained region (W,)(W^{*},\infty) which satisfies

R1RUW11R+μLUW+12σ2L2UWWδU=0.\frac{R}{1-R}U_{W}^{1-\frac{1}{R}}+\mu LU_{W}+\frac{1}{2}\sigma^{2}L^{2}U_{WW}-\delta U=0. (A-11)

We define a function K(x):[W,)(0,)K(x):[W^{*},\infty)\rightarrow(0,\infty) by the ordinary differential equation

12σ2L2K′′(x)=δK(x)R1RK(x)11RμLK(x).\frac{1}{2}\sigma^{2}L^{2}K^{\prime\prime}(x)=\delta K(x)-\frac{R}{1-R}K^{\prime}(x)^{1-\frac{1}{R}}-\mu LK^{\prime}(x). (A-12)

The value-matching condition is

K(W)=J(C),K(W^{*})=J(C^{*}), (A-13)

and the smooth-fit condition at WW^{*} is

K(W)=(C)R.K^{\prime}(W^{*})=(C^{*})^{-R}. (A-14)

It is not sufficient to characterize the function K(x)K(x) yet as WW^{*} is to be solved. For the end, we notice that U(W,0)U(W)U(W)U(W,0)\leq U(W)\leq U^{\infty}(W), K(W)K(W) has the order as W1RW^{1-R} when WW\rightarrow\infty.

Case 1. If R>1R>1, then limWU(W)=0\lim_{W\rightarrow\infty}U(W)=0. Then we derive another boundary condition K()=0K(\infty)=0.

If this case, the ordinary differential equation theory (King, Billingham and Otto, 2003) characterizes the function K(x)K(x) and WW^{*}. Specifically, we define a function h:[0,1W](0,),h(W)=K(1W)h:[0,\frac{1}{W^{*}}]\rightarrow(0,\infty),h(W)=K(\frac{1}{W}). Then limW0h(W)=0,h(1W)=J(C)\lim_{W\rightarrow 0}h(W)=0,h(\frac{1}{W^{*}})=J(C^{*}), and it is straightforward to see that h(x)h(x) satisfies a second-order differential equation h′′(x)=H(x,h(x),h(x))h^{\prime\prime}(x)=H(x,h(x),h^{\prime}(x)). Finally, WW^{*} satisfies the following equation (since K(W)=(C)RK^{\prime}(W^{*})=(C^{*})^{-R})

(C)R=h(1W)1(W)2,\displaystyle(C^{*})^{-R}=h^{\prime}(\frac{1}{W^{*}})\frac{-1}{(W^{*})^{2}}, (A-15)

implying

h(1W)=(W)2(C)R=H(C)2(C)R.h^{\prime}(\frac{1}{W^{*}})=-(W^{*})^{2}(C^{*})^{-R}=-H(C^{*})^{2}(C^{*})^{-R}. (A-16)

Case 2. If R<1R<1, then limWU(W)=\lim_{W\rightarrow\infty}U(W)=\infty.

We define another function k(x):[0,1W](0,)k(x):[0,\frac{1}{W}]\rightarrow(0,\infty) by k(x)=1K(1/x)k(x)=\frac{1}{K(1/x)}. Then limW0k(W)=0,k(1W)=1J(C)\lim_{W\rightarrow 0}k(W)=0,k(\frac{1}{W^{*}})=\frac{1}{J(C^{*})}. It is straightforward to verify that k(x)k(x) satisfies a second-order ordinary differential equation. Moreover,

k(x)=1K(1/x)2K(1/x)1x2,\displaystyle k^{\prime}(x)=\frac{1}{K(1/x)^{2}}K^{\prime}(1/x)\frac{1}{x^{2}},

implying

k(1W)=(C)RJ(C)2H(C)2.k^{\prime}(\frac{1}{W^{*}})=\frac{(C^{*})^{-R}}{J(C^{*})^{2}}H(C^{*})^{2}. (A-17)

By using the characterization of the value function in Proposition 1 and the value function is C2C^{2} smooth, the existence and the uniqueness of these two parameters C,m{C^{*},m} is guaranteed by two equations (A-10) and (A-14). \Box

To simplify the notations, we use V0(W),V(W)V^{0}(W),V^{\infty}(W) to represent V(W,0),V(W,)V(W,0),V(W,\infty).

We prove two lemmas in proving Proposition 6.

Lemma 7.2

Assume V(W)V(W) is C2C^{2} smooth, then there exists two positive numbers C0,C1C_{0},C_{1} such that

C0WRV(W)C1WR,W>0.\displaystyle C_{0}W^{-R}\leq V^{\prime}(W)\leq C_{1}W^{-R},\forall W>0.

In particular, limW0V(W)=\lim_{W\rightarrow 0}V^{\prime}(W)=\infty and limWV(W)=0\lim_{W\rightarrow\infty}V^{\prime}(W)=0.

Proof:  By a direct calculation, V0(W)=u(W)δV^{0}(W)=\frac{u(W)}{\delta} and Proposition 2 states that V(W)=u(W)δρV^{\infty}(W)=\frac{u(W)}{\delta-\rho}. Then, by using the concave property of the function V()V(\cdot), for any positive number W>0W>0 and E>0E>0, we have

V(W)\displaystyle V^{\prime}(W) \displaystyle\geq 1E[V(W+E)V(W)]\displaystyle{1\over E}[V(W+E)-V(W)]
\displaystyle\geq 1E[V0(W+E)V(W)]\displaystyle{1\over E}[V^{0}(W+E)-V^{\infty}(W)]
=\displaystyle= 1E[11R1δ(W+E)1R11R1δρW1R].\displaystyle\frac{1}{E}\left[\frac{1}{1-R}\frac{1}{\delta}(W+E)^{1-R}-\frac{1}{1-R}\frac{1}{\delta-\rho}W^{1-R}\right].

Choosing E=kWE=kW, we have

V(W)1k[11R1δ(k+1)1R11R1δρ]WR\displaystyle V^{\prime}(W)\geq{1\over k}\left[\frac{1}{1-R}\frac{1}{\delta}(k+1)^{1-R}-\frac{1}{1-R}\frac{1}{\delta-\rho}\right]W^{-R}

Let

C0=supk>01k(1R)[1δ(k+1)1R1δρ]C_{0}=\sup_{k>0}\frac{1}{k(1-R)}\left[\frac{1}{\delta}(k+1)^{1-R}-\frac{1}{\delta-\rho}\right] (A-18)

where x+=max(x,0)x^{+}=\max(x,0). It is easy to see C0C_{0} is positive no matter R>1R>1 or R<1R<1.

By the same reason, for any E=βW,β(0,1)E=\beta W,\beta\in(0,1), we have

V(W)\displaystyle V^{\prime}(W) \displaystyle\leq 1βW[V(W)V(WβW)]\displaystyle\frac{1}{\beta W}\left[V(W)-V(W-\beta W)\right]
\displaystyle\leq 1βW[V(W)V0(WβW)]\displaystyle\frac{1}{\beta W}\left[V^{\infty}(W)-V^{0}(W-\beta W)\right]
\displaystyle\leq 1β(1R){1δρ1δ(1β)1R}WR.\displaystyle\frac{1}{\beta(1-R)}\left\{\frac{1}{\delta-\rho}-\frac{1}{\delta}(1-\beta)^{1-R}\right\}W^{-R}.

Let

C1=inf0<β<11β(1R)[1δρ1δ(1β)1R]C_{1}=\inf_{0<\beta<1}\frac{1}{\beta(1-R)}\left[\frac{1}{\delta-\rho}-\frac{1}{\delta}(1-\beta)^{1-R}\right] (A-19)

Clearly, C1C_{1} is positive no matter R>1R>1 or R<1R<1. The proof is finished. \Box

Remark 7.2

The proof of Lemma 7.2 is similar to a method in Xu and Yi (2016) for an optimal portfolio choice problem on a consumption constraint.

Lemma 7.3

Assume V()V(\cdot) is C2C^{2} smooth, then there exists W~\tilde{W} such that the open interval (0,W~)(0,\tilde{W}) is included in 𝒰{\cal U}, and X(W~)=L.X^{*}(\tilde{W})=L.

Proof:  Assume not, then there exists a sequence Wn0W_{n}\rightarrow 0 such that X(Wn)=LX^{*}(W_{n})=L. By using equation (21) and Lemma 7.2, we have

δV(Wn)\displaystyle\delta V(W_{n}) \displaystyle\geq 11RWn1R+12μLV(Wn)\displaystyle{1\over 1-R}W_{n}^{1-R}+{1\over 2}\mu LV^{\prime}(W_{n})
\displaystyle\geq 11RWn1R+12μLC0WnR\displaystyle{1\over 1-R}W_{n}^{1-R}+{1\over 2}\mu LC_{0}{W_{n}}^{-R}
=\displaystyle= WnR(11RWn+12μLC0)\displaystyle W_{n}^{-R}({1\over 1-R}W_{n}+{1\over 2}\mu LC_{0})

Case 1. R<1R<1. Since V(W)V(W) is continuous , as nn\rightarrow\infty, the left hand side of the last inequality approaches to δV(0)=0\delta V(0)=0.

Case 2. R>1R>1. The left hand side of the last inequality approaches to δV(0)=\delta V(0)=-\infty. However, the term WnR(11RWn+12μLC0)W_{n}^{-R}({1\over 1-R}W_{n}+{1\over 2}\mu LC_{0}) tends to ++\infty on the right hand side of the last equality, which is a contradiction. \Box

Proof of Proposition 6.

Since the single crossing condition holds for the function g(W)=[μu(W)+σ2Lu′′(W)]=WR1(μWσ2LR)g(W)=-[\mu u^{\prime}(W)+\sigma^{2}Lu^{\prime\prime}(W)]=-W^{-R-1}(\mu W-\sigma^{2}LR) in this case, this proposition follows from Proposition 3, Lemma 7.2 and Lemma 7.3. \Box

Lemma 7.4

Let F:(0,)×××F:(0,\infty)\times\mathbb{R}\times\mathbb{R}\times\mathbb{R}\rightarrow\mathbb{R} be a continuous and elliptic operator, that is, F(x,r,p,X)F(x,r,p,Y),XYF(x,r,p,X)\leq F(x,r,p,Y),\forall X\geq Y. Assume V(x)V(x) is a continuous viscosity solution of a second-order (HJB) equation F(x,u,ux,uxx)=0F(x,u,u_{x},u_{xx})=0 and the region of xx is 𝒟=(0,){\cal D}=(0,\infty). Moreover, there exists xx^{*} such that V(x)V(x) is smooth in both (0,x)(0,x^{*}) and (x,)(x^{*},\infty), then V(x)V(x) must satisfies the smooth-fit condition at xx^{*}, that is, V(x)=V(x+)V^{\prime}(x^{*}-)=V^{\prime}(x^{*}+).

Proof:  Without lost of generality, we assume that V(x)<0<V(x+)V^{\prime}(x^{*}-)<0<V^{\prime}(x^{*}+) and derive a contradiction. Since there is no available test function, the subsolution holds automatically. We next check the supersolution. Let the test function in the form of

ψ(x)V(x)+12[V(x)+V(x+)](xx)+α(xx)2\displaystyle\psi(x)\equiv V(x^{*})+\frac{1}{2}\left[V^{\prime}(x^{*}-)+V^{\prime}(x^{*}+)\right](x-x^{*})+\alpha(x-x^{*})^{2}

We claim that α\alpha can take any real value: To make ψ(x)\psi(x) the valid test function, we need to guarantee that ψ(x)V(x)\psi(x)\leq V(x) when xx is in a small neighborhood of xx^{*}. However, when xxx\rightarrow x^{*}, the linear term 12[V(x)+V(x+)](xx)\frac{1}{2}\left[V^{\prime}(x^{*}-)+V^{\prime}(x^{*}+)\right](x-x^{*}) will dominate the quadratic term α(xx)2\alpha(x-x^{*})^{2}. Therefore, when xx and xx^{*} are close enough, we could choose sufficiently large α\alpha such that ψ(x)V(x)\psi(x)\leq V(x). It is now clear that α\alpha can take any value.
Now, apply the viscosity property at xx^{*}, we have

F(x,V(x),12[V(x)+V(x+)],2α)0,\displaystyle F\left(x^{*},V(x^{*}),\frac{1}{2}\left[V^{\prime}(x^{*}-)+V^{\prime}(x^{*}+)\right],2\alpha\right)\geq 0,

which is impossible by the free choice of the parameter α\alpha. \Box

Remark 7.3

Lemma 7.4 can be viewed as a converse statement of Proposition 6. If the value function is smooth in each region (0,W),(W,)(0,W^{*}),(W^{*},\infty), then the value function must be smooth as long as the value function is continuous and a viscosity solution of a HJB equation.

Proof of Proposition 7.

We divide the proof into several steps.

Step 1. Assuming WW^{*} is known, we derive candidate solution of Equation (A-6) in the constrained region. To simplify notation we still use V(W)V(W) to represent the feasible solution of the value function, being a solution of a corresponding ODE.

The solution of the homogeneous ODE, 12σ2L2VWW+LμVWδV(W)=0\frac{1}{2}\sigma^{2}L^{2}V_{WW}+L\mu V_{W}-\delta V(W)=0, is written as C1eβ1W+C2eβ2W.C_{1}e^{\beta_{1}W}+C_{2}e^{\beta_{2}W}. By the method of partial integral, one particular solution for the non-linear ODE (A-6) is

V0(W)=0W2σ2L2u(x){eβ1xeβ2Weβ1Weβ2xW(eβ1x,eβ2x)}𝑑xV_{0}(W)=-\int_{0}^{W}\frac{2}{\sigma^{2}L^{2}}u(x)\left\{\frac{e^{\beta_{1}x}e^{\beta_{2}W}-e^{\beta_{1}W}e^{\beta_{2}x}}{W(e^{\beta_{1}x},e^{\beta_{2}x})}\right\}dx (A-20)

where W(f,g)=fgfgW(f,g)=fg^{\prime}-f^{\prime}g is the Wronskian determinants of two solutions {f,g}\{f,g\} of a homogeneous second-order ODE. By a straightforward calculation,

V0(W)\displaystyle V_{0}(W) =\displaystyle= 2(β1β2)(1R)σ2L2\displaystyle\frac{2}{(\beta_{1}-\beta_{2})(1-R)\sigma^{2}L^{2}}
×{eβ2W0Wx1Reβ2x𝑑xeβ1W0Wx1Reβ1x𝑑x}.\displaystyle\times\left\{e^{\beta_{2}W}\int_{0}^{W}x^{1-R}e^{-\beta_{2}x}dx-e^{\beta_{1}W}\int_{0}^{W}x^{1-R}e^{-\beta_{1}x}dx\right\}.

Therefore, the function V0(W)V_{0}(W) is well-defined and it can be expressed in terms of the incomplete gamma function. A general solution of the ODE (A-6) is

V(W)=C1eβ1W+C2eβ2W+V0(W).V(W)=C_{1}e^{\beta_{1}W}+C_{2}e^{\beta_{2}W}+V_{0}(W). (A-21)

Step 2. Assuming WW^{*} is known, we show that C1=2(β1β2)(1R)σ2L2(β1)R2Γ(2R)C_{1}={2\over(\beta_{1}-\beta_{2})(1-R)\sigma^{2}L^{2}}(\beta_{1})^{R-2}\Gamma(2-R) in Equation (A-21).

By Proposition 2, V(W)W1R\frac{V(W)}{W^{1-R}} is bounded above by a constant. Therefore, V(W)/eβ1W0V(W)/e^{\beta_{1}W}\rightarrow 0 as WW\rightarrow\infty in the constrained region. On the other hand, by (A-21), as WW\rightarrow\infty

C1+C2e(β2β1)W+V0(W)eβ1W0C_{1}+C_{2}e^{(\beta_{2}-\beta_{1})W}+{V_{0}(W)\over e^{\beta_{1}W}}\rightarrow 0 (A-22)

Note that

V0(W)eβ1W=2(β1β2)(1R)σ2L2×{e(β2β1)W0Wx1Reβ2x𝑑x0Wx1Reβ1x𝑑x}.\frac{V_{0}(W)}{e^{\beta_{1}W}}=\frac{2}{(\beta_{1}-\beta_{2})(1-R)\sigma^{2}L^{2}}\times\left\{e^{(\beta_{2}-\beta_{1})W}\int_{0}^{W}x^{1-R}e^{-\beta_{2}x}dx-\int_{0}^{W}x^{1-R}e^{-\beta_{1}x}dx\right\}. (A-23)

For the the first term in the bracket of (A-23), since β2<0\beta_{2}<0, we have

e(β2β1)W0Wx1Reβ2x𝑑x\displaystyle e^{(\beta_{2}-\beta_{1})W}\int_{0}^{W}x^{1-R}e^{-\beta_{2}x}dx =\displaystyle= eβ1W0Wx1Reβ2(Wx)𝑑x\displaystyle e^{-\beta_{1}W}\int_{0}^{W}x^{1-R}e^{\beta_{2}(W-x)}dx
\displaystyle\leq eβ1W0Wx1R𝑑x\displaystyle e^{-\beta_{1}W}\int_{0}^{W}x^{1-R}dx
=\displaystyle= eβ1WW2R2R\displaystyle e^{-\beta_{1}W}{W^{2-R}\over 2-R}

which tends to 0 as WW\rightarrow\infty.
For the second term in the bracket of (A-23), change of variable y=β1xy=\beta_{1}x leads to

0Wx1Reβ1x𝑑x=(β1)R20β1Wy1Rey𝑑y.\displaystyle\int_{0}^{W}x^{1-R}e^{-\beta_{1}x}dx=(\beta_{1})^{R-2}\int_{0}^{\beta_{1}W}y^{1-R}e^{-y}dy.

By the property of incomplete Gamma function (B-3) in Appendix B,

(β1)R20β1Wy1Rey𝑑y(β1)R2Γ(2R).\displaystyle(\beta_{1})^{R-2}\int_{0}^{\beta_{1}W}y^{1-R}e^{-y}dy\rightarrow(\beta_{1})^{R-2}\Gamma(2-R).

Then, we obtain

C1=2(β1β2)(1R)σ2L2(β1)R2Γ(2R).\displaystyle C_{1}={2\over(\beta_{1}-\beta_{2})(1-R)\sigma^{2}L^{2}}(\beta_{1})^{R-2}\Gamma(2-R).

In Step 5 below, we show that C2=C(W)C_{2}=C(W^{*}) in Equation (23).

Step 3. Assuming WW^{*} is known, we characterize the feasible solution in the unconstrained region.

We introduce a new variable gg by V(W)=gRV^{\prime}(W)=g^{-R}. Since V()V(\cdot) is concave by a standard argument, V(W)V^{\prime}(W) is a decreasing function. Then, W=G(g)W=G(g) for an increasing function G()G(\cdot). Similarly, we can write gg as a well-defined increasing function of WW, g=g(W)g=g(W). We characterize the function G()G(\cdot) and derive the feasible function in terms of the auxiliary function G()G(\cdot).

Since W=G(g(W))W=G(g(W)), then 1=G(g)g(W)1=G^{\prime}(g)g^{\prime}(W), yielding G(g)=1/g(W)G^{\prime}(g)=1/g^{\prime}(W). By using V′′(W)=RgR1G(g),V^{\prime\prime}(W)={-Rg^{-R-1}\over G^{\prime}(g)}, the HJB equation becomes

δV(G(g))=11R[G(g)]1R+κRgR+1G(g).\displaystyle\delta V(G(g))={1\over 1-R}[G(g)]^{1-R}+{\kappa\over R}g^{-R+1}G^{\prime}(g).

We differentiate both sides of the above equation again with respect to WW, obtaining

G′′(g)=Rκg1G(g)[δρgR(G(g))R]G^{\prime\prime}(g)={R\over\kappa}g^{-1}G^{\prime}(g)\left[\delta-\rho-g^{R}(G(g))^{-R}\right] (A-24)

Since G()G(\cdot) is strictly increasing, the unconstrained region of WW, WWW\leq W^{*}, corresponds one-one to a region of gg, gg=g(W)g\leq g^{*}=g(W^{*}). Moreover, for any WWW\leq W^{*},

V(W)\displaystyle V(W) =\displaystyle= u(0)δ+0WVW𝑑W\displaystyle\frac{u(0)}{\delta}+\int_{0}^{W}V_{W}dW
=\displaystyle= u(0)δ+0G1(W)gRG(g)𝑑g.\displaystyle\frac{u(0)}{\delta}+\int_{0}^{G^{-1}(W)}g^{-R}G^{\prime}(g)dg.

Therefore, the feasible value function in the unconstrained region is uniquely determined by the auxiliary function G()G(\cdot). The number gg^{*} is shown to be D(W)D(W^{*}) in Step 5 below.

Step 4. Assuming WW^{*} is known, we derive the boundary condition for ordinary differential equation (A-24).

Since V(0)=+V^{\prime}(0)=+\infty (Lemma 7.2), we have G(0)=0G(0)=0. Second, at W=WW=W^{*}, G(g)=WG(g^{*})=W^{*}. Moreover, the constraint μσ2V(W)V′′(W)=L-{\mu\over\sigma^{2}}{V^{\prime}(W^{*}-)\over V^{\prime\prime}(W^{*}-)}=L implies that

G(g)=LRσ2μ(g)1.\displaystyle G^{\prime}(g^{*})={LR\sigma^{2}\over\mu}(g^{*})^{-1}.

By the characterization of the feasible value function in Step 3, the required smooth-fit condition is

(g)R=2(β1β2)(1R)σ2L2(β1)R1Γ(2R)eβ1W+C2β2eβ2W+V0(W).(g^{*})^{-R}={2\over(\beta_{1}-\beta_{2})(1-R)\sigma^{2}L^{2}}(\beta_{1})^{R-1}\Gamma(2-R)e^{\beta_{1}W}+C_{2}\beta_{2}e^{\beta_{2}W^{*}}+V_{0}^{\prime}(W^{*}). (A-25)

Therefore, the boundary condition of the ODE (A-24) are G(0)=0G(0)=0, G(g)=WG(g^{*})=W^{*} and G(g)=LRσ2μ(g)1G^{\prime}(g^{*})={LR\sigma^{2}\over\mu}(g^{*})^{-1}. It remans to determine gg^{*} and CC in the next step.

Step 5. Assuming the smooth-fit condition of the value function V(W)V(W), we show that C2=C(W)C_{2}=C(W^{*}) and g=D(W)g^{*}=D(W^{*}).

The smooth-fit condition can be written as μσ2V(W+)V′′(W+)=L-{\mu\over\sigma^{2}}{V^{\prime}(W^{*}+)\over V^{\prime\prime}(W^{*}+)}=L. Then, the feasible function in Step 2 implies that

μ[2(β1β2)(1R)σ2L2(β1)R1Γ(2R)eβ1W+C2β2eβ2W+V0(W)]\displaystyle-\mu\left[{2\over(\beta_{1}-\beta_{2})(1-R)\sigma^{2}L^{2}}(\beta_{1})^{R-1}\Gamma(2-R)e^{\beta_{1}W}+C_{2}\beta_{2}e^{\beta_{2}W^{*}}+V_{0}^{\prime}(W^{*})\right]
=σ2L[2(β1β2)(1R)σ2L2(β1)RΓ(2R)eβ1W+C2β22eβ2W+V0′′(W)]\displaystyle=\sigma^{2}L\left[{2\over(\beta_{1}-\beta_{2})(1-R)\sigma^{2}L^{2}}(\beta_{1})^{R}\Gamma(2-R)e^{\beta_{1}W}+C_{2}\beta_{2}^{2}e^{\beta_{2}W^{*}}+V_{0}^{\prime\prime}(W^{*})\right]

Solving this equation, we obtain C2=C(W)C_{2}=C(W^{*}) as in (23). By Equation (A-25), we have g=D(W)g^{*}=D(W^{*}).

Step 6. Assuming the existence of WW^{*} (which is guaranteed under assumption of C2C^{2} smooth condition of the value function), we derive the equation of the parameter WW^{*}.

In fact, the value-matching equation, V(W)=V(W+)V(W^{*}-)=V(W^{*}+), can be written as

u(0)δ+0ggRG(g)𝑑g=2(β1β2)(1R)σ2L2(β1)R2Γ(2R)eβ1W+C(W)eβ2W+V0(W).\frac{u(0)}{\delta}+\int_{0}^{g^{*}}g^{-R}G^{\prime}(g)dg={2\over(\beta_{1}-\beta_{2})(1-R)\sigma^{2}L^{2}}(\beta_{1})^{R-2}\Gamma(2-R)e^{\beta_{1}W^{*}}+C(W^{*})e^{\beta_{2}W^{*}}+V_{0}(W^{*}). (A-26)

This is a one-variable equation of the variable WW^{*}, as g=D(W)g^{*}=D(W^{*}). This equation is the same as Equation (26) proposed in the statement of this proposition.

Step 7. (Necessary condition) If the value function is C2C^{2} smooth, then by Proposition 6, there is a positive number WW^{*} such that the unconstrained region and the constrained region are separated by this number WW^{*}. By Step 1 - Step 6 this number WW^{*} must satisfy Equation (A-26) to ensure the smooth-fit condition of the value function. Moreover, the positive solution of Equation (A-26) must be unique by Proposition 6 again. By its construction, V(W,L)=V~(W,L)V(W,L)=\tilde{V}(W,L) by (27). The necessary part is proved.

Step 8. (Verification). We assume the existence of a positive solution WW^{*} of Equation (A-26) and show that the value function is C2C^{2} smooth.

In fact, by Step 1 - Step 6, the function V~(W,L)\tilde{V}(W,L) is C2C^{2} smooth, and a smooth solution of the HJB equation in each region (0,W)(0,W^{*}) and (W,)(W^{*},\infty). It remains to show that V~(W,L)\tilde{V}(W,L) is a viscosity solution of the HJB equation. We show that, μV~W(W,L)σ2V~WW(W,L)L,WW\frac{\mu\tilde{V}_{W}(W,L)}{-\sigma^{2}\tilde{V}_{WW}(W,L)}\leq L,\forall W\leq W^{*}. By its definition, it suffices to show that the function gG(g)gG^{\prime}(g) is increasing since μV~W(W,L)σ2V~WW(W,L)=μRσ2gG(g)\frac{\mu\tilde{V}_{W}(W,L)}{-\sigma^{2}\tilde{V}_{WW}(W,L)}=\frac{\mu}{R\sigma^{2}}gG^{\prime}(g) and μRσ2gG(g)=L\frac{\mu}{R\sigma^{2}}g^{*}G^{\prime}(g^{*})=L. Here, the function G()G(\cdot) is defined by WW^{*}. For this purpose, we notice that

(gG(g))\displaystyle(gG^{\prime}(g))^{\prime} =\displaystyle= gG′′(g)+G(g)\displaystyle gG^{\prime\prime}(g)+G^{\prime}(g)
=\displaystyle= G(g)[Rκ(δρgRGR)]+G(g)\displaystyle G^{\prime}(g)\left[\frac{R}{\kappa}(\delta-\rho-g^{R}G^{-R})\right]+G^{\prime}(g)

in which Equation (25) is used. Since G(g)>0G^{\prime}(g)>0 follows from the concavity of the function V(W)V(W) in the region WLW\leq L, it reduces to show that

[Rκ(δρgRGR)]+10,\displaystyle\left[\frac{R}{\kappa}(\delta-\rho-g^{R}G^{-R})\right]+1\geq 0,

Or equivalently, gRGR<δρ+κRg^{R}G^{-R}<\delta-\rho+\frac{\kappa}{R}, as proposed in the proposition. Therefore, we have proved that

{μσ2V~W(W,L)V~WW(W,L)L}=(0,W],\displaystyle\left\{\frac{\mu}{-\sigma^{2}}\frac{\tilde{V}_{W}(W,L)}{\tilde{V}_{WW}(W,L)}\leq L\right\}=(0,W^{*}],

and

{μσ2V~W(W,L)V~WW(W,L)L}=[W,).\displaystyle\left\{\frac{\mu}{-\sigma^{2}}\frac{\tilde{V}_{W}(W,L)}{\tilde{V}_{WW}(W,L)}\geq L\right\}=[W^{*},\infty).

By its construction, V(W,L)=V~(W,L)V(W,L)=\tilde{V}(W,L) is C2C^{2} smooth, and it is the value function by Proposition 1. Moreover, by Step 7 and Proposition 6, the positive number WW^{*} must be the unique positive solution of Equation (A-26). We have thus proved the sufficient part (the verification theorem). \Box

Proof of Proposition 8.

By using the same argument in proving Proposition 1, we can prove that the value function is the unique viscosity solution of the HJB equation (for V(W)=Vb(W)V(W)=V^{b}(W))

δV(W)=max0XbW[12σ2X2V′′+μXV]+maxc0{αu(c)cV(W)}+βu(W)\displaystyle\delta V(W)=\max_{0\leq X\leq bW}\left[{1\over 2}\sigma^{2}X^{2}V^{\prime\prime}+\mu XV^{\prime}\right]+\max_{c\geq 0}\left\{\alpha u(c)-cV(W)\right\}+\beta u(W)

with initial value V(0)=0V(0)=0. Similar to Proposition 2, we find a C2C^{2} solution of the form V(W)=BW1R1RV(W)=B\frac{W^{1-R}}{1-R} to the above HJB equation for a positive number BB.

By plugging V(W)=BW1R1RV(W)=B\frac{W^{1-R}}{1-R} into the HJB equation with X=bWX^{*}=bW, a straightforward computation implies that

δBW1R1R\displaystyle\delta B\frac{W^{1-R}}{1-R} =\displaystyle= 12σ2b2W2B(R)WR1+μbWBWR+β11RW1R+α1/RR1R(BWR)11/R)\displaystyle{1\over 2}\sigma^{2}b^{2}W^{2}B(-R)W^{-R-1}+\mu bWBW^{-R}+\beta{1\over 1-R}W^{1-R}+\alpha^{1/R}\frac{R}{1-R}(BW^{-R)^{1-1/R}})

yielding an equation of BB as follows

[δ+(1R)(12σ2b2Rμb)]B=β+α1/RRB11/R.\left[\delta+(1-R)(\frac{1}{2}\sigma^{2}b^{2}R-\mu b)\right]B=\beta+\alpha^{1/R}RB^{1-1/R}. (A-27)

Since b<μRσ2b<\frac{\mu}{R\sigma^{2}}, then X=bWX^{*}=bW is the solution in max0XbW[12σ2X2V′′+μXV]\max_{0\leq X\leq bW}\left[\frac{1}{2}\sigma^{2}X^{2}V^{\prime\prime}+\mu XV^{\prime}\right]. Moreover, the optimal consumption rate satisfies that αu(c)=V(W)=BWR\alpha u^{\prime}(c)=V^{\prime}(W)=BW^{-R}. It remains to show the unique positive solution BB of Equation (A-27) for α>0,β>0\alpha>0,\beta>0. We notice that, since 0bμRσ20\leq b\leq\frac{\mu}{R\sigma^{2}}, we have

κR12σ2b2Rμb0.-\frac{\kappa}{R}\leq\frac{1}{2}\sigma^{2}b^{2}R-\mu b\leq 0. (A-28)

Case 1. If R<1R<1, then by Assumption A,

δ+(1R)(12σ2b2Rμb)δ(1R)κR>0.\displaystyle\delta+(1-R)(\frac{1}{2}\sigma^{2}b^{2}R-\mu b)\geq\delta-(1-R)\frac{\kappa}{R}>0.

Moreover, the left side of Equation (A-27) increases while the right side decreases with BB, the existence and uniqueness of a positive solution BB is evident.

Case 2. If R>1R>1, the

δ+(1R)(12σ2b2Rμb)δ>0.\displaystyle\delta+(1-R)(\frac{1}{2}\sigma^{2}b^{2}R-\mu b)\geq\delta>0.

In this situation, the right side is increasing and concave. Moreover, as WW\rightarrow\infty, the right side is dominated by the left side of equation, a linear function of BB. Therefore, it is straightforward to see the existence and uniqueness of the number BB. The proof is completed. \Box

Proof of Corollary 1.

In the unconstrained region, VW=gRV_{W}=g^{-R}. Since V′′(W)=RgR1G(g)V^{\prime\prime}(W)=\frac{-Rg^{-R-1}}{G^{\prime}(g)}, the optimal strategy is X(W)=μRσ2gG(g)X(W)=\frac{\mu}{R\sigma^{2}}gG^{\prime}(g). G()G(\cdot) is not a linear function in general. Otherwise, W=Rσ2μLW^{*}=\frac{R\sigma^{2}}{\mu}L. Equation (26) is viewed as an equation of of LL, in which both sides are analytical function of the variable LL. By the analytical function property, it cannot hold for a general choice of the capacity level LL. \Box

Appendix B: Incomplete Gamma function

The lower incomplete gamma function and the upper incomplete gamma function are defined by by

Γ(s,x)=xts1et𝑑t;γ(s,x)=0xts1et𝑑t.\Gamma(s,x)=\int_{x}^{\infty}t^{s-1}e^{-t}dt;\gamma(s,x)=\int_{0}^{x}t^{s-1}e^{-t}dt. (B-1)

For any Re(s)>0Re(s)>0, the functions Γ(s,x)\Gamma(s,x) and γ(s,x)\gamma(s,x) can be defined easily. Each of them can be developed into a holomorphic function. In fact, the incomplete Gamma function is well-defined for all complex ss and xx, by using the power series expansion

γ(s,x)=xsΓ(s)exk=0xkΓ(s+k+1).\gamma(s,x)=x^{s}\Gamma(s)e^{-x}\sum_{k=0}^{\infty}\frac{x^{k}}{\Gamma(s+k+1)}. (B-2)

The following asymptotic behavior for the incomplete gamma function are used in the proof of Proposition 7.

limxγ(s,x)=Γ(s),\lim_{x\rightarrow\infty}\gamma(s,x)=\Gamma(s), (B-3)

and

limx0γ(s,x)xs=1s.\lim_{x\rightarrow 0}\frac{\gamma(s,x)}{x^{s}}=\frac{1}{s}. (B-4)

See N.M. Temme, “The asymptotic expansion of the incomplete gamma functions” , SIAM J. Math. Anal. 10 (1979), pp. 757 - 766.

It can also be connected with Kummer’s Confluent Hypergeometric Function, when Re(z)>0Re(z)>0,

γ(s,z)=s1zsezM(1,s+1,z)\gamma(s,z)=s^{-1}z^{s}e^{-z}M(1,s+1,z) (B-5)

where

M(1,s+1,z)=1+z(s+1)+z2(s+1)(s+2)+M(1,s+1,z)=1+\frac{z}{(s+1)}+\frac{z^{2}}{(s+1)(s+2)}+... (B-6)

Therefore, the incomplete Gamma functions can be computed effectively.

References

  • [1] Ahn, S., K. Choi., and B. Lim., 2019. Optimal Consumption and Investment under Time-Varying Liquidity Constraints. Journal of Financial Quantitative Analysis, 54, 1643-1681.
  • [2] Bakshi, G., and Z. Chen., 1996. The Spirit of Capitalism and Stock Market Prices. American Economic Review, 86, 133 - 157.
  • [3] Bengen, W., 1994. Determining Withdrawal Rates Using Historical Data. Journal of Financial Planning, October, 14-24.
  • [4] Black, F., and A. Perold., 1992. Theory of Constant Proportion Portfolio Insurance. Journal of Economic Dynamics and Control, 16, 403-426.
  • [5] Blanchet-Scaillet, C., N. El Karouri, N., M. Jeanblanc., and L. Martellini., 2008. Optimal Investment Decisions when Time-Horizon is Uncertain. Journal of Mathematical Economics, 44, 1100-1113.
  • [6] Benzoni, L., P. Collin-Dufresne and R. Goldstein., 2007. Portfolio Choice over the Life-Cycle when the Stock and Labor Markets are Cointegrated. Journal of Finance, 62, 2123-2167,
  • [7] Bismut, J-M., 1975. Growth and Optimal Intertemporal Allocation of Risks. Journal of Economic Theory, 10, 239-257.
  • [8] Bodie, Z., R.Merton., and W. Samuelson., 1992. Labor Supply Flexibility and Portfolio Choice in a Life Cycle Model. Journal of Economic Dynamics and Control, 16, 427-449.
  • [9] Bodie, Z., J. Detemple., and R. Rindisbacher., 2009. Life Cycle Finance and the Design of Pension Plans. Annual Review of Financial Economics, 1, 249-285.
  • [10] Carroll, C., 1997. Buffer-Stock Saving and the Life Cycle/Permanent Income Hypothesis. Quarterly Journal of Economics, 112, 1-55.
  • [11] Carroll, C., 2002. Why Do the Rich Saves So Much? Does Atlas Shrug? The Economic Consequences of Taxing the Rich, ed. by J. B. Slemrod. Harvard University Press.
  • [12] Carroll, C., and M. Kimball., 1996. On the Concavity of the Consumption Function. Econometrica, 64, 981-992.
  • [13] Chen, S., R. Fu., and Z. Zhou., 2020. Consumption and Portfolio Decisions with Uncertain Lifetimes. Mathematics and Financial Economics, 14, 507-545.
  • [14] Chen, X., and W. Tian., 2014. Optimal Portfolio Choice and Consistent Performance. Decisions in Economics and Finance, 37 (2), 454-474.
  • [15] Cocco, J., F. Gomes and P.J. Maenhout., 2005. Consumption and Portfolio Choices over the Life-Cycle. Review of Financial Studies,18, 491-533.
  • [16] Dybvig, P., 1995. Duesenberry’s Ratcheting of Consumption: Optimal Dynamic Consumption and Investment Given Intolerance for Any Decline in Standard of Living. Review of Economic Studies, 62, 287-313.
  • [17] Dybvig, P., and H. Liu., 2010. Lifetime Consumption and Investment: Retirement and Constrained Borrowing. Journal of Economic Theory, 145, 885-907.
  • [18] Elie, R., and N. Touzi., 2008. Optimal Lifetime Consumption and Investment under a Drawdown Constraint. Finance and Stochastics, 12, 299-330.
  • [19] El Karoui, N., and M. Jeanblanc-Picque., 1998. Optimization of Consumption with Labor Income. Finance and Stochastics, 2, 409-440.
  • [20] Fleming, W.H. and Soner, H.M., 2006. Controlled Markov Processes and Viscosity Solutions (second edition), Stochastic Modeling and Applied Probability, (25), Springer Verlag.
  • [21] Gomes, F.J and A. Michaelides., 2005. Optimal Life-Cycle Asset Allocation: Understanding the Empirical Evidence. Journal of Finance, 60, 869-904.
  • [22] Grossman, S., and J. Vila., 1992. Optimal Dynamic Trading Stretegies wuth Leverage Constraints,Journal of Financial Quantitative Analysis, 27, 151-168.
  • [23] Karatzas, I., and S. Shreve., 1998. Methods of Mathematical Finance, Applications of Mathematics (39), Springer.
  • [24] King, A., J. Billingham., and S. Otto., 2003. Differential Equations: Linear, Nonlinear, Ordinary, Partial, Cambridge University Press.
  • [25] Liu, H., and M. Loewenstein., 2002. Optimal Portfolio Selection with Transaction Costs and Finite Horizons. Review of Financial Studies, 15, 805-835.
  • [26] Malkiel, B. 1999. A Random Walk Down Wall Street, Norton & Company, Inc.
  • [27] Merton, R. C., 1971. Optimum Consumption and Portfolio Rules in a Continuous Time Model. Journal of Economic Theory, 3, 373-413.
  • [28] Modigliani, F., 1986. Life Cycle, Individual Thrift, and the Wealth of Nations. American Economic Review, 76, 297-313.
  • [29] Ottaviani, M., and P. Sotensen., 2015. Price Reaction to Information with Heterogeneous Beliefs and Wealth Effects: Underreaction, Momentum, and Reversal. American Economic Review, 105, 1-34.
  • [30] Smith, W., 2001. How Does the Spirit of Capitalism Affect Stock Market Prices? Review of Financial Studies, 14 (4), 1215-1232.
  • [31] Strulovici, B., and M. Szydlowski., 2015. On the Smoothness of Value Functions and the Existence of Optimal Strategies in Diffusion Models. Journal of Economic Theory, 159, 1016-1055.
  • [32] Vanguard Group., 2018. How America Saves.
  • [33] Vila, J., and T. Zariphopoulou., 1997. Optimal Consumption and Portfolio Choice with Borrowing Constraints. Journal of Economic Theory, 77, 402-431.
  • [34] Wachter, J., and M. Yogo., 2010. Why do Household Portfolio Shares Rise in Wealth? Review of Financial Studies, 23, 3929-65.
  • [35] Xu, Z., and F. Yi., 2016. An Optimal Consumption Investment Model with Constraint on Consumption. Mathematical Cpntrol & Related Fields, 6(3), 517-534.
  • [36] Yogo, M., 2016. Portfolio Choice in Retirement: Health Risk and the Demand of Annuities, Housing, and Risky Assets. Journal of Monetary Economics, 80, 17-34.
  • [37] Zariphopoulou, T., 1994. Consumption and Investment Models with Constraints, SIAM Journal on Control and Optimization, 32, 59-85.
Refer to caption
Figure 1: This figure displays the auxiliary function G(g)G(g) in the unconstrained region in Proposition 7 and Corollary 1. The model parameters are μ=0.1,σ=0.3,R=0.5,l=0.7,\mu=0.1,\sigma=0.3,R=0.5,l=0.7, and W0=1,000,000W_{0}=1,000,000. The x-axis represents the parameter gg and the y-axis represents G(g)G(g) (in the unit of 100,000). As shown, this function is NOT a linear function, thus, the optimal strategy is not a myopic one as shown in Corollary 1.
Refer to caption
Figure 2: This figure displays the optimal portfolio strategy under three different strategies for 0<R<10<R<1. “Model” denotes the model in Problem (B) under a risk capacity constraint XtL=0.7W0X_{t}\leq L=0.7W_{0}. Parameters are μ=0.1,σ=0.3,R=0.5,l=0.7.\mu=0.1,\sigma=0.3,R=0.5,l=0.7. By calculation, the wealth threshold W=490,235W^{*}=490,235 above which the retiree invests 700,000 in the stock market. When the wealth portfolio is smaller than WW^{*}, the optimal strategy is μRσ2gG(g)\frac{\mu}{R\sigma^{2}}gG^{\prime}(g) where the auxiliary function G()G(\cdot) is illustrated in Figure 1. “Benchmark” denotes the optimal dollar amount in Proposition 2 in the absence of the constraint on the risky asset investment. “BPC” denotes the optimal strategy (in Proposition 8) under a leverage constraint that Xt12μRσ2WtX_{t}\leq\frac{1}{2}\frac{\mu}{R\sigma^{2}}W_{t}.
Refer to caption
Figure 3: This figure displays the optimal percentage of wealth, X(W)W\frac{X(W)}{W}, invested in the stock market in Problem (B). The parameters are the same as in Figure 2. “BPC” denotes the optimal strategy (in Proposition 8) under a leverage constraint that XtbWtX_{t}\leq bW_{t} where b=12μRσ2b=\frac{1}{2}\frac{\mu}{R\sigma^{2}}. As shown, the percentage is decreasing in the entire region of WW. We also notice that the percentage curve is steeper in the beginning of the retirement time when the wealth is closes to initial wealth than that when the wealth closes to the threshold WW^{*}. As a function of WW, X(W)W\frac{X(W)}{W} is not C1C^{1} smooth under the risk capacity constraint, in contrast to the standard model (Proposition 2) or the model under leverage constraint (Proposition 8).
Refer to caption
Figure 4: This figure displays the optimal portfolio strategy under three different strategies for R>1R>1. “Model” denotes the model in Problem (B) under a risk capacity constraint XtL=0.73W0X_{t}\leq L=\frac{0.7}{3}W_{0}. Parameters are μ=0.1,σ=0.3,R=1.5,l=0.7/3.\mu=0.1,\sigma=0.3,R=1.5,l=0.7/3. By calculation, the threshold level of the wealth is W=339,168W^{*}=339,168. “Benchmark” denotes the optimal dollar amount in Proposition 2 in the absence of the constraint on the risky asset investment. Finally, “BPC” denotes the optimal strategy in Proposition 8 under a leverage constraint that Xt12μRσ2WtX_{t}\leq\frac{1}{2}\frac{\mu}{R\sigma^{2}}W_{t}.
Refer to caption
Figure 5: This figure displays the optimal portfolio strategy in Problem (A) under three different strategies. “Model” denotes the model in Problem (A) under a risk capacity constraint XtL=0.3W0X_{t}\leq L=0.3W_{0}. Parameters are μ=0.1,σ=0.3,R=1.5,l=0.3.\mu=0.1,\sigma=0.3,R=1.5,l=0.3. “Benchmark” denotes the optimal dollar amount in Merton’s model. Finally, “VZ” denotes the optimal strategy under a leverage constraint that Xt12μRσ2WtX_{t}\leq\frac{1}{2}\frac{\mu}{R\sigma^{2}}W_{t}, which is solved in Vila and Zariphopoulou (1997).
Refer to caption
Figure 6: This figure displays the effect of the risk capacity level, LL, on the investing strategy in Problem (B). The parameters are the same as in Figure 2. As shown, the higher the capacity level LL, the higher the dollar amount in the risky asset. The figure also demonstrates that the threshold, WW^{*}, positively depends on LL. The risk capacity level LL affects both the expected level of spending and the investing strategy even when the portfolio value is smaller than this threshold.