The McCormick martingale optimal transport
Abstract
Martingale optimal transport (MOT) often yields broad price bounds for options, constraining their practical applicability. In this study, we extend MOT by incorporating causality constraints among assets, inspired by the nonanticipativity condition of stochastic processes. However, this introduces a computationally challenging bilinear program. To tackle this issue, we propose McCormick relaxations to ease the bicausal formulation and refer to it as McCormick MOT. The primal attainment and strong duality of McCormick MOT are established under standard assumptions. Empirically, using the lower and upper bounds derived from marginal constraints, the McCormick relaxations reduce the price gap by an average of 1% for stocks with liquid option markets and 4% for those with moderately liquid markets. When tighter bounds on probability masses are applied, the average reduction increases to 12.66%.
Keywords: Martingale optimal transport, causal optimal transport, bilinear constraints, McCormick relaxations, strong duality
Mathematics Subject Classification: 91G20, 60G42, 90C46.
1 Introduction
In the realm of option pricing, a crucial task is to establish model-independent arbitrage-free price bounds for derivatives. These bounds serve a direct application in identifying potential arbitrage opportunities in traded options. Hobson and Neuberger, (2012) pioneered the study on robust bounds for forward start options, while Martingale Optimal Transport (MOT), introduced by Beiglböck et al., (2013), presented a comprehensive framework utilizing tools from Monge–Kantorovich mass transport. The strong duality result has a financial interpretation, yielding sub/super hedging portfolios. Furthering the MOT research line, Beiglböck et al., (2017, 2019) considered dual attainment problems. Beiglböck and Juillet, (2016) identified a martingale coupling reminiscent of the classic monotone quantile coupling. Several numerical schemes for MOT were proposed by Guo and Obłój, (2019); Eckstein et al., (2021), employing techniques such as linear programming (LP) or neural networks. Investigations into MOT stability were carried out by Backhoff-Veraguas and Pammer, (2022); Brückerhoff and Juillet, (2022); Wiesel, (2023).
A significant hurdle in MOT arises from the wide price bounds, thereby constraining their practical applicability. This claim is empirically substantiated by Beiglböck et al., (2013, Figure 1) and Eckstein et al., (2021, Tables 4.2 and 4.3). Achieving more precise bounds typically requires the incorporation of additional information into the modeling framework. For example, Lütkebohmert and Sester, (2019) argued that tighter bounds can be derived when more information about the variance of returns is available, although at the cost of sacrificing model independence. Fahim and Huang, (2016) enhanced precision of price bounds by integrating liquid exotic options into the hedging portfolio. Furthermore, Eckstein et al., (2021, Section 4) refined the bounds by assuming knowledge of the marginals at earlier maturities.
In this study, we introduce another methodology inspired by recent strides in causal optimal transport (COT) (Lassalle,, 2013; Backhoff-Veraguas et al.,, 2017). COT posits a crucial condition: given the past of one process , the past of another process should be independent of the future of under the transport plan. Essentially, COT imposes the nonanticipativity condition. The versatility of COT is evidenced by its applications in various domains such as mathematical finance (Backhoff-Veraguas et al.,, 2020), mean field games (Acciaio et al.,, 2021; Backhoff-Veraguas and Zhang,, 2023), and stochastic optimization (Pflug and Pichler,, 2012, 2014; Acciaio et al.,, 2020).
In Section 3, we present the causality constraint in a two-asset setting. An essential challenge arises as the MOT incorporating causality transforms into a bilinear program that is non-convex. This inherent difficulty stems from the fact that only marginals at each maturity are known in MOT. However, causality requires conditional kernels that rely on the joint distribution across different maturities, detailed in Proposition 3.2. Consequently, to overcome this difficulty, we relax the original problem and consider the McCormick envelopes (McCormick,, 1976) for the bilinear equality constraints. We refer to it as the McCormick MOT. McCormick relaxations offer a straightforward implementation and are widely employed in optimization software, while we note that various alternative algorithms for bilinear problems exist; see Audet et al., (2000); Sherali and Fraticelli, (2002); Anstreicher, (2009).
In discrete scenarios, McCormick MOT is a finite-dimensional LP problem and thus straightforward. Our main technical contributions are for the continuous case with absolutely continuous densities. To derive McCormick envelopes, we assume that lower and upper bounds for densities are available, which are referred to as capacity constraints in Korman and McCann, (2015). The primal attainment is established in Theorem 4.5 after selecting a suitable weak topology. Analogously to the classic MOT, a natural question is the strong duality of the McCormick MOT. Lemma 4.6 and Theorem 4.7 prove the relevant result under standard assumptions in the MOT literature. The main idea underlying these proofs remains rooted in the Fenchel-Rockafellar duality theorem.
In our numerical study, we introduce a calibration method for risk-neutral densities that ensures convex order while addressing bid-ask spreads and multiple tenors. Given the discrete nature of empirical data, our approach formulates a finite-dimensional LP problem, offering a straightforward optimization process. We observe that the improvement in price bounds depends on the specific option payoffs and the liquidity of European options on the underlying assets. McCormick MOT proves to be effective for path-dependent payoffs, where the temporal structure plays a more pronounced role. First, using the lower and upper bounds that naturally arise from marginal constraints, the McCormick relaxations reduce the price gap by an average of 1% for stocks with liquid option markets and 4% for those with moderately liquid markets. When tighter bounds on probability masses are applied, the McCormick MOT achieves an average reduction of 12.66%. Importantly, our methodology is universal since it integrates seamlessly with other knowledge available to the agent. The source code is publicly available at https://github.com/hanbingyan/McCormick.
The rest of the paper is organized as follows. Section 2 introduces the notation and formulation of MOT. Section 3 presents the motivation and several implications of the causality constraint. In Section 4, we demonstrate the McCormick relaxations in both discrete and continuous cases. Section 5 shows the improvements of the price bounds with an illustrative example and empirical data. The appendix gives proofs of the main results.
2 Problem formulation
We consider a financial market comprising two risky assets, denoted and , while the extension to include more assets is straightforward. Suppose that the market has European options on each of the two stocks for the maturities , where and the maturities are not necessarily evenly spaced. Let and represent the prices of risky assets at maturity . The range of the first risky asset at time is denoted as , assumed to be a closed subset of . Consequently, forms a closed subset of . The set of all Borel probability measures on is denoted as . Similarly, for the second risky asset , we introduce another closed set , with the corresponding notations defined analogously.
In this study, we assume that individual risk-neutral probabilities denoted and , are known. Specifically, if the prices of European options are available for all strikes, Breeden and Litzenberger, (1978) provided a formula for the risk-neutral probability density. In cases involving a finite number of strikes, a linear interpolation method is described in Davis and Hobson, (2007). Additionally, an alternative cubic-spline method has been proposed by Fengler, (2009) for finitely many options prices.
However, the joint distribution among different maturities or assets is typically unknown. Let represent the set of couplings with as marginals for each . Similarly, is defined. Consider as the set of couplings with and as marginals. The model-independent and arbitrage-free framework in Beiglböck et al., (2013) involves examining a measure that satisfies the martingale condition:
(2.1) |
For the sake of simplicity, we assume zero interest rates and no dividends in (2.1), while our empirical results utilize forward prices to incorporate these factors. Denote as the set of all martingale measures satisfying marginal constraints. Suppose possess identical finite first moments and increase in convex order in , that is, for all convex functions and ,
Denote the convex order condition as . Similarly, assume have the same finite moments and . By Strassen, (1965), the assumptions above are necessary and sufficient conditions for .
In the subsequent discussion, we examine an exotic option with the payoff function . The MOT framework (Beiglböck et al.,, 2013; Eckstein et al.,, 2021) investigates model-free bounds for this exotic option:
(2.2) |
(2.2) is known as the primal formulation. The dual problem unveils semi-static hedging strategies as outlined in Beiglböck et al., (2013, Theorem 1.1).
In the absence of modeling assumptions, the gap between lower and upper bounds in MOT can be substantial, thereby limiting its practical applicability. A common approach to address this challenge involves incorporating additional information. For instance, Fahim and Huang, (2016) augmented the model with market prices of specific exchange-listed exotic options, such as standardized digital and barrier options. Another strategy entails including information related to volatility. Joint calibration on SPX and VIX options leverages the structural link between the S&P 500 and VIX indices; see Guyon et al., (2017); De Marco and Henry-Labordere, (2015) for more details. Additionally, Bayraktar et al., (2021) examined MOT examples with volatility constraints, while Lütkebohmert and Sester, (2019) considered supplementary information regarding the variance of returns.
The conventional optimal transport framework fails to account for the temporal structure and information flow inherent in time series data. To address this limitation, recent research has introduced COT (Lassalle,, 2013; Backhoff-Veraguas et al.,, 2017). When dealing with options on multiple assets, we demonstrate that the nonanticipativity (causality) condition provides an alternative means of tightening bounds, typically without requiring new data or estimation. Section 3 presents the causality constraint and its implications for MOT.
3 Motivation and implication of causality
3.1 Nonanticipativity condition
Under a risk-neutral measure with discrete-time processes, the underlying assets and are typically represented as:
(3.1) |
Here, and denote random returns for and , respectively, which are the only sources of randomness in the system (3.1). A standard assumption posits that are independent and identically distributed, as commonly imposed in binomial tree models. also satisfy the same assumption. However, it should be noted that for each time , and are allowed to be correlated. For example, a parsimonious assumption is that the correlation between and is a constant. Under these conditions, is independent of , conditional on . This equivalently implies that , since knowledge of does not aid in predicting once is known. It is crucial to emphasize that and are not independent if is unknown.
Aligned with the nonanticipativity condition that originates from stochastic difference (or differential) equations, Lassalle, (2013) imposes that a transport plan should satisfy
(3.2) |
which is equivalent to
(3.3) |
see Eckstein and Pammer, (2024, Remark 2.2). The property (3.2) (or (3.3)) is known as the causality condition. A transport plan that satisfies (3.2) (or (3.3)) is termed causal. If we interchange the positions of and the following condition holds:
(3.4) |
then the transport plan is anticausal. Couplings that are both causal and anticausal are referred to as bicausal. For later use, denote as the set of all bicausal and martingale transport plans, with and as marginals for each .
Motivated by the nonanticipativity condition, we propose to further restrict the MOT problem (2.2) by considering bicausal plans:
(3.5) |
Before delving into the main results, we discuss some properties for the bicausal MOT (3.5). The first consequence of the bicausal condition is the equivalence of the martingale property under individual filtrations and the joint filtration. The proof of Proposition 3.1 directly applies the causality condition (3.2) and is therefore omitted.
3.2 Non-convexity
When the joint distributions and, consequently, conditional kernels are provided, the causality condition (3.2) (or (3.3)) imposes a linear constraint on ; see Backhoff-Veraguas et al., (2017, Proposition 2.4 (3)). However, this is no longer applicable when the conditional kernels are not predetermined. The corresponding equivalent condition is outlined in Proposition 3.2. The proof follows a rationale similar to that of Backhoff-Veraguas et al., (2017, Proposition 2.4 (3)), and we include it in the Appendix for the sake of completeness.
Proposition 3.2.
is causal if and only if
(3.7) |
for all , , and .
To observe that the causality constraint is no longer linear in , one can think about the discrete case. The conditional mass function , where the denominator is not uniquely determined. In the subsequent section, we demonstrate that this constraint is bilinear in and non-convex. This characteristic poses a primary challenge in the bicausal MOT and makes the program difficult to solve. Consequently, we propose adopting the McCormick relaxation (McCormick,, 1976) for the bilinear constraint. The resulting relaxed problem is convex, offering a computationally more tractable approach while maintaining the capacity to enhance bounds in various scenarios.
4 McCormick relaxation
4.1 The discrete case
To elucidate the main concept, we examine the case with discrete marginals, interpreting as probability mass functions. The causality constraint (3.2) (or (3.3)) can be expressed equivalently as
(4.1) |
We stress that (4.1) holds pointwise, while (3.2) (or (3.3)) holds -a.s. Indeed, if , then . Both conditional probabilities in (3.3) are well-defined. Then (4.1) holds. If , then is not defined. (3.3) is not required to hold. However, we note that since . Therefore, (4.1) is valid as .
(4.1) essentially constitutes a bilinear equality constraint on . Consequently, it is computationally challenging to find exact upper/lower bounds in the bicausal MOT problem (3.5). In the optimization literature, bilinear programming falls under quadratically constrained quadratic programming (QCQP). Several algorithms, including branch and cut with reformulation-linearization techniques (Audet et al.,, 2000), and semi-definite programming relaxations (Anstreicher,, 2009; Sherali and Fraticelli,, 2002), have been proposed. In this study, we opt for the classic McCormick relaxation in McCormick, (1976) for several reasons. It is widely employed as a built-in routine in optimization software and is also straightforward to implement manually. Moreover, the dual formulation remains tractable, allowing for the demonstration of the impact of causality constraints on hedging strategies.
McCormick relaxations construct the convex and concave envelopes as follows. The bilinear term on the left-hand side of (4.1) is substituted with . For probability , we suppose the upper bound and lower bound are known. The corresponding upper and lower bounds for are denoted as and , respectively. Here, the subscripts in and , , indicate that different functions are used when the variables have different indices. However, for the sake of brevity, we denote and , since the indices in imply the functions used. Then we simply write
Since , expanding the expression gives
(4.2) |
Similarly, the same procedure applied to products involving and yields three additional inequalities. We also replace the right-hand side of (4.1) with , and four corresponding inequalities are derived similarly. The causality constraint becomes .
Together with the relaxation on anticausality and the martingale condition, we introduce McCormick martingale couplings as follows:
Definition 4.1.
Given the upper bound and lower bound on probability masses, is called a McCormick martingale coupling if it satisfies
-
(1)
the capacity constraint: ;
-
(2)
the martingale condition:
(4.3) -
(3)
the McCormick relaxation of the causality condition: for all and , there exist variables satisfying
(4.4) -
(4)
the McCormick relaxation of the anticausality condition, by interchanging and in (4.4).
We denote the set of all McCormick martingale couplings as .
All of the inequalities in (4.4) are derived in the same way as (4.2). Note that (4.4) holds for all , instead of merely -a.s. In condition (3), it is possible to simplify . The term (or and ) also serves as a variable in the optimization process. However, in (4.4), we can eliminate the dependence on by imposing that the maximum of the lower bounds does not exceed the minimum of the upper bounds for . Therefore, we only write to signify that is a McCormick martingale coupling.
A simple choice of bounds is and , which are derived from the marginal conditions and do not require additional modeling assumptions. Under this choice, is also not empty. Indeed, we can first construct (or rather arbitrarily pick) the marginal laws, and , such that and are martingales under and , respectively. Then the independent coupling serves as a McCormick martingale coupling. In the general finite and discrete cases, many LP solvers can check whether is non-empty.
Remark 4.2.
Remark 4.3.
The total mass of or may not be equal to one. Both and are functions only, rather than probability measures or signed measures. Even when is a measure, the product is not a measure. Specifically, for a set where and are disjoint, . Furthermore, it is crucial to interpret (4.4) pointwise in the discrete case, given that the McCormick relaxation is derived pointwise in (4.2).
We focus on addressing the minimization problem associated with the McCormick MOT in (4.5), as the maximization problem can be treated similarly:
(4.5) |
In finite and discrete scenarios, the infimum in the primal problem is attained when the set is not empty. The dual problem is derived through classic finite-dimensional LP and is omitted here.
4.2 The absolutely continuous case with capacity constraints
In the continuous scenario similar to Korman and McCann, (2015); Bogachev, (2022), we make the assumption that the domains and the marginals and are absolutely continuous with respect to the Lebesgue measure on . Similarly, we consider couplings that are absolutely continuous with respect to the Lebesgue measure on . When the context is evident, we denote by and omit the arguments. Let represent the density of a coupling . Although probability measures with densities are absolutely continuous with respect to the Lebesgue measure, we restrict them to the Borel -algebra on the Euclidean space and interpret them as Borel probability measures. Additionally, nonnegative stock prices result in zero densities in negative domains.
Due to absolute continuity, the causality constraint (3.2) (or (3.3)) transforms into the following density equality:
(4.6) |
Crucially, (4.6) holds -a.e., instead of merely -a.s. It can be verified in the same spirit of (4.1), by noting that both sides of (4.6) are zero when regular conditional kernels are not defined.
With a slight abuse of notation, let still be the auxiliary variable. With the same convention on omitting the subscripts as in the discrete case, we suppose that the upper bound and lower bound for the density are known, such that
which are referred to as capacity/density constraints (Korman and McCann,, 2015; Bogachev,, 2022). Similarly, the inequality (4.2) becomes
(4.7) |
Other inequalities can be derived in a similar fashion. It is important to emphasize that and are not necessarily density functions.
Denote as the density of , and as the individual density vectors. Similarly, let be the density of , and be the individual density vectors. We define the continuous version of McCormick martingale couplings similarly. Note that (4.9) holds -a.e.
Definition 4.4.
Given the upper bound and lower bound on densities, a coupling with density is called a McCormick martingale coupling if it satisfies
-
(1)
the capacity constraint: , -a.e.;
-
(2)
the martingale condition:
(4.8) -
(3)
the McCormick relaxation of the causality condition on densities: for all , there exist such that the following inequality holds -almost everywhere:
(4.9) -
(4)
the McCormick relaxation of the anticausality condition on densities, by interchanging and in (4.9).
We use to denote that corresponds to a McCormick coupling with capacity constraints.
With continuous densities, we denote the risk-neutral price of the exotic option as
(4.10) |
The primal problem, incorporating McCormick relaxation and capacity constraints, can be expressed as
(4.11) |
This formulation is referred to as the continuous version of McCormick MOT.
Theorem 4.5.
Suppose the following conditions hold:
-
1.
is not empty;
-
2.
Probability densities and have finite first moments. The bounds and ;
-
3.
The cost (payoff) function is measurable and for some constant .
Then the infimum in (4.11) is attained.
4.2.1 The strong duality
It is well known that the dual formulation of the classic MOT corresponds to a semi-static hedging strategy, comprising the sum of static vanilla portfolios and dynamic delta positions (Beiglböck et al.,, 2013). A natural question arises regarding the impact of the McCormick relaxation on the hedging portfolio.
In an informal manner, one can derive the dual problem by considering the Lagrange multipliers method and interchanging the infimum with the supremum. We introduce the multipliers as follows, having reduced :
-
(1)
and , denote potential functions testing the marginal constraints for and , respectively. In financial terms, they can be interpreted as a portfolio of static vanilla options;
-
(2)
and , test the martingale condition for and , respectively. They represent self-financing trading strategies in the risky assets;
-
(3)
, , , are multipliers for the McCormick relaxation of the causality condition;
-
(4)
, , are multipliers for the McCormick relaxation of the anticausality condition;
-
(5)
and are the multipliers for the upper and lower bounds, respectively.
Constraints in the dual problem consist of the following components:
-
(1)
The subhedging condition in the sense that
(4.12) where
(4.13) -
(2)
The dual inequality for the McCormick relaxation of the causality condition:
(4.14) -
(3)
The dual inequality for the McCormick relaxation of the anticausality condition:
(4.15)
Define the set of feasible multipliers as
(4.16) |
The objective function in the dual problem is
(4.17) | ||||
Finally, we can state the dual problem as
(4.18) |
where we omit the time subscript in , etc., for simplicity.
Loosely speaking, McCormick relaxations modify the subhedging portfolio in (4.12) by considering
(4.19) |
in lieu of . Additionally, the objective function (4.17) is augmented with terms originating from capacity constraints and McCormick relaxations. However, elucidating these additional terms poses challenges due to the imposition of numerous inequalities in the McCormick envelopes.
We establish strong duality , as defined in (4.11) and (4.18), through a two-step proof. First, we extend the capacity-constrained case presented in Korman et al., (2015, Theorem 1) to encompass non-compact supports, where neither martingale conditions nor McCormick relaxations are imposed. Although our Lemma 4.6 is acknowledged in the literature (Korman et al.,, 2015), a precise proof has not been provided in previous work. For later use, let represent the set of couplings with density and marginals and on . While the proof focuses on one-dimensional cases for simplicity, the extension to multi-dimensional cases is analogous. The main idea involves a modification of Villani, (2003, Proposition 1.22), specifically tailored to address the capacity constraint. Unlike Theorem 4.5, Lemma 4.6 assumes lower semicontinuous (l.s.c.) costs. This condition is needed since the proof employs continuous and bounded functions for cost (payoff) approximation.
Lemma 4.6.
Suppose the following conditions hold:
-
1.
The set of couplings is not empty;
-
2.
The upper bound satisfies . Marginal probability densities and have finite first moments;
-
3.
The cost is l.s.c. and for some constant .
Then strong duality holds:
(4.20) |
where is defined in (4.10),
and
The infimum is also attained. Moreover, it does not change the value of the supremum on the right-hand side if we restrict to be continuous and bounded.
To establish the strong duality result using Lemma 4.6, we additionally assume that the upper and lower bounds are continuous and bounded in Theorem 4.7.
Theorem 4.7.
Suppose the following conditions hold:
-
1.
is not empty;
-
2.
The upper bound satisfies . The lower bound satisfies . The probability densities have finite first moments;
-
3.
The cost (payoff) function is l.s.c. and for some constant .
Then the strong duality holds: .
5 Numerical study
In this section, we investigate the effectiveness of McCormick MOT using both synthetic and empirical data. Given the typical unavailability of risk-neutral densities, our empirical study includes the development of a calibration procedure capable of accommodating bid-ask spreads. We focus on basket options with Asian-style payoffs. Initially, using the lower and upper bounds that naturally arise from marginal constraints, the McCormick relaxations reduce the price gap by an average of 1% for stocks with liquid option markets and 4% for those with moderately liquid markets. When tighter bounds on probability masses are applied, the McCormick MOT achieves an average reduction of 12.66%.
5.1 An illustrating example
Assuming a zero risk-free rate for simplicity, we explore a scenario involving two stocks, and , covering two periods. An exotic option has a payoff given by .
Setting the initial stock prices as and for illustrative purposes, we posit that each stock can have only three different prices at each maturity. Specifically:
-
(1)
At time , the stock price with probabilities , respectively. with probabilities , respectively;
-
(2)
At time , the stock price with probabilities , respectively. with probabilities , respectively.
It is straightforward to solve this problem with optimization programs such as Gurobi. The classic MOT yields option price bounds of . To isolate the impact of McCormick relaxations, we utilize the lower and upper bounds implied by the marginal conditions: and . Employing the McCormick MOT refines the bounds to . Despite the non-convex nature of bicausal MOT, (due to the small size of the problem) the program successfully addresses the problem by brute-force, providing bounds of . This simple example highlights the potential of McCormick MOT, prompting further exploration in more general cases.
5.2 A calibration method of risk-neutral densities
In the empirical case, McCormick MOT also needs risk-neutral distributions as input parameters. When calibrating and independently, there is no guarantee of maintaining convex order. To address this issue, we propose a methodology for calibrating the risk-neutral densities of a risky asset while maintaining the convex order between marginals. An alternative approach involves the separate calibration of and , followed by convexification using the method outlined in Alfonsi et al., (2017, Equation 3.1). However, this approach often results in the modification of the support of , raising uncertainties regarding the consistency of the modified with the options data.
Our construction is outlined as follows. The current time is denoted as . At present, a finite number of European call options with known bid and ask prices are available. Recall that the set of maturities is represented as . For each maturity , there are options available, where is not required to be the same across all maturities. Strike prices at a given maturity are sorted as . In the context of considering the risk-neutral density of a single asset, we denote the underlying stock price at as . Furthermore,
-
(1)
represents the forward price of the stock, where the forward contract is initiated at time and delivered at time ;
-
(2)
denotes the discount factor for time , i.e., the price of a zero-coupon bond at time with maturity at time ;
-
(3)
is the ask price of the call with the strike and maturity . Similarly, denotes the bid price.
We refer to as a feasible price if . To streamline the framework, we assume that the interest rate is deterministic and that any dividends paid by the underlying assets are also deterministic if applicable. Following the approach of Davis and Hobson, (2007), we introduce scaled prices denoted as
(5.1) |
We write as a joint risk-neutral measure encompassing all maturities. For simplicity, we denote as the marginal of on . Similarly, refers to the marginal of on . Suppose that the support of the stock price at time , denoted as , is given by the set of possible strikes at time , i.e., . Consequently, is an array.
To find the risk-neutral measure , we consider the following optimization problem:
(5.2) | |||
(5.3) | |||
(5.4) | |||
(5.5) | |||
(5.6) |
The objective function (5.2) aims to place the feasible price . The constraint (5.3) represents the risk-neutral pricing formula, while (5.4) enforces the martingale condition. (5.5) stems from the fact that the expected value of stock prices under a risk-neutral measure equals the forward price. The final condition ensures that , the marginal of at time , acts as a probability mass function. A notable advantage of our framework is that the program (5.2) is still an LP problem, although the solution is not necessarily unique. When there is a risk-neutral measure that guarantees , the objective value is the lowest and equals . When the optimized objective value exceeds , it indicates the existence of a risk-neutral measure , but certain options lack a feasible price under . Although only marginals are required, our program in (5.2) actually obtains a joint martingale probability distribution. Consequently, it ensures the convex order of marginals automatically. Besides, it addresses bid-ask spreads without solely relying on the mid-price, i.e., the average of bid and ask prices.
5.3 Basket options
The effectiveness of McCormick relaxations depends on the characteristics of the option payoff. Since causality imposes conditional independence, we find that McCormick relaxations are more effective when the payoff function is path-dependent for both assets. Consider a basket call option with an Asian-style payoff, averaged across two stocks and two periods:
(5.7) |
First, we employ probability bounds implied by marginal constraints. Although these bounds are typically loose, the McCormick MOT can still yield tighter price intervals compared to the classic MOT framework.
We obtained the zero-coupon yield curve, forward prices, and European option prices from OptionMetrics via Wharton Research Data Services (WRDS). In the first example, consider an investor interested in stocks from two pharmaceutical companies, namely Gilead Sciences (Ticker: GILD) and GSK plc (Ticker: GSK). According to OptionMetrics data, the liquidity of European options for these stocks is considered moderate. For example, on 28 February 2023, GSK options show an open interest of 4458 and a volume of 167401, while GILD options have an open interest of 4989 and a volume of 233003. We utilize Gurobi to solve the linear program (5.2) and derive the risk-neutral marginals and . To assess the reduction in basket option price bounds, we introduce the following ratio:
(5.8) |
For the tenors, we select as the maturity date closest to the current time . The second maturity, , is approximately one month after . In the context of the payoff (5.7), the strike is determined as the integer rounded off from the average of forward prices with delivery dates and . Consequently, the basket option is close to the at-the-money level. We employ Gurobi to solve both the classic and McCormick MOT, treating them as LP problems. For the computational complexity, our problems typically have around 10,000 variables and 1,000 linear inequality constraints, which can be efficiently solved in less than a second on a laptop. Table 1 provides examples of price limits for the basket option and the ratios defined in (5.8). Considering ranging from February 28, 2022, to February 28, 2023 (the most recent one-year horizon in WRDS), we derive ratio values and present a histogram in Figure 1. The average ratio is approximately 96%, which signifies a 4% reduction in price bounds. In the best-case scenario, the bounds are reduced by 20%. Our code and Excel files detailing the ratios are accessible at https://github.com/hanbingyan/McCormick.
When considering stocks with a more liquid option market, the reduction in the price gap is less pronounced. As an illustration, on February 28, 2023, European options for JPMorgan Chase (JPM) reported an open interest of 54,034 and a volume of 1,189,048, while European options for Morgan Stanley (MS) exhibited an open interest of 22,885 and a volume of 669,638. The corresponding results are presented in Table 2, and Figure 2 provides the histogram of the ratios. On average, the application of the McCormick MOT results in a 1% reduction in the bounds. This outcome can be attributed to larger supports of risk-neutral densities for JPM and MS.
Strike | MOT Max | MOT Min | McCormick Max | McCormick Min | Ratio | |||
---|---|---|---|---|---|---|---|---|
2022-02-28 | 2022-03-04 | 2022-04-01 | 50 | 1.88583 | 1.35447 | 1.88414 | 1.35823 | 0.98974 |
2022-04-26 | 2022-04-29 | 2022-05-27 | 52 | 2.03708 | 1.44786 | 2.03535 | 1.44919 | 0.99482 |
2022-06-23 | 2022-06-24 | 2022-07-22 | 52 | 1.30432 | 0.87923 | 1.26458 | 0.88730 | 0.88752 |
2022-08-26 | 2022-09-02 | 2022-09-30 | 47 | 1.18972 | 0.87830 | 1.17176 | 0.89783 | 0.87965 |
2022-10-24 | 2022-10-28 | 2022-11-25 | 50 | 1.48902 | 0.66430 | 1.48362 | 0.66841 | 0.98848 |
2022-12-20 | 2022-12-23 | 2023-01-20 | 59 | 1.66154 | 1.08860 | 1.64943 | 1.09253 | 0.97201 |
2023-02-17 | 2023-02-24 | 2023-03-24 | 60 | 1.19966 | 0.68178 | 1.19715 | 0.68867 | 0.98185 |
Strike | MOT Max | MOT Min | McCormick Max | McCormick Min | Ratio | |||
---|---|---|---|---|---|---|---|---|
2022-02-28 | 2022-03-04 | 2022-04-01 | 116 | 3.51744 | 1.60546 | 3.51189 | 1.61620 | 0.99148 |
2022-04-26 | 2022-04-29 | 2022-05-27 | 102 | 3.31127 | 1.35499 | 3.30457 | 1.37144 | 0.98817 |
2022-06-23 | 2022-06-24 | 2022-07-22 | 93 | 2.91588 | 1.22635 | 2.90865 | 1.23232 | 0.99219 |
2022-08-19 | 2022-08-26 | 2022-09-23 | 104 | 2.55403 | 0.72075 | 2.53698 | 0.73453 | 0.98318 |
2022-10-17 | 2022-10-21 | 2022-11-18 | 96 | 3.07063 | 0.81261 | 3.06671 | 0.83110 | 0.99007 |
2022-12-13 | 2022-12-16 | 2023-01-13 | 113 | 2.89605 | 0.69713 | 2.89492 | 0.71601 | 0.99090 |
2023-02-10 | 2023-02-17 | 2023-03-17 | 120 | 2.57027 | 0.38493 | 2.56400 | 0.39660 | 0.99179 |


5.4 Tighter bounds on probability masses
In previous instances, the natural lower and upper bounds and on probability masses may have been relatively loose, necessitating tighter constraints using additional information. Our methodology integrates seamlessly with other knowledge available to the agent. In this subsection, we focus on the basket option (5.7) on GILD and GSK. Suppose the agent imposes the following constraints on the coupling :
(5.9) | ||||
(5.10) |
Here, serves as a shrinking parameter, and is a subset of paths where (5.10) holds. Empirically, there are typically more than 2,000 paths available for . The upper bound of 0.01 in (5.9) ensures that the coupling does not concentrate excessively on too few paths; this value is chosen arbitrarily for demonstration purposes. For the lower bound (5.10), the agent specifies that certain paths of interest should have a sufficiently large probability. To make (5.9) and (5.10) feasible, we can set . Lacking prior knowledge of the underlying assets GILD and GSK, the set is chosen arbitrarily for demonstration. For each support , , , and , we select one-third of the possible stock prices in an equally spaced manner, resulting in the set containing at most of the paths. The set of McCormick martingale couplings can be empty after imposing (5.9) and (5.10). However, the existence of a McCormick martingale coupling can be numerically checked using LP solvers such as Gurobi.
Table 3 presents the statistics of the ratios obtained under different shrinking parameters and maturities . We consider for simplicity. The second maturity date can be 1, 2, 3 or 4 weeks after the first maturity date , where is the maturity date closest to the current time . Overall, the McCormick MOT yields much better price intervals when tighter bounds and are available. On average, the eight cases in Table 3 achieve a reduction of 12.66%. The best instance, with a minimum ratio of 0.1908, indicates a reduction of more than 80%. Notably, improved constraints and have already led to tighter price intervals with the MOT method. The reductions in Table 3 demonstrate that McCormick relaxations can further enhance these intervals, primarily because the inequalities like (4.4) become more effective. Besides, the performance of McCormick relaxations is better when the maturity is shorter since the number of paths is smaller, and the upper bound (5.9) is more likely to be binding. Generally, a larger parameter results in greater improvements in price bounds. However, this is not always the case, as the set is more likely to be empty with larger .
Index for | Ratios Max | Ratios Mean | Ratios Min | |
---|---|---|---|---|
0.005 | 1 | 0.9827 | 0.8767 | 0.5749 |
0.005 | 2 | 0.9642 | 0.8398 | 0.3441 |
0.005 | 3 | 0.9929 | 0.8985 | 0.5748 |
0.005 | 4 | 0.9892 | 0.9165 | 0.4718 |
0.01 | 1 | 0.9802 | 0.8476 | 0.1908 |
0.01 | 2 | 0.9559 | 0.8452 | 0.3796 |
0.01 | 3 | 0.9895 | 0.8604 | 0.3247 |
0.01 | 4 | 0.9852 | 0.9025 | 0.4542 |
Although Eckstein et al., (2021) used a different approach by incorporating additional maturities, a comparative analysis of reductions with their findings provides information on the significance of our results. In Eckstein et al., (2021, Table 4.3), focusing on uniform marginals and spread options, the reported price intervals are for a single period, for two periods and for four periods. Consequently, in their four-period scenario, the calculated ratio is , falling within the range observed in our Table 3.
Acknowledgments
This research began when Bingyan Han was a postdoctoral researcher in the Department of Mathematics at the University of Michigan. He expresses gratitude to the University of Michigan for providing support and an atmosphere conducive to this work.
References
- Acciaio et al., (2021) Acciaio, B., Backhoff-Veraguas, J., and Jia, J. (2021). Cournot–Nash equilibrium and optimal transport in a dynamic setting. SIAM Journal on Control and Optimization, 59(3):2273–2300.
- Acciaio et al., (2020) Acciaio, B., Backhoff-Veraguas, J., and Zalashko, A. (2020). Causal optimal transport and its links to enlargement of filtrations and continuous-time stochastic optimization. Stochastic Processes and their Applications, 130(5):2918–2953.
- Alfonsi et al., (2017) Alfonsi, A., Corbetta, J., and Jourdain, B. (2017). Sampling of probability measures in the convex order and approximation of martingale optimal transport problems. Available at SSRN 3072356.
- Anstreicher, (2009) Anstreicher, K. M. (2009). Semidefinite programming versus the reformulation-linearization technique for nonconvex quadratically constrained quadratic programming. Journal of Global Optimization, 43:471–484.
- Audet et al., (2000) Audet, C., Hansen, P., Jaumard, B., and Savard, G. (2000). A branch and cut algorithm for nonconvex quadratically constrained quadratic programming. Mathematical Programming, 87:131–152.
- Backhoff-Veraguas et al., (2020) Backhoff-Veraguas, J., Bartl, D., Beiglböck, M., and Eder, M. (2020). Adapted Wasserstein distances and stability in mathematical finance. Finance and Stochastics, 24(3):601–632.
- Backhoff-Veraguas et al., (2017) Backhoff-Veraguas, J., Beiglböck, M., Lin, Y., and Zalashko, A. (2017). Causal transport in discrete time and applications. SIAM Journal on Optimization, 27(4):2528–2562.
- Backhoff-Veraguas and Pammer, (2022) Backhoff-Veraguas, J. and Pammer, G. (2022). Stability of martingale optimal transport and weak optimal transport. The Annals of Applied Probability, 32(1):721–752.
- Backhoff-Veraguas and Zhang, (2023) Backhoff-Veraguas, J. and Zhang, X. (2023). Dynamic Cournot-Nash equilibrium: the non-potential case. Mathematics and Financial Economics, 17(2):153–174.
- Bayraktar et al., (2021) Bayraktar, E., Zhang, X., and Zhou, Z. (2021). Transport plans with domain constraints. Applied Mathematics & Optimization, 84(1):1131–1158.
- Beiglböck et al., (2013) Beiglböck, M., Henry-Labordere, P., and Penkner, F. (2013). Model-independent bounds for option prices – a mass transport approach. Finance and Stochastics, 17:477–501.
- Beiglböck and Juillet, (2016) Beiglböck, M. and Juillet, N. (2016). On a problem of optimal transport under marginal martingale constraints. Annals of Probability, 44(1):42–106.
- Beiglböck et al., (2019) Beiglböck, M., Lim, T., and Obłój, J. (2019). Dual attainment for the martingale transport problem. Bernoulli, 25(3):1640–1658.
- Beiglböck et al., (2017) Beiglböck, M., Nutz, M., and Touzi, N. (2017). Complete duality for martingale optimal transport on the line. The Annals of Probability, 45(5):3038–3074.
- Bertsekas and Shreve, (1978) Bertsekas, D. and Shreve, S. E. (1978). Stochastic optimal control: The discrete-time case. Academic Press.
- Bogachev, (2007) Bogachev, V. I. (2007). Measure Theory, volume II. Springer Science & Business Media.
- Bogachev, (2022) Bogachev, V. I. (2022). Kantorovich problems with a parameter and density constraints. Siberian Mathematical Journal, 63(1):34–47.
- Breeden and Litzenberger, (1978) Breeden, D. T. and Litzenberger, R. H. (1978). Prices of state-contingent claims implicit in option prices. Journal of Business, pages 621–651.
- Brückerhoff and Juillet, (2022) Brückerhoff, M. and Juillet, N. (2022). Instability of martingale optimal transport in dimension . Electronic Communications in Probability, 27(none):1 – 10.
- Davis and Hobson, (2007) Davis, M. H. and Hobson, D. G. (2007). The range of traded option prices. Mathematical Finance, 17(1):1–14.
- De Marco and Henry-Labordere, (2015) De Marco, S. and Henry-Labordere, P. (2015). Linking vanillas and VIX options: A constrained martingale optimal transport problem. SIAM Journal on Financial Mathematics, 6(1):1171–1194.
- Eckstein et al., (2021) Eckstein, S., Guo, G., Lim, T., and Obłój, J. (2021). Robust pricing and hedging of options on multiple assets and its numerics. SIAM Journal on Financial Mathematics, 12(1):158–188.
- Eckstein and Pammer, (2024) Eckstein, S. and Pammer, G. (2024). Computational methods for adapted optimal transport. The Annals of Applied Probability, 34(1A):675–713.
- Fahim and Huang, (2016) Fahim, A. and Huang, Y.-J. (2016). Model-independent superhedging under portfolio constraints. Finance and Stochastics, 20:51–81.
- Fengler, (2009) Fengler, M. R. (2009). Arbitrage-free smoothing of the implied volatility surface. Quantitative Finance, 9(4):417–428.
- Guo and Obłój, (2019) Guo, G. and Obłój, J. (2019). Computational methods for martingale optimal transport problems. The Annals of Applied Probability, 29(6):3311–3347.
- Guyon et al., (2017) Guyon, J., Menegaux, R., and Nutz, M. (2017). Bounds for VIX futures given S&P 500 smiles. Finance and Stochastics, 21:593–630.
- Hobson and Neuberger, (2012) Hobson, D. and Neuberger, A. (2012). Robust bounds for forward start options. Mathematical Finance, 22(1):31–56.
- Korman and McCann, (2015) Korman, J. and McCann, R. (2015). Optimal transportation with capacity constraints. Transactions of the American Mathematical Society, 367(3):1501–1521.
- Korman et al., (2015) Korman, J., McCann, R. J., and Seis, C. (2015). An elementary approach to linear programming duality with application to capacity constrained transport. Journal of Convex Analysis, 22(3):797–808.
- Lassalle, (2013) Lassalle, R. (2013). Causal transference plans and their Monge-Kantorovich problems. arXiv preprint arXiv:1303.6925.
- Lütkebohmert and Sester, (2019) Lütkebohmert, E. and Sester, J. (2019). Tightening robust price bounds for exotic derivatives. Quantitative Finance, 19(11):1797–1815.
- McCormick, (1976) McCormick, G. P. (1976). Computability of global solutions to factorable nonconvex programs: Part I – Convex underestimating problems. Mathematical Programming, 10(1):147–175.
- Pflug and Pichler, (2012) Pflug, G. C. and Pichler, A. (2012). A distance for multistage stochastic optimization models. SIAM Journal on Optimization, 22(1):1–23.
- Pflug and Pichler, (2014) Pflug, G. C. and Pichler, A. (2014). Multistage stochastic optimization, volume 1104. Springer.
- Rogers and Williams, (1993) Rogers, L. and Williams, D. (1993). Diffusions, Markov Processes, and Martingales: Volume 1, Foundations. Cambridge University Press.
- Sherali and Fraticelli, (2002) Sherali, H. D. and Fraticelli, B. M. (2002). Enhancing RLT relaxations via a new class of semidefinite cuts. Journal of Global Optimization, 22:233–261.
- Strassen, (1965) Strassen, V. (1965). The existence of probability measures with given marginals. The Annals of Mathematical Statistics, 36(2):423–439.
- Strasser, (1985) Strasser, H. (1985). Mathematical theory of statistics: Statistical experiments and asymptotic decision theory, volume 7. Walter de Gruyter.
- Villani, (2003) Villani, C. (2003). Topics in Optimal Transportation. Number 58. American Mathematical Society.
- Wiesel, (2023) Wiesel, J. (2023). Continuity of the martingale optimal transport problem on the real line. The Annals of Applied Probability, 33(6A):4645–4692.
Appendix A Proofs of results
Proof of Proposition 3.2.
Define with . It is worth noting that is defined -almost surely. A coupling is causal if and only if
(A.1) |
for all and all defined with . We emphasize that (A.1) holds -almost surely, and the conditional kernel depends on the choice of since the joint distribution of is not fixed.
Proof of Theorem 4.5.
If we substitute the cost with , it only introduces a finite constant to the original problem, thanks to the finite first moments and marginal conditions. Consequently, we can assume henceforth.
We endow with the weak topology, where converges to in the weak topology if and only if
Since every function in is bounded by the same integrable function , Bogachev, (2007, Theorem 4.7.20 (v)) shows that has a compact closure under the weak topology of . If we can demonstrate that is closed, then it is also compact.
Consider a sequence of functions in that converges to in the weak topology of . This convergence is expressed as:
First, select with a measurable set in . Since it holds that
we can obtain that -a.e. Similarly, we can show , -a.e. Next, consider . Then, has the marginal on . Other marginal constraints can also be verified. To establish that satisfies the martingale constraint, we observe that
where is a compact set in , and . With a given compact set , the first term converges to zero since and are bounded on . The second term is bounded by since and have marginals with finite first moments. When the side width , we have . Consequently, also satisfies the martingale constraint. In summary, the limit , and the compactness follows.
The proof of the lower semicontinuity for the objective is standard. When is bounded, the functional
is continuous, following from the definition of weak topology. In the case where is nonnegative but unbounded, there exists a sequence of bounded measurable functions converging increasingly to . Using the monotone convergence theorem, we have
demonstrating that is a supremum of continuous functionals and, consequently, l.s.c. The claim then follows as the infimum is attained for an l.s.c. functional on a compact set. ∎
Proof of Lemma 4.6.
Similarly to the proof of Villani, (2003, Proposition 1.22), we can substitute the cost for for an arbitrarily large . With the finite first moments and marginal conditions, this modification adds the same finite constant to both sides of the strong duality. The result for the modified cost implies the result for the original cost function . Hence, we proceed with the assumption in the subsequent analysis.
Define the functionals and as follows:
(A.5) | ||||
and
(A.6) | ||||
is well-defined. Specifically, if for all and , then and with a constant ; see Villani, (2003, Section 1.1, p.27) for the same argument. Analogously, is also well-defined.
To verify the assumptions in Fenchel-Rockafellar duality theorem (Villani,, 2003, Theorem 1.9), we observe that and are finite at . Moreover, under conditions and , it follows that and . Consequently, , and therefore is continuous at .
A direct calculation yields
where in the last equality, we substitute with .
A challenge arises due to the fact that the topological dual of , denoted as , is greater than , which represents the set of all finite signed measures. To calculate the Legendre-Fenchel transforms of and , we take advantage of Villani, (2003, Lemmas 1.24 and 1.25) to deal with continuous linear functionals on . In addition, we note that the topological dual of the product space is the product of dual spaces, namely .
With linear forms , the Legendre-Fenchel transform of is defined as follows:
where denotes the duality bracket (Villani,, 2003, p. xiv).
Next, we identify the conditions for and compute . If there exists satisfying , then we can choose and send . Similarly, if there exists and , then we can consider and with . Therefore, only when is nonnegative and is nonpositive.
Furthermore, the Legendre-Fenchel transform of is given by
To avoid , we first need
(A.7) |
It is noteworthy that should be nonnegative, as argued above. Moreover, also acts continuously on the subset of , where denotes the set of all continuous functions approaching 0 at infinity. Since the topological dual of is , there exists a unique measure such that
We can then decompose , where is a continuous linear functional with
Under the condition (A.7), Villani, (2003, Lemma 1.25) shows that , and is a nonnegative measure with Borel marginals and .
Another requirement to avoid is
(A.8) |
Since is nonpositive and is a measure, then
(A.9) |
Together with (A.8), setting yields
(A.10) |
This implies that is absolutely continuous with respect to the Lebesgue measure on the Borel -algebra of . Otherwise, suppose that the opposite is true. Then, there exists a Borel set such that and . Since is regular (Bertsekas and Shreve,, 1978, Proposition 7.17), there exists a closed subset and for a small . Similar to the proof of Bertsekas and Shreve, (1978, Proposition 7.18), let with a sufficiently large . By Urysohn’s lemma, there exist continuous functions such that on and on . Then
which implies
leading to a contradiction with . By the Radon-Nikodým theorem (Rogers and Williams,, 1993, Theorem 9.3, p.98), the Radon-Nikodým density exists:
Furthermore, (A.10) implies Otherwise, suppose , where . is a Borel set. Since is regular, there exists a closed subset and . Introducing continuous and bounded functions similarly as before, (A.10) and the dominated convergence theorem lead to
which contradicts the fact that on and .
In summary, requires that is a probability measure in with density and is nonpositive. Clearly, the reverse direction is also true: If is a probability measure in with density and is nonpositive, then .
Moreover, when is a probability measure in with density and is nonpositive, it follows that and
because the l.s.c. cost can be approximated pointwise by a monotonically increasing sequence of continuous and bounded functions. Then the equality above follows from the monotone convergence theorem. Therefore, the right-hand side of the Fenchel-Rockafellar duality reduces to
We have proved the strong duality (4.20), albeit with continuous and bounded . In the case, the claim follows from
where the last inequality is a result of weak duality, similar to Villani, (2003, Proposition 1.5). In addition, in the first equality, the notation indicates that are also continuous and bounded. ∎
Proof of Theorem 4.7.
Similarly to the proof of Lemma 4.6, we can assume the cost .
Consider a continuous and bounded cost . Maximization in the dual problem can be achieved in two steps. First, we maximize over when other multipliers are given. This reduces to a dual problem with the upper capacity constraint in Lemma 4.6, but with a new cost given by
(A.11) | ||||
On the basis of our assumptions, we have . Hence, is continuous, and . An application of Lemma 4.6 leads to
(A.12) | ||||
(A.13) | ||||
(A.14) | ||||
(A.15) |
(A.14) follows from the minimax theorem, given in Strasser, (1985, Theorem 45.8, p. 239) or Beiglböck et al., (2013, Theorem 3.1). In fact, is compact in the weak topology of . Since has a linear growth rate and the marginals have finite first moments, we can also prove that the objective is continuous in using the weak topology of , similar to the approach in Beiglböck et al., (2013, Lemma 2.2).
The final equality (A.15) is derived from the observation that any violation of the martingale condition, the McCormick relaxations, or the lower bound results in an infinite value.
In the case of a nonnegative and l.s.c. cost function , the strong duality can be established through a standard approximation argument. For example, see the last part of the proof presented in Beiglböck et al., (2013, Theorem 1.1). ∎