This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.


The Limits of Price Discrimination Under Privacy Constraints

Alireza Fallah University of California, Berkeley. [email protected]    Michael I. Jordan University of California, Berkeley. [email protected]    Ali Makhdoumi Fuqua School of Business, Duke University. [email protected]    Azarakhsh Malekian Rotman School of Management, University of Toronto. [email protected]
(June 2024)
Abstract

We consider a producer’s problem of selling a product to a continuum of privacy-conscious consumers, where the producer can implement third-degree price discrimination, offering different prices to different market segments. Our privacy mechanism provides a degree of protection by probabilistically masking each market segment. We establish that the resultant set of all consumer-producer utilities forms a convex polygon, characterized explicitly as a linear mapping of a certain high-dimensional convex polytope into 2\mathbb{R}^{2}. This characterization enables us to investigate the impact of the privacy mechanism on both producer and consumer utilities. In particular, we establish that the privacy constraint always hurts the producer by reducing both the maximum and minimum utility achievable. From the consumer’s perspective, although the privacy mechanism ensures an increase in the minimum utility compared to the non-private scenario, interestingly, it may reduce the maximum utility. Finally, we demonstrate that increasing the privacy level does not necessarily intensify these effects. For instance, the maximum utility for the producer or the minimum utility for the consumer may exhibit nonmonotonic behavior in response to an increase of the privacy level.

1 Introduction

Price discrimination has long been a topic of considerable interest in economics and computer science. At its core, price discrimination involves a seller’s ability to charge different prices to different consumers for the same product or service. This strategy, underpinned by the seller’s understanding of consumer preferences, can lead to maximized profits and a more efficient allocation of resources. While economically rational from a business perspective, price discrimination raises complex questions regarding fairness, market power, and consumer welfare.

Simultaneously, consumer privacy has become a growing concern in the digital age. With the increasing ability of companies to collect, analyze, and utilize vast amounts of personal data, the protection of consumer privacy has become a major issue for consumers, companies, and regulators. Privacy concerns are not just limited to the safeguarding of personal information but also encompass how this information is used in market practices, including pricing strategies.

Thus, the intersection of price discrimination and consumer privacy is a timely and intriguing topic. One might initially assume that these two concepts are at odds. Price discrimination, by its very nature, relies on detailed information about consumers to tailor prices effectively. In contrast, an emphasis on consumer privacy could limit the ability of sellers to implement price discrimination strategies. However, consumers can benefit from price discrimination, and thus, this intersection warrants a more nuanced examination. Understanding how privacy constraints affect the ability of firms to engage in price discrimination helps in understanding existing market dynamics and also aids in shaping policies that protect consumer interests.

Our study of the intersection of price discrimination and privacy builds on prior work of Bergemann et al. [2015]. Let us first describe their model and then explain how things change if we consider consumers’ privacy concerns. Consider a monopolist who is selling a product to a continuum of consumers. If the producer knows the values of consumers, then they could engage in first-degree price discrimination to maximize their utility, while the utility of consumers would be reduced to zero. On the other hand, if the producer has no information besides a prior distribution about the value of consumers, then they can only enact uniform pricing. This leads to two pairs of producer utilities and consumer utilities. Bergemann et al. [2015] delineate the set of all possible pairs by using different segmentations. Let us clarify this approach with an example. Imagine a producer aiming to set a product’s price for its customers across the United States. In this context, the aggregated market is defined as the entire US market, meaning the distribution of consumers’ valuations nationwide. Each zip code within the US represents its own market, which is basically the distribution of the consumers’ values in that zip code. Segmentation, in this case, refers to the division of the aggregated US market into these specific markets. Generally speaking, a segmentation involves breaking down the aggregated market into smaller markets, under the condition that these smaller markets must add up to the aggregated market. Bergemann et al. [2015] establish that the set of all possible pairs of consumer and producer utilities is a triangle as depicted in Figure 1. The point QQ corresponds to the first-degree price discrimination, and the line TRTR represents all possible consumer utilities when the producer utility is at its minimum, equating to the utility derived from the uniform pricing.

Refer to caption
Figure 1: The surplus triangle of Bergemann et al. [2015]

For a given market, knowing the underlying distribution of consumers’ values reveals information about the consumers in that market. Take, for instance, the consumers residing in a certain zip code. Knowing the distribution of the consumers in that zip code induces privacy costs for a multitude of reasons, including potential profiling, harmful targeting, potential manipulation, and unfair business practices. In this paper, we aim to explore how limiting information disclosure about the market affects the potential combinations of consumer and seller utilities. We conceptualize privacy leakage as the extent to which revealing a message about the value distribution within a market segment reveals the actual distribution of values of consumers in that segment. This measure quantifies the change in our understanding of a segment’s distribution following the receipt of a specific message. More precisely, we require that the distribution of consumer values in each market must go through a privacy mechanism that masks the true market and outputs a uniformly sampled market with some probability β\beta and outputs the true market with the complementary probability 1β1-\beta. Our privacy mechanism is motivated by a market-level randomized response mechanism—where randomized response is a well-known technique in the privacy literature, commonly utilized for the privatization of discrete-valued data [see, e.g., Warner, 1965, Greenberg et al., 1969]. Additionally, as we will highlight in the next section when presenting our model, our privacy mechanism is methodology used in practice in technology companies and government agencies when aggregate statistics such as histograms are published in a manner that provides privacy guarantees Desfontaines [2021]. These methods ensure that a function reported over users’ data does not significantly depend on any individual user’s data, thereby safeguarding personal privacy Dwork et al. [2014].

In this paper, we operate within a Bayesian setting where we assume that the producer holds a prior for every market and updates it after observing the output of the privacy mechanism. To avoid any unaccounted privacy loss, it is important to choose the least informative prior. Intuitively, under the principle of indifference, a reasonable choice of prior belief for the producer would be the uniform distribution over the entire space of market distributions. Indeed, the question of selecting an uninformative prior has been the subject of extensive literature in Bayesian analysis, where reference priors are a leading framework Bernardo [1979]. As we will demonstrate, the uniform distribution is the reference prior in our setting. Given this Bayesian framework, we ask the following question:

Given the privacy constraint, what set of consumer and producer utility pairs can be attained through all possible market segmentations?

Notice that by increasing β\beta, we strengthen the privacy guarantees. The case of β=1\beta=1 corresponds to a fully private setting in which the producer has no information about each segment or the aggregated market. It is worth emphasizing that in our setting, unlike in Bergemann et al. [2015], the producer does not know the aggregated market. This assumption is crucial when studying privacy loss, as having additional knowledge about the aggregated market would constitute unaccounted information leakage. Consider, for example, a setup with two possible values where each consumer’s value is either low or high. If the aggregated market indicates that everyone’s value is high with a probability of one, then the producer would know every segment’s market, even if β=1\beta=1 (which is supposed to deliver a full privacy guarantee). Lastly, the case of β=0\beta=0 reduces our setting to the non-private case, where the producer can see all segments without any perturbation, and hence can deduce the aggregated market as well.

Both our analysis and our results are different in significant ways from those of the non-private case. In particular, since the aggregated market is no longer known to the producer, there is no guarantee that the producer’s surplus is at least as high as profits under the uniform monopoly price (i.e., at or above the line TRTR in Figure 1). Moreover, our results allow us to resolve the question of whether the privacy constraint always benefits the consumer. In more detail, our main results and insights are as follows.

First, in the setting of two consumer values, we are able to fully characterize the space of all consumer and producer utilities. Moreover, we are able to identify nuances in this setting that we later explore in a more general setting. In particular, we establish that the set of all consumer and producer utilities is a triangle, similar to the non-private case, but the triangle takes different shapes depending on the consumer values, the aggregate market distribution, and the privacy parameter. Figure 2 illustrates our characterization for two cases that display qualitative differences compared to the setting without privacy constraints. In this figure, the set of possible producer and consumer utilities under the privacy mechanism is depicted by the blue triangle ABCABC. The triangle TQRTQR, marked by red dashed lines, represents the non-private case for the corresponding set of values and the aggregated market. We see that the privacy mechanism impacts the limits of price discrimination through a direct effect and an indirect effect. Let us describe these effects and explain how they shape the set of all possible consumer and producer utilities.

The direct effect arises because the producer observes a random market instead of the true market, with probability β\beta, as a consequence of the privacy mechanism. This means that with probability β\beta, the consumer and producer utilities are independent of the segment’s markets. The direct effect implies a shift in the space of consumer and producer utilities. Considering Figure 2, the direct effect implies that the minimum consumer utility, as opposed to the non-private case, is no longer zero; instead, there is a non-zero lower bound on the consumer utility (line AB versus line QT). It also implies that the sum of consumer and producer utilities is smaller than that of the non-private case (line BC versus line QR).

To understand the indirect effect, notice that with probability 1β1-\beta, the producer observes the true market. However, the producer does not know whether or not this observed market is the truth and, therefore, adopts a different pricing strategy compared to the non-private case. The indirect effect refers to this change in the producer pricing strategy. Notably, the producer’s pricing strategy is optimal from their perspective, taking into account the privacy mechanism, but it does not correspond to the optimal pricing strategy in the non-private case. To better illustrate the impact of this indirect effect, let us revisit the non-private case (as depicted by the triangle TQRTQR in Figure 2). Imagine a segmentation for which the producer is indifferent between several prices for certain segments. In such a case, although different price choices do not alter the producer’s utility, they can lead to different consumer utility. For example, every point along the line TRTR represents the same level of utility for the producer. However, selecting various prices can minimize or maximize the consumer utilities (corresponding to points TT and RR, respectively). In the private setting, however, various pricing options that make the producer indifferent ex-ante (i.e., yielding the same expected utility when considering the privacy mechanism) might actually result in different ex-post producer utilities. Consequently, opting for different prices alters both the consumer and producer utilities. This is the reason why, with privacy constraints, the line ACAC is no longer parallel to the x-axis. This difference is more pronounced as we consider K>2K>2 values (see Figure 3) where the lower limit of the set of all possible consumer and producer utilities, in sharp contrast to the non-private case, is not a straight line.

The above effects and the corresponding set of all possible consumer and producer utilities are visualized in Figure 2: the triangle ABDABD represents the non-private triangle TQRTQR after undergoing the scaling and shifting attributable to the direct impact of privacy. The discrepancy between triangles ABCABC and ABDABD thus precisely indicates the indirect effect that arises from the change in the producer’s pricing policy due to the privacy mechanism.

Refer to caption
Refer to caption
Figure 2: Illustration of our characterization of the set of all consumer and producer utilities under a privacy mechanism (the triangle ABCABC) with a comparison to the setting without privacy (the triangle TQRTQR). Both panels correspond to a setting with two consumer values: high and low. The left panel corresponds to a setting where high and low consumer values are far apart. The right panel corresponds to a setting where high and low consumer values are close, and the privacy parameter is small.

Second, our characterization yields the following insights.

Insight 1: Imposing the privacy constraint hurts the producer by decreasing both its minimum and maximum utility across all segmentations. This is intuitive because after adopting the privacy mechanism, the producer knows less about the consumer’s values and, therefore, cannot extract as much surplus as in the non-private case.

Insight 2: Imposing the privacy constraint helps the consumers by increasing their minimum utility. However, interestingly, it can hurt consumers by decreasing their maximum utility (compare, for example, the consumer utility in points CC and RR in the two panels of Figure 2). This first phenomenon aligns with the direct impact of the privacy mechanism, as previously elaborated. Because the producer does not always observe the true market, they may sometimes choose a suboptimal price, resulting in a positive expected utility for consumers. The second phenomenon is more nuanced. We observe that the introduction of a privacy mechanism can influence the producer’s market pricing strategy, leading to more conservative or riskier decisions depending on the consumers’ values. This effect arises because the privacy mechanism reduces the informativeness of the market observed by the producer. On one hand, in cases where there is a significant gap between high and low consumer values, the producer finds the risk of setting higher prices justified. This shift towards riskier pricing strategies is the indirect impact of privacy we discussed earlier. Under such circumstances, if the actual market includes many low-value consumers, selecting higher prices could detrimentally affect consumer utility. On the other hand, when consumer values are relatively similar, the producer might opt for a lower, safer pricing strategy, avoiding “unnecessary” risks. Now, if the true market is composed largely of high-value consumers, this conservative approach can lead to increased utility for consumers compared to the non-private case, thereby increasing maximum consumer utility.

Insight 3: As we discussed above, imposing a privacy constraint decreases both the minimum and maximum producer utility while it increases the minimum consumer utility. One may conjecture that these changes amplify as the privacy parameter increases. However, interestingly, we prove that an increase in the privacy parameter β\beta does not necessarily decrease the producer’s maximum utility and, similarly, does not necessarily increase the minimum consumer utility. Put differently, while imposing the privacy constraint always reduces the maximum producer utility, more privacy does not necessarily lead to a more reduction in this utility. Similarly, although the privacy mechanism ensures a non-zero minimum utility for the consumer, this minimum does not always increase with a higher privacy parameter. The intuition behind these nuanced non-monotonicities is similar to Insight 2, discussed above.

Third, we establish that our main insights carry over to the general setting with K>2K>2 consumer values. Notably, the set of all consumer and producer utilities pairs is no longer a triangle, in sharp contrast to that of the non-private case. Instead, it is a convex polygon (see Figure 3 illustration for one example and how it differs from the non-private case). Similar to the case of K=2K=2, we identify that the privacy mechanism introduces both direct and indirect effects on the set of possible consumer and producer utilities. The direct impact implies a combination of scaling and shifting, transforming the triangle TQRTQR into triangle ABDABD as depicted in Figure 3. The indirect effect of privacy, however, changes the shape of the utility set from a triangle to a convex polygon. More precisely, we illustrate that this set can be represented as a linear mapping of a convex polytope in K2\mathbb{R}^{K^{2}} into a 2-dimensional space. This representation enables us to extend all the insights above from the case with K=2K=2 to the general case with K2K\geq 2. Our analysis also provides the construction of a segmentation to achieve any feasible point for consumer and producer utilities.

Refer to caption
Figure 3: Illustration of the set of possible consumer and producer utilities (the solid blue area) and its comparison with the non-private case (the triangle TQRTQR) for an example with K=5K=5 values.

Furthermore, our characterization allows us to identify several key properties of the polygon that delineate the limits of price discrimination under privacy constraints. One that we highlight here is regarding the impact of privacy on first-degree price discrimination. In the non-private scenario, point QQ in Figure 3 represents perfect discrimination utilities, where each segment contains consumers of identical value. Now, let us consider the same segmentation and see how the privacy mechanism impacts the producer’s utility (i.e., point B in Figure 3). First, notice that because of the direct impact of the privacy mechanism (that masks segments with probability β\beta), the producer’s utility decreases to another point. The question arises: does the indirect effect of privacy further decrease the producer’s utility? In other words, the question is whether the privacy mechanism can influence the producer’s pricing strategy to such an extent that, even when they observe a market comprising consumers with similar values, they set a different price for that market. Interestingly, we show that this scenario is indeed possible. We specifically identify a threshold for the privacy parameter β\beta. Once this threshold is surpassed, it prompts the producer to avoid using certain values for market pricing, as other values yield higher expected utility. Below this threshold, however, the indirect effect of privacy does not further decrease the producer’s utility.

From a technical point of view, to characterize the set of all possible consumer and producer utilities under privacy constraints, we need novel techniques and analysis compared to the existing ones in the literature. Notably, the analysis in Bergemann et al. [2015] relies heavily on the premise that producer utility is bounded below by optimal uniform pricing. This assumption, however, does not hold under privacy constraints since the aggregated market remains undisclosed to the producer. Furthermore, their argument is built on characterizing extremal markets—markets wherein the producer is indifferent between choosing any value within the support of the distribution of the consumer values. In contrast, our argument employs a merging technique, effectively simplifying the problem to an examination of segmentations where, for each value, no more than one segment is priced at that value. This approach not only makes our analysis applicable to the private case but also builds a framework that easily extends to the general setting with multiple consumer values.

1.1 Related work

Our paper relates to the literature on price discrimination. Earlier works on this topic include Robinson [1969], Schmalensee [1981], Varian [1985], Aguirre et al. [2010], Cowan [2016]. More recent papers on this topic include Roesler and Szentes [2017] who characterize the buyer-optimal signaling, Bergemann et al. [2015] who prove that it is possible to achieve any total surplus and its division between consumers and the producer through segmentation, Haghpapanah and Siegel [2019] who characterize the consumer optimal segmentation, and Elliott et al. [2021] who study how a platform’s information design can enable the division of surplus between the consumer and the seller.

The closest paper to ours is Bergemann et al. [2015], which characterizes the set of all possible consumer and producer utilities achievable by segmentations. We build on the setting of this paper by asking what happens if we impose privacy constraints. As discussed above, this fundamentally changes the results and also requires different analyses. This work has motivated several follow-up works that consider either extensions or variations of it. For instance, a similar problem in multiproduct settings has been studied in Haghpanah and Hartline [2021], Haghpanah and Siegel [2022], and Haghpanah and Siegel [2023]. Consumer profiling via information design has been studied in Fainmesser et al. [2023]. Robust price discrimination has been studied in Arieli et al. [2024]. A unified approach to second and third degree price discrimination has been studied in Bergemann et al. [2024]. Extension to randomized auctions (possibly over a menu of large size) has been studied in Ko and Munagala [2022], and price discrimination with fairness considerations has been studied in Banerjee et al. [2024]. To the best of our knowledge, the limits of price discrimination under privacy constraints have not been studied in the literature.111Segmentation can be viewed as a Bayesian persuasion problem as introduced by Kamenica and Gentzkow [2011]. We refer to Bergemann et al. [2015] for a detailed description of this connection.

Our paper also relates to the literature on privacy in markets and platforms. Acquisti et al. [2016] survey the empirical and theoretical works that study the economic value and consequences of protecting and disclosing personal information on consumers and producers. More recently, Posner and Weyl [2018] explore the consequences of consumers sharing data with online platforms, and Jones and Tonetti [2020] study the consequences of data non-rivalry (i.e., consumer data can be used by many producers simultaneously) for the consumers and the producers. Acemoglu et al. [2022], Bergemann et al. [2020], and Ichihashi [2020] study the privacy consequences of data externality, whereby a user’s data reveals information about others, Acemoglu et al. [2023a] study the platform’s optimal architecture that achieves the Pareto frontier of user privacy level and platform estimation error, and Acemoglu et al. [2023b] develop an experimentation game to study the harms of the platform’s information advantage in product offering. Finally, privacy and personal data in financial markets have been studied in Farboodi and Veldkamp [2023].

More closely related to ours are the works at the intersection of privacy and price discrimination. In this regard, Ali et al. [2020] study the implications of giving consumers control over their data and the ability to choose how producers access their data. They investigate whether such regulations improve consumer utility and find that consumer control can guarantee gains for every consumer type relative to both perfect price discrimination and no personalized pricing. Hidir and Vellodi [2021] characterize the consumer optimal market segmentation compatible with the consumer’s incentives to reveal their preferences voluntarily. They further investigate the implication of their results for consumer privacy and price discrimination in online retail. We depart from this literature by characterizing the set of all possible consumer and producer utilities achievable by market segmentations under privacy constraints.222Our paper is also related to the literature on differential privacy introduced in Dwork et al. [2006a] and Dwork et al. [2006b] (see Dwork et al. [2014] for a survey). In particular, our privacy mechanism can be viewed as a variation of the randomized response that is used to guarantee differential privacy for discrete value data points (see, e.g., Erlingsson et al. [2014]).

The rest of the paper proceeds as follows. In Section 2, we present our model and introduce the problem formulation. In Section 3, we focus on a setting with K=2K=2 consumer values and characterize the set of all possible consumer and producer utilities that can be achieved by segmentation. We then discuss the insights we obtain from our analysis and, in particular, whether imposing privacy constraints helps or hurts the producer and the consumer. In Section 4, we show how our main results and insights continue to hold for the general case of K>2K>2 consumer values. Section 5 concludes while the Appendix presents the omitted proofs from the text.

2 Model

We build our model upon the one proposed by Bergemann et al. [2015]. In particular, we consider a producer’s problem of pricing a product for a continuum of consumers represented by the interval Ω:=[0,1]\Omega:=[0,1]. We normalize the cost of production to zero and assume that each consumer’s value for the product belongs to the set 𝒱:={v1,v2,,vK}\mathcal{V}:=\{v_{1},v_{2},\cdots,v_{K}\} for some KK\in\mathbb{N}, where vK>>v2>v1>0v_{K}>\cdots>v_{2}>v_{1}>0. The value of a consumer is denoted by a random variable V:Ω𝒱V:\Omega\to\mathcal{V}. We assume that a consumer purchases the product if the offered price is lower than or equal to their value.

Markets:

A market xx is a distribution over 𝒱\mathcal{V}, and hence, the set of all markets is the simplex over 𝒱\mathcal{V}, given by

𝒳:=Δ(𝒱)={x:𝒱0|k=1Kx(vk)=1}.\mathcal{X}:=\Delta(\mathcal{V})=\left\{x:\mathcal{V}\to\mathbb{R}_{\geq 0}~{}\big{|}~{}\sum_{k=1}^{K}x(v_{k})=1\right\}. (1)

We denote the aggregated market, which corresponds to the distribution of VV, by xx^{*}.

Segmentation:

A segmentation is a partitioning of the aggregated market xx^{*} into different markets. We illustrate a segmentation using a distribution σ\sigma over the space of all markets, 𝒳=Δ(𝒱)\mathcal{X}=\Delta(\mathcal{V}), as defined in equation (1). For any x𝒳x\in\mathcal{X} within the support of σ\sigma, we have a segment in the partitioning of the aggregated market which its corresponding market is xx and its population size is σ(x)\sigma(x). The set of all segmentations is given by333As our proofs show, we can limit our attention to partitions comprising finitely many segments, i.e., distributions σ\sigma with finite support, without loss of generality.

Σ:={σΔ(𝒳)|xsupp(σ)σ(x).x=x,|supp(σ)|<}.\Sigma:=\left\{\sigma\in\Delta(\mathcal{X})~{}\big{|}~{}\sum_{x\in\text{supp}(\sigma)}\sigma(x).x=x^{*},|\text{supp}(\sigma)|<\infty\right\}. (2)

Privacy mechanism:

A privacy mechanism \mathcal{M} is a mapping from the set of all markets to itself, i.e., :𝒳𝒳\mathcal{M}:\mathcal{X}\to\mathcal{X}. More specifically, let xsupp(σ)x\in\text{supp}(\sigma) for some segmentation σΣ\sigma\in\Sigma. To take the privacy considerations into account, we assume the producer observes (x)\mathcal{M}(x) (instead of the true xx), which is a private version of the market xx. In particular, we focus on the following class of mechanisms, parameterized by β[0,1]\beta\in[0,1]:

β(x):={x with probability 1βUnif(𝒳) with probability β,\mathcal{M}_{\beta}(x):=\begin{cases}x&\text{ with probability }1-\beta\\ \text{Unif}(\mathcal{X})&\text{ with probability }\beta,\end{cases} (3)

where Unif(𝒳)\text{Unif}(\mathcal{X}) represents drawing a market from 𝒳\mathcal{X} uniformly at random. One can interpret β()\mathcal{M}_{\beta}(\cdot) as a mechanism that returns the true market with probability 1β1-\beta but masks it with the complimentary probability β\beta (and outputs a uniform noise instead). Notice that for β=0\beta=0, our problem becomes identical to that of Bergemann et al. [2015], which does not consider privacy. Moreover, as β\beta increases, one can learn less about the distribution of consumer values in a market.

Our privacy mechanism can be linked to the randomized response mechanism, albeit applied at the market level. The randomized response is a well-known technique in the privacy literature, commonly utilized for making a variable with discrete values private [Warner, 1965, Greenberg et al., 1969]. This mechanism maintains the true value of the variable with a certain probability while substituting it with a uniformly chosen random variable from the variable’s finite range with the complementary probability. Our approach is similar, with the distinction being that the market space is infinite. Moreover, applying definitions from the privacy literature, we can quantify the privacy-leakage associated with the mechanism β()\mathcal{M}_{\beta}(\cdot). Let us formalize this connection next.

Definition 1.

[Rassouli and Gündüz [2019]] The privacy-leakage about random variable XX by revealing YY is

𝔼Y[TV(pX|Y(|Y),pX())],\displaystyle\mathbb{E}_{Y}\left[\mathrm{TV}\left(p_{X|Y}(\cdot|Y),p_{X}(\cdot)\right)\right],

where pX()p_{X}(\cdot) and pX|Y(|Y)p_{X|Y}(\cdot|Y) are the probability density function and the conditional probability density function of XX, respectively, and TV(,)\mathrm{TV}(\cdot,\cdot) denotes the total variation distance between two random variables.

Proposition 1.

The privacy-leakage (defined in Definition 1) of our masking mechanism β()\mathcal{M}_{\beta}(\cdot) is equal to 1β1-\beta.

This privacy-leakage of our masking mechanism is maximized when β=0\beta=0 and is reduced to zero when β=1\beta=1, where the mechanism returns pure noise (see Rassouli and Gündüz [2019] for further discussion on the relationship between this measure and other definitions of privacy loss).

Our privacy mechanism privatizes the distribution of values, i.e., the market, and is therefore closely related to the literature on differentially private histograms, with applications ranging from limiting inferences about the prevalence of a disease in a group of people to protecting password frequency lists (see for instance Xu et al. [2013], Suresh [2019]), and with fielded applications in companies and agencies such as Apple, Google, and the United States Census Bureau Apple , Abowd [2018]. Briefly, a randomized algorithm, as a function of users’ data, is considered differentially private if the distribution of the algorithm’s output does not change significantly by altering the data of one user. The motivation behind this definition is that if the output is not overly sensitive to a user’s data, it implies that we cannot learn much about a user by observing the output [Dwork et al., 2006a, b]. In Section A.2, we further discuss differential privacy and its connection to the privacy mechanism in (3).

Utility functions:

If we set the product’s price for a market xx equal to some p[0,)p\in[0,\infty), the producer’s utility and the consumers’ utility are given by

𝒰p(p,x):=pk=1Kx(vk)𝟙(pvk),𝒰c(p,x):=k=1Kx(vk)(vkp)𝟙(pvk).\displaystyle\begin{split}\mathcal{U}_{p}(p,x)&:=p\sum_{k=1}^{K}x(v_{k})\mathbbm{1}(p\leq v_{k}),\\ \mathcal{U}_{c}(p,x)&:=\sum_{k=1}^{K}x(v_{k})(v_{k}-p)\mathbbm{1}(p\leq v_{k}).\end{split} (4)

Producer’s belief update:

As we discussed in the introduction, the producer does not know the aggregated market or any of the segment markets. Instead, they hold a prior regarding each segment’s market and then update it using Bayes’ theorem after observing the market through the privacy mechanism. We denote the prior by π()\pi(\cdot) and the posterior after observing x^=β(x)\hat{x}=\mathcal{M}_{\beta}(x) by πβ(|x^)\pi_{\beta}(\cdot|\hat{x}). Note that both distributions are defined over the space of all markets 𝒳\mathcal{X}.

Here, we need to make a modeling assumption about the choice of prior. In particular, we want the prior to carry no information about the underlying market, as otherwise, it would mean a privacy leakage that is not accounted for. Consider, for instance, a setting with K=2K=2 values where all consumers have the higher value (v2v_{2}), i.e., x(v2)=1x^{*}(v_{2})=1. If the prior assigns a high probability to the markets where the probability of the high value is close to one, i.e., xx such that x(v2)1x(v_{2})\approx 1, then even with β=1\beta=1, the posterior would still place a high probability on markets with x(v2)x(v_{2}) close to one. This implies that the seller almost knows that most of the market comprises high-value consumers. In other words, even β=1\beta=1 fails to provide strong privacy protection. In the remainder of this paper we make the following assumption concerning the prior distribution π()\pi(\cdot).

Assumption 1.

The prior π()\pi(\cdot) is equal to Unif(𝒳)\text{Unif}(\mathcal{X}) where Unif(𝒳)\text{Unif}(\mathcal{X}) denotes the uniform distribution over the space of all markets 𝒳\mathcal{X}. In other words, π(x)=1Vol(𝒳)\pi(x)=\tfrac{1}{\text{Vol}(\mathcal{X})} for any market x𝒳x\in\mathcal{X}.

This assumption ensures that we select a least informative prior, which can be formalized variationally as the prior that maximizes the expected Kullback-Leibler (KL) divergence of posterior from the prior [Bernardo, 1979].

Definition 2.

The reference prior is the prior that maximizes the KL divergence in expectation:

argmaxπ()𝒳μβ(x^)(𝒳πβ(x|x^)log(πβ(x|x^)π(x))𝑑x)𝑑x^,\operatorname*{arg\,max}_{\pi(\cdot)}\int_{\mathcal{X}}\mu_{\beta}(\hat{x})\left(\int_{\mathcal{X}}\pi_{\beta}(x|\hat{x})\log(\frac{\pi_{\beta}(x|\hat{x})}{\pi(x)})~{}dx\right)d\hat{x}, (5)

where μβ()\mu_{\beta}(\cdot) denotes the distribution of the observed market x^\hat{x}

The reference prior in our setting is the uniform distribution over the space of markets 𝒳\mathcal{X}.

Proposition 2.

Unif(𝒳)\text{Unif}(\mathcal{X}) is the solution to (5).

In fact, if we want to choose a prior that conveys no information about the underlying market, the prior distribution is an intuitive choice, as all different markets are equally likely under such a prior. Therefore, the producer starts with Unif(𝒳)\text{Unif}(\mathcal{X}) as their prior belief, and after observing x^=β(x)\hat{x}=\mathcal{M}_{\beta}(x), they form the posterior. By applying Bayes’ theorem, it can be demonstrated that this posterior distribution has a mass of size 1β1-\beta over x^\hat{x}, and the remaining β\beta mass is uniformly distributed over 𝒳\mathcal{X}.

Optimal pricing rules:

The producer’s expected revenue from a market conditional on observing x^\hat{x} and under some price p0p\in\mathbb{R}_{\geq 0} is given by

𝒰p(p|x^):=𝔼xπβ(x|x^)[𝒰p(p,x)].\mathcal{U}_{p}(p|\hat{x}):=\mathbb{E}_{~{}x\sim\pi_{\beta}(x|\hat{x})}\left[\mathcal{U}_{p}(p,x)\right]. (6)

A price pp is called optimal for a market given observation x^\hat{x} if its corresponding expected revenue is at least as great as that of any other price:

𝒰p(p|x^)𝒰p(p|x^) for any p0.\mathcal{U}_{p}(p|\hat{x})\geq\mathcal{U}_{p}(p^{\prime}|\hat{x})\text{ for any }p^{\prime}\geq 0. (7)

It is evident that the optimal price always belongs to the set 𝒱\mathcal{V}; therefore, we will limit our search for the optimal prices to this set.

A pricing rule ϕ:𝒳Δ(𝒱)\phi:\mathcal{X}\to\Delta(\mathcal{V}) is a function where, for any observation x^\hat{x}, ϕ(x^)\phi(\hat{x}) specifies a distribution over prices. Specifically, a pricing rule ϕ\phi is considered optimal if, for each x^\hat{x}, the support of ϕ(x^)\phi(\hat{x}) consists solely of prices that are optimal given the observation x^\hat{x}.

Formulating our goal:

We now formalize our main question. Given an optimal pricing rule ϕ()\phi(\cdot), we denote the expected utility of the producer and consumer from a market x𝒳x\in\mathcal{X} by Upϕ(x)U_{p}^{\phi}(x) and Ucϕ(x)U_{c}^{\phi}(x), respectively. This expectation accounts for the randomness arising from the producer observing the private version β(x)\mathcal{M}_{\beta}(x) instead of xx and applying the (potentially random) optimal pricing rule ϕ()\phi(\cdot) based on this observation. More formally, we have

Upϕ(x):=𝔼x^β(x)[𝔼pϕ(x^)[𝒰p(p,x)]] and Ucϕ(x):=𝔼x^β(x)[𝔼pϕ(x^)[𝒰c(p,x)]].\displaystyle U_{p}^{\phi}(x):=\mathbb{E}_{\hat{x}\sim\mathcal{M}_{\beta}(x)}\left[\mathbb{E}_{p\sim\phi(\hat{x})}[~{}\mathcal{U}_{p}(p,x)]\right]\text{ and }\quad U_{c}^{\phi}(x):=\mathbb{E}_{\hat{x}\sim\mathcal{M}_{\beta}(x)}\left[\mathbb{E}_{p\sim\phi(\hat{x})}[~{}\mathcal{U}_{c}(p,x)]\right]. (8)

It is worth highlighting that the above utilities are computed with respect to the true market xx (and not as an expectation with respect to the prior Unif(𝒳)\text{Unif}(\mathcal{X})). In other words, there is a fixed market xx, and the producer has a uniform prior Unif(𝒳)\text{Unif}(\mathcal{X}) on it. This prior is updated to the posterior πβ(x|x^)\pi_{\beta}(x|\hat{x}) after observing x^β(x)\hat{x}\sim\mathcal{M}_{\beta}(x). Based on this posterior, the producer selects the optimal price pϕ(x^)p\sim\phi(\hat{x}). Now, the functions Upϕ(x)U_{p}^{\phi}(x) and Ucϕ(x)U_{c}^{\phi}(x) calculate the utilities for the producer and consumer of the original market xx when priced at pp.

Formally, our goal is to characterize the set of all possible pairs of expected utilities for both the producer and consumer across all potential segmentations and optimal pricing rules. In other words, we would like to characterize the following set:

𝒮:={[xsupp(σ)σ(x)Ucϕ(x),xsupp(σ)σ(x)Upϕ(x)]|σΣ and ϕ is an optimal pricing rule}.\mathcal{S}:=\left\{\Big{[}\sum_{x\in\text{supp}(\sigma)}\sigma(x)U_{c}^{\phi}(x),\sum_{x\in\text{supp}(\sigma)}\sigma(x)U_{p}^{\phi}(x)\Big{]}^{\top}~{}\Big{|}~{}\sigma\in\Sigma\text{ and }\phi\text{ is an optimal pricing rule}\right\}. (9)

3 Price discrimination limits under privacy: the case of K=2K=2

We begin by examining the simpler scenario where the consumer’s valuation is limited to two values, i.e., K=2K=2, to gain insights into the limits of price discrimination under privacy constraints. In this scenario, the set of all markets, 𝒳\mathcal{X}, is, in fact, the set of Bernoulli distributions, which can be parameterized by the probability of the high value, v2v_{2}, denoted by α[0,1]\alpha\in[0,1]. Throughout this section, we use xx and α\alpha interchangeably to represent the market. In particular, α\alpha^{*} denotes the probability of the high value for the aggregated market xx^{*}. We also define η\eta as the ratio of the lower value to the higher value, i.e.,

η:=v1v21.\eta:=\frac{v_{1}}{v_{2}}\leq 1. (10)

We first characterize the optimal price conditioning on the observed market x^\hat{x}.

Lemma 1.

Suppose the producer observes x^=(1α^,α^)\hat{x}=(1-\hat{\alpha},\hat{\alpha}), where α^\hat{\alpha} denotes the observed probability of the high value v2v_{2}. Define the threshold tt^{*} as:

t:=ηβ/21β.t^{*}:=\frac{\eta-\beta/2}{1-\beta}. (11)

Then, the producer should set the price to v1v_{1} (i.e., it is the optimal price) if and only if α^t\hat{\alpha}\leq t^{*}, and v2v_{2} is the optimal price otherwise.

Note that when 2ηβ2\eta\leq\beta, this lemma implies that the optimal price becomes v2v_{2}, regardless of the observation x^\hat{x}. Similarly, when β2(1η)\beta\geq 2(1-\eta), the optimal price is v1v_{1}. Note that when the producer sets the price to the high value v2v_{2}, only consumers with the high value will purchase the product, resulting in their expected utility being zero due to the absence of consumer surplus. Conversely, when the producer opts for the low value v1v_{1} as the price, all consumers—regardless of their valuation—will purchase the product. This pricing strategy allows consumers with the high value to achieve a positive expected surplus, as they gain the difference between their valuation and the lower price. Therefore, if β2η\beta\geq 2\eta, we have

𝒮={[0,v2α]T}\displaystyle\mathcal{S}=\left\{\left[0,v_{2}\alpha^{*}\right]^{T}\right\}

and if β2(1η)\beta\geq 2(1-\eta), then we have

𝒮={[(v2v1)α,v1]T}.\displaystyle\mathcal{S}=\left\{\left[\left(v_{2}-v_{1}\right)\alpha^{*},v_{1}\right]^{T}\right\}.

We next focus on the more interesting scenario in which the following assumption holds.

Assumption 2.

We suppose βmin{2η,2(1η)}\beta\leq\min\{2\eta,2(1-\eta)\}.

As established in Lemma 1, in this case, the optimal pricing rule sets the price to v1v_{1} for observations α^<t\hat{\alpha}<t^{*} and v2v_{2} for α^>t\hat{\alpha}>t^{*}. The following proposition analyzes the effects of privacy mechanism on the set 𝒮\mathcal{S}, distinguishing between the direct and the indirect effects of privacy.

Proposition 3.

Suppose Assumption 2 holds. Then, 𝒮\mathcal{S} is given by

𝒮={β[tα(v2v1),tv1+(1t)αv2]+(1β)s|s𝒮},\mathcal{S}=\left\{\beta\Big{[}t^{*}\alpha^{*}(v_{2}-v_{1}),~{}t^{*}v_{1}+(1-t^{*})\alpha^{*}v_{2}\Big{]}^{\top}+(1-\beta)s^{\prime}~{}\Big{|}~{}s^{\prime}\in\mathcal{S}^{\prime}\right\}, (12)

where 𝒮\mathcal{S}^{\prime} is defined as

𝒮\displaystyle\mathcal{S^{\prime}} :={[γ1α1(v2v1),γ1v1+γ2α2v2]|\displaystyle:=\biggl{\{}\Big{[}\gamma_{1}\alpha_{1}(v_{2}-v_{1}),\gamma_{1}v_{1}+\gamma_{2}\alpha_{2}v_{2}\Big{]}^{\top}~{}\Big{|} (13)
α1[0,min{t,α}],α2[max{t,α},1],γ1 and γ2[0,1],γ1+γ2=1,γ1α1+γ2α2=α}.\displaystyle\qquad\alpha_{1}\in[0,\min\{t^{*},\alpha^{*}\}],\alpha_{2}\in[\max\{t^{*},\alpha^{*}\},1],\gamma_{1}\text{ and }\gamma_{2}\in[0,1],\gamma_{1}+\gamma_{2}=1,\gamma_{1}\alpha_{1}+\gamma_{2}\alpha_{2}=\alpha^{*}\biggr{\}}.

This result shows that the set 𝒮\mathcal{S} undergoes three distinct changes as β\beta varies. First, the set 𝒮\mathcal{S}^{\prime} is scaled by a factor 1β1-\beta. Second, its location in the plane shifts due to the vector β[tα(v2v1),tv1+(1t)αv2]\beta\Big{[}t^{*}\alpha^{*}(v_{2}-v_{1}),~{}t^{*}v_{1}+(1-t^{*})\alpha^{*}v_{2}\Big{]}^{\top} added to it. These two changes are because the producer observes a completely random market with probability β\beta and sets the price based on this random market, resulting in a constant expected utility that is independent of the true market. Lastly, the shape of 𝒮\mathcal{S} evolves due to the corresponding changes in the shape of 𝒮\mathcal{S}^{\prime}. Notice that the shape of 𝒮\mathcal{S}^{\prime} is influenced by tt^{*}, which represents the producer’s threshold for the proportion of high-value customers in order to choose the higher price. In fact, substituting tt^{*} in the definition of 𝒮\mathcal{S}^{\prime} with η\eta (the optimal threshold in the non-private case) aligns 𝒮\mathcal{S}^{\prime} precisely with the set 𝒮\mathcal{S} in the absence of privacy constraints. This last change can be interpreted as the indirect impact of the privacy mechanism, which is rooted in the producer adjusting its pricing strategy due to the noisy observation.

Also, notice that Proposition 3 suggests that it suffices to focus only on segmentations with two markets (1α1,α1)(1-\alpha_{1},\alpha_{1}) and (1α2,α2)(1-\alpha_{2},\alpha_{2}) with probabilities γ1\gamma_{1} and γ2\gamma_{2}, respectively. Moreover, the first segment’s fraction of high values, i.e., α1\alpha_{1}, is less than or equal to tt^{*}, and the other’s (i.e., α2)\alpha_{2}) is greater than or equal to tt^{*}.

In the appendix, we characterize the set 𝒮\mathcal{S}^{\prime} and show that it takes the form of a triangle, with its shape changing as β\beta varies. That said, we next formally characterize the set 𝒮\mathcal{S}, taking into account all these three factors.

Theorem 1.

Suppose Assumption 2 holds. Then, the set 𝒮\mathcal{S} is the triangle ABCABC, given by

A=β𝒄+(1β)A with A:=[0,αv2+v1(tα)+t],B=β𝒄+(1β)B with B:=[0,αv2+(1α)v1],C=β𝒄+(1β)C with C:=[α(v2v1)(v2v1)(αt)+1t,v1+(v2v1)(αt)+1t],\displaystyle\begin{split}A&=\beta\bm{c}+(1-\beta)A^{\prime}\text{ with }A^{\prime}:=\biggl{[}0,~{}\alpha^{*}v_{2}+v_{1}\frac{(t^{*}-\alpha^{*})_{+}}{t^{*}}\biggr{]}^{\top},\\ B&=\beta\bm{c}+(1-\beta)B^{\prime}\text{ with }B^{\prime}:=\biggl{[}0,~{}\alpha^{*}v_{2}+(1-\alpha^{*})v_{1}\biggr{]}^{\top},\\ C&=\beta\bm{c}+(1-\beta)C^{\prime}\text{ with }C^{\prime}:=\biggl{[}\alpha^{*}(v_{2}-v_{1})-(v_{2}-v_{1})\frac{(\alpha^{*}-t^{*})_{+}}{1-t^{*}},~{}v_{1}+(v_{2}-v_{1})\frac{(\alpha^{*}-t^{*})_{+}}{1-t^{*}}\biggr{]}^{\top},\end{split} (14)

where

𝒄=[tα(v2v1),tv1+(1t)αv2].\bm{c}=\Big{[}t^{*}\alpha^{*}(v_{2}-v_{1}),~{}t^{*}v_{1}+(1-t^{*})\alpha^{*}v_{2}\Big{]}^{\top}. (15)

To better understand this characterization, recall that, as we discussed earlier, the effect of the privacy mechanism can be decomposed into two components: a direct one and an indirect one. The direct effect is due to the market being masked with probability β\beta and the producer choosing the price based on pure noise observation. This results in a constant term in utilities that is independent of the true market, captured by the term β𝒄\beta\bm{c} in Equation (14)\eqref{eqn:ABC}.

The indirect effect of privacy arises from changes in the producer’s pricing strategy and is represented by the difference between triangle ABCA’B’C’ and the non-private case TQRTQR in Figure 1. Specifically, the lines ABA’B’ and BCB’C’ are analogous to the corresponding lines QTQT and QRQR in the non-private case, where the first represents the condition that consumer utility is non-negative, and the second shows that the sum of consumer and producer utilities is less than the maximum possible surplus. However, line ACA’C’ is where the indirect effect of privacy is observed. In the non-private case, the line TRTR is parallel to the x-axis, since the producer utility is lower bounded by the utility obtained from uniform pricing. In the presence of a privacy constraint, this line is no longer parallel to the x-axis. Moreover, this indirect effect of privacy becomes more pronounced when considering K>2K>2 values, as this lower boundary of the set 𝒮\mathcal{S} is no longer a straight line but becomes piece-wise linear.

3.1 Illustration of the set 𝒮\mathcal{S}

As equation (14) in Theorem 1 suggests, there are two main regimes in characterizing the triangle 𝒮\mathcal{S}: tαt^{*}\geq\alpha^{*} and tαt^{*}\leq\alpha^{*}, where tt^{*} itself depends on η\eta and β\beta. We next discuss how the set 𝒮\mathcal{S} evolves with varying β\beta, for the two cases of αη\alpha^{*}\geq\eta and αη\alpha^{*}\leq\eta.

Figure 4 depicts 𝒮\mathcal{S} for the case αη\alpha^{*}\geq\eta. In particular, Figure 4(d) illustrates the non-private case where the triangle ABCABC coincides with the triangle TQRTQR that we introduced in the Introduction (see Figure 1). In the presence of privacy mechanism, triangle TQRTQR maps to triangle T~BR~\tilde{T}B\tilde{R} after going through the shifting and scaling operators, meaning that the difference between triangles ABCABC and T~BR~\tilde{T}B\tilde{R} is purely attributed to the indirect effect of privacy.

Figure 4(a) illustrates the case η1/2\eta\leq 1/2 which implies αηt\alpha^{*}\geq\eta\geq t^{*}. In this case, as β\beta decreases from 2η2\eta to 0, tt^{*} increases from 0 to η\eta which implies CC moving down from BB to R~\tilde{R}.

Figures 4(b) and 4(c) correspond to the case αη1/2\alpha^{*}\geq\eta\geq 1/2. Notice that η1/2\eta\geq 1/2 implies tηt^{*}\geq\eta, and hence in this case, we could have either αt\alpha^{*}\geq t^{*} or αt\alpha^{*}\leq t^{*}, depending on the value of β\beta. Let β~\tilde{\beta} be the value of β\beta for which t=αt^{*}=\alpha^{*}, i.e.,

ηβ~/21β~=α.\frac{\eta-\tilde{\beta}/2}{1-\tilde{\beta}}=\alpha^{*}. (16)

Figure 4(b) illustrates the case ββ~\beta\geq\tilde{\beta} which implies tαηt^{*}\geq\alpha^{*}\geq\eta. As β\beta decreases from 2(1η)2(1-\eta) to β~\tilde{\beta}, tt^{*} decreases from 11 to α\alpha^{*} which implies AA moving from BB down towards T~\tilde{T}. Finally, Figure 4(d) illustrates the case ββ~\beta\leq\tilde{\beta} which implies αtη\alpha^{*}\geq t^{*}\geq\eta. In this case, as β\beta decreases from β~\tilde{\beta} to 0, tt^{*} decreases from α\alpha^{*} to η\eta which implies CC moving up towards R~\tilde{R}.

Consumer utilityProducer utilityAA or T~\tilde{T}BBCCR~\tilde{R}
(a) η1/2\eta\leq 1/2
Consumer utilityProducer utilityAAT~\tilde{T}BBCCR~\tilde{R}
(b) η1/2\eta\geq 1/2 and ββ~\beta\geq\tilde{\beta}
Consumer utilityProducer utilityAA or T~\tilde{T}BBCCR~\tilde{R}
(c) η1/2\eta\geq 1/2 and ββ~\beta\leq\tilde{\beta}
Consumer utilityProducer utilityAA or TTBB or QQCC or RR
(d) Non-private case β=0\beta=0
Figure 4: Illustration of 𝒮\mathcal{S} for the case αη\alpha^{*}\geq\eta.

Similarly, Figure 5 illustrates the triangle ABCABC for the case αη\alpha^{*}\leq\eta and for different values of η\eta and β\beta. The primary distinction in this scenario is that, in the non-private case, the optimal uniform price is v1v_{1} instead of v2v_{2}.

In Figure 5(a), as β\beta decreases from 2η2\eta to β~\tilde{\beta}, tt^{*} increases from 0 to α\alpha^{*} which implies CC moving down towards R~\tilde{R}. In Figure 5(b), as β\beta decreases from β~\tilde{\beta} to 0, tt^{*} increases from α\alpha^{*} to η\eta which implies AA moving up towards T~\tilde{T}. In Figure 5(c), as β\beta decreases from 2(1η)2(1-\eta) to 0, tt^{*} decreases from 11 to η\eta and hence AA moves down from BB to T~\tilde{T}.

Consumer utilityProducer utilityAABBCCT~\tilde{T}R~\tilde{R}
(a) η1/2\eta\leq 1/2 and ββ~\beta\geq\tilde{\beta}
Consumer utilityProducer utilityAABBCC or R~\tilde{R}T~\tilde{T}
(b) η1/2\eta\leq 1/2 and ββ~\beta\leq\tilde{\beta}
Consumer utilityProducer utilityAABBC or R~\tilde{R}T~\tilde{T}
(c) η1/2\eta\geq 1/2
Consumer utilityProducer utilityAA or TTBB or QQCC or RR
(d) Non-private case β=0\beta=0
Figure 5: Illustration of 𝒮\mathcal{S} for the case αη\alpha^{*}\leq\eta.

3.2 Does imposing privacy hurt or help the producer and the consumers?

We next discuss that imposing privacy always hurts the producer but, interestingly, may hurt or help consumers. We do so by comparing the set 𝒮\mathcal{S} with the non-private case in Figures 4(d) and 5(d), studied in Bergemann et al. [2015].

Imposing privacy helps consumers to increase their minimum utility.

In the non-private case, the consumer’s utility can be as low as zero. However, in the private case, a minimum utility of βtα(v2v1)\beta t^{*}\alpha^{*}(v_{2}-v_{1}) is ensured for the consumer.

Imposing privacy hurts the producer by decreasing its minimum utility.

In the non-private case, the line TRTR represents the optimal uniform pricing, which constitutes the minimum utility for the producer. However, in the private case, the producer’s utility falls below this value for two reasons. One reason is that the triangle ABCABC always intersects with the line T~R~\tilde{T}\tilde{R}, and the producer’s utility at line T~R~\tilde{T}\tilde{R} is already (weakly) smaller than the utility corresponding to line TRTR. To see the latter, observe that the producer’s utility corresponding to line T~R~\tilde{T}\tilde{R} is given by

β(tv1+(1t)αv2)+(1β)( producer utility at line TR).\beta\Big{(}t^{*}v_{1}+(1-t^{*})\alpha^{*}v_{2}\Big{)}+(1-\beta)\Big{(}\text{ producer utility at line }TR\Big{)}. (17)

Notice that this term is a convex combination of two components: the first component is an average of αv2\alpha^{*}v_{2} and v1v_{1}, and the second component is the maximum of these two terms. Hence, the utility expressed in the equation above is less than the maximum of αv2\alpha^{*}v_{2} and v1v_{1}, i.e., the producer utility at line TRTR.

Moreover, as demonstrated by Figures 4 and 5, when αη12\alpha^{*}\geq\eta\geq\frac{1}{2} or αη12\alpha^{*}\leq\eta\leq\frac{1}{2}, certain segmentations may yield a producer’s utility even lower than that at the line T~R~\tilde{T}\tilde{R}. In the next section, when we explore the case with KK values, we precisely characterize when this phenomenon occurs in the general case.

Imposing privacy hurts the producer by decreasing its maximum utility.

In the non-private case, the maximum utility corresponds to first-degree price discrimination (i.e., point QQ) is given by αv2+(1α)v1\alpha^{*}v_{2}+(1-\alpha^{*})v_{1}. In the private case, the maximum producer utility aligns with point BB in all cases and is equal to

β(tv1+(1t)αv2)+(1β)(αv2+(1α)v1),\beta\Big{(}t^{*}v_{1}+(1-t^{*})\alpha^{*}v_{2}\Big{)}+(1-\beta)\Big{(}\alpha^{*}v_{2}+(1-\alpha^{*})v_{1}\Big{)}, (18)

which is lower than in the non-private case.

Imposing privacy can hurt consumers by decreasing their maximum utility.

For consumers, the maximum utility could either increase or decrease. In the non-private case, the highest utility the consumer can achieve is at point RR. This could approach zero as α\alpha^{*} nears one. However, as mentioned earlier, the consumers’ utility in the private case has a minimum of βtα(v2v1)\beta t^{*}\alpha^{*}(v_{2}-v_{1}), which does not diminish to zero as α\alpha^{*} increases to one. Thus, the maximum possible utility for consumers could be higher in the private case. To illustrate a potential decrease, consider η1/2α\eta\leq 1/2\leq\alpha^{*}, where consumers’ maximum utility corresponds to point CC in Figure 4(a). As β\beta increases to 2η2\eta, CC converges to BB, and BB itself moves towards a point on the y-axis, leading to consumers’ utility approaching zero.

3.3 The indirect effect of privacy

Next, we shift our focus to the shape of 𝒮\mathcal{S} compared to the triangle T~BR~\tilde{T}B\tilde{R}. As we mentioned earlier, this comparison captures the indirect effect due to the change in the producer’s pricing strategy and illustrates how various segmentations can lead to different utilities for the producer and consumer. For example, as discussed before Assumption 2, when β\beta is sufficiently large, 𝒮\mathcal{S} reduces to a single point, indicating no advantage in price discrimination.

The case αη\alpha^{*}\geq\eta :

First, when η1/2\eta\leq 1/2, indicating that the difference between the high and low values is significant, the set 𝒮\mathcal{S} loses the ACR~AC\tilde{R} portion due to the indirect effect of privacy. In other words, the change in pricing strategy primarily eliminates segmentations beneficial to the consumer while retaining those favorable to the producer. To see why this occurs, notice that the consumer gains utility when their value for the product is high (i.e., v2v_{2}), but the market is priced low (i.e., v1v_{1}). However, with a small η\eta, the threshold tt^{*} decreases, implying that the high value’s gain is substantial enough for the producer to risk setting the market price at v2v_{2}. This shift adversely affects the consumers’ utility.

Second, when η1/2\eta\geq 1/2, as discussed earlier, the threshold tt^{*} becomes larger than η\eta. In other words, the difference between the high and low values is not significant, and hence, the producer is more inclined to use the lower price v1v_{1}. This effect intensifies when β\beta is very large, as tt^{*} then exceeds both α\alpha^{*} and η\eta. It implies that even the aggregated market, priced at v2v_{2} in the non-private case, is now priced at v1v_{1} with a probability of (1β)+βt(1-\beta)+\beta t^{*} (illustrated by point CC in Figure 4(b)). Roughly speaking, in this scenario, a segment is more likely to be priced at v1v_{1}, unless it has a very high proportion of consumers with value v2v_{2}, which corresponds to point BB and nearby points where first-degree price discrimination occurs with two completely separate segments. This is why, in this case, 𝒮\mathcal{S} predominantly encompasses the area near the line BCBC and loses out on the area around AA, which corresponds to segments of a more mixed nature priced at v2v_{2}.

Finally, when η1/2\eta\geq 1/2 but β\beta is small, the previously described situation mitigates, and hence the whole triangle ABR~AB\tilde{R} is recovered. However, the producer still remains slightly inclined to price markets at v1v_{1}, which is why points like CC fall below the uniform pricing line AR~A\tilde{R}. However, as β\beta decreases, CC moves towards R~\tilde{R}, indicating that Figure 4(c) gradually aligns with Figure 4(d).

The case αη\alpha^{*}\leq\eta :

With η1/2\eta\geq 1/2, as illustrated in Figure 5(c), we observe that the triangle ACT~AC\tilde{T} is eliminated due to the privacy mechanism. This leads to two intuitions: first, the privacy mechanism benefits the consumer by eliminating segmentations that yield low utility for them. Second, it does not significantly affect segmentations that distinctly separate high and low value consumers. The reason for these phenomena lies in the fact that η1/2\eta\geq 1/2 suggests a small difference between high and low values, prompting the producer to favor pricing the market at v1v_{1} over v2v_{2}. This inclination minimally impacts segmentations that effectively distinguish between high and low value consumers (represented by point BB and its neighboring points). Additionally, incorrectly setting the price to v1v_{1} increases the consumer’s utility.

With η1/2\eta\leq 1/2, the producer is more inclined to set the market price at v2v_{2}. However, the assumption αη\alpha^{*}\leq\eta indicates a scarcity of high-value customers, making v1v_{1} the optimal uniform price. This is why in Figures 5(a) and 5(b), we observe points below the uniform pricing line T~R~\tilde{T}\tilde{R}. This effect is particularly intensified when β\beta is high, i.e., Figure 5(a), as the threshold for setting the price high (i.e., tt^{*}) approaches zero. This adversely impacts consumers since consumers gain utility only when they have a high value and the marker is priced at the low value v1v_{1}.

3.4 Varying the privacy parameter

So far, we have shown that implementing the privacy mechanism reduces both the minimum and maximum possible utility of the producer, in contrast to the non-private case, while increasing the minimum utility for the consumer. A natural question arises: do these changes amplify as the privacy parameter increases? For instance, does an increase in the privacy parameter β\beta lead to a further decrease in the producer’s maximum utility and an increase in the minimum consumer utility? Interestingly, this is not always the case! In other words, while the privacy constraint always reduces the maximum producer utility, more privacy does not necessarily equate to a more significant reduction in this utility. Similarly, although the privacy mechanism ensures a non-zero minimum utility for the consumer, this minimum does not always increase with a higher privacy parameter. We next formalize this observation.

The impact of increasing the privacy parameter on the producer utility:

Let us start with the maximum possible producer utility. Notice that the maximum producer utility corresponds with the point BB in Figures 4 and 5.

Lemma 2.

The maximum producer utility across all segmentations is a decreasing function of β\beta over the interval [0,min{2η,2(1η)}][0,\min\{2\eta,2(1-\eta)\}] if and only if α,η1/2\alpha^{*},\eta\geq 1/2 or α,η1/2\alpha^{*},\eta\leq 1/2.

We defer the proof to the appendix. This result suggests that when α1/2η\alpha^{*}\geq 1/2\geq\eta or when α1/2η\alpha^{*}\leq 1/2\leq\eta, we could see an increase in the maximum producer utility by increasing the privacy parameter. Next, we provide an intuition for this result and explain why increasing the privacy parameter might actually lead to an increase in the producer’s utility in these cases.

The maximum utility for a producer aligns with first-degree price discrimination, where consumers with identical values are grouped into a single segment. With privacy implemented, each segment is subject to potential alteration with a certain probability, possibly resulting in suboptimal pricing by the producer. As β\beta increases, the likelihood of such occurrences also rises.

Consider the scenario where η\eta is close to one, i.e., the high and low values are relatively close. In this case, the cost of mistakenly pricing a segment of high-value customers at a lower price is not that severe. However, the suboptimal pricing of a segment of low-value consumers at a higher price can be considerably costly, as the revenue will be completely lost. This especially intensifies when the aggregated market is predominantly composed of low-value consumers, i.e., when α\alpha^{*} is small.

This scenario, where η\eta is large and α\alpha^{*} is small, is one of the two cases where the nonmonotonicity occurs. In this context, mistakingly offering a low price to high-value consumers is not detrimental. However, mistakingly offering a high price to low-value consumers can significantly reduce the producer’s utility. Why might increasing privacy potentially enhance the producer’s maximum utility in this scenario? Because higher β\beta induces a more conservative pricing strategy, making the producer more inclined to opt for the lower price. Specifically, for η>1/2\eta>1/2, the pricing threshold tt^{*} shifts from η\eta towards 11 as β\beta increases, implying that producers are less likely to select the higher price as the privacy parameter grows. Consequently, the risk of mistakingly offering a low price to high-value consumers diminishes, potentially boosting the producer’s maximum utility.

The other instance of nonmonotonicity arises when η\eta is small and α\alpha^{*} is large. In this scenario, there is a substantial difference between high and low values. In such cases, mistakenly offering a low price to high-value consumers can be particularly detrimental, leading to a significant reduction in maximum producer utility as β\beta increases. Consequently, with the increase of β\beta, and as the observed market becomes less informative, the producer becomes more inclined to take the risk of selecting the higher price. In such circumstances, and when the market is mainly composed of high-value customers, this riskier pricing strategy might lead to an increase in producer utility.

The impact of increasing the privacy parameter on the consumer utility:

Lemma 3.

The minimum consumer utility across all segmentations is an increasing function of β\beta over the interval the interval [0,min{2η,2(1η)}][0,\min\{2\eta,2(1-\eta)\}] if and only if η1/2\eta\geq 1/2.

This result suggests that, for any η<1/2\eta<1/2, the minimum consumer utility exhibits nonmonotonic behavior as β\beta varies within the interval [0,2η][0,2\eta]. For producer utility, as we noted, an increase in β\beta when η\eta is small leads to a preference for the higher price. Now, it is important to note that the consumer utility is non-zero only when their value is higher than the chosen price, an occurrence that becomes less frequent if the producer is more willing to select the higher price. Consequently, an increase in the privacy parameter could lead to a decrease in the minimum consumer utility in such scenarios.

4 Price discrimination limits under privacy: the general case

Here, we turn our focus to the general case with K>2K>2 values. As we establish, once we consider the general case, the fundamental difference between our characterization and that of Bergemann et al. [2015] without privacy becomes more apparent. In particular, the set of possible consumer and producer utilities is no longer a triangle and instead is a more nuanced polytope that we will characterize. However, we establish that many of the insights that we derived in the special case of K=2K=2 continue to hold in the general setting as well.

We make use of the following notation that enables driving the analogue of Lemma 1 in the general case. For any i[K]i\in[K], let 𝒳iβ𝒳\mathcal{X}_{i}^{\beta}\subseteq\mathcal{X} be the set of (observed) markets for which viv_{i} is an optimal price, i.e.,

𝒳iβ={x^|𝒰p(vi|x^)𝒰p(vj|x^) for any j[K]},\mathcal{X}_{i}^{\beta}=\Big{\{}\hat{x}~{}\Big{|}~{}\mathcal{U}_{p}(v_{i}|\hat{x})\geq\mathcal{U}_{p}(v_{j}|\hat{x})\text{ for any }j\in[K]\Big{\}}, (19)

where 𝒰p(|x^)\mathcal{U}_{p}(\cdot|\hat{x}) is defined in (6). Going back to the case K=2K=2, 𝒳1β\mathcal{X}_{1}^{\beta} and 𝒳2β\mathcal{X}_{2}^{\beta} corresponds to α[0,t]\alpha\in[0,t^{*}] and α[t,1]\alpha\in[t^{*},1], respectively. The following result establishes the necessary and sufficient condition for 𝒳kβ\mathcal{X}_{k}^{\beta} to be non-empty.

Lemma 4.

For any i[K]i\in[K], the set 𝒳iβ\mathcal{X}_{i}^{\beta} is non-empty if and only if ββ¯i\beta\leq\bar{\beta}_{i}, where β¯i\bar{\beta}_{i} is the unique solution of the following equation 444[z]+[z]_{+} denotes max{z,0}\max\{z,0\}.:

β¯i1β¯i:=maxjK(vi𝟙(j<i)vj)[(K+1j)vj(K+1i)vi]+.\frac{\bar{\beta}_{i}}{1-\bar{\beta}_{i}}:=\max_{j}\frac{K\Big{(}v_{i}-\mathbbm{1}(j<i)v_{j}\Big{)}}{\Big{[}(K+1-j)v_{j}-(K+1-i)v_{i}\Big{]}_{+}}. (20)

Note that ββ¯i\beta\geq\bar{\beta}_{i} implies that price viv_{i} is not optimal for any observed market, thereby precluding the producer from choosing viv_{i}. To better understand this, recall that the expected utility of the producer, given an observed market x^\hat{x} and upon choosing price viv_{i}, denoted as 𝒰p(vi|x^)\mathcal{U}_{p}(v_{i}|\hat{x}), is given by

𝒰p(vi|x^)=(1β)𝒰p(vi,x^)+β𝔼xUnif(𝒳)[𝒰p(vi,x)].\mathcal{U}_{p}(v_{i}|\hat{x})=(1-\beta)~{}\mathcal{U}_{p}(v_{i},\hat{x})+\beta~{}\mathbb{E}_{x\sim\text{Unif}(\mathcal{X})}\left[\mathcal{U}_{p}(v_{i},x)\right]. (21)

Essentially, the expected utility of the producer can be viewed as a convex combination of their utility in the fully observed market scenario (analogous to the non-private case) and their utility under complete uncertainty, where they rely on a uniform prior over markets. As β\beta increases, indicating a noisier observed market, the weight the producer places on their utility from a uniform prior grows. Consequently, they may opt not to select certain prices that would result in lower utility when the market is drawn from a uniform distribution.

Specifically, for the case when K=2K=2, one can verify β¯1=2(1η)\bar{\beta}_{1}=2(1-\eta) and β¯2=2η\bar{\beta}_{2}=2\eta. This is the rationale behind imposing Assumption 2 in the previous section, which guarantees that in the scenario where K=2K=2, both prices are viable options. If this were not the case, only one pricing option would remain, causing the set 𝒮\mathcal{S} to reduce to a single point.

Our next result extends Proposition 3 to this general case.

Proposition 4.

The set 𝒮\mathcal{S} can be represented as

S=β𝒄+(1β)𝒮,S=\beta\bm{c}+(1-\beta)\mathcal{S}^{\prime}, (22)

where 𝐜2\bm{c}\in\mathbb{R}^{2} is a constant vector, given by

𝒄=[k=1K(𝒳kβ)𝒰c(vk,x),k=1K(𝒳kβ)𝒰p(vk,x)],\bm{c}=\left[\sum_{k=1}^{K}\mathbb{P}(\mathcal{X}_{k}^{\beta})~{}\mathcal{U}_{c}(v_{k},x^{*}),\sum_{k=1}^{K}\mathbb{P}(\mathcal{X}_{k}^{\beta})~{}\mathcal{U}_{p}(v_{k},x^{*})\right]^{\top}, (23)

where ()\mathbb{P}(\cdot) denotes the uniform distribution over 𝒳\mathcal{X}, and 𝒰c(,)\mathcal{U}_{c}(\cdot,\cdot) and 𝒰p(,)\mathcal{U}_{p}(\cdot,\cdot) are as defined in (4). Moreover, the set 𝒮\mathcal{S}^{\prime} is given by

𝒮={[k=1Kγk𝒰c(vk,xk),k=1Kγk𝒰p(vk,xk)]|xk𝒳kβ,γk[0,1],k=1Kγkxk=x,k=1Kγk=1}.\displaystyle\mathcal{S}^{\prime}=\left\{\Big{[}\sum_{k=1}^{K}\gamma_{k}~{}\mathcal{U}_{c}(v_{k},x_{k}),\sum_{k=1}^{K}\gamma_{k}~{}\mathcal{U}_{p}(v_{k},x_{k})\Big{]}^{\top}~{}\Big{|}~{}x_{k}\in\mathcal{X}_{k}^{\beta},\gamma_{k}\in[0,1],\sum_{k=1}^{K}\gamma_{k}x_{k}=x^{*},\sum_{k=1}^{K}\gamma_{k}=1\right\}. (24)

This result shows that, in the general case, and similar to the case when K=2K=2, the set 𝒮\mathcal{S}, which depicts the limits of price discrimination under privacy mechanisms, is influenced by three factors: a constant shift β𝒄\beta\bm{c}, a scaling factor 1β1-\beta, and the set 𝒮\mathcal{S}^{\prime} that determines its shape. Consequently, the insights derived for K=2K=2 based on this characterization extend to the general case. In particular, the shift β𝒄\beta\bm{c} indicates that, in contrast to the non-private case, the consumers’ minimum utility is not zero. Moreover, the scaling factor 1β1-\beta suggests that market segmentation generally becomes less impactful as the privacy factor β\beta increases.

Our primary goal now is to specify the set 𝒮\mathcal{S}^{\prime}. As we discussed in the case of K=2K=2, the set 𝒮\mathcal{S}^{\prime} captures the indirect impact of privacy, as it illustrates the effects of the change in the producer’s pricing strategy, given that they know the privacy mechanism is applied. More specifically, 𝒮\mathcal{S}^{\prime} is the set of potential pairs of utilities for the consumer and producer when the producer accurately perceives the segmented markets (as in the non-private case) but adopts the pricing rule from the private case (associated with sets {𝒳kβ}k\{\mathcal{X}_{k}^{\beta}\}_{k}) rather than the optimal pricing rule for the non-private scenario. In fact, in the non-private case with β=0\beta=0, this set aligns with 𝒮\mathcal{S}.

Proposition 4 also suggests that our analysis can be limited to segmentations where, for any value vkv_{k}, there is at most one segment whose optimal corresponding price is vkv_{k}. This effectively broadens the insights of Proposition 3 from the case of K=2K=2 and simplifies the characterization of 𝒮\mathcal{S}^{\prime}. As we stated in the previous section, the set 𝒮\mathcal{S}^{\prime} is a triangle for K=2K=2. In general, we next establish that it is a convex polygon.

Theorem 2.

Let 𝒫\mathcal{P} denote a convex polytope in K2\mathbb{R}^{K^{2}}, represented by the variables {z(i,j)}i,j=1K\{z(i,j)\}_{i,j=1}^{K} and the following equations:

z(i,1)z(i,K)0 for all i[K],\displaystyle z(i,1)\geq\cdots\geq z(i,K)\geq 0\text{ for all }i\in[K], (25a)
z(i,i)viz(i,j)vjβK(1β)z(i,1)((K+1j)vj(K+1i)vi) for all i,j[K],\displaystyle z(i,i)v_{i}-z(i,j)v_{j}\geq\frac{\beta}{K(1-\beta)}z(i,1)\Big{(}(K+1-j)v_{j}-(K+1-i)v_{i}\Big{)}\text{ for all }i,j\in[K], (25b)
i=1Kz(i,j)=k=jKx(vk) for all j[K].\displaystyle\sum_{i=1}^{K}z(i,j)=\sum_{k=j}^{K}x^{*}(v_{k})\text{ for all }j\in[K]. (25c)

Then, the set 𝒮\mathcal{S}^{\prime}, given in (24), can be cast as a linear transformation of 𝒫\mathcal{P} from K2\mathbb{R}^{K^{2}} to 2\mathbb{R}^{2}, given by

𝒮={i=1K1j=i+1K(vjvj1)z(i,j),i=1Kviz(i,i)}.\mathcal{S}^{\prime}=\left\{\sum_{i=1}^{K-1}\sum_{j=i+1}^{K}(v_{j}-v_{j-1})z(i,j),~{}\sum_{i=1}^{K}v_{i}z(i,i)\right\}. (26)

Proof sketch: To understand how this result is established, recall the definition of 𝒮\mathcal{S}^{\prime}, which involves KK markets x1,,xKx_{1},\cdots,x_{K}, with each market xi𝒳iβx_{i}\in\mathcal{X}_{i}^{\beta}. This implies that viv_{i} is an optimal price for xix_{i}. Now, let y(i,)y(i,\cdot) represent the complementary cumulative distribution function of market xix_{i}, defined as:

y(i,j):=k=jKxi(vk).y(i,j):=\sum_{k=j}^{K}x_{i}(v_{k}).

It is evident that:

1=y(i,1)y(i,K)0 for all i[K].1=y(i,1)\geq\cdots\geq y(i,K)\geq 0\text{ for all }i\in[K].

Furthermore, we verify that viv_{i} being the optimal price for xix_{i} necessitates:

y(i,i)viy(i,j)vjβK(1β)((K+1j)vj(K+1i)vi) for all i,j[K].y(i,i)v_{i}-y(i,j)v_{j}\geq\frac{\beta}{K(1-\beta)}\Big{(}(K+1-j)v_{j}-(K+1-i)v_{i}\Big{)}\text{ for all }i,j\in[K]. (27)

Therefore, conditions (25a) and (25b) are satisfied if we substitute z(,)z(\cdot,\cdot) with y(,)y(\cdot,\cdot). Additionally, the aggregated market imposes the condition k=1Kγkxk=x\sum_{k=1}^{K}\gamma_{k}x_{k}=x^{*}. Using y(,)y(\cdot,\cdot), this condition can be represented as:

i=1Kγiy(i,j)=k=jKx(vk) for all j[K].\sum_{i=1}^{K}\gamma_{i}y(i,j)=\sum_{k=j}^{K}x^{*}(v_{k})\text{ for all }j\in[K]. (28)

By defining z(i,j)=γiy(i,j)z(i,j)=\gamma_{i}y(i,j), we immediately see that (25c) is valid. The other two conditions, (25a) and (25b), are also satisfied since all y(i,)y(i,\cdot) terms are multiplied by the same variable γi\gamma_{i}. Therefore (25) provide a representation for the space of segmentations in the definition 𝒮\mathcal{S}^{\prime}, i.e., (γk)k=1K(\gamma_{k})_{k=1}^{K} and (xk)k=1K(x_{k})_{k=1}^{K}, in form of a convex polytope. It remains to show that the linear mapping (26) is equal to the consumer and producer utilities. We defer this part to the appendix. \square

We should highlight that the proof outlined above provides a construction for a segmentation to achieve any feasible point for the consumer and producer utilities.

Before proceeding, let us highlight that in the special of β=0\beta=0, the polytope characterized in Theorem 2 simplifies to a triangle as characterized in Bergemann et al. [2015]. However, in our setting, with privacy, it takes a more nuanced form. Let us present an example illustrating some potential shapes of 𝒮\mathcal{S}^{\prime}. This example will serve as a basis to introduce and motivate several results concerning 𝒮\mathcal{S}^{\prime} and, consequently, the set 𝒮\mathcal{S}.

Example 1.

Consider the case K=5K=5 with values [vk]k=15=[0.8,2,3,4.2,5][v_{k}]_{k=1}^{5}=[0.8,2,3,4.2,5] and β=0.3\beta=0.3. Figure 6 illustrates 𝒮\mathcal{S^{\prime}} for three distinct aggregated market values xx^{*}. Specifically, the aggregated markets associated with Figures 6(a), 6(b), and 6(c) are (0.2,0.1,0.4,0.2,0.1)(0.2,0.1,0.4,0.2,0.1), (0.2,0.3,0.2,0.2,0.1)(0.2,0.3,0.2,0.2,0.1), and (0.2,0.1,0.1,0.05,0.55)(0.2,0.1,0.1,0.05,0.55), respectively. The dashed triangle TQRTQR in the figures represents the non-private case, in which, as previously stated, the set 𝒮\mathcal{S}^{\prime} coincides with the set 𝒮\mathcal{S}.

Refer to caption
(a) x=(0.2,0.1,0.4,0.2,0.1)x^{*}=(0.2,0.1,0.4,0.2,0.1)
Refer to caption
(b) x=(0.2,0.3,0.2,0.2,0.1)x^{*}=(0.2,0.3,0.2,0.2,0.1)
Refer to caption
(c) x=(0.2,0.1,0.1,0.05,0.55)x^{*}=(0.2,0.1,0.1,0.05,0.55)
Figure 6: Illustration of 𝒮\mathcal{S^{\prime}} with values [vk]k=15=[0.8,2,3,4.2,5][v_{k}]_{k=1}^{5}=[0.8,2,3,4.2,5], β=0.3\beta=0.3, and three different aggregated markets.

Notice that the point QQ in Figure 6 corresponds to the first-degree price discrimination and is given by

(0,k=1Kx(vk)vk).\left(0,\sum_{k=1}^{K}x^{*}(v_{k})v_{k}\right).

We next list a number of results regarding 𝒮\mathcal{S} and 𝒮\mathcal{S}^{\prime}. The first one is a straightforward consequence of Theorem 2:

Corollary 1.

The set 𝒮\mathcal{S}^{\prime} forms a polygon and is situated to the right of the y-axis and to the left of the line passing through points QQ and RR, which represents the total surplus equal to the maximum possible surplus, i.e., k=1Kx(vk)vk\sum_{k=1}^{K}x^{*}(v_{k})v_{k}.

The figures in Figure 6 suggest that the set 𝒮\mathcal{S}^{\prime} includes the point QQ. This observation holds true when β\beta is sufficiently small, ensuring that all values within the support of xx^{*} remain viable as optimal prices. In such scenarios, a segmentation that groups all consumers with the same value into the same segment can attain point QQ. However, if β\beta is so large that, for some kk, the value vkv_{k} is never selected by the producer while x(vk)>0x^{*}(v_{k})>0, the producer cannot achieve the maximum utility as the consumers with value vkv_{k} will never be priced at vkv_{k}. We formalize this observation in the following corollary:

Corollary 2.

The set 𝒮\mathcal{S}^{\prime} includes the point QQ if and only if

βmink:x(vk)>0β¯k.\beta\leq\min_{k:x^{*}(v_{k})>0}\bar{\beta}_{k}.

Returning to Example 1, we calculate that (β¯k)k=15(\bar{\beta}_{k})_{k=1}^{5} equals (0.44,0.91,1,0.91,0.54)(0.44,0.91,1,0.91,0.54). Therefore, with β=0.5\beta=0.5, for instance, one of the values (v1v_{1}) becomes infeasible as optimal market prices. Figure 7 displays 𝒮\mathcal{S}^{\prime} using the parameters from Example 1, but with β=0.5\beta=0.5 instead of β=0.3\beta=0.3. As anticipated, the point QQ is no longer included in the set 𝒮\mathcal{S}^{\prime}.

Refer to caption
(a) x=(0.2,0.1,0.4,0.2,0.1)x^{*}=(0.2,0.1,0.4,0.2,0.1)
Refer to caption
(b) x=(0.2,0.3,0.2,0.2,0.1)x^{*}=(0.2,0.3,0.2,0.2,0.1)
Refer to caption
(c) x=(0.2,0.1,0.1,0.05,0.55)x^{*}=(0.2,0.1,0.1,0.05,0.55)
Figure 7: Illustration of 𝒮\mathcal{S^{\prime}} with values [vk]k=15=[0.8,2,3,4.2,5][v_{k}]_{k=1}^{5}=[0.8,2,3,4.2,5], β=0.5\beta=0.5, and three different aggregated markets.

Now, let us consider what these two facts imply for the set 𝒮\mathcal{S}, i.e., the constraints on price discrimination under privacy considerations.

Fact 1.

The maximum possible utility of the producer from market segmentation is always weakly smaller when a privacy mechanism is applied compared to the non-private case.

To understand this, note that the maximum utility of the producer in the non-private case corresponds to point QQ, which is expressed by

k=1Kx(vk)vk.\sum_{k=1}^{K}x^{*}(v_{k})v_{k}. (29)

However, in the private case, as Proposition 4 demonstrates, the utility is bounded by

β(maxvk𝒰p(vk,x))+(1β)(maximum producer utility in 𝒮).\beta\left(\max_{v_{k}}~{}\mathcal{U}_{p}(v_{k},x^{*})\right)+(1-\beta)\left(\text{maximum producer utility in }\mathcal{S}^{\prime}\right). (30)

This term is a convex combination of two components, where the second component is at most equal to (29) (when QQ is included in 𝒮\mathcal{S}^{\prime}), and the first component corresponds to the optimal uniform pricing (line TRTR) and is weakly smaller than (29). This confirms the reduction in the maximum producer utility under privacy. Furthermore, when β\beta is sufficiently large such that QQ is not included in 𝒮\mathcal{S}^{\prime}, the second component in (30) also becomes strictly smaller than (29).

Now, let us focus on the impact of the privacy mechanism on the minimum producer utility. In the non-private case, the minimum producer’s utility is represented by the line TRTR. To compare this with the private case, we need to examine the lower boundary of 𝒮\mathcal{S}^{\prime}. Observing Figures 6 and 7, it appears that 𝒮\mathcal{S}^{\prime} consistently intersects with line TRTR and sometimes even crosses this line. We formalize this observation in the following result:

Proposition 5.

Suppose β>0\beta>0 and x(vk)>0x^{*}(v_{k})>0 for all k[K]k\in[K]. The lowest point of the set 𝒮\mathcal{S}^{\prime} lies on or beneath the line that represents optimal uniform pricing in the non-private case, i.e., line TRTR. Moreover, it is below this line if and only if the optimal pricing for the non-private case and the fully private case differ, i.e., there exists ii such that

(K+1i)Kvi>(K+1j)Kvj for all jargmaxk[K]𝒰p(vk,x).\frac{(K+1-i)}{K}v_{i}>\frac{(K+1-j)}{K}v_{j}\quad\text{ for all }\,j\in\operatorname*{arg\,max}_{k\in[K]}\mathcal{U}_{p}(v_{k},x^{*}). (31)

We defer the proof of this proposition to the appendix. Revisiting Example 1, we find that while the condition (31) is not met for xx^{*} corresponding to Figure 6(a), it is fulfilled for the aggregated markets associated with Figures 6(b) and 6(c). Consequently, the set 𝒮\mathcal{S}^{\prime} intersects with line TRTR in the former case without crossing it, whereas in the latter two cases, it extends below this line.

For the case K=2K=2, the condition (31) translates into either αη1/2\alpha^{*}\geq\eta\geq 1/2 or αη1/2\alpha^{*}\leq\eta\leq 1/2. The former condition, αη1/2\alpha^{*}\geq\eta\geq 1/2, corresponds to Figures 4(b) and 4(c). It is important to note that these figures depict the set 𝒮\mathcal{S}, which is derived from the set 𝒮\mathcal{S}^{\prime} after applying the scaling and shift described in Proposition 4. Specifically, line TRTR in 𝒮\mathcal{S}^{\prime} corresponds to line T~R~\tilde{T}\tilde{R} in these figures. Consequently, the lowest point of 𝒮\mathcal{S}^{\prime} maps to point CC that is located below the line T~R~\tilde{T}\tilde{R} in both cases. In contrast, for Figure 4(a), where the condition (31) is not met, the lowest point of 𝒮\mathcal{S}^{\prime} maps to point AA, which lies precisely on the line T~R~\tilde{T}\tilde{R}. Similarly, for the case αη1/2\alpha^{*}\leq\eta\leq 1/2, as illustrated in Figures 5(a) and 5(b), the lowest point of 𝒮\mathcal{S}^{\prime} maps to point AA which is situated below the line T~R~\tilde{T}\tilde{R}.

It is worth emphasizing that condition (31) can indicate the ambiguity that a privacy mechanism introduces into the producer’s pricing strategy. Note that when this condition is not met, it implies that the optimal pricing strategy remains constant, regardless of the privacy level. This makes the privacy mechanism less detrimental to the producer’s utility. Conversely, when this condition is satisfied, it indicates that the optimal strategy varies with the level of privacy, thereby diminishing the producer’s utility due to suboptimal pricing decisions.

Corollary 3.

The minimum utility of the producer across all segmentations is always weakly smaller when a privacy mechanism is applied compared to the non-private case.

To see why this result holds, notice that, according to Proposition 4, the minimum utility of the producer is given by

β(k=1K(𝒳kβ)𝒰p(vk,x))+(1β)(minimum producer utility in 𝒮),\beta\left(\sum_{k=1}^{K}\mathbb{P}(\mathcal{X}_{k}^{\beta})~{}\mathcal{U}_{p}(v_{k},x^{*})\right)+(1-\beta)\left(\text{minimum producer utility in }\mathcal{S}^{\prime}\right), (32)

which is a convex combination of two terms. The first one is upper bounded by maxvk𝒰p(vk,x)\max_{v_{k}}~{}\mathcal{U}_{p}(v_{k},x^{*}), i.e., the minimum utility of the producer in the non-private case (corresponding to line TRTR). Proposition 5 implies that the minimum producer utility in 𝒮\mathcal{S}^{\prime} is (weakly) smaller than this term which establishes Corollary 3.

We now shift our attention to the consumer’s utility. As previously discussed following Proposition 4, the privacy mechanism guarantees a non-zero minimum utility for the consumer, a stark contrast to the non-private case where the consumer’s minimum utility can be zero. In the following result, we explore another phenomenon that contributes to consumer’s minimum utility.

Fact 2.

The set 𝒮\mathcal{S}^{\prime} is distanced from the y-axis by a minimum of x(vK)(vKvK1)x^{*}(v_{K})(v_{K}-v_{K-1}) when β>β¯K\beta>\bar{\beta}_{K}.

The main point to note is that for sufficiently large values of β\beta, the set 𝒮\mathcal{S}^{\prime} has a positive distance from the y-axis. This results in a guaranteed additional positive minimum utility term for the consumer within the set 𝒮\mathcal{S}. An illustration of this phenomenon is provided in Figure 8, which displays the set 𝒮\mathcal{S}^{\prime} for the values specified in Example 1, but with β=0.6\beta=0.6. As stated earlier, in Example 1, β¯5\bar{\beta}_{5} equals 0.540.54.

Refer to caption
(a) x=(0.2,0.1,0.4,0.2,0.1)x^{*}=(0.2,0.1,0.4,0.2,0.1)
Refer to caption
(b) x=(0.2,0.3,0.2,0.2,0.1)x^{*}=(0.2,0.3,0.2,0.2,0.1)
Refer to caption
(c) x=(0.2,0.1,0.1,0.05,0.55)x^{*}=(0.2,0.1,0.1,0.05,0.55)
Figure 8: Illustration of 𝒮\mathcal{S^{\prime}} with values [vk]k=15=[0.8,2,3,4.2,5][v_{k}]_{k=1}^{5}=[0.8,2,3,4.2,5], β=0.6\beta=0.6, and three different aggregated markets.

Finally, as we discussed following Theorem 1 through examples with K=2K=2, the maximum utility for consumers, in general, could either increase or decrease under the privacy mechanism. The following result better highlights why both cases are possible.

Proposition 6.

Suppose β<minkβ¯k\beta<\min_{k}\bar{\beta}_{k}.

  1. (i)

    Assume v1v_{1} is the optimal uniform price in the non-private case, and the values satisfy the condition:

    vi+1KiK+1ivi for all i[K1].v_{i+1}\geq\frac{K-i}{K+1-i}v_{i}\text{ for all }i\in[K-1]. (33)

    Then, the maximum utility for the consumer is a decreasing function of β\beta.

  2. (ii)

    Assume vKv_{K} is the optimal uniform price in the non-private case, and the values satisfy the condition:

    vi+1KiK+1ivi for all i[K1].v_{i+1}\leq\frac{K-i}{K+1-i}v_{i}\text{ for all }i\in[K-1]. (34)

    Then, there exists α~\tilde{\alpha} and β~>0\tilde{\beta}>0 such that the maximum utility for the consumer is an increasing function over the interval [0,β~][0,\tilde{\beta}] when x(vK)α~x^{*}(v_{K})\geq\tilde{\alpha}.

The proof is provided in the appendix. In particular, for the case when K=2K=2, the conditions outlined in part (i), which lead to a decrease in maximum consumer utility under privacy, translate to αη1/2\alpha^{*}\leq\eta\leq 1/2. On the other hand, the conditions of part (ii), which implies an increase in maximum consumer utility in the presence of the privacy mechanism, correspond to αη1/2\alpha^{*}\geq\eta\geq 1/2 (also, as established in the proof, we can set α~=12η\tilde{\alpha}=\frac{1}{2-\eta} in this case).

But what differentiates these two cases? Why does privacy help consumer utility in one scenario while hurting it in another? The rationale here is similar to the explanations provided in Section 3.4. The condition (33) indicates that under a uniform market prior, the highest price vKv_{K} leads to the maximum expected utility for the producer, while the lowest price v1v_{1} yields the minimum. Consequently, when privacy introduces ambiguity, the producer has more incentive to prefer higher prices. However, the assumption that v1v_{1} is the optimal uniform price suggests that the market is mainly composed of low-value customers. Therefore, a tendency to set higher prices due to privacy considerations adversely affects consumer utility. In other words, privacy prompts the producer to risk setting higher prices, resulting in diminished consumer utility.

Conversely, the condition (34) means that under a uniform market prior, the lowest price v1v_{1} leads to the highest expected utility for the producer, while the highest price vKv_{K} offers the least. Thus, with privacy in play, the producer is likely to adopt a more conservative stance and favor lower prices. That said, given that vKv_{K} is the optimal uniform price and x(vK)α~x^{*}(v_{K})\geq\tilde{\alpha}, we have a market with a majority of high-value customers who benefit more when the market price is set lower. In summary, applying the privacy mechanism makes the producer to take a more conservative pricing approach, resulting in more consumers purchasing the product at a lower price than their value.

5 Conclusion

This paper explores the intersection of price discrimination and consumer privacy. Building upon the framework of Bergemann et al. [2015], we first introduce a privacy mechanism that adds a layer of uncertainty regarding consumer values, thereby directly impacting both consumer and producer utilities in a market setting. We use a novel analysis based on a merging technique to characterize the set of all pairs of consumer and producer utilities and establish that, unlike the existing results, it is not necessarily a triangle. Instead, it can be viewed as the linear mapping of a high-dimensional polytope into 2\mathbb{R}^{2}. We used our characterization to derive several insights. In particular, we prove that imposing a privacy constraint does not necessarily help the consumers and, in fact, can reduce their utility.

We view our paper as a first attempt to understand how user privacy impacts the limits of price discrimination. There are several exciting directions to explore in this vein. For instance, an interesting future direction is to extend our results to a setting with a multiproduct seller (similar to that of Haghpanah and Siegel [2022]).

6 Acknowledgments

Alireza Fallah acknowledges support from the European Research Council Synergy Program, the National Science Foundation under grant number DMS-1928930, and the Alfred P. Sloan Foundation under grant G-2021-16778. The latter two grants correspond to his residency at the Simons Laufer Mathematical Sciences Institute (formerly known as MSRI) in Berkeley, California, during the Fall 2023 semester. Michael Jordan acknowledges support from the Mathematical Data Science program of the Office of Naval Research under grant number N00014-21-1-2840 and the European Research Council Synergy Program.

References

  • Abowd [2018] J. M. Abowd. The US Census Bureau adopts differential privacy. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 2867–2867, 2018.
  • Acemoglu et al. [2022] D. Acemoglu, A. Makhdoumi, A. Malekian, and A. Ozdaglar. Too much data: Prices and inefficiencies in data markets. American Economic Journal: Microeconomics:Micro, 2022.
  • Acemoglu et al. [2023a] D. Acemoglu, A. Fallah, A. Makhdoumi, A. Malekian, and A. Ozdaglar. How good are privacy guarantees? platform architecture and violation of user privacy. Technical report, National Bureau of Economic Research, 2023a.
  • Acemoglu et al. [2023b] D. Acemoglu, A. Makhdoumi, A. Malekian, and A. Ozdaglar. A model of behavioral manipulation. Technical report, National Bureau of Economic Research, 2023b.
  • Acquisti et al. [2016] A. Acquisti, C. Taylor, and L. Wagman. The economics of privacy. Journal of Economic Literature, 54(2):442–492, 2016.
  • Aguirre et al. [2010] I. Aguirre, S. Cowan, and J. Vickers. Monopoly price discrimination and demand curvature. American Economic Review, 100(4):1601–1615, 2010.
  • Ali et al. [2020] S. N. Ali, G. Lewis, and S. Vasserman. Voluntary disclosure and personalized pricing. In Proceedings of the 21st ACM Conference on Economics and Computation, pages 537–538, 2020.
  • [8] Apple. Differential privacy overview - apple. https://www.apple.com/privacy/docs/Differential_Privacy_Overview.pdf. Retrieved May 4, 2023.
  • Arieli et al. [2024] I. Arieli, Y. Babichenko, O. Madmon, and M. Tennenholtz. Robust price discrimination. arXiv preprint arXiv:2401.16942, 2024.
  • Banerjee et al. [2024] S. Banerjee, K. Munagala, Y. Shen, and K. Wang. Fair price discrimination. In Proceedings of the 2024 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 2679–2703. SIAM, 2024.
  • Bergemann et al. [2015] D. Bergemann, B. Brooks, and S. Morris. The limits of price discrimination. American Economic Review, 105(3):921–957, 2015.
  • Bergemann et al. [2020] D. Bergemann, A. Bonatti, and T. Gan. The economics of social data. arXiv preprint arXiv:2004.03107, 2020.
  • Bergemann et al. [2024] D. Bergemann, T. Heumann, and M. C. Wang. A unified approach to second and third degree price discrimination. arXiv preprint arXiv:2401.12366, 2024.
  • Bernardo [1979] J. M. Bernardo. Reference posterior distributions for Bayesian inference. Journal of the Royal Statistical Society Series B: Statistical Methodology, 41(2):113–128, 1979.
  • Bertsekas [1997] D. P. Bertsekas. Nonlinear programming. Journal of the Operational Research Society, 48(3):334–334, 1997.
  • Cowan [2016] S. Cowan. Welfare-increasing third-degree price discrimination. The RAND Journal of Economics, 47(2):326–340, 2016.
  • Desfontaines [2021] D. Desfontaines. A list of real-world uses of differential privacy. Ted is Writing Things, 2021.
  • Dwork et al. [2006a] C. Dwork, K. Kenthapadi, F. McSherry, I. Mironov, and M. Naor. Our data, ourselves: Privacy via distributed noise generation. In Advances in Cryptology-EUROCRYPT 2006: 24th Annual International Conference on the Theory and Applications of Cryptographic Techniques, St. Petersburg, Russia, May 28-June 1, 2006. Proceedings 25, pages 486–503. Springer, 2006a.
  • Dwork et al. [2006b] C. Dwork, F. McSherry, K. Nissim, and A. Smith. Calibrating noise to sensitivity in private data analysis. In Theory of Cryptography: Third Theory of Cryptography Conference, TCC 2006, New York, NY, USA, March 4-7, 2006. Proceedings 3, pages 265–284. Springer, 2006b.
  • Dwork et al. [2014] C. Dwork, A. Roth, et al. The algorithmic foundations of differential privacy. Foundations and Trends® in Theoretical Computer Science, 9(3–4):211–407, 2014.
  • Elliott et al. [2021] M. Elliott, A. Galeotti, A. Koh, and W. Li. Market segmentation through information. Available at SSRN 3432315, 2021.
  • Erlingsson et al. [2014] Ú. Erlingsson, V. Pihur, and A. Korolova. Rappor: Randomized aggregatable privacy-preserving ordinal response. In Proceedings of the 2014 ACM Conference on Computer and Communications Security, pages 1054–1067, 2014.
  • Fainmesser et al. [2023] I. P. Fainmesser, A. Galeotti, and R. Momot. Consumer profiling via information design. Available at SSRN, 2023.
  • Farboodi and Veldkamp [2023] M. Farboodi and L. Veldkamp. Data and markets. Annual Review of Economics, 15:23–40, 2023.
  • Greenberg et al. [1969] B. G. Greenberg, A.-L. A. Abul-Ela, W. R. Simmons, and D. G. Horvitz. The unrelated question randomized response model: Theoretical framework. Journal of the American Statistical Association, 64(326):520–539, 1969.
  • Haghpanah and Hartline [2021] N. Haghpanah and J. Hartline. When is pure bundling optimal? The Review of Economic Studies, 88(3):1127–1156, 2021.
  • Haghpanah and Siegel [2022] N. Haghpanah and R. Siegel. The limits of multiproduct price discrimination. American Economic Review: Insights, 4(4):443–458, 2022.
  • Haghpanah and Siegel [2023] N. Haghpanah and R. Siegel. Pareto-improving segmentation of multiproduct markets. Journal of Political Economy, 131(6):000–000, 2023.
  • Haghpapanah and Siegel [2019] N. Haghpapanah and R. Siegel. Consumer-optimal market segmentation. In Proceedings of the 2019 ACM Conference on Economics and Computation, pages 241–242, 2019.
  • Hidir and Vellodi [2021] S. Hidir and N. Vellodi. Privacy, personalization, and price discrimination. Journal of the European Economic Association, 19(2):1342–1363, 2021.
  • Ichihashi [2020] S. Ichihashi. Online privacy and information disclosure by consumers. American Economic Review, 110(2):569–595, 2020.
  • Jones and Tonetti [2020] C. I. Jones and C. Tonetti. Nonrivalry and the economics of data. American Economic Review, 110(9):2819–2858, 2020.
  • Kamenica and Gentzkow [2011] E. Kamenica and M. Gentzkow. Bayesian persuasion. American Economic Review, 101(6):2590–2615, 2011.
  • Ko and Munagala [2022] S.-H. Ko and K. Munagala. Optimal price discrimination for randomized mechanisms. In Proceedings of the 23rd ACM Conference on Economics and Computation, pages 477–496, 2022.
  • Lei et al. [2023] Y. Lei, S. Miao, and R. Momot. Privacy-preserving personalized revenue management. Management Science, 2023.
  • Milgrom and Segal [2002] P. Milgrom and I. Segal. Envelope theorems for arbitrary choice sets. Econometrica, 70(2):583–601, 2002.
  • Posner and Weyl [2018] E. Posner and E. Weyl. Radical Markets: Uprooting Capitalism and Democracy for a Just Society. Princeton University Press, 2018.
  • Rassouli and Gündüz [2019] B. Rassouli and D. Gündüz. Optimal utility-privacy trade-off with total variation distance as a privacy measure. IEEE Transactions on Information Forensics and Security, 15:594–603, 2019.
  • Robinson [1969] J. Robinson. The Economics of Imperfect Competition. Springer, 1969.
  • Roesler and Szentes [2017] A.-K. Roesler and B. Szentes. Buyer-optimal learning and monopoly pricing. American Economic Review, 107(7):2072–2080, 2017.
  • Schmalensee [1981] R. Schmalensee. Output and welfare implications of monopolistic third-degree price discrimination. The American Economic Review, 71(1):242–247, 1981.
  • Suresh [2019] A. T. Suresh. Differentially private anonymized histograms. Advances in Neural Information Processing Systems, 32, 2019.
  • Varian [1985] H. R. Varian. Price discrimination and social welfare. The American Economic Review, 75(4):870–875, 1985.
  • Warner [1965] S. L. Warner. Randomized response: A survey technique for eliminating evasive answer bias. Journal of the American Statistical Association, 60(309):63–69, 1965.
  • Xu et al. [2013] J. Xu, Z. Zhang, X. Xiao, Y. Yang, G. Yu, and M. Winslett. Differentially private histogram publication. The VLDB journal, 22:797–822, 2013.

Appendix A Deferred Proofs

A.1 Proof of Proposition 1

In our case, the prior distribution is uniform, and the posterior distribution after observing the output of the privacy mechanism is uniform only with probability β\beta. Therefore, for Y=β(X)Y=\mathcal{M}_{\beta}(X), we have

TV(pX|Y(|Y),pX())=1β\displaystyle\mathrm{TV}(p_{X|Y}(\cdot|Y),p_{X}(\cdot))=1-\beta

and therefore the privacy-leakage defined in Definition 1 becomes 1β1-\beta.

A.2 Connections with differential privacy

In the context of pricing based on histograms or distribution of values, the dataset is the set of consumers’ values, and the algorithm’s output is the selected market price. Thus, the goal of this privacy framework is to ensure that one cannot infer too much about the consumers’ values by observing the optimal price. Note that this addresses a different aspect of privacy compared to that which we discussed previously. In our setting, the main concern is not the producer itself but rather what an adversary can infer about the consumers’ values by observing the price set for them. For instance, if one observes that the insurance premium is set high for a zip code, one may infer that residents of that zip code often require medical assistance. The application of differential privacy to pricing has also been studied in the context of revenue management [see, e.g., Lei et al., 2023]. A caveat is that differential privacy is typically applicable to settings with a finite number of users, as opposed to our setting with a continuum of consumers. We discuss this matter in Section A.2, provide an adjusted definition of differential privacy, and demonstrate how our privacy mechanism (3) is, in fact, differentially private.

Let us first recall the definition of differential privacy for an algorithm 𝒜\mathcal{A} over a set of nn users’ data sets, denoted by 𝒛={z1,,zn}\bm{z}=\{z_{1},\cdots,z_{n}\} [see Dwork et al., 2014, for a detailed discussion of the definition and its applications].

Definition 3.

A randomized algorithm 𝒜:𝒵n𝒴\mathcal{A}:\mathcal{Z}^{n}\to\mathcal{Y} is called ε\varepsilon-differentially private if, for any measurable set 𝒪\mathcal{O}, we have

(𝒜(𝒛)𝒪)eε(𝒜(𝒛)𝒪),{\mathbb{P}(\mathcal{A}(\bm{z})\in\mathcal{{O}})}\leq e^{\varepsilon}~{}{\mathbb{P}(\mathcal{A}(\bm{z^{\prime}})\in\mathcal{{O}})}, (35)

for any two neighboring datasets 𝐳\bm{z} and 𝐳\bm{z^{\prime}} that only differ in one coordinate.

Here, ε\varepsilon quantifies the privacy loss. The higher ε\varepsilon is, it means the output is more sensitive to the user’s data, meaning that it provides a weaker privacy guarantee.

One challenge here is that, in our case, we work with a continuum of consumers, and hence, changing one user’s data is not meaningful. A natural proposal is to consider two markets that are close enough as a surrogate for neighboring distributions. In particular, we can use a distribution distance function d:𝒳×𝒳0d:\mathcal{X}\times\mathcal{X}\to\mathbb{R}^{\geq 0} as a measure of the closeness of markets and say two markets xx and xx^{\prime} are neighboring markets if d(x,x)Δd(x,x^{\prime})\leq\Delta for some choice of Δ>0\Delta>0. We, therefore, use the following adjusted definition. Recall that ϕ()\phi(\cdot) denotes the optimal pricing rule and 𝒳\mathcal{X} represents the set of markets.

Definition 4.

A randomized mechanism :𝒳𝒳\mathcal{M}:\mathcal{X}\to\mathcal{X} is called ε\varepsilon-differentially private if, for any value vi𝒱v_{i}\in\mathcal{V}, we have

(ϕ((x))=vi)eε(ϕ((x))=vi),\mathbb{P}(\phi(\mathcal{M}(x))=v_{i})\leq e^{\varepsilon}~{}\mathbb{P}(\phi(\mathcal{M}(x^{\prime}))=v_{i}), (36)

for two markets d(x,x)Δd(x,x^{\prime})\leq\Delta.

Now, let us see how our mechanism fits into this definition.

Proposition 7.

The mechanism β()\mathcal{M}_{\beta}(\cdot) is ε\varepsilon-differentially private with

ε=maxi:Vol(Xiβ)01β+βVol(Xiβ)βVol(Xiβ),\varepsilon=\max_{i:\text{Vol}(X_{i}^{\beta})\neq 0}\frac{1-\beta+\beta\text{Vol}(X_{i}^{\beta})}{\beta\text{Vol}(X_{i}^{\beta})}, (37)

where XiβX_{i}^{\beta} is defined in (19).

Proof.

To simplify the notation, let us denote β(x)\mathcal{M}_{\beta}(x) by x^\hat{x}. Then, we have

(ϕ(x^)=vi)=(x^Xiβ)=(1β)𝟙(xXiβ)+βVol(Xiβ).\mathbb{P}(\phi(\hat{x})=v_{i})=\mathbb{P}(\hat{x}\in X_{i}^{\beta})=(1-\beta)\mathbbm{1}(x\in X_{i}^{\beta})+\beta\text{Vol}(X_{i}^{\beta}).

By changing xx, the term 𝟙(xXiβ)\mathbbm{1}(x\in X_{i}^{\beta}) could change from zero to one, but the second term βVol(Xiβ)\beta\text{Vol}(X_{i}^{\beta}) remains unchanged. This, consequently, implies the result. ∎

A.3 Proof of Proposition 2

Let νβ(x,x^)\nu_{\beta}(x,\hat{x}) denote the joint distribution of xx and x^\hat{x}. With a slight abuse of notation, we denote the distribution of x^\hat{x} condition on xx by νβ(x^|x)\nu_{\beta}(\hat{x}|x). Note that the term in (5) that we wish to maximize is equal to

𝒳μβ(x^)\displaystyle\int_{\mathcal{X}}\mu_{\beta}(\hat{x}) (𝒳πβ(x|x^)log(πβ(x|x^)π(x))𝑑x)dx^=𝒳𝒳νβ(x,x^)log(νβ(x,x^)π(x)μβ(x^))𝑑x𝑑x^\displaystyle\left(\int_{\mathcal{X}}\pi_{\beta}(x|\hat{x})\log(\frac{\pi_{\beta}(x|\hat{x})}{\pi(x)})~{}dx\right)d\hat{x}=\int_{\mathcal{X}}\int_{\mathcal{X}}\nu_{\beta}(x,\hat{x})\log(\frac{\nu_{\beta}(x,\hat{x})}{\pi(x)\mu_{\beta}(\hat{x})})~{}dxd\hat{x} (38)
=𝒳π(x)𝒳νβ(x^|x)log(νβ(x^|x)μβ(x^))𝑑x^𝑑x\displaystyle=\int_{\mathcal{X}}\pi(x)\int_{\mathcal{X}}\nu_{\beta}(\hat{x}|x)\log(\frac{\nu_{\beta}(\hat{x}|x)}{\mu_{\beta}(\hat{x})})~{}d\hat{x}dx (39)
=𝒳π(x)(𝒳νβ(x^|x)log(νβ(x^|x)))𝑑x𝒳𝒳π(x)νβ(x^|x)log(μβ(x^))\displaystyle=\int_{\mathcal{X}}\pi(x)\left(\int_{\mathcal{X}}\nu_{\beta}(\hat{x}|x)\log(\nu_{\beta}(\hat{x}|x))\right)dx-\int_{\mathcal{X}}\int_{\mathcal{X}}\pi(x)\nu_{\beta}(\hat{x}|x)\log(\mu_{\beta}(\hat{x})) (40)
=𝒳π(x)(𝒳νβ(x^|x)log(νβ(x^|x)))𝑑x𝒳μβ(x^)log(μβ(x^)).\displaystyle=\int_{\mathcal{X}}\pi(x)\left(\int_{\mathcal{X}}\nu_{\beta}(\hat{x}|x)\log(\nu_{\beta}(\hat{x}|x))\right)dx-\int_{\mathcal{X}}\mu_{\beta}(\hat{x})\log(\mu_{\beta}(\hat{x})). (41)

Note that, given the privacy mechanism β()\mathcal{M}_{\beta}(\cdot), 𝒳νβ(x^|x)log(νβ(x^|x))\int_{\mathcal{X}}\nu_{\beta}(\hat{x}|x)\log(\nu_{\beta}(\hat{x}|x)) is independent of xx, and therefore, the first term in (41) does not depend on the choice of π()\pi(\cdot). The second term is the entropy of μβ(x^)\mu_{\beta}(\hat{x}) and we know that it is maximized when μβ()\mu_{\beta}(\cdot) is the uniform distribution, which is the case when π()\pi(\cdot) is the uniform distribution. This completes the proof.

A.4 Proof of Lemma 1

Notice that the producer’s utility by setting the price equal to v1v_{1} is given by 𝒰p(v1|x^)=v1\mathcal{U}_{p}(v_{1}|\hat{x})=v_{1} as all the consumers will buy the product at this price. For price v2v_{2}, the expected utility is given by

𝒰p(v2|x^)\displaystyle\mathcal{U}_{p}(v_{2}|\hat{x}) =v2((1β)α^+β01α𝑑α)=v2((1β)α^+β2).\displaystyle=v_{2}\left((1-\beta)\hat{\alpha}+\beta\int_{0}^{1}\alpha~{}d\alpha\right)=v_{2}\left((1-\beta)\hat{\alpha}+\frac{\beta}{2}\right). (42)

Comparing this with 𝒰p(v1|x^)=v1\mathcal{U}_{p}(v_{1}|\hat{x})=v_{1} completes the proof.

A.5 Proof of Proposition 3

First, notice that different optimal pricing rules only differ in how they price a market upon observing α^=t\hat{\alpha}=t^{*}, since in this situation both prices v2v_{2} and v1v_{1} are optimal. Therefore, we can parameterize the optimal pricing rules by the probability of setting the price to v2v_{2} given α^=t\hat{\alpha}=t^{*}. This probability is denoted by δ[0,1]\delta\in[0,1] and its corresponding optimal pricing rule is represented by ϕδ()\phi_{\delta}(\cdot).

Next, we characterize the expected utilities of consumer and the producer from a market α\alpha. Given the above discussion, and with a slight abuse of notation, we denote these expected utilities by Upδ()U_{p}^{\delta}(\cdot) and Ucδ()U_{c}^{\delta}(\cdot), respectively.

Lemma 5.

Consider a market x=(1α,α)x=(1-\alpha,\alpha) and a pricing rule ϕ()\phi(\cdot) that sets the price of a market v2v_{2} with probability δ\delta in the case of observing α^=t\hat{\alpha}=t^{*}.

  1. (i)

    If α<t\alpha<t^{*}, then Upδ(x)=f1(α)U_{p}^{\delta}(x)=f_{1}(\alpha) and Ucδ(x)=g1(α)U_{c}^{\delta}(x)=g_{1}(\alpha), where f1()f_{1}(\cdot) and g1()g_{1}(\cdot) are given by

    f1(α)=(1β+βt)v1+β(1t)αv2,g1(α)=(1β+βt)α(v2v1).\displaystyle f_{1}(\alpha)=(1-\beta+\beta t^{*})v_{1}+\beta(1-t^{*})\alpha v_{2},\quad g_{1}(\alpha)=(1-\beta+\beta t^{*})\alpha(v_{2}-v_{1}).
  2. (ii)

    If α>t\alpha>t^{*}, then Upδ(x)=f2(α)U_{p}^{\delta}(x)=f_{2}(\alpha) and Ucδ(x)=g2(α)U_{c}^{\delta}(x)=g_{2}(\alpha), where f2()f_{2}(\cdot) and g2()g_{2}(\cdot) are given by

    f2(α)=βtv1+(1β+β(1t))αv2,g2(α)=βtα(v2v1).\displaystyle f_{2}(\alpha)=\beta t^{*}v_{1}+(1-\beta+\beta(1-t^{*}))\alpha v_{2},\quad g_{2}(\alpha)=\beta t^{*}\alpha(v_{2}-v_{1}).
  3. (iii)

    If α=t\alpha=t^{*}, then Upδ(x)U_{p}^{\delta}(x) and Ucδ(x)U_{c}^{\delta}(x) are given by

    Upδ(x)=(1δ)f1(t)+δf2(t),Ucδ(x)=(1δ)g1(t)+δg2(t).\displaystyle U_{p}^{\delta}(x)=(1-\delta)f_{1}(t^{*})+\delta f_{2}(t^{*}),\quad U_{c}^{\delta}(x)=(1-\delta)g_{1}(t^{*})+\delta g_{2}(t^{*}).
Proof.

Suppose α<t\alpha<t^{*}. Then, with probability 1β+βt1-\beta+\beta t^{*}, the realized α^\hat{\alpha} falls within the interval [0,t)[0,t^{*}), and hence the optimal price is v1v_{1} as per Lemma 1. In this scenario, the producer’s utility would be v1v_{1} and the consumer’ utility would be α(v2v1)\alpha(v_{2}-v_{1}). Also, with probability β(1t)\beta(1-t^{*}), α^\hat{\alpha} falls into the interval (t,1](t^{*},1], and the market would be priced at v2v_{2}. Here, the producer’s utility is αv2\alpha v_{2} and the consumer’ utility is zero. Summing these scenarios gives the expected utilities. It is also worth noting that the probability of α^=t\hat{\alpha}=t^{*} is zero in this case.

The case α>t\alpha>t^{*} can be argued similarly. Finally, for the case α=t\alpha=t^{*}, with probability 1β1-\beta we have α^=t\hat{\alpha}=t^{*} which leads to choosing price v2v_{2} with probability δ\delta and price v1v_{1} with probability 1δ1-\delta. This observation establishes the final result. ∎

Now, we turn back to our goal of characterizing the set of utility pairs (9). In general, searching over the entire set of segmentations and pricing rules can be challenging. However, the following lemma enables us to narrow our search without loss of generality.

Lemma 6.

Suppose Assumption 2 holds. Then,

𝒮\displaystyle\mathcal{S} ={[γ1g1(α1)+γ2g2(α2),γ1f1(α1)+γ2f2(α2)]|\displaystyle=\biggl{\{}\Big{[}\gamma_{1}g_{1}(\alpha_{1})+\gamma_{2}g_{2}(\alpha_{2}),\gamma_{1}f_{1}(\alpha_{1})+\gamma_{2}f_{2}(\alpha_{2})\Big{]}^{\top}~{}\Big{|} (43)
α1[0,min{t,α}],α2[max{t,α},1],γ1 and γ2[0,1],γ1+γ2=1,γ1α1+γ2α2=α}\displaystyle\qquad\alpha_{1}\in[0,\min\{t^{*},\alpha^{*}\}],\alpha_{2}\in[\max\{t^{*},\alpha^{*}\},1],\gamma_{1}\text{ and }\gamma_{2}\in[0,1],\gamma_{1}+\gamma_{2}=1,\gamma_{1}\alpha_{1}+\gamma_{2}\alpha_{2}=\alpha^{*}\biggr{\}}
Proof.

We first prove that for any segmentation σΣ\sigma\in\Sigma and optimal pricing rule ϕδ()\phi_{\delta}(\cdot), there is a representation in the form of right hand side of (43) such that

xsupp(σ)σ(x)Upδ(x)=γ1f1(α1)+γ2f2(α2) and xsupp(σ)σ(x)Ucδ(x)=γ1g1(α1)+γ2g2(α2).\sum_{x\in\text{supp}(\sigma)}\sigma(x)U_{p}^{\delta}(x)=\gamma_{1}f_{1}(\alpha_{1})+\gamma_{2}f_{2}(\alpha_{2})\quad\text{ and }\sum_{x\in\text{supp}(\sigma)}\sigma(x)U_{c}^{\delta}(x)=\gamma_{1}g_{1}(\alpha_{1})+\gamma_{2}g_{2}(\alpha_{2}). (44)

To simplify the notation, we use σ(α)\sigma(\alpha) and Upδ(α)U_{p}^{\delta}(\alpha) to denote σ(x)\sigma(x) and Upδ(x)U_{p}^{\delta}(x) when x=(1α,α)x=(1-\alpha,\alpha). Notice that

α:σ(α)>0\displaystyle\sum_{\alpha:\sigma(\alpha)>0} σ(α)Upδ(α)=α<t&σ(α)>0σ(α)Upδ(α)+α>t&σ(α)>0σ(α)Upδ(α)+σ(t)Upδ(t)\displaystyle\sigma(\alpha)U_{p}^{\delta}(\alpha)=\sum_{\alpha<t^{*}\&\sigma(\alpha)>0}\sigma(\alpha)U_{p}^{\delta}(\alpha)+\sum_{\alpha>t^{*}\&\sigma(\alpha)>0}\sigma(\alpha)U_{p}^{\delta}(\alpha)+\sigma(t^{*})U_{p}^{\delta}(t^{*})
=α<t&σ(α)>0σ(α)f1(α)+α>t&σ(α)>0σ(α)f2(α)+σ(t)((1δ)f1(t)+δf2(t)),\displaystyle=\sum_{\alpha<t^{*}\&\sigma(\alpha)>0}\sigma(\alpha)f_{1}(\alpha)+\sum_{\alpha>t^{*}\&\sigma(\alpha)>0}\sigma(\alpha)f_{2}(\alpha)+\sigma(t^{*})\left((1-\delta)f_{1}(t^{*})+\delta f_{2}(t^{*})\right), (45)

where the last equation uses Lemma 5. Now, notice that f1()f_{1}(\cdot) and f2()f_{2}(\cdot) are linear functions. Therefore, we can cast (45) as

γ1f1(α1)+γ2f2(α2)\gamma_{1}f_{1}(\alpha_{1})+\gamma_{2}f_{2}(\alpha_{2})

with

γ1\displaystyle\gamma_{1} =α<t&σ(α)>0σ(α)+(1δ)σ(t),\displaystyle=\sum_{\alpha<t^{*}\&\sigma(\alpha)>0}\sigma(\alpha)+(1-\delta)\sigma(t^{*}),
α1\displaystyle\alpha_{1} =1γ1(α<t&σ(α)>0σ(α)α+(1δ)σ(t)t),\displaystyle=\frac{1}{\gamma_{1}}\left(\sum_{\alpha<t^{*}\&\sigma(\alpha)>0}\sigma(\alpha)\alpha+(1-\delta)\sigma(t^{*})t^{*}\right),
γ2\displaystyle\gamma_{2} =α>t&σ(α)>0σ(α)+δσ(t),\displaystyle=\sum_{\alpha>t^{*}\&\sigma(\alpha)>0}\sigma(\alpha)+\delta\sigma(t^{*}),
α2\displaystyle\alpha_{2} =1γ2(α>t&σ(α)>0σ(α)α+δσ(t)t).\displaystyle=\frac{1}{\gamma_{2}}\left(\sum_{\alpha>t^{*}\&\sigma(\alpha)>0}\sigma(\alpha)\alpha+\delta\sigma(t^{*})t^{*}\right).

Notice that a similar representation can also be shown for the consumer’ utility. We next verify that the conditions in (43) are met. Firstly, as σ()\sigma(\cdot) is a segmentation within Σ\Sigma, it can be verified that γ1\gamma_{1} and γ2\gamma_{2} are nonnegative and together sum up to one. Furthermore, the condition α:σ(α)>0σ(α)α=α\sum_{\alpha:\sigma(\alpha)>0}\sigma(\alpha)\alpha=\alpha^{*} ensures that γ1α1+γ2α2=α\gamma_{1}\alpha_{1}+\gamma_{2}\alpha_{2}=\alpha^{*}.

Given that α1\alpha_{1} is effectively a weighted average of tt^{*} and several values of α<t\alpha<t^{*}, it follows that α1t\alpha_{1}\leq t^{*}. Similarly, it can be established that α2t\alpha_{2}\geq t^{*}. Moreover, a weighted average of α1\alpha_{1} and α2\alpha_{2} equates to α\alpha^{*}. Consequently, we should have α1αα2\alpha_{1}\leq\alpha^{*}\leq\alpha_{2}. All these considerations imply that α1[0,min{t,α}]\alpha_{1}\in[0,\min\{t^{*},\alpha^{*}\}] and α2[max{t,α},1]\alpha_{2}\in[\max\{t^{*},\alpha^{*}\},1].

Now we establish the converse. Suppose α1,α2,γ1\alpha_{1},\alpha_{2},\gamma_{1}, and γ2\gamma_{2} are given, satisfying the conditions on the right-hand side of (43). We will show that there exists a segmentation σΣ\sigma\in\Sigma and an optimal pricing rule ϕδ()\phi_{\delta}(\cdot) such that (44) is satisfied. If α1<t\alpha_{1}<t^{*} and α2>t\alpha_{2}>t^{*}, we can consider a segmentation with two markets (1α1,α1)(1-\alpha_{1},\alpha_{1}) and (1α2,α2)(1-\alpha_{2},\alpha_{2}), assigned probabilities γ1\gamma_{1} and γ2\gamma_{2} respectively, and apply any optimal pricing rule to satisfy (44). If α1=t\alpha_{1}=t^{*} and α2>t\alpha_{2}>t^{*}, then ϕ0()\phi_{0}(\cdot) should be taken as the optimal pricing rule. Conversely, if α1<t\alpha_{1}<t^{*} and α2=t\alpha_{2}=t^{*}, ϕ1()\phi_{1}(\cdot) becomes the optimal pricing rule. The only remaining scenario is α1=α2=t\alpha_{1}=\alpha_{2}=t^{*}*, where we take ϕδ()\phi_{\delta}(\cdot) with δ=γ2\delta=\gamma_{2} as the optimal pricing rule. This completes the proof. ∎

Next, note that we can cast γ1f1(α1)+γ2f2(α2)\gamma_{1}f_{1}(\alpha_{1})+\gamma_{2}f_{2}(\alpha_{2}) in the right hand side of (43) in the following way:

γ1f1(α1)+γ2f2(α2)\displaystyle\gamma_{1}f_{1}(\alpha_{1})+\gamma_{2}f_{2}(\alpha_{2}) =β(tv1+(1t)αv2)+(1β)(γ1v1+γ2α2v2),\displaystyle=\beta\left(t^{*}v_{1}+(1-t^{*})\alpha^{*}v_{2}\right)+(1-\beta)\left(\gamma_{1}v_{1}+\gamma_{2}\alpha_{2}v_{2}\right), (46)

where we used the fact that γ1α1+γ2α2=α\gamma_{1}\alpha_{1}+\gamma_{2}\alpha_{2}=\alpha^{*} and γ1+γ2=1\gamma_{1}+\gamma_{2}=1. Similarly, we have

γ1g1(α1)+γ2g2(α2)=βtα(v2v1)+(1β)γ1α1(v2v1).\displaystyle\gamma_{1}g_{1}(\alpha_{1})+\gamma_{2}g_{2}(\alpha_{2})=\beta t^{*}\alpha^{*}(v_{2}-v_{1})+(1-\beta)\gamma_{1}\alpha_{1}(v_{2}-v_{1}). (47)

Notice that the terms β(tv1+(1t)αv2)\beta\left(t^{*}v_{1}+(1-t^{*})\alpha^{*}v_{2}\right) and βtα(v2v1)\beta t^{*}\alpha^{*}(v_{2}-v_{1}) in (46) and (47), respectively, are constants. Thus, it suffices to characterize the set 𝒮\mathcal{S}^{\prime}.

A.6 Proof of Theorem 1

We first identify the set 𝒮\mathcal{S}^{\prime}, doing so by separately considering the cases αt\alpha^{*}\geq t^{*} and αt\alpha^{*}\leq t^{*}.

Case (I): αt\alpha^{*}\geq t^{*}

Lemma 7.

Suppose βmin{2η,2(1η)}\beta\leq\min\{2\eta,2(1-\eta)\} and αt\alpha^{*}\geq t^{*}. Then 𝒮\mathcal{S}^{\prime} is in the form of a triangle with three vertices E=[0,αv2+(1α)v1]E=[0,\alpha^{*}v_{2}+(1-\alpha^{*})v_{1}]^{\top}, F=[0,αv2]F=[0,\alpha^{*}v_{2}]^{\top}, and G=E+[ξ,ξ]G=E+[\xi,-\xi]^{\top} with

ξ:=(1α)t1t(v2v1).\xi:=(1-\alpha^{*})\frac{t^{*}}{1-t^{*}}(v_{2}-v_{1}).
Proof.

First, note that, the two conditions γ1+γ2=1\gamma_{1}+\gamma_{2}=1 and γ1α1+γ2α2=α\gamma_{1}\alpha_{1}+\gamma_{2}\alpha_{2}=\alpha^{*} together imply that

γ1=α2αα2α1,γ2=αα1α2α1.\gamma_{1}=\frac{\alpha_{2}-\alpha^{*}}{\alpha_{2}-\alpha_{1}},\quad\gamma_{2}=\frac{\alpha^{*}-\alpha_{1}}{\alpha_{2}-\alpha_{1}}. (48)

Next, let us define z1:=γ1(1α1)z_{1}:=\gamma_{1}(1-\alpha_{1}). Using α2[α,1]\alpha_{2}\in[\alpha^{*},1] along with (48) implies that, for any α1\alpha_{1}, z1z_{1} spans the interval [0,1α][0,1-\alpha^{*}] as α2\alpha_{2} varies over [α,1][\alpha^{*},1]. Additionally, let us define z2:=α11α1z_{2}:=\frac{\alpha_{1}}{1-\alpha_{1}}. Note that as α1\alpha_{1} spans the interval [0,t][0,t^{*}], z2z_{2} spans the interval [0,t1t][0,\frac{t^{*}}{1-t^{*}}].

Now, notice that we can rewrite γ1v1+γ2α2v2\gamma_{1}v_{1}+\gamma_{2}\alpha_{2}v_{2} as

γ1v1+γ2α2v2\displaystyle\gamma_{1}v_{1}+\gamma_{2}\alpha_{2}v_{2} =γ1α1v2γ1α1(v2v1)+γ1(1α1)v1+γ2α2v2\displaystyle=\gamma_{1}\alpha_{1}v_{2}-\gamma_{1}\alpha_{1}(v_{2}-v_{1})+\gamma_{1}(1-\alpha_{1})v_{1}+\gamma_{2}\alpha_{2}v_{2}
=αv2+γ1(1α1)v1γ1α1(v2v1)\displaystyle=\alpha^{*}v_{2}+\gamma_{1}(1-\alpha_{1})v_{1}-\gamma_{1}\alpha_{1}(v_{2}-v_{1})
=αv2+z1v1z1z2(v2v1),\displaystyle=\alpha^{*}v_{2}+z_{1}v_{1}-z_{1}z_{2}(v_{2}-v_{1}), (49)

where the second equality used the fact that γ1α1+γ2α2=α\gamma_{1}\alpha_{1}+\gamma_{2}\alpha_{2}=\alpha^{*}. Therefore, we can cast 𝒮\mathcal{S}^{\prime} as

𝒮={[z1z2(v2v1),αv2+z1v1z1z2(v2v1)]|z1[0,1α],z2[0,t1t]}.\displaystyle\mathcal{S^{\prime}}=\biggl{\{}\Big{[}z_{1}z_{2}(v_{2}-v_{1}),\alpha^{*}v_{2}+z_{1}v_{1}-z_{1}z_{2}(v_{2}-v_{1})\Big{]}^{\top}~{}\Big{|}z_{1}\in[0,1-\alpha^{*}],z_{2}\in[0,\frac{t^{*}}{1-t^{*}}]\biggr{\}}. (50)

It is straightforward to verify that this set is indeed the triangle EFGEFG. ∎

Figure 9 illustrates this triangle for the two cases of tηt^{*}\leq\eta and tηt^{*}\geq\eta. Notice that the triangle formed by points AA, BB, and DD corresponds with the non-private case where β=0\beta=0, implying t=ηt^{*}=\eta and 𝒮=𝒮\mathcal{S}^{\prime}=\mathcal{S}.

αv2+(1α)v1\alpha^{*}v_{2}+(1-\alpha^{*})v_{1}Eαv2\alpha^{*}v_{2}FGH(1α)v1(1-\alpha^{*})v_{1}
(a) tηt^{*}\leq\eta
αv2+(1α)v1\alpha^{*}v_{2}+(1-\alpha^{*})v_{1}Eαv2\alpha^{*}v_{2}FGH(1α)v1(1-\alpha^{*})v_{1}
(b) tηt^{*}\geq\eta
Figure 9: Illustration of 𝒮\mathcal{S}^{\prime} for the case αt\alpha^{*}\geq t^{*}

Case (II): αt\alpha^{*}\leq t^{*}

Lemma 8.

Suppose βmin{2η,2(1η)}\beta\leq\min\{2\eta,2(1-\eta)\} and αt\alpha^{*}\leq t^{*}. Then 𝒮\mathcal{S}^{\prime} is in the form of a triangle with three vertices E=[0,αv2+(1α)v1]E=[0,\alpha^{*}v_{2}+(1-\alpha^{*})v_{1}]^{\top}, F=[α(v2v1),v1]F=[\alpha^{*}(v_{2}-v_{1}),v_{1}]^{\top}, and G=[0,κ]G=[0,\kappa]^{\top} with

κ:=v1+αv2(1ηt).\kappa:=v_{1}+\alpha^{*}v_{2}(1-\frac{\eta}{t^{*}}).
Proof.

The proof technique is similar to the ones used in the proof of Lemma 7. Here, we define w1:=γ2α2w_{1}:=\gamma_{2}\alpha_{2} and w2:=1α2w_{2}:=\frac{1}{\alpha_{2}}. Using (48), we can see that α1\alpha_{1} and α2\alpha_{2} sweeping over [0,α][0,\alpha^{*}] and [t,1][t^{*},1], respectively, imply w1w_{1} and w2w_{2} spanning over [0,α][0,\alpha^{*}] and [1,1t][1,\frac{1}{t^{*}}], respectively.

We next represent 𝒮\mathcal{S}^{\prime} using w1w_{1} and w2w_{2}. Particularly, we can rewrite 𝒮\mathcal{S}^{\prime} as

𝒮={[(αw1)(v2v1),v1+w1v2w1w2v1]|w1[0,α],w2[1,1t]}.\displaystyle\mathcal{S^{\prime}}=\biggl{\{}\Big{[}(\alpha^{*}-w_{1})(v_{2}-v_{1}),v_{1}+w_{1}v_{2}-w_{1}w_{2}v_{1}\Big{]}^{\top}~{}\Big{|}w_{1}\in[0,\alpha^{*}],w_{2}\in[1,\frac{1}{t^{*}}]\biggr{\}}. (51)

We can see that this is the triangle given in the lemma’s statement. ∎

Figure 10 illustrates this triangle for the two cases of tηt^{*}\leq\eta and tηt^{*}\geq\eta. Here, the triangle formed by points AA, EE, and GG corresponds with the non-private case where β=0\beta=0, implying t=ηt^{*}=\eta and 𝒮=𝒮\mathcal{S}^{\prime}=\mathcal{S}.

αv2+(1α)v1\alpha^{*}v_{2}+(1-\alpha^{*})v_{1}Ev1v_{1}HGFα(v2v1)\alpha^{*}(v_{2}-v_{1})
(a) tηt^{*}\leq\eta
αv2+(1α)v1\alpha^{*}v_{2}+(1-\alpha^{*})v_{1}Ev1v_{1}HGFα(v2v1)\alpha^{*}(v_{2}-v_{1})
(b) tηt^{*}\geq\eta
Figure 10: Illustration of 𝒮\mathcal{S}^{\prime} for the case αt\alpha^{*}\leq t^{*}

Now, using Proposition 3 along with Lemmas 7 and 8 completes the proof.

A.7 Proof of Lemma 2

Recall by (18) that the maximum producer utility is given by

umax(β):=αv2+(1α)v1β(t(αv2v1)+(1α)v1),u_{\text{max}}(\beta):=\alpha^{*}v_{2}+(1-\alpha^{*})v_{1}-\beta\left(t^{*}(\alpha^{*}v_{2}-v_{1})+(1-\alpha^{*})v_{1}\right), (52)

where tt^{*} is given by Lemma 1. Taking the derivative with respect to β\beta, we have

ddβumax(β)\displaystyle\frac{d}{d\beta}u_{\text{max}}(\beta) =(η+α2αη)(β22β)+2η2η22η(1β)2\displaystyle=-\frac{(\eta+\alpha^{*}-2\alpha^{*}\eta)(\beta^{2}-2\beta)+2\eta-2\eta^{2}}{2\eta(1-\beta)^{2}}
=η+α2αη2η(1β)2((1β)2+(αη)(2η1)η+α2αη).\displaystyle=-\frac{\eta+\alpha^{*}-2\alpha^{*}\eta}{2\eta(1-\beta)^{2}}\left((1-\beta)^{2}+\frac{(\alpha^{*}-\eta)(2\eta-1)}{\eta+\alpha^{*}-2\alpha^{*}\eta}\right). (53)

Notice that α+η2αη0\alpha^{*}+\eta-2\alpha^{*}\eta\geq 0 as α,η1\alpha^{*},\eta\leq 1. Now, to see whether the maximum producer utility is monotone or not, we need to check whether the sign of

h(β):=(1β)2+(αη)(2η1)η+α2αηh(\beta):=(1-\beta)^{2}+\frac{(\alpha^{*}-\eta)(2\eta-1)}{\eta+\alpha^{*}-2\alpha^{*}\eta} (54)

changes as β\beta varies over [0,min{2η,2(1η)}][0,\min\{2\eta,2(1-\eta)\}]. Note that h(β)h(\beta) is decreasing over this interval. We also know that the maximum utility under privacy is (weakly) lower than the non-private case, meaning that (53) is nonpositive at β=0\beta=0 which implies h(0)0h(0)\geq 0 (we could also simplify verify this by setting β=0\beta=0 at (54)).

Consequently, the maximum producer utility is monotone in β\beta if and only if

h(min{2η,2(1η)})0.h\left(\min\{2\eta,2(1-\eta)\}\right)\geq 0.

Notice that

h(min{2η,2(1η)})\displaystyle h\left(\min\{2\eta,2(1-\eta)\}\right) =(12η)2+(αη)(2η1)η+α2αη=(12η)2η(1η)(12α)η+α2αη,\displaystyle=(1-2\eta)^{2}+\frac{(\alpha^{*}-\eta)(2\eta-1)}{\eta+\alpha^{*}-2\alpha^{*}\eta}=\frac{(1-2\eta)2\eta(1-\eta)(1-2\alpha^{*})}{\eta+\alpha^{*}-2\alpha^{*}\eta}, (55)

which is nonnegative if and only if (12η)(12α)0(1-2\eta)(1-2\alpha^{*})\geq 0. This completes the proof.

A.8 Proof of Lemma 3

Recall that, by Theorem 1, the minimum consumer utility is given by

βtα(v2v1).\beta t^{*}\alpha^{*}(v_{2}-v_{1}). (56)

To understand when this is a monotone function of β\beta, we need to look into the derivative of βt\beta t^{*} as a function of β\beta. This derivative is given by

2ηβ(2β)2(1β)2=(β1)2(12η)2(1β)2.\frac{2\eta-\beta(2-\beta)}{2(1-\beta)^{2}}=\frac{(\beta-1)^{2}-(1-2\eta)}{2(1-\beta)^{2}}. (57)

We know that this derivative is nonnegative at β=0\beta=0. Hence, the minimum consumer utility is monotone if and only if the above derivative is nonnegative at β=min{2η,2(1η)}\beta=\min\{2\eta,2(1-\eta)\}. Plugging this value for β\beta into (57), we have

(12η)2(12η)2(1β)2=2η(12η)2(1β)2,\frac{(1-2\eta)^{2}-(1-2\eta)}{2(1-\beta)^{2}}=\frac{-2\eta(1-2\eta)}{2(1-\beta)^{2}},

which is nonnegative if and only if η1/2\eta\geq 1/2. This completes the proof.

A.9 Proof of Lemma 4

Recall that 𝒰p(vi|x^)\mathcal{U}_{p}(v_{i}|\hat{x}), is given by

𝒰p(vi|x^)=(1β)𝒰p(vi,x^)+β𝔼xUnif(𝒳)[𝒰p(vi,x)].\mathcal{U}_{p}(v_{i}|\hat{x})=(1-\beta)~{}\mathcal{U}_{p}(v_{i},\hat{x})+\beta~{}\mathbb{E}_{x\sim\text{Unif}(\mathcal{X})}\left[\mathcal{U}_{p}(v_{i},x)\right]. (58)

Also, recall that

𝒰p(vi,x)=vik=iKx(vk),\mathcal{U}_{p}(v_{i},x)=v_{i}\sum_{k=i}^{K}x(v_{k}),

and therefore, we have

𝔼xUnif(𝒳)[𝒰p(vi,x)]=vik=iK𝔼xUnif(𝒳)[x(vk)]=vi(K+1i)1K,\mathbb{E}_{x\sim\text{Unif}(\mathcal{X})}\left[\mathcal{U}_{p}(v_{i},x)\right]=v_{i}\sum_{k=i}^{K}\mathbb{E}_{x\sim\text{Unif}(\mathcal{X})}\left[x(v_{k})\right]=v_{i}(K+1-i)\frac{1}{K}, (59)

where the last argument follows from the fact that the expectation of x(v1),,x(vK)x(v_{1}),\cdots,x(v_{K}) are all equal and equal to 1/K1/K when xx is drawn uniformly at random. Thus, we can rewrite 𝒰p(vi|x^)\mathcal{U}_{p}(v_{i}|\hat{x}) as

𝒰p(vi|x^)=vi((1β)k=iKx^(vk)+βK+1iK).\mathcal{U}_{p}(v_{i}|\hat{x})=v_{i}\left((1-\beta)~{}\sum_{k=i}^{K}\hat{x}(v_{k})+\beta~{}\frac{K+1-i}{K}\right). (60)

Now, if x^𝒳iβ\hat{x}\in\mathcal{X}_{i}^{\beta}, then we have

𝒰p(vi|x^)𝒰p(vj|x^) for all j[K].\mathcal{U}_{p}(v_{i}|\hat{x})\geq\mathcal{U}_{p}(v_{j}|\hat{x})\text{ for all }j\in[K]. (61)

Plugging (60) into this equation implies

vik=iKx^(vk)+β1βvi(K+1i)Kvjk=jKx^(vk)+β1βvj(K+1j)K.v_{i}\sum_{k=i}^{K}\hat{x}(v_{k})+\frac{\beta}{1-\beta}\cdot\frac{v_{i}(K+1-i)}{K}\geq v_{j}\sum_{k=j}^{K}\hat{x}(v_{k})+\frac{\beta}{1-\beta}\cdot\frac{v_{j}(K+1-j)}{K}. (62)

First, suppose j<ij<i. In this case, k=jKx^(vk)k=iKx^(vk)\sum_{k=j}^{K}\hat{x}(v_{k})\geq\sum_{k=i}^{K}\hat{x}(v_{k}), and hence, we have

(vivj)k=iKx^(vk)+β1βvi(K+1i)Kβ1βvj(K+1j)K.(v_{i}-v_{j})\sum_{k=i}^{K}\hat{x}(v_{k})+\frac{\beta}{1-\beta}\cdot\frac{v_{i}(K+1-i)}{K}\geq\frac{\beta}{1-\beta}\cdot\frac{v_{j}(K+1-j)}{K}. (63)

Furthermore, using 1k=iKx^(vk)1\geq\sum_{k=i}^{K}\hat{x}(v_{k}), we obtain

β1βK(vivj)[(K+1j)vj(K+1i)vi]+.\frac{\beta}{1-\beta}\geq\frac{K(v_{i}-v_{j})}{\Big{[}(K+1-j)v_{j}-(K+1-i)v_{i}\Big{]}_{+}}. (64)

Next, consider the case j>ij>i. In this case, we use the bounds 1k=iKx^(vk)1\geq\sum_{k=i}^{K}\hat{x}(v_{k}) and k=jKx^(vk)0\sum_{k=j}^{K}\hat{x}(v_{k})\geq 0 along with (62) to obtain

β1βKvi[(K+1j)vj(K+1i)vi]+.\frac{\beta}{1-\beta}\geq\frac{Kv_{i}}{\Big{[}(K+1-j)v_{j}-(K+1-i)v_{i}\Big{]}_{+}}. (65)

Notice that (64) and (65) together completes the proof of the fact that if 𝒳iβ\mathcal{X}_{i}^{\beta} is non-empty, then ββ¯i\beta\geq\bar{\beta}_{i}. To see why the reverse is also true, notice the all the inequalities that we used would change to equality for the case that the market only consists of consumers with value viv_{i}, i.e., x^(vi)=1\hat{x}(v_{i})=1. So this market is indeed in 𝒳iβ\mathcal{X}_{i}^{\beta} as long as ββ¯i\beta\geq\bar{\beta}_{i}.

A.10 Proof of Proposition 4

First, note that β(x)\mathcal{M}_{\beta}(x) is equal to xx itself with probability 1β1-\beta and is equal to a uniformly random market with probability β\beta. In the latter case, the market falls into 𝒳kβ\mathcal{X}_{k}^{\beta} with probability (𝒳kβ)\mathbb{P}(\mathcal{X}_{k}^{\beta}) and hence is priced at vkv_{k} (the boundary of 𝒳kβ\mathcal{X}_{k}^{\beta}’s is measure-zero). Therefore, we have

Upϕ(x)\displaystyle U_{p}^{\phi}(x) =βk=1K(𝒳kβ)𝒰p(vk,x)+(1β)𝔼pϕ(x)[𝒰p(p,x)]\displaystyle=\beta~{}\sum_{k=1}^{K}\mathbb{P}(\mathcal{X}_{k}^{\beta})~{}\mathcal{U}_{p}(v_{k},x)+(1-\beta)~{}\mathbb{E}_{p\sim\phi(x)}[~{}\mathcal{U}_{p}(p,x)]
Ucϕ(x)\displaystyle U_{c}^{\phi}(x) =βk=1K(𝒳kβ)𝒰c(vk,x)+(1β)𝔼pϕ(x)[𝒰c(p,x)].\displaystyle=\beta~{}\sum_{k=1}^{K}\mathbb{P}(\mathcal{X}_{k}^{\beta})~{}\mathcal{U}_{c}(v_{k},x)+(1-\beta)~{}\mathbb{E}_{p\sim\phi(x)}[~{}\mathcal{U}_{c}(p,x)].

Hence, 𝒮\mathcal{S} in (9) can be written as

β{[xsupp(σ)σ(x)k=1K(𝒳kβ)𝒰c(vk,x),xsupp(σ)σ(x)k=1K(𝒳kβ)𝒰p(vk,x)]|σΣ}+(1β)𝒮1,\displaystyle\beta\left\{\Big{[}\sum_{x\in\text{supp}(\sigma)}\sigma(x)\sum_{k=1}^{K}\mathbb{P}(\mathcal{X}_{k}^{\beta})~{}\mathcal{U}_{c}(v_{k},x),\sum_{x\in\text{supp}(\sigma)}\sigma(x)\sum_{k=1}^{K}\mathbb{P}(\mathcal{X}_{k}^{\beta})~{}\mathcal{U}_{p}(v_{k},x)\Big{]}^{\top}~{}\Big{|}~{}\sigma\in\Sigma\right\}+(1-\beta)\mathcal{S}_{1}, (66)

where 𝒮1\mathcal{S}_{1} is given by

{[xsupp(σ)σ(x)𝔼pϕ(x)[𝒰c(p,x)],xsupp(σ)σ(x)𝔼pϕ(x)[𝒰c(p,x)]]|σΣ and ϕ optimal pricing}\left\{\Big{[}\sum_{x\in\text{supp}(\sigma)}\sigma(x)\mathbb{E}_{p\sim\phi(x)}[~{}\mathcal{U}_{c}(p,x)],\sum_{x\in\text{supp}(\sigma)}\sigma(x)\mathbb{E}_{p\sim\phi(x)}[~{}\mathcal{U}_{c}(p,x)]\Big{]}^{\top}~{}\Big{|}~{}\sigma\in\Sigma\text{ and }\phi\text{ optimal pricing}\right\} (67)

Now, note that 𝒰c(vk,x)\mathcal{U}_{c}(v_{k},x) is linear in xx, and thus, we have

xsupp(σ)σ(x)k=1K(𝒳kβ)𝒰c(vk,x)=k=1K(𝒳kβ)xsupp(σ)σ(x)𝒰c(vk,x)=k=1K(𝒳kβ)𝒰c(vk,x).\sum_{x\in\text{supp}(\sigma)}\sigma(x)\sum_{k=1}^{K}\mathbb{P}(\mathcal{X}_{k}^{\beta})~{}\mathcal{U}_{c}(v_{k},x)=\sum_{k=1}^{K}\mathbb{P}(\mathcal{X}_{k}^{\beta})\sum_{x\in\text{supp}(\sigma)}~{}\sigma(x)\mathcal{U}_{c}(v_{k},x)=\sum_{k=1}^{K}\mathbb{P}(\mathcal{X}_{k}^{\beta})~{}\mathcal{U}_{c}(v_{k},x^{*}).

We can similarly simplify the other term corresponding the produce utility and hence the first term in (66) is β𝕔\beta\mathbbm{c}. Now, it remains to show that 𝒮1\mathcal{S}_{1} is same as the set 𝒮\mathcal{S}^{\prime}, defined in the statement of the proposition.

Next, we argue that each market segment in 𝒮1\mathcal{S}_{1} can be subdivided into smaller segments, each priced at a single value, by allowing multiple versions of the same market priced at different, yet optimal, values. Formally, considering a σΣ\sigma\in\Sigma and an optimal pricing rule ϕ()\phi(\cdot), for any xsupp(σ)x\in\text{supp}(\sigma), let 𝒱(x)𝒱\mathcal{V}(x)\subseteq\mathcal{V} be the support of ϕ(x)\phi(x). All values in 𝒱(x)\mathcal{V}(x) are optimal prices for xx.

For each xsupp(σ)x\in\text{supp}(\sigma), we create |𝒱(x)||\mathcal{V}(x)| replicas of xx. Specifically, for each v𝒱(x)v\in\mathcal{V}(x), we replicate market xx, corresponding to a segment of the consumer continuum σ(x)(ϕ(x)=v)\sigma(x)\mathbb{P}(\phi(x)=v), and price it at vv. Notice that this segmentation breakdown maintains identical consumer and producer utilities. We can formally express this as:

𝒮1={[k=1Ki=1kγki𝒰c(vk,xki),k=1Ki=1kγki𝒰p(vk,xki)]|γki0,i,kγki=1,xki𝒳kβ,i,kγkixki=x}.\displaystyle\mathcal{S}_{1}=\left\{\Big{[}\sum_{k=1}^{K}\sum_{i=1}^{\ell_{k}}\gamma_{k}^{i}~{}\mathcal{U}_{c}(v_{k},x_{k}^{i}),\sum_{k=1}^{K}\sum_{i=1}^{\ell_{k}}\gamma_{k}^{i}~{}\mathcal{U}_{p}(v_{k},x_{k}^{i})\Big{]}^{\top}\Big{|}\gamma_{k}^{i}\geq 0,\sum_{i,k}\gamma_{k}^{i}=1,x_{k}^{i}\in\mathcal{X}_{k}^{\beta},\sum_{i,k}\gamma_{k}^{i}x_{k}^{i}=x^{*}\right\}.

Finally, once again using the fact that 𝒰c(vk,x)\mathcal{U}_{c}(v_{k},x) and 𝒰p(vk,x)\mathcal{U}_{p}(v_{k},x) are linear in xx, we can simplify the above term to

{[k=1Kγk𝒰c(vk,xk),k=1Kγk𝒰p(vk,xk)]|γk0,kγk=1,xk𝒳kβ,kγkxk=x},\displaystyle\left\{\Big{[}\sum_{k=1}^{K}\gamma_{k}~{}\mathcal{U}_{c}(v_{k},x_{k}),\sum_{k=1}^{K}\gamma_{k}~{}\mathcal{U}_{p}(v_{k},x_{k})\Big{]}^{\top}~{}\Big{|}~{}\gamma_{k}\geq 0,\sum_{k}\gamma_{k}=1,x_{k}\in\mathcal{X}_{k}^{\beta},\sum_{k}\gamma_{k}x_{k}=x^{*}\right\}, (68)

with

γk=i=1kγki, and xk=i=1kγkixki.\gamma_{k}=\sum_{i=1}^{\ell_{k}}\gamma_{k}^{i},\text{ and }x_{k}=\sum_{i=1}^{\ell_{k}}\gamma_{k}^{i}x_{k}^{i}. (69)

Notice that (68) is the set 𝒮\mathcal{S}^{\prime} and hence the proof is complete.

A.11 Proof of Theorem 2

Here, we provide the details of the proof sketch provided in the main text. Before computing the utilities, it is worth noting that the proof of inequality (27) is similar to the derivation of (62) in the proof of Lemma 4.

We start with the producer utility corresponding to the set 𝒮\mathcal{S}^{\prime}. We have

i=1Kγi𝒰p(vi,xi)=i=1Kγivij=iKxi(vj)=i=1Kγiviy(i,i)=i=1Kviz(i,i).\displaystyle\sum_{i=1}^{K}\gamma_{i}~{}\mathcal{U}_{p}(v_{i},x_{i})=\sum_{i=1}^{K}\gamma_{i}v_{i}\sum_{j=i}^{K}x_{i}(v_{j})=\sum_{i=1}^{K}\gamma_{i}v_{i}y(i,i)=\sum_{i=1}^{K}v_{i}z(i,i). (70)

Next, regarding the consumer’s side, notice that we have

i=1Kγi𝒰c(vi,xi)\displaystyle\sum_{i=1}^{K}\gamma_{i}~{}\mathcal{U}_{c}(v_{i},x_{i}) =i=1Kγij=iK(vjvi)(y(i,j)y(i,j+1)),\displaystyle=\sum_{i=1}^{K}\gamma_{i}~{}\sum_{j=i}^{K}(v_{j}-v_{i})(y(i,j)-y(i,j+1)), (71)

with the convenience that y(i,K+1):=0y(i,K+1):=0. Notice that we could start the index jj from i+1i+1 and adjust the range of index ii to end at K1K-1. By making these changes, we obtain

i=1K1γij=i+1K(vjvi)(y(i,j)y(i,j+1)).\sum_{i=1}^{K-1}\gamma_{i}~{}\sum_{j=i+1}^{K}(v_{j}-v_{i})(y(i,j)-y(i,j+1)). (72)

Notice that we can rewrite this as

i=1K1j=i+1Kγiy(i,j)(vjvi(vj1vi))\sum_{i=1}^{K-1}\sum_{j=i+1}^{K}~{}\gamma_{i}~{}y(i,j)(v_{j}-v_{i}-(v_{j-1}-v_{i})) (73)

which is equal to

i=1K1j=i+1Kz(i,j)(vjvj1).\sum_{i=1}^{K-1}\sum_{j=i+1}^{K}~{}z(i,j)(v_{j}-v_{j-1}). (74)

This completes the proof.

A.12 Proof of Proposition 5

Let yy^{*} denote the complementary cumulative distribution function of market xx^{*}, i.e.,

y(i):=k=iKx(vk).y^{*}(i):=\sum_{k=i}^{K}x^{*}(v_{k}). (75)

First, we establish that the lowest point of the set 𝒮\mathcal{S}^{\prime} always lies on or beneath the line TRTR. To see this, note that there exists an \ell^{*} for which x𝒳βx^{*}\in\mathcal{X}_{\ell^{*}}^{\beta}. Consequently, there exists one point in 𝒮\mathcal{S}^{\prime} corresponding to uniformly pricing the market at vv_{\ell^{*}}. Since the line TRTR represents the optimal uniform pricing, this point either resides on TRTR (if \ell^{*} is the optimal uniform price for the non-private case) or falls beneath it in other cases. This substantiates the first claim.

Next, we show that if the condition (31) does not hold, the set 𝒮\mathcal{S}^{\prime} will not cross the line TRTR. To prove this, we use the polytope representation in Theorem 2. Summing up both sides of (25b) over ii (for a fixed jj), we obtain

i=1Kz(i,i)vii=1Kz(i,j)vj+βK(1β)i=1Kz(i,1)((K+1j)vj(K+1i)vi).\sum_{i=1}^{K}z(i,i)v_{i}\geq\sum_{i=1}^{K}z(i,j)v_{j}+\frac{\beta}{K(1-\beta)}\sum_{i=1}^{K}z(i,1)\Big{(}(K+1-j)v_{j}-(K+1-i)v_{i}\Big{)}. (76)

Notice that, by (25c), i=1Kz(i,j)=y(j)\sum_{i=1}^{K}z(i,j)=y^{*}(j). In particular, i=1Kz(i,1)=y(1)=1\sum_{i=1}^{K}z(i,1)=y^{*}(1)=1. Hence, we can rewrite (76) as

i=1Kz(i,i)viy(j)vj+βK(1β)((K+1j)vji=1Kz(i,1)(K+1i)vi).\sum_{i=1}^{K}z(i,i)v_{i}\geq y^{*}(j)v_{j}+\frac{\beta}{K(1-\beta)}\Big{(}(K+1-j)v_{j}-\sum_{i=1}^{K}z(i,1)(K+1-i)v_{i}\Big{)}. (77)

Now, notice that the left hand side is the producer utility in 𝒮\mathcal{S^{\prime}}. Choose jj such that vjv_{j} be the optimal uniform pricing (if there are multiple of them, choose the one with the highest (K+1j)vj(K+1-j)v_{j} value). Hence, y(j)vjy^{*}(j)v_{j} represents the producer utility corresponding to the line TRTR. Now, recall that i=1Kz(i,1)=y(1)=1\sum_{i=1}^{K}z(i,1)=y^{*}(1)=1, and therefore, the term

i=1Kz(i,1)(K+1i)vi\sum_{i=1}^{K}z(i,1)(K+1-i)v_{i} (78)

is a weighted average of ((K+1i)vi)i=1K((K+1-i)v_{i})_{i=1}^{K}. Since the condition (31) does not hold, each term (K+1i)vi(K+1-i)v_{i}, and hence the weighted average (78), are weakly smaller than (K+1j)vj(K+1-j)v_{j}. Plugging this into (77) shows that when the condition (31) does not hold, the term i=1Kz(i,i)vi\sum_{i=1}^{K}z(i,i)v_{i} is always weakly larger than the producer utility at the optimal uniform price, and hence the set 𝒮\mathcal{S}^{\prime} does not go below the line TRTR.

Now, suppose the condition (31) holds. Let β\mathcal{I}_{\beta} be the set of optimal uniform prices when we observe the market xx^{*} but use the pricing strategy corresponding to the privacy parameter β\beta, i.e.,

β:=argmaxk[y(k)vk+βK(1β)(K+1k)k].\mathcal{I}_{\beta}:=\operatorname*{arg\,max}_{k}\Big{[}y^{*}(k)v_{k}+\frac{\beta}{K(1-\beta)}(K+1-k)k\Big{]}.

In particular, 0\mathcal{I}_{0} represents the set of optimal uniform prices in the non-private case and the set 0\mathcal{I}_{0} includes those ii’s that maximize (K+1i)i(K+1-i)i.

Now, we only need to consider the case β0\mathcal{I}_{\beta}\subseteq\mathcal{I}_{0}, as otherwise the segmentation corresponding to that value that is in β\mathcal{I}_{\beta} but not in 0\mathcal{I}_{0} would be below the line TRTR. Also, the condition (31) implies that the sets 0\mathcal{I}_{0} and 1\mathcal{I}_{1} are disjoint, and hence, the sets β\mathcal{I}_{\beta} and 1\mathcal{I}_{1} are also disjoint.

We define the market x~\tilde{x} in the following way:

k=iKx~(vi)={Zvi if i1,Zviϵ if i1,\sum_{k=i}^{K}\tilde{x}(v_{i})=\begin{cases}\frac{Z}{v_{i}}&\text{ if }i\notin\mathcal{I}_{1},\\ \frac{Z}{v_{i}}-\epsilon&\text{ if }i\in\mathcal{I}_{1},\end{cases} (79)

where ZZ is chosen such that x~\tilde{x} becomes a distribution. It is clear that such x~\tilde{x} exists for sufficiently small ϵ\epsilon given that (vi)i=1K(v_{i})_{i=1}^{K} is a strictly ascending sequence.

Notice that, given the way that x~\tilde{x} is defined, 𝒰p(vi,x~)\mathcal{U}_{p}(v_{i},\tilde{x}) is equal to its maximum if and only if i1i\notin\mathcal{I}_{1}. In other words, a price viv_{i} maximizes the producer utility for this market if and only if i1i\notin\mathcal{I}_{1}. However, in the definition of 𝒮\mathcal{S}^{\prime}, we use the pricing strategy corresponding to the private case, i.e., we look at the sets 𝒳kβ\mathcal{X}_{k}^{\beta}’s. Now, we claim that, for sufficiently small ϵ\epsilon, x~𝒳iβ\tilde{x}\in\mathcal{X}_{i}^{\beta} for some i1i\in\mathcal{I}_{1}. In other words, for sufficiently small ϵ\epsilon, we end up choosing a price viv_{i} with i1i\in\mathcal{I}_{1} for the market x~\tilde{x}, although it would lead to a lower utility compared to any price vjv_{j} with j1j\notin\mathcal{I}_{1}.

To show this claim holds, it suffices to establish the following claim and then use the continuity of utilities in ϵ\epsilon.

Claim 1.

For ϵ=0\epsilon=0, any price viv_{i} with i1i\in\mathcal{I}_{1} is strictly preferred over any price vjv_{j} with j1j\notin\mathcal{I}_{1} if we use the pricing strategy corresponding to parameter β>0\beta>0.

Proof.

To prove this, we need to show that

(1β)k=iKx~(vk)vi+βK+1iKvi>(1β)k=jKx~(vk)vj+βK+1jKvj(1-\beta)\sum_{k=i}^{K}\tilde{x}(v_{k})v_{i}+\beta\frac{K+1-i}{K}v_{i}>(1-\beta)\sum_{k=j}^{K}\tilde{x}(v_{k})v_{j}+\beta\frac{K+1-j}{K}v_{j}

Note that the first term on the left and right hand side are equal when ϵ=0\epsilon=0. Now the second term on the left hand side is bigger than the second term on the right hand side given that i1i\in\mathcal{I}_{1} and j1j\notin\mathcal{I}_{1}. ∎

Now, consider a segmentation with two segments: the first segment’s market is x~\tilde{x} and it consists δ\delta fraction of consumers and the second segment’s market is xδx~1δ\frac{x^{*}-\delta\tilde{x}}{1-\delta} which consists the remaining 1δ1-\delta fraction of consumers. For sufficiently small δ\delta, the optimal price for the second segment would be sill in β\mathcal{I}_{\beta} given that the market is perturbed minimally. Let us denote that optimal price as vjv_{j^{*}} with jβj^{*}\in\mathcal{I}_{\beta}.

This implies j1j^{*}\notin\mathcal{I}_{1}. Therefore, as we elaborated above, this would mean that the price picked for the first segment would lead to a lower utility compared to the case that we would have picked vjv_{j^{*}}. Given that we pick vjv_{j^{*}} for the second segment, the overall producer utility corresponding to this segmentation would be strictly lower than 𝒰p(vj,x)\mathcal{U}_{p}(v_{j^{*}},x^{*}).

Recall that, as we discussed above, we can assume β0\mathcal{I}_{\beta}\subseteq\mathcal{I}_{0} and hence j0j^{*}\in\mathcal{I}_{0} as well. Thus, 𝒰p(vj,x)\mathcal{U}_{p}(v_{j^{*}},x^{*}) is in fact the utility corresponding to the line TRTR. Hence, the segmentation above corresponds to a point in the set 𝒮\mathcal{S}^{\prime} which falls below the line TRTR. This completes the proof.

A.13 Proof of Fact 2

Notice that β>β¯K\beta>\bar{\beta}_{K} implies that no segment’s price is going to be set equal to vKv_{K}. Hence, the consumers with value vkv_{k} will face a a price upper bounded by vK1v_{K-1}. As a result, the consumer utility is lower bounded by

x(vK)(vKvK1).x^{*}(v_{K})(v_{K}-v_{K-1}).

A.14 Proof of Proposition 6

For any β\beta, let ρ(β)\rho(\beta) denote the maximum consumer utility in the set 𝒮\mathcal{S}^{\prime}. Using Theorem 2, we have

ρ(β)\displaystyle\rho(\beta) =max𝒛i=1K1j=i+1K(vjvj1)z(i,j)\displaystyle=\max_{\bm{z}}\sum_{i=1}^{K-1}\sum_{j=i+1}^{K}(v_{j}-v_{j-1})z(i,j) (80a)
s.t. z(i,1)z(i,K)0 for all i[K],\displaystyle z(i,1)\geq\cdots\geq z(i,K)\geq 0\text{ for all }i\in[K], (80b)
z(i,i)viz(i,j)vjβK(1β)z(i,1)((K+1j)vj(K+1i)vi) for all i,j[K],\displaystyle z(i,i)v_{i}-z(i,j)v_{j}\geq\frac{\beta}{K(1-\beta)}z(i,1)\Big{(}(K+1-j)v_{j}-(K+1-i)v_{i}\Big{)}\text{ for all }i,j\in[K], (80c)
i=1Kz(i,j)=k=jKx(vk) for all j[K].\displaystyle\sum_{i=1}^{K}z(i,j)=\sum_{k=j}^{K}x^{*}(v_{k})\text{ for all }j\in[K]. (80d)

Notice that, as long as β<minkβ¯k\beta<\min_{k}\bar{\beta}_{k}, all the constraints (80c) are strict when the vector 𝒛\bm{z} is set equal to the segmentation in which all consumers with the same value go into one segment. Therefore, we can use Corollary 5 from Milgrom and Segal [2002] to deduce that the function ρ()\rho(\cdot) is absolutely continuous (and hence differentiable almost everywhere). Therefore, we have

ρ(β)=ρ(0)+0βρ(ζ)𝑑ζ,\rho(\beta)=\rho(0)+\int_{0}^{\beta}\rho^{\prime}(\zeta)d\zeta, (81)

where ρ()\rho^{\prime}(\cdot) denotes the derivative of ρ()\rho(\cdot) (when it exists). Next, we make the following claim:

Claim 2.

Suppose ρ()\rho(\cdot) is differentiable at some β\beta. Then, condition (33) implies ρ(β)0\rho^{\prime}(\beta)\leq 0 and condition (34) implies ρ(β)0\rho^{\prime}(\beta)\geq 0.

Proof.

We use the Envelope theorem to characterize the derivative of ρ()\rho(\cdot). Let λi,j\lambda_{i,j} denote the Lagrangian multiplier corresponding to the condition (80c). Then, we have

ρ(β)=ddββ1β[i,jλi,j((K+1j)vj(K+1i)vi)zβ(i,1)],\rho^{\prime}(\beta)=-\frac{d}{d\beta}\frac{\beta}{1-\beta}\Big{[}\sum_{i,j}\lambda_{i,j}\Big{(}(K+1-j)v_{j}-(K+1-i)v_{i}\Big{)}z^{*}_{\beta}(i,1)\Big{]}, (82)

where 𝒛β\bm{z}^{*}_{\beta} is the solution of (80). Next, using the Karush–Kuhn–Tucker conditions Bertsekas [1997], we know that (i) λi,j0\lambda_{i,j}\geq 0, and (ii) λi,j>0\lambda_{i,j}>0 if and only if the condition (80c) is active, i.e.,

zβ(i,i)vizβ(i,j)vj=βK(1β)zβ(i,1)((K+1j)vj(K+1i)vi).z^{*}_{\beta}(i,i)v_{i}-z^{*}_{\beta}(i,j)v_{j}=\frac{\beta}{K(1-\beta)}z^{*}_{\beta}(i,1)\Big{(}(K+1-j)v_{j}-(K+1-i)v_{i}\Big{)}. (83)

Next, recall from the proof of Theorem 2 that zβ(i,)z^{*}_{\beta}(i,\cdot) corresponds to a market that is priced viv_{i}. Now, if (83) holds for some ii and jj, it means that producer has been indifferent between viv_{i} or vjv_{j} as the market price, but viv_{i} has been chosen. Since 𝒛β\bm{z}^{*}_{\beta} is the solution to the consumer maximization problem, this would mean that vi<vjv_{i}<v_{j}, or equivalently, i<ji<j. Therefore, λi,j\lambda_{i,j} is either positive or zero, and we have λi,j>0\lambda_{i,j}>0 only if j>ij>i.

Notice that condition (33) implies (K+1j)vj(K+1i)vi(K+1-j)v_{j}\geq(K+1-i)v_{i} when jij\geq i. Using this result, along with (82) and the fact that β/(1β)\beta/(1-\beta) is an increasing function of β\beta, we obtain ρ(β)0\rho^{\prime}(\beta)\leq 0. Similarly, we can see that, under condition (34), we have ρ(β)0\rho^{\prime}(\beta)\geq 0. This completes the proof of the claim. ∎

Let us turn our focus to the maximum consumer utility. As Proposition 4 establishes, for a given β\beta, the maximum consumer utility is given by

β(k=1K(𝒳kβ)𝒰c(vk,x))+(1β)ρ(β).\beta\Big{(}\sum_{k=1}^{K}\mathbb{P}(\mathcal{X}_{k}^{\beta})~{}\mathcal{U}_{c}(v_{k},x^{*})\Big{)}+(1-\beta)\rho(\beta). (84)

In particular, for the non-private case, this is equal to ρ(0)\rho(0). Claim 2 along with (81) implies that, under condition (33) we have ρ(β)ρ(0)\rho(\beta)\leq\rho(0) and under condition (34) we have ρ(β)ρ(0)\rho(\beta)\geq\rho(0). Therefore, to establish Proposition 6, it suffices to compare k=1K(𝒳kβ)𝒰c(vk,x)\sum_{k=1}^{K}\mathbb{P}(\mathcal{X}_{k}^{\beta})~{}\mathcal{U}_{c}(v_{k},x^{*}) with ρ(0)\rho(0). The following claim does this, and therefore, completes the proof.

Claim 3.
  1. (i)

    Assume v1v_{1} is the optimal uniform price in the non-private case. Then, we have

    ρ(0)k=1K(𝒳kβ)𝒰c(vk,x).\rho(0)\geq\sum_{k=1}^{K}\mathbb{P}(\mathcal{X}_{k}^{\beta})~{}\mathcal{U}_{c}(v_{k},x^{*}).
  2. (ii)

    Assume vKv_{K} is the optimal uniform price in the non-private case. Then, there exists α~\tilde{\alpha} and β~>0\tilde{\beta}>0 such that, if x(vK)α~x^{*}(v_{K})\geq\tilde{\alpha} and ββ~\beta\leq\tilde{\beta}, then we have

    ρ(0)k=1K(𝒳kβ)𝒰c(vk,x).\rho(0)\leq\sum_{k=1}^{K}\mathbb{P}(\mathcal{X}_{k}^{\beta})~{}\mathcal{U}_{c}(v_{k},x^{*}).
Proof.

Let us denote the maximum producer utility (corresponding to the first-degree price discrimination) by MM. We know that the maximum consumer utility in the non-private case lies on the line that sets the sum of consumer and producer utilities equal to MM. Therefore, when v1v_{1} is the optimal uniform price, we have ρ(0)=Mv1\rho(0)=M-v_{1}. Also, notice that for any value vkv_{k}, we have

𝒰c(vk,x)=i=kKx(vi)(vivk)i=kKx(vi)(viv1)i=1Kx(vi)(viv1)=Mv1.\mathcal{U}_{c}(v_{k},x^{*})=\sum_{i=k}^{K}x^{*}(v_{i})(v_{i}-v_{k})\leq\sum_{i=k}^{K}x^{*}(v_{i})(v_{i}-v_{1})\leq\sum_{i=1}^{K}x^{*}(v_{i})(v_{i}-v_{1})=M-v_{1}. (85)

Thus, and given that k=1K(𝒳kβ)=1\sum_{k=1}^{K}\mathbb{P}(\mathcal{X}_{k}^{\beta})=1, we deduce that

k=1K(𝒳kβ)𝒰c(vk,x)Mv1\sum_{k=1}^{K}\mathbb{P}(\mathcal{X}_{k}^{\beta})~{}\mathcal{U}_{c}(v_{k},x^{*})\leq M-v_{1}

which completes the proof of part (i). Next we prove part (ii). Given that, in this part, vKv_{K} is the optimal uniform price in the non-private case, we have

ρ(0)=Mx(vK)vK(1x(vK))vK1.\rho(0)=M-x^{*}(v_{K})v_{K}\leq(1-x^{*}(v_{K}))v_{K-1}. (86)

Also, note that, for any kK1k\leq K-1 we have

𝒰c(vk,x)x(vK)(vKvk),\mathcal{U}_{c}(v_{k},x^{*})\geq x^{*}(v_{K})(v_{K}-v_{k}), (87)

which implies

k=1K(𝒳kβ)𝒰c(vk,x)x(vK)[k=1K1(𝒳kβ)(vKvk)].\displaystyle\sum_{k=1}^{K}\mathbb{P}(\mathcal{X}_{k}^{\beta})~{}\mathcal{U}_{c}(v_{k},x^{*})\geq x^{*}(v_{K})\Big{[}\sum_{k=1}^{K-1}\mathbb{P}(\mathcal{X}_{k}^{\beta})(v_{K}-v_{k})\Big{]}. (88)

The term k=1K1(𝒳kβ)(vKvk)\sum_{k=1}^{K-1}\mathbb{P}(\mathcal{X}_{k}^{\beta})(v_{K}-v_{k}) is positive for β=0\beta=0, and since it is continuous in β\beta, we can choose β~>0\tilde{\beta}>0 such that

infββ~[k=1K1(𝒳kβ)(vKvk)]ϵ,\inf_{\beta\leq\tilde{\beta}}\Big{[}\sum_{k=1}^{K-1}\mathbb{P}(\mathcal{X}_{k}^{\beta})(v_{K}-v_{k})\Big{]}\geq\epsilon, (89)

for some ϵ>0\epsilon>0. Therefore, for any ββ~\beta\leq\tilde{\beta}, we have

k=1K(𝒳kβ)𝒰c(vk,x)ϵx(vK).\displaystyle\sum_{k=1}^{K}\mathbb{P}(\mathcal{X}_{k}^{\beta})~{}\mathcal{U}_{c}(v_{k},x^{*})\geq\epsilon~{}x^{*}(v_{K}). (90)

Now, notice that the right-hand side of (86) goes to zero as x(vK)x^{*}(v_{K}) increases to one. On the other hand, the right-hand side of (90) is an increasing and positive function of x(vK)x^{*}(v_{K}). Therefore, there exists α~\tilde{\alpha} such that, if x(vK)α~x^{*}(v_{K})\geq\tilde{\alpha}, we have

k=1K(𝒳kβ)𝒰c(vk,x)ρ(0),\sum_{k=1}^{K}\mathbb{P}(\mathcal{X}_{k}^{\beta})~{}\mathcal{U}_{c}(v_{k},x^{*})\geq\rho(0), (91)

for any ββ~\beta\leq\tilde{\beta}. This completes the proof. ∎