subsecref \newrefsubsecname = \RSsectxt \RS@ifundefinedthmref \newrefthmname = theorem \RS@ifundefinedlemref \newreflemname = lemma \newrefsecname=Section \newrefsubsecname=Subsection \newrefaxmname=Axiom \newreflemname=Lemma \newrefdefname=Definition \newrefpropname=Proposition \newrefthmname=Theorem \newrefremarkname=Remark \newrefcorname=Corollary \newreffigname=Figure \newrefeqname=Equation \newrefappxname=Appendix \newrefoappxname=Online Appendix \newreffnname=Footnote \newreffactname=Fact
Ordered Reference Dependent Choice
Most recent public version: http://s.xzlim.com/ordc)
Abstract
This paper studies how violations of structural assumptions like expected
utility and exponential discounting can be connected to basic rationality
violations, even though these assumptions are typically regarded as
independent building blocks in decision theory. A reference-dependent
generalization of behavioral postulates captures preference shifts
in various choice domains. When reference points are fixed, canonical
models hold; otherwise, reference-dependent preference parameters
(e.g., CARA coefficients, discount factors) give rise to “non-standard”
behavior. The framework allows us to study risk, time, and social
preferences collectively, where seemingly independent anomalies are
interconnected through the lens of reference-dependent choice.
Keywords: Basic rationality, structural postulates, reference
dependence, context effects, risk preference, time preference, social
preference
JEL: D01, D11
1 Introduction
In various branches of economics, multiple assumptions come together to form the basis of an economic model, and interesting findings often emerge from the unforeseen interplay among these assumptions. The empirical failure of these models, however, need not lie in the substance of each individual assumption but is rooted in their indiscriminate applications.
In individual decision-making, the standard model of choice faces two distinct strands of empirical challenges. First, structural assumptions like the expected utility form and exponential discounting are violated in simple choice experiments, such as the Allais paradox and present bias behavior. Second, and separately, studies show that choices are often affected by reference points, resulting in behavior that violates basic rationality assumptions like the weak axiom of revealed preferences (WARP). With few exceptions, these two classes of departures have been studied separately, and independently for each domain of choice, leading to the development of models that attempt to explain one phenomenon in isolation of the others.111Risk domain: rank-dependent utility (Quiggin, 1982), quadratic utility (Machina, 1982), disappointment aversion (Gul, 1991), betweenness preferences (Chew, 1983; Fishburn, 1983; Dekel, 1986), and cautious expected utility (Cerreia-Vioglio, Dillenberger, and Ortoleva, 2015) maintain basic rationality. Time domain: various models of hyperbolic discounting (Loewenstein and Prelec, 1992; Frederick, Loewenstein, and O’donoghue, 2002), quasi-hyperbolic discounting (Phelps and Pollak, 1968; Laibson, 1997), and related generalizations (Chakraborty, 2021; Chambers, Echenique, and Miller, 2023) maintain basic rationality. Others: Kőszegi and Rabin (2007) and Ortoleva (2010) use reference dependency to explain structural violations. Hara, Ok, and Riella (2019) maintain structural assumption but relaxes basic rationality. In richer settings: Bordalo, Gennaioli, and Shleifer (2012); Lanzani (2022) relax (state-independent) Independence and (state-independent) Transitivity to study correlated lotteries and Noor and Takeoka (2015) relax Independence (for ex-ante preference) and WARP (for ex-post choices) to study two-stage self-control problems.
This paper introduces a unified framework that studies how the two types of violations may be in part related to one another, stemming from a common source. The central approach is motivated by a simple observation: Suppose preference parameters (e.g., utility functions capturing risk attitude and discount factors representing degree of patience) are influenced by reference alternatives, then even decision makers who typically adhere to normative postulates (e.g., maximize exponentially discounted expected utility) would every so often violate rationality assumptions and structural assumptions—when reference alternatives change. On the other hand, choices made under the same reference would fully align with both postulates. Thus, while the two types of assumptions are conventionally treated as separate building blocks of a choice model—introduced as independently motivated axioms—their deviations are intrinsically connected by systematic shifts in preferences.
To illustrate, a myriad of documented anomalies, including the Allais paradox, suggests that decision makers exhibit increased risk aversion when presented with safer options (Allais, 1990; Wakker and Deneffe, 1996; Herne, 1999; Bleichrodt and Schmidt, 2002; Andreoni and Sprenger, 2011). While this behavior contradicts the expected utility theory, it aligns with the expected utility framework when coupled with context-dependent utility functions that vary in concavity. This observation motivates the model in the risk domain.
(1.1) |
Standard expected utility applies when the safest alternative, which acts as the reference , is fixed; but when it changes, a safer reference leads to a more concave utility function , reflecting a systematic increase in risk aversion.
This observation is not exclusive to expected utility. In time preferences, present bias individuals who are less patient in short-term decisions violate exponential discounting (Laibson, 1997; Frederick, Loewenstein, and O’donoghue, 2002; Benhabib, Bisin, and Schotter, 2010; Chakraborty, 2021). However, their behavior could be consistent with the exponential discounting form when paired with context-dependent discount factors that capture changing time preferences.
(1.2) |
When a problem offers sooner payments, it alters the reference point , prompting the decision maker to use a lower discount factor that reflects increased impatience. Again, changes in preferences are systematic along a certain order, and behavior is otherwise standard.
For social preferences, it is well-documented in economics and psychology that the very same individuals display different degree of altruism in different choice settings, for example when a balanced split of reward is available than when it is not (Ainslie, 1992; Rabin, 1993; Nelson, 2002; Fehr and Schmidt, 2006; Sutter, 2007). These context-dependent preferences could be consistent with
(1.3) |
where increased altruism is captured by a utility from sharing, , that systematically increases when more-equitable splits become the reference.
This paper aims to examine these behaviors as one collective, addressing three key questions: (1) Under what conditions do non-standard behaviors across various choice domains permit such representations? (2) What do they have in common? and (3) How does their systematic departure from canonical models inform the relationship between rationality assumptions and structural assumptions?
It turns out that, although these behavioral anomalies are typically investigated in largely separate and domain-specific studies, the behavioral content of the proposed models is underpinned by a “meta” axiomatic foundation referred to as Reference Dependence (RD). RD is the key innovation of this paper, introducing a reference-dependent approach that can generalize a large class of behavioral postulates or axioms, be it “rational” or “structural”. When applied to the risk, time, and social domains, it yields three complete characterizations that resonate with one another.
To illustrate the idea, 2 applies RD only to rationality assumption by requiring that in every (finite) choice set, at least one alternative would preserve WARP among choice behavior from its subsets. Intuitively, if the failure of rationality is caused by reference dependence, then rationality should continue to hold at least for choice sets that share the same reference. It turns out that this postulate characterizes a two-step choice process: A reference order is maximized to identify the reference alternative of a choice problem . The reference alternative then determines a utility function that the decision maker maximizes. Intuitively, the context of a choice problem is captured by the alternative that ranks highest in the reference order and the underlying context-dependent preference is subsequently determined.
(1.4) |
It has not gone unnoticed that Equations 1.1, 1.2, and 1.3 are special cases of (LABEL:general), sharing two essentially components: (i) a reference order and (ii) reference-dependent preference parameters. It is also apparent that basic rationality—the assumption that one persistent utility function is being maximized—can fail, which makes the proposed explanations not particularly appealing, at least until the recent accumulation of theoretical interest and empirical evidence against basic rationality itself.
It turns out that, barring technical challenges, the axiomatic characterization of each of these behaviors requires little more than adapting RD to their domain-specific normative postulates. For risk preference, Risk-RD preserves both WARP (rationality axiom) and von Neumann-Morgenstern’s Independence (structural axiom) when the safest alternative is maintained. For time preference, Time-RD preserves normative postulates WARP (rationality axiom) and Stationarity (structural axiom) when the earliest available payment is fixed. For social preference, Social-RD calls for consistency with WARP (rationality axiom) and Quasi-linearity (structural axiom) when the most-balanced options coincide. The underlying intuition is universal: Upholding the reference point ensures the validity of all normative postulates, so that violations of structural assumptions—whatever they are and whatever the domain—are linked to reference dependent preferences manifested in basic rationality violations.222The generality of this exercise is demonstrated in B. A second axiom, which does not involve reference points, captures systematic changes in preferences by requiring that choices cannot become more risk loving / more patient / more selfish when a subset of the original choice set is considered.
Notwithstanding its intuitiveness, this approach does not fully align with the conventional wisdom in decision theory (and economics in general) where assumptions are weakened one at a time. Relaxing both rationality postulates and structural postulates leads to an instinctive concern about admitting too wide a range of behavior. However, the joint generalization introduced by RD exhibits greater discipline than an independent generalization, and it contributes to three interrelated insights that span all three domains, forming the core of this study.
First, the models predict how structural anomalies traditionally detected in binary comparisons (such as the Allais paradox and present bias behavior) will manifest as WARP violations when larger choice sets are considered, providing testable predictions that could bridge the two largely separate empirical literature. To illustrate, suppose Option 1 is a later payment and it is chosen over a sooner payment Option 2, but the opposite decision emerges when both options are symmetrically advanced, an anomaly known as present bias.333This behavior violates Stationarity, the axiom responsible for exponential discounting, which requires a consistent preference between two options even when the decision is revisited at later point in time. Then, it is predicted that adding a particular unchosen alternative Option 3 to the original comparison could switch the choice to Option 2, causing a WARP violation; similar observations link the common ratio effect in risk preferences to WARP violations. They resonate with the motivation of the present framework in suggesting that deviations from standard models may arise from changing preferences rather than being a mere failure of structural assumptions. This plausible connection, however, is obscured in studies that assume basic rationality, thereby missing the opportunity to draw insights from behavior in non-binary choice sets that could offer a fundamentally different perspective on traditional anomalies.
Second, the models suggest how basic rationality and structural postulates can be inextricably linked, even though they are typically regarded as independent building blocks of individual decision-making. 1 shows that introducing just WARP or just Independence to the risk model immediately implies standard expected utility behavior, even though these postulates must be jointly imposed in a general setting. This means a decision maker who has any utility representation will also have an expected utility representation; similar results are obtained for time and social domains. The proposed models thus capture distinct non-standard behavior when contrasted with a substantial body of the literature that only generalizes structural assumptions. For example, even though many models can explain the Allais paradox and present bias behavior, choice behavior from prominent models like rank-dependent utility, quadratic utility, disappointment aversion, betweenness, cautious expected utility, hyperbolic discounting, and quasi-hyperbolic discounting overlap with mine only in the special case where behavior is fully standard.444See 1 for references. That is, for non-standard decision makers, our models provide mutually exclusive predictions.
Third, the innovation in this exercise is due crucially to an underexplored generalization of structural assumptions forbidden by traditional adherence to rationality assumptions. To see this, consider the risk domain and suppose lottery is preferred to lottery . The von Neumann-Morgenstern’s Independence condition requires their common mixtures to have the same preference order, meaning that is preferred to .555Lottery refers to the (compound) lottery generated by mixing lottery with probability and lottery with probability . Independence says the preference between and should be the same as the preference between and , since they differ only by a common term. Although a generalization of Independence loosens this requirement, WARP makes it impossible to discuss how is preferred to in some choice sets while the opposite holds in others. Relaxing WARP immediately allows for this kind of generalization and paves the way for a context-dependent implementation of structural postulates. This paper presents one of many possible demonstrations and, perhaps counter-intuitively, shows that weakening both kinds of postulates could bring us “closer” to canonical models. Similar limitations are present when a (complete) preference relation serves as the primitive, which might explain why this approach has not received much attention.
While the three observations are formulated within the scope of ordered reference, they might make a case for the comprehensive examination of conceptually different behavior that complement foundational groundwork already established by isolated investigations. Perhaps the examination of individual decision-making rightly began with a theoretical decomposition of a complex behavior into key components—encompassing conceptually distinct notions like “rationality axioms” and “structural axioms”—to focus our studies and interpretations. But the natural progression now, knowing that each of these components fails to some extent, entails a joint investigation of these theoretical constructs.
Related literature is next. 2 introduces the basic framework and RD. Sections 3-5, the main parts of this paper, take the unified framework to risk, time, and social domains; they introduce axioms, provide representation theorems, study implications, and discuss evidence. 6 concludes. Key proofs are in A. Technical results and omitted proofs are relegated to B.
1.1 Related literature
The engagement of reference alternatives relates to the extensive literature on reference-dependent preferences, originating from the seminal work of Kahneman and Tversky (1979) on loss aversion and initially explored under the assumption that reference points are directly observed.666And relatedly, Tversky and Kahneman (1991); Kahneman, Knetsch, and Thaler (1991). Subsequently, the scope of mechanisms involving exogenous reference points has broadened beyond the realm of gain-loss utility, e.g., general status quo bias (Masatlioglu and Ok, 2005, 2014), ambiguity aversion (Ortoleva, 2010), wishful thinking (Kovach, 2020), and categorical thinking (Ellis and Masatlioglu, 2022).777Other work related to status quo bias includes Rubinstein and Zhou (1999); Sagi (2006); Apesteguia and Ballester (2009); Dean, Kıbrıs, and Masatlioglu (2017).
A new way to study reference points was popularized by Kőszegi and Rabin (2006) where endogenous reference points capture seemingly reference-dependent behavior even though reference points are not directly observed. These studies encompass both objective reference (Kivetz, Netzer, and Srinivasan, 2004; Orhun, 2009; Bordalo, Gennaioli, and Shleifer, 2013; Tserenjigmid, 2019) and subjective reference (Kőszegi and Rabin, 2006; Ok, Ortoleva, and Riella, 2015; Freeman, 2017). The present paper falls into this category and explores a novel use of endogenous reference to proxy for domain-specific contexts and govern domain-specific preference shifts.
Although the understudied link between rationality assumptions and structural assumptions forms the core of this paper, the framework can be applied to the generic choice domain where only rationality assumptions are considered. In this case, the model and its behavioral implications coincide with Rubinstein and Salant (2006)’s Triggered Rationality.888Ravid and Steverson (2021) studies the same behavior under a model of bad temptation. That same model is also studied in Kıbrıs, Masatlioglu, and Suleymanov (2023) and Giarlotta, Petralia, and Watson (2023) using a different axiom that essentially says “if dropping in the presence of causes a WARP violation, then dropping in the presence of cannot”. Their axiom is an appealing alternative when applied only to rationality violations, as it cannot be extended to structural assumptions like Independence and Stationarity.999Let be common mixtures of . Using notation to denote “ is chosen from the choice set ”, the behavior , , , , , , , , , , satisfies Kıbrıs, Masatlioglu, and Suleymanov (2023)’s Single Reversal even after modifying it to consider Independence violation as a reversal, but there is no reference alternative in because and violate WARP whereas and violate Independence. 3.2 rules out this behavior.
More broadly, theories that systematically apply to different domains of choice include, among others, loss aversion (Kahneman and Tversky, 1979; Kőszegi and Rabin, 2006), attraction effect (Huber, Payne, and Puto, 1982), compromise effect (Simonson, 1989), salience (Bordalo, Gennaioli, and Shleifer, 2012, 2013), and focusing (Kőszegi and Szeidl, 2013). These models consider evaluations of multi-attributes alternatives that are affected by the attributes of available alternatives, some of which later generalized as categorical thinking (Ellis and Masatlioglu, 2022). They provide valuable insights that help us understand how psychological and attention factors can influence economic decisions by affecting our perception of an alternative.
To this end, Ellis and Masatlioglu (2022)’s study may be the closest to mine, since they also consider reference points and explore applications in various choice settings. Their study focuses on the endogenous formation of categories due to exogeneously given reference points, and when two alternatives are assigned to different categories, they are evaluated differently and potentially result in a preference reversal. The key mechanism that connects reference and preference is thus categorization. In contrast, the present framework considers endogenous reference points, uses the functional forms of standard models to evaluate alternatives, and captures deviations using changes in preference parameters. Kovach (2020)’s wishful thinking lies in between the two approaches; a decision maker’s subjective belief depends on exogeneously given status quo (the reference point), but she is otherwise standard in maximizing subjective expected utility.
In terms of characterization, Reference Dependence (RD) offers new tools. First, it is known that the equivalence between canonical axioms and canonical models breaks down when data is limited or incomplete; this technical issue emerges as choice problems are partitioned into reference-dependent subsets.101010See Samuelson (1948) and Aumann (1962). For example, if does not contain all doubletons and tripletons, then a choice correspondence on that satisfies WARP (and Continuity) does not necessarily admit a utility representation. This challenge extends to richer domains; for example in the risk domain, if the underlying set of lotteries is not a convex subset of lotteries, then a choice correspondence that satisfies WARP and Independence does not necessarily admit an expected utility representation (even if it admits a utility representation). Instead of strengthening axioms (Houthakker, 1950; Echenique, Imai, and Saito, 2020; de Clippel and Rozen, 2021) or embracing more general models (Dubra, Maccheroni, and Ok, 2004; Manzini and Mariotti, 2008; Evren, 2014; Hara, Ok, and Riella, 2019), RD exploits reference formation to impart adequate structure to each subset of behavior so that a standard representation emerges. The method bears qualitative similarity to Ke and Chen (2022)’s weak local independence, which characterizes local expected utility using local compliance of canonical Independence. Second, systematic deviations from structural assumptions are imposed by relating small and large choice sets, achieving effects similar in spirit to Dillenberger (2010); Cerreia-Vioglio, Dillenberger, and Ortoleva (2015)’s negative certainty independence and Chakraborty (2021)’s weak present bias in more standard settings. These unexpected connections invite curiosity into the potential role of reference dependence in studies that do not explicitly consider them.
Finally, the paper aligns with a broader agenda regarding the comprehensive examination of behaviors conventionally studied in isolation, providing breadth to the already established depth. This agenda includes, among others, empirical studies of possible interrelations in behavioral traits (Falk, Becker, Dohmen, Enke, Huffman, and Sunde, 2018; Chapman, Dean, Ortoleva, Snowberg, and Camerer, 2023; Stango and Zinman, 2023),111111Less representative samples are used in Burks, Carpenter, Goette, and Rustichini (2009) (truck drivers) and Dean and Ortoleva (2019) (university students). methodological development that separates preference inconsistency and parametric misspecification (Halevy, 2015; Polisson, Quah, and Renou, 2020; Echenique, Imai, and Saito, 2020, 2023; de Clippel and Rozen, 2023), experiments that assess a broad spectrum of anomalies as potential mistakes (Nielsen and Rehbeck, 2022), theoretical investigation that links non-standard risk and time preferences (Chakraborty, Halevy, and Saito, 2020), and revealed preference analyses that highlight basic rationality postulates in rich/different domains (Dembo, Kariv, Polisson, and Quah, 2021; Halevy, Walker-Jones, and Zrill, 2023; Chen, Liu, Shan, Zhong, and Zhou, 2023).
2 Basic Framework
Let be a separable metric space, endowed with the standard Euclidean metric , that represents the set of all alternatives. Let be the set of all finite and nonempty subsets of , also called choice sets. The primitive of this paper is a choice correspondence where for all . I assume throughout the paper that is continuous:
Axiom (Continuity).
has a closed graph.121212That is, , , and for every imply , where refers to convergence in the Hausdorff distance, defined by .
The risk, time, and social preferences studied in this paper share a common starting point: For any given choice set, the decision maker is seemingly standard by maximizing a single utility function. But globally, behavior is non-standard because this function depends on possibly different reference alternatives across choice sets.131313This naturally bounds non-standard behavior: When is finite, there are at most distinct utility functions, but there are around choice sets, and this difference increases exponentially in .
Definition 1.
A choice correspondence admits an Ordered-Reference Dependent Utility (ORDU) representation if there exist a linear order and a set of utility functions such that
and for all , where has a closed graph.141414A linear order is a complete, reflexive, transitive, and antisymmetric binary relation on .
Existing theories that incorporate a reference order can be traced back to Rubinstein and Salant (2006)’s Triggered Rationality, which coincide with ORDU. Restricted to the generic choice domain, Kıbrıs, Masatlioglu, and Suleymanov (2023); Giarlotta, Petralia, and Watson (2023); Kibris, Masatlioglu, and Suleymanov (2024) expand on this trajectory by exploring different axiomatizations, stochastic choice, and connections to psychological constraints / limited consideration. The latter suggests how different kinds of rationality violations may be related, complementing the present framework that focuses on context-dependent preferences. They also capture interesting narratives in the generic choice domain. Kıbrıs, Masatlioglu, and Suleymanov (2023) suggest that the top results when consumers search for a product are conspicuous, serving as the reference and influencing their final decisions; Giarlotta, Petralia, and Watson (2023) propose that the frog’s legs dish in Luce and Raiffa’s Dinner is salient, becoming the reference and increasing a consumer’s confidence, thence preference, for steak; Kibris, Masatlioglu, and Suleymanov (2024) consider the case of marketing campaigns where a consumer is more likely to recall an advertised product and uses it as benchmark to make consumption decisions.
Despite its simplicity and intuitiveness, focusing on the generic choice domain is not without caveats. The formation of completely subjective reference points adds challenges to their identification. Compounding this issue is the lack of structure in each reference-dependent utility function and the absence of a systematic relationship between these utility functions.
Explored in Sections 3-5 (risk, time, and social preferences), a richer choice domain provides natural remedies to, and in fact benefit from, the flexibility of this model. First, it significantly expands the interpretation of a reference order, where it ranges from being partially subjective (ranking lotteries by riskiness, 3) to fully objective (Gini index, 5), so as to capture the relevant domain-specific context. In turn, the reference order serves as a natural anchor along which domain-specific preference shifts are manifested, such as increasing risk aversion or decreasing patience along the established order. The two components—reference order and reference effect—interact with each another, yielding a framework that captures a highly specific and tractable form of set-dependent preferences.
Moreover, the models in all three choice domains share a “meta” axiomatic framework that in its simplest form characterizes ORDU. To illustrate the basic idea, consider following definition that maintains the content of the weak axiom of revealed preferences (WARP) but allows for selective application.
Definition 2.
satisfies WARP over if for all , if and , then .
The classical rationality assumption on choice behavior entails imposing WARP over . The following postulate imposes WARP only locally, using a reference point as the anchor.
Axiom 2.1.
For every choice set , satisfies WARP over for some .
Theorem 1.
satisfies 2.1 and Continuity if and only if it admits an ORDU representation.
2.1 captures choice behavior that satisfies WARP in a reference-dependent manner and coincides with the reference point property in Rubinstein and Salant (2006). To understand this axiom, suppose choices from and its subset constitute a WARP violation. If this is caused by a change in reference point, specifically, that the reference alternative of was removed when we go to subset , then a natural limitation of WARP violations would arise: Had we not removed the reference alternative of , choices must satisfy WARP. To put it differently, suppose that by preserving some alternative in , choices from the subsets of would comply with WARP, then is a candidate reference alternative of . 2.1 demands that every choice set contains (at least) one candidate reference alternative, which makes is less demanding than the standard postulate that imposes WARP indiscriminately.151515Since standard WARP requires “ satisfies WARP over ”, which in turn implies “ satisfies WARP over ” for any , it is stronger than 2.1.
To illustrate further, consider the following choice correspondence on .
Note that this choice correspondence fails to satisfy WARP globally because is chosen from but is chosen from . To check whether it satisfies 2.1, we have to visit every choice set. Starting with , note that there is no WARP violations among subsets of that contain , i.e., is a reference alternative, so choice set passes the test. These subsets have been conveniently placed in the left column. Moreover, note that when we visit any of these subsets, continues to be a reference alternative, so they too pass the test. For the remaining choice sets, we begin with and note that is no WARP violations among subsets of that contain , so and these subsets pass the test; they are conveniently positioned in the middle column. The only choice set left is where WARP is trivial because the only non-singleton subset of is itself. The axiom is thus satisfied. It amounts to a structured partitioning of choice sets—the left, middle, and right columns—so that within each part there is no WARP violation.1616162.1 is falsifiable whenever when . Using notation to denote “ is chosen from the choice set ”, the choice correspondence have two instances of WARP violations, (i) between and and (ii) between and , so none of can be the reference alternative of . Relatedly, a cardinal measure of falsifiability is to count the minimum number of observations required for falsification. For standard WARP, that number is 2: for example, when WARP is violated between and . For 2.1, that number is 3: for example , , and , since the reference of is in and/or , but WARP is violated both between , and between , . Under this measure, reference dependence makes 2.1 harder to reject relative to WARP by one additional observation.
The highlight of this approach is not the rationality assumption WARP per se, but the way WARP as a behavioral postulate was generalized in an attempt to call for its compliance locally. More generally, it follows the template “for every choice set , the choice correspondence satisfies over for some ” where can be a behavioral postulate of interest and can be an objective range in which reference points lie. This general approach is referred to as Reference Dependence (RD), which is formally introduced and analyzed in B and used in Sections 3-5. Related studies like Kıbrıs, Masatlioglu, and Suleymanov (2023) propose alternative characterization designed for WARP and cannot be directly extended in this way (see 9).
3 Risk Preference
Consider a decision maker whose willingness to take risk is dynamic and dependent on how much of it is avoidable. The safest alternative in a choice set provides a natural measure for this context. Sometimes, we have the option to fully avoid risk by keeping our assets in cash or by buying an insurance policy, and so the safest option is quite safe. But in other situations, such as a carefully designed lab experiment in which all options involve risk, taking some risk becomes unavoidable. The premise of my model is a decision maker whose risk aversion systematically differs between different set-dependent contexts—greater risk aversion when risk is increasingly avoidable.
3.1 Preliminaries and axioms
Consider a finite set of prizes , where , with the largest and smallest prizes denoted by and respectively.171717If , either the only choice set is a singleton set or choice sets contain only lotteries related by first order stochastic dominance, and 3.1 full pins down choices. Let be the set of all probability measures over , called lotteries. Everything else follows 2. Per convention, denotes a degenerate lottery and denotes the degenerate lottery that gives prize . For and , denotes the convex combination . For , denotes the probability that lottery gives prize . I assume throughout that satisfies first order stochastic dominance (FOSD):
Axiom 3.1.
If first order stochastically dominates (where ) and , then .
Next, Reference Dependence (2) is applied to both WARP and the von Neumann-Morgenstern’s Independence condition, beginning with a definition that applies Independence selectively.
Definition 3.
satisfies Independence over if for all and , if , , , and , then and .
In standard expected utility, satisfies WARP and Independence over . I depart from standard expected utility by allowing for preferences to depend on the safest available alternatives—the reference—but demand compliance with WARP and Independence whenever a collection of choice sets share a reference. When is that? If () is a mean-preserving spread of (), it is clearly not the safest. Additionally, a second order partially compensates for the incomplete nature of MPS by also deeming lotteries with increased probabilities of the most extreme prizes (but keeping the relative probability of intermediate prizes the same) to be riskier. Formally, is an extreme spread of () if for some and .181818The two risk orders are non-contradictory and typically non-nested. Extreme spread is intuitively related to Aumann and Serrano (2008)’s risk index, where lotteries are deemed safer in the “economics sense”—under standard expected utility, the extreme spreads of are lotteries in that are preferred to by a more-risk-loving decision maker if a more-risk-averse decision maker does so. Non-contradictory: extreme spreads of live in , which does not contain any mean preserving contraction of . Non-nested: extreme spreads need not preserve mean, mean preserving spreads need not maintain relative probability of intermediate prizes; in the special case where , mean preserving spreads are nested in extreme spreads.
Definition 4.
Let be the set of least risky lotteries in .
Axiom 3.2 (Risk Reference Dependence).
For every , satisfies WARP and Independence over for some .
The next and last axiom captures changing risk aversion when more options become available. It is standard to say that a preference relation is more-risk-averse than another preference relation if, for any degenerate lottery and lottery , implies . This definition is often studied alongside expected utility, but it is, in fact, independent of it. 3.3 extends this definition to lotteries that differ by a degenerate component: where can be obtained from by reallocating probabilities from one prize to one or more prizes. Then, it posits that a decision maker cannot be more-risk-loving when a choice set expands. The underlying intuition is that additional alternatives should only be able to increase the extent to which risk is avoidable, and if the avoidability of risk (weakly) increases risk aversion, then the additions must not result in increased risk tolerance. We say the pair of lotteries is a common mixture of the pair of lotteries if there exist and such that and .
Axiom 3.3.
Suppose and are common mixtures of . If and , then implies .
3.2 Model
Definition 5.
admits an Avoidable Risk Expected Utility (AREU) representation if it admits an ORDU representation such that for some set of strictly increasing functions ,
-
•
,
-
•
and each implies ,
-
•
implies for some concave function :.
When choice behavior admits an AREU representation, it is as if the reference alternative is first determined by , which ranks safer alternatives higher, and then the decision maker maximizes expected utility using the associated context-dependent (Bernoulli) utility function . Moreover, a safer reference leads to a (weakly) more concave utility function. This generalizes the standard model where a decision maker maximizes expected utility using a single utility function throughout, but departure from expected utility is limited to systematic changes in risk attitude. It can be shown that (for a fixed ) each is unique up to positive affine transformation, except possibly when .191919Uniqueness is demonstrated in 2. When , it is possible that is only used to evaluate lotteries that first order stochastically dominates / dominated by , so that any strictly increasing transformation of is acceptable.
Allais in WARP violations
Perhaps because the Allais paradox is a direct failure of the structural assumption Independence, many models that seek to explain this anomaly weaken Independence but maintain basic rationality. AREU considers an arguably different approach by linking the Allais paradox to a completely different class of failures, WARP violations from non-binary choice sets.
To see the intuition, consider the common ratio effect in binary comparisons: the sure prize of () is preferred to a lottery that yields with chance (), but a lottery that yields with chance () is preferred to a lottery that yields with chance (). If treated as separate decisions, the former decision entails a (Bernoulli) utility function that is more concave than the latter’s under the expected utility functional.202020Let and . Suppose (resp. ) explains the choice from (resp. ) under expected utility. After normalization (for example and ), choice pattern arises if and only if and , which in turn implies is a concave transformation of . But the expected utility theory rules out the use of different utility functions for the same decision maker.212121More precisely, expected utility allows for different utility functions as long as they are related by positive affine transformations, but these utility functions make identical predictions. AREU builds on this observation. Given a reference order that deems safer than , the utility function for the first choice set is more concave, which in consequence allows for the observed pair of choices ( and ) but rules out the opposite pair ( and ).222222Continuing from 20, the opposite behavior requires and is ruled out. This observation resembles the Negative Certainty Independence postulate in Dillenberger (2010); Cerreia-Vioglio, Dillenberger, and Ortoleva (2015). The same prediction applies to the common consequence effect and the lotteries involved can be generalized.232323Consider a degenerate lottery and a lottery such that neither of them first order stochastically dominates another. Consider the lotteries and where is a lottery and , and suppose . If and , then for all such that explains the first choice and explains the second choice, it is straightforward to show that for some concave function . Moreover, these choices can always be explained by an AREU representation such that . Conversely, suppose the choices and admit an AREU representation such that , then whenever (and equivalently whenever ).
Because different utility functions are involved, AREU predicts a novel manifestation of the common ratio effect—typically formulated in binary comparisons—as WARP violations. Consider the lotteries , , , and , related by common mixture.242424 and where . A decision maker who chooses over , over , and over in binary comparisons commits the common ratio effect (between the first two choices), reconciled in AREU by a reference order that ranks highest. Now, consider the choice set , for which must be the reference. The decision maker treats this choice set as having the same context as and use the same utility function that ranks over , which, due to expected utility, requires her to choose from . However, the decision maker chose from , so she has committed a WARP violation. This simple observation introduces a direct link between structural violations and basic rationality violations.
Other evidence
While the Allais paradox takes center stage among anomalies in the risk domain, the evidence and intuition for increased risk aversion in the presence of safer options are also found in a wide range of studies. In a setting meant to test for the compromise effect, Herne (1999) found that the presence of a safer option results in WARP violations in the direction of greater risk aversion. Wakker and Deneffe (1996) introduces the tradeoff method to elicit risk aversion without using a sure prize and found that the estimated utility functions are less concave relative to standard methods that involve sure prizes. Andreoni and Sprenger (2011) found similar effects when the safest option is close enough to certainty. Restricted to binary comparisons, Bleichrodt and Schmidt (2002) studies a model of context-dependent gambling effect where a decision maker has two utility functions and uses the more concave one whenever the binary comparison involves a riskless option.
Linking structural properties to basic rationality
It turns out that compliance with WARP or Independence would independently bring us back to standard expected utility, stated in 1.
Proposition 1.
If admits an AREU representation, then the following are equivalent:
-
1.
satisfies WARP (over ).
-
2.
satisfies Independence (over ).
-
3.
admits an expected utility representation.
-
4.
admits a utility representation.
This also means that if admits any utility representation, then it must also have an expected utility representation.252525As is standard, we say admits a utility representation if there exists a real valued function such that for all . This observation provides a formal separation between AREU and non-expected utility models that uphold basic rationality and further suggests that violation of Independence in this model is a matter of changing preferences.
It can be shown that imposing transitivity achieves the same outcome. Moreover, if transitivity is only satisfied locally, that is, applying only to a region of lotteries, then the model gives rise to betweenness behavior in that region and further implies fanning out if behavior is risk averse and fanning in when it is risk loving. These in-depth analyses are relegated to Lim (2023a).
Model specification and identification
In applications, keeping track of so many utility functions can be challenging, an issue shared in Cerreia-Vioglio, Dillenberger, and Ortoleva (2015), Chakraborty (2021), and Ellis and Masatlioglu (2022).262626Relatedly, models of ambiguity aversion also use a collection of subjective priors (Gilboa and Schmeidler, 1989). AREU provides a middle ground: Knowing that utility functions are related by concave transformations, an analyst might reasonably assume that a decision maker’s utility functions come from a set of constant absolute risk aversion (CARA) utility functions given by a subjective range of Arrow-Pratt coefficients . More generally, it is also possible for risk attitude to progress from risk loving (convex utility functions) to risk averse (concave utility functions). The range of risk attitudes is ultimately subjective and could vary across individuals or demographics; one individual may be moderately but consistently risk averse, with a very small range of CARA coefficients, whereas another individual may be occasionally risk loving but sometimes very risk averse.
Partial subjectivity in the reference order allows for more individual differences but burdens identification. In the extreme case where behavior is consistent with standard expected utility, it is impossible to pin down , although analysis can proceed with standard expected utility. Fortunately, as long as two reference points index different utility functions, identification of between them is guaranteed. First, if two choice sets differ only by , and choices are inconsistent with expected utility maximization, then we identify that ranks higher in than the other alternatives in the choice set. It turns out that the converse is also true. As long as and index different utility functions, if , then we can find choice sets such that and where choices from and violate WARP, meaning we revealed .272727The proof of 1 contains this observation. Essentially, it relies on a less obvious property implied the model that guarantees existence of a full-dimensional subset of lotteries that rank below and in but are better than and when they act as the reference points.
4 Time Preference
The canonical model for time preference is Discounted Utility, where a decision maker evaluates each payment-time pair using exponential discounting, i.e., . But the Stationarity condition within this model is routinely challenged by lab and field subjects who switch their choices between two payments when the decision is made in advance, typically favoring the later option for long-term decisions, an actively studied behavioral phenomenon termed present bias (Laibson, 1997; Frederick, Loewenstein, and O’donoghue, 2002; Benhabib, Bisin, and Schotter, 2010; Halevy, 2015; Chakraborty, 2021; Chambers, Echenique, and Miller, 2023). This section studies how present bias is related WARP-violating preference changes. The original axioms in Fishburn and Rubinstein (1982) are imposed only among choice sets that share a reference point, which in this case is the soonest available payment, as it partially captures how early in advance a decision maker is making the decision.
4.1 Preliminaries and axioms
Let be a non-degenerate interval of payments and let be a non-degenerate interval of time points. Let be the set of all timed payments, where each option is a payment of that arrives at time . Everything else follows 2. To simplify analysis, I assume the upper bound of payments is large enough so that some payment at time is better than the worst payment at time , specifically . The first axiom is standard; greater payments and sooner payments are better.
Axiom 4.1.
-
1.
If , then .
-
2.
If , then .
The well-known Stationarity condition posits that a decision maker’s preference between two future payments is consistent regardless of when the decision is made. Consider the following definition that allows for selective application.
Definition 6.
satisfies Stationarity over if for all and , if , , , and , then .
Whereas global compliance with Stationarity is captured by , the next axiom demands local compliance. Specifically, it requires Stationarity to be satisfied between any two choice sets that share an earliest payment.
Definition 7.
Let be the set of earliest payments in .
Axiom 4.2 (Time Reference Dependence).
If , then satisfies WARP and Stationary over .
It turns out that 4.2 is an application of Reference Dependence (2), formalized by 1, which assures us that the proposed approach is related to demanding compliance between certain pairs of choice sets.
Lemma 1.
satisfies 4.2 if and only if for every and , satisfies WARP and Stationarity over .
The next postulate rules out increased patience when more options become available. The intuition is that additional options can only tempt the decision maker to become more impatient, so if an impatient decision is already made from , for example if is (strictly) chosen over where , then there is no superset such that the decision maker becomes more patient by choosing in the presence of .
Axiom 4.3.
Suppose , , and . If and , then implies .
However, this falls short of definitively capturing changes in patience. Even in a completely standard world where every individual maximizes exponentially discounted utility, behavioral differences in delay aversion (among individuals) cannot be definitively decomposed into differences in discounting and differences in consumption utility, an issue discussed in Ok and Benoît (2007). Meaning an individual who prefers the sooner alternative could have greater patience paired with lower marginal utility for money.
The last postulate addresses this issues by capturing fixed consumption utilities under varying discounting/patience: Suppose a decision maker is indifferent between all options in the choice set , where and . Then in the choice set where , since the delays between options have shortened, a standard exponential discounting decision maker would pick as the new choice. Yet, our decision maker will face competing forces. On one hand, the possibility of sooner consumption makes her more impatient; on the other hand, shorter delays between options make later payments more attractive. Allowing her the freedom to resolve these competing forces, the next postulate requires that if she ends up choosing both and —as if the competing forces are balanced—then she must also choose the intermediate option . The same requirement applies when a common delay (or advancement) is additionally imposed. Both 4.3 and 4.4 are trivially satisfied in exponential discounting.
Axiom 4.4.
Consider such that and such that and . If , then either , , or .
4.2 Model
Since we consider the standard environment where sooner is always better, discount factors are restricted to non-negative real numbers strictly less than 1, with the exception of for which is possible.
Definition 8.
admits a Present-Biased Exponentially Discounted Utility (PEDU) representation if it admits an ORDU representation such that for some strictly increasing function and set of discount factors ,
-
•
,
-
•
implies and ,
-
•
implies .
In this model, it is as if the decision maker maximizes exponentially discounted utility, but with discount factors that depend on the timing of the earliest available payment. When it is possible to choose an early payment, the decision maker uses a lower discount factor, resulting in behavior that reflects reduced patience. The model thus delivers present bias behavior using familiar technologies—since the exponential discounting form is preserved in every instance of decision-making, changes in patience are simply captured by set-dependent discount factors. Intuitively, with the entire set of possible payments progressively postponed, the decision maker begins to treat them more akin to long-term concerns than before, resulting in increased patience.
It can be shown that is unique given , except possibly when .282828Uniqueness is demonstrated in the proof of 3. When , is only used to evaluate alternatives that also arrive at time , so any paired with a strictly increasing can explain those choices. It could still be unique if , since a PEDU representation requires . In applications, since the reference order and the discount factors depend only on the timing of a payment, it is without loss to consider discount factors that are based on time rather than on alternatives. This is achieved by setting for all and then using the earliest available time of a payment as reference point.
Generalized single-switching
Changes in preferences are tractable due to a generalized single-switching property. In binary comparisons, a unique threshold captures the postponement beyond which the later payment will be chosen and before which the sooner payment will be chosen. In more general choice sets, this threshold no longer guarantees a choice between the two timed payments but continues to stipulate the point of postponement beyond which the sooner payment cannot be chosen (because the later payment is available) and before which the later payment cannot be chosen (because the sooner payment is available). This generalized single-switching property thus extends our understanding of present bias in binary comparisons to arbitrary choice sets—even in the absence of basic rationality assumptions—and it is closely tied to the unified framework in which references are ordered and preference shifts systematically along this established order.
Present bias in WARP violations
Although present bias is typically viewed as a structural violation, PEDU predicts a novel manifestation of present bias as WARP violations. Consider the present bias behavior where “$20 in 4 days” is chosen over “$18 in 3 days”, but “$18 today” is chosen over “$20 tomorrow”. In PEDU, this behavior is explained using a lower discount factor for the latter choice set. However, notice that under this lower discount factor, “$18 in 3 days” is preferred to “$20 in 4 days”, so the introduction of a third option that induces this discount factor but is not itself chosen, for example “$15 today”, will result in a reversal where “$18 in 3 days” is chosen over “$20 in 4 days”. This is now a WARP violation that shares the same underlying driver as present bias behavior, even though present bias is typically studied in binary comparisons. In fact, consistent with the spirit of present bias, WARP violations in PEDU are restricted to decreased patience, and only when sooner payments are added.
Linking structural properties to basic rationality
To further ascertain the aforementioned connection, 2 shows that relaxing just one of the two conditions would fully recover standard exponential discounting. Consequently, if a PEDU decision maker has any utility representation, then she must also have a standard exponential discounting utility representation. This adds to the suggestion that anomalies captured by PEDU are rooted in systematic changes in preferences.
Proposition 2.
If admits a PEDU representation, then the following are equivalent:
-
1.
satisfies WARP (over ).
-
2.
satisfies Stationarity (over ).
-
3.
admits an exponential discounting utility representation.
-
4.
admits a utility representation.
Hyperbolic discounting
2 separates PEDU from hyperbolic discounting, quasi-hyperbolic discounting, and related generalizations (Phelps and Pollak, 1968; Loewenstein and Prelec, 1992; Laibson, 1997; Frederick, Loewenstein, and O’donoghue, 2002; Chambers, Echenique, and Miller, 2023; Chakraborty, 2021) due to their adherence to basic rationality, but the empirically informed intuition that discount factors can vary is shared. In contrast, PEDU varies discount factors at the choice problem level whereas hyperbolic discounting does so at the alternative level. Binary comparisons hold similar behavioral implications: when two options are gradually advanced, there may be a point where the choice is switched from the sooner to the later.292929Chakraborty (2021) calls this Weak Present Bias and studies its implications. But for larger choice sets, unlike PEDU, hyperbolic discounting predicts that the preference ranking between any two options stays the same regardless of the presence of a third alternative.
WARP violations in other time preference settings
Beyond the conventional time preference setting, an active literature on menu preference applies Gul and Pesendorfer (2001)’s temptation model to decision makers who prefer a smaller menu in order to prevent their future selves from committing undesirable present bias behaviors (Noor, 2011; Lipman, Pesendorfer, et al., 2013; Ahn, Iijima, Le Yaouanq, and Sarver, 2019). In these models, past and future selves prefer to choose differently from the same set of alternatives, which could manifest as a reversal if played out, therefore PEDU and these models tackle dynamic inconsistency using related intuitions about long-term and short-term attitudes.
Freeman (2021)’s task completion study, which is related to the above literature and closer to PEDU’s setting, considers a time-inconsistent decision maker who exhibits choice reversals when additional opportunities for completions are introduced. In particular, a sophisticated decision maker ends up completing the task earlier, therefore choosing a sooner option when choice set expands is a common theme between our work. However, the manifestation of this behavior is different; a reversal in PEDU can only occur when an alternative earlier than any other is added, yet in Freeman (2021), adding this kind of alternatives either results in the addition chosen or the choice remains unchanged, therefore WARP will hold.
Consumption streams
Focusing on one time payment helps glean the intuition of this framework, but the approach already suggests how an extension to consumption streams can be conducted, where a decision maker maximizes (for discrete time). If is the consumption stream that offers the soonest payment, then the characterization amounts to adding Koopmans (1960)’s axioms alongside WARP and Stationarity using Reference Dependence (2). B clarifies what axioms can be accommodated, and it includes common versions of separability.
5 Social Preference
Consider a decision maker whose willingness to share is greater when the situation allows for greater equality. It departs from models of other-regarding preferences that capture a fixed inequality aversion (Fehr and Schmidt, 1999; Bolton and Ockenfels, 2000; Charness and Rabin, 2002). To illustrate, suppose a decision maker is endowed with and is asked to share it with another individual. However, instead of choosing any split of this , she was only given a few options. When asked to choose between giving and giving , giving may seem like a fair decision. However, when the choice is between giving , , or , she may opt for giving instead. The choices and violate WARP, and hence a fixed utility function, even if it captures other-regarding preferences and inequality aversion, is incapable of explaining this behavior.
5.1 Preliminaries and axioms
Let , where , be a set of income distributions. For each option , is the dollar amount received by the decision maker and is the dollar amount given to another individual. Everything else follows 2. The first axiom assumes that an income distribution is strictly preferred when it gives someone more and no one less.
Axiom 5.1.
If , and , then .
Reference Dependence (2) adapts to this domain and characterizes choices that conform with quasi-linear preferences when the underlying choice sets have the same level of attainable equality. Since the impending model involves reference-dependent preferences, using quasi-linear utilities as baseline (rather than using more general models of other-regarding preferences) provides meaningful restrictions.
Definition 9.
satisfies Quasi-linearity over if for all and , if , , , and , then .
The measure of attainable equality is based on the Gini coefficient,
which ranges from (most balanced) to (least balanced) for our 2-agents setting. Analogous to other domains, compliance with WARP and Quasi-linearity is called for when two choice sets share a Gini-minimizing income distribution.
Definition 10.
Let be the set of most-balanced income distributions in .
Axiom 5.2 (Equality Reference Dependence).
For any and any most-balanced income distribution , satisfies WARP and Quasi-linearity over .
The next and last postulate regulates changes in preferences. Suppose and a decision maker chooses to share more than to share less . I postulate that making more options available will not cause the decision maker to switch to sharing less, since the added options can only increase attainable equality.
Axiom 5.3.
Suppose and . If and , then .
5.2 Model
Definition 11.
admits a Fairness-based Social Preference Utility (FSPU) representation if it admits an ORDU representation such that for some set of strictly increasing functions ,
-
•
,
-
•
implies and for all ,
-
•
implies .
FSPU combines an objective measure of equality with a subjective interpretation of fairness. Every decision maker bases her choice on the Gini-minimizing option, , as it captures the amount of attainable equality in a choice set. When attainable equality is higher ( is lower), utility difference between sharing more and sharing less increases, reflecting increased willingness to share. The amount of increase depends on the decision maker’s subjective sense of fairness. A very large increase causes WARP violations, where the decision maker switches from an option that shares less to an option that shares more even though both options are always present. Like the other domains, preference parameters are unique.303030Uniqueness is demonstrated in the proof of 4.
For applications, it is without loss to further simplify FSPU by using Gini coefficient—rather than alternatives—to index context-dependent utility from sharing. To do so, for all , set where , and then use the lowest attainable Gini coefficients as reference points.
Menu-dependent altruism
As in the motivating example, the model explains context-dependent willingness to share when distributing a fixed pie with different splitting options. Suppose a decision maker is allocating between herself and another individual, and each choice set is characterized by a set of splitting fractions . That is, she can allocate to herself and to other party if and only if . Consider and . Since attainable equality is greater in (it contains an equal split), a decision maker who chooses from may exhibit increased willingness to share that results in choosing from , even if this violates WARP. But the model rules out the opposite behavior: A decision maker who chooses from cannot choose from , since it would imply decreased willingness to share. Also, a reversal cannot happen between and since they have the same level of attainable equality.
Equality over generosity
Willingness to share is maximized when a perfectly balanced income distribution is available. In particular, the model captures increased altruism not due to the opportunity to give more per se, but due to the opportunity to be equal. To illustrate the difference, consider the same example but with and . Even though contains alternatives that achieve greater equality, the decision maker’s ability to give is the same across the two choice sets. Yet, since the feasible allocations are always unfavorable to her, higher attainable equality results from her ability to take more. In this example, the decision maker may be interpreted as being less altruistic when the world is unfair to her, but becomes more altruistic when more greater equality becomes possible.
Fairness over efficiency
Consider one last application where FSPU allows for willingness to forgo a greater total surplus in favor of sharing. Suppose the decision maker must choose between and . The second option is appealing in that the total amount of money is greater, whereas the first option sacrifices both total surplus and payment to oneself in order to provide a share to the other individual. Suppose is chosen. In FSPU, adding as an option can cause the decision maker to switch from to due to increased altruism. While this behavior seems reasonable, it is inconsistent with any model that complies with WARP.
Empirical evidence
The vast literature on distributional preferences provides suggestive evidence for FSPU behavior. Moreover, unlike the case of risk and time domains, they do focus on basic rationality violations. In dictator games, List (2007); Bardsley (2008); Korenok, Millner, and Razzolini (2014) find that changes to a dictator’s choice set affect her willingness to give and result in WARP-violating choices. Dana, Cain, and Dawes (2006) investigate the underlying mechanism by making the dictator game an option and Dana, Weber, and Kuang (2007) do so by manipulating the visibility of the choice set. They find the audience effect, where fair behavior is the result of subjects’ desire to be perceived (by themselves and others) as fair. Rabin (1993) studies an intention-based explanation in game theoretic settings where kindness is reciprocated. Although existing studies motivate FSPU, the model does not distinguish between willingness to share that depends intrinsically on outcomes and that resulting from intentions.313131More on outcome-based vs intention-based inequality aversion can be found in Ainslie (1992), Nelson (2002), Fehr and Schmidt (2006), Sutter (2007), and Kagel and Roth (2016).
In a more recent study, Cox, List, Price, Sadiraj, and Samek (2016) conduct experiments that explicitly test for basic rationality violations in dictator games and, consistent with FSPU, find that shrinking a choice set results in WARP violations in the direction of keeping more for oneself. They propose a modification to basic rationality by introducing a testable prediction based on a definition of moral reference points, which depend on the framing of the problem (e.g., “Give” and “Take”) and features of the feasible distributions. When moral reference points are fixed, rationality postulates are satisfied; otherwise, violations favor the party who benefits from the new moral reference point. Their work provides empirical support for FSPU, which in turn offers a theory that complements their findings.
Observable contexts and menu preference
The intuitions contained in FSPU resonates with other studies that, unlike FSPU, exploit a richer setting. In settings that include multiple actors, Cox, Friedman, and Sadiraj (2008) study how the generosity of a first mover affects the altruism of a second mover. Cheung (2023) focuses on a second mover who, more generally, makes different decisions from the same choice set based on how the underlying choice set was chosen by a first mover. Relatedly, van Bruggen, Heufer, and Yang (2023) consider a decision maker whose social preference depends on exogeneous contexts like “selfish” and “generous”. In a menu preference setting, Dillenberger and Sadowski (2012) study a decision maker who has shame concern and prefers a smaller menu that excludes normatively better allocations that entail lower self-payoffs, since not choosing those options can induce shame.
Linking structural properties to basic rationality
Like before, 3 shows that WARP violation and failure of standard postulate (Quasi-linearity) are linked. In this setting, it also suggests that wealth effects may be in part contributed by reference dependent preferences.323232Quasi-linear utility in wealth is often interpreted as the absence of wealth effects. In this domain, it means an individual’s willingness to give does not depend on how much she would have left—her wealth—because if giving is better than giving with a base wealth , i.e., , then the same holds true at a different wealth level , i.e., .
Proposition 3.
If admits a FSPU representation, then the following are equivalent:
-
1.
satisfies WARP (over ).
-
2.
satisfies Quasi-linearity (over ).
-
3.
admits a quasi-linear utility representation.
-
4.
admits a utility representation.
6 Conclusion
This paper presents a single, unifying, framework for reference-based context-dependent preferences. The key innovation, Reference Dependence (RD), provides a way to jointly and systematically weaken multiple postulates even if they are conceptually distinct. The method is then applied to the risk, time, and social domains where basic rationality postulates and structural postulates are jointly relaxed, upholding the core principles of normative postulates by demanding their local compliance. In each setting, behavior can be understood as the result of canonical models when reference points are fixed, and deviations from these models are accounted for by systematic changes in reference-dependent preference parameters. Reference points in this framework are determined by the maximization of a reference order, which can be viewed as an instrument that captures the relevant context of a choice problem.
Building upon decades of domain-specific research on seemingly independent structural anomalies, including but not limited to the Allais paradox and present bias behavior, this paper studies a possible link that could relate them to WARP violations. This, in turn, informs more fundamentally on the relationship between rationality postulates and structural postulates. The exercise adds to our understanding of why normative postulates fail, offers new ways to introduce assumptions, and suggests new avenues for empirical research.
References
- (1)
- Ahn, Iijima, Le Yaouanq, and Sarver (2019) Ahn, D. S., R. Iijima, Y. Le Yaouanq, and T. Sarver (2019): “Behavioural Characterizations of Naivete for Time-inconsistent Preferences,” The Review of Economic Studies, 86(6), 2319–2355.
- Ainslie (1992) Ainslie, G. (1992): Picoeconomics: The Strategic Interaction of Successive Motivational States within the Person. Cambridge University Press.
- Allais (1990) Allais, M. (1990): “Allais Paradox,” in Utility and Probability, pp. 3–9. Springer.
- Andreoni and Sprenger (2011) Andreoni, J., and C. Sprenger (2011): “Uncertainty Equivalents: Testing the Limits of the Independence Axiom,” Working paper, National Bureau of Economic Research.
- Apesteguia and Ballester (2009) Apesteguia, J., and M. A. Ballester (2009): “A Theory of Reference-Dependent Behavior,” Economic Theory, 40, 427–455.
- Aumann (1962) Aumann, R. J. (1962): “Utility Theory without the Completeness Axiom,” Econometrica, pp. 445–462.
- Aumann and Serrano (2008) Aumann, R. J., and R. Serrano (2008): “An Economic Index of Riskiness,” Journal of Political Economy, 116(5), 810–836.
- Bardsley (2008) Bardsley, N. (2008): “Dictator Game Giving: Altruism or Artefact?,” Experimental Economics, 11(2), 122–133.
- Benhabib, Bisin, and Schotter (2010) Benhabib, J., A. Bisin, and A. Schotter (2010): “Present-Bias, Quasi-Hyperbolic Discounting, and Fixed Costs,” Games and Economic Behavior, 69(2), 205–223.
- Bleichrodt and Schmidt (2002) Bleichrodt, H., and U. Schmidt (2002): “A Context-Dependent Model of the Gambling Effect,” Management Science, 48(6), 802–812.
- Bolton and Ockenfels (2000) Bolton, G. E., and A. Ockenfels (2000): “ERC: A Theory of Equity, Reciprocity, and Competition,” American Economic Review, 90(1), 166–193.
- Bordalo, Gennaioli, and Shleifer (2012) Bordalo, P., N. Gennaioli, and A. Shleifer (2012): “Salience Theory of Choice under Risk,” Quarterly Journal of Economics, 127(3), 1243–1285.
- Bordalo, Gennaioli, and Shleifer (2013) (2013): “Salience and Consumer Choice,” Journal of Political Economy, 121(5), 803–843.
- Burks, Carpenter, Goette, and Rustichini (2009) Burks, S. V., J. P. Carpenter, L. Goette, and A. Rustichini (2009): “Cognitive Skills Affect Economic Preferences, Strategic Behavior, and Job Attachment,” Proceedings of the National Academy of Sciences, 106(19), 7745–7750.
- Cerreia-Vioglio, Dillenberger, and Ortoleva (2015) Cerreia-Vioglio, S., D. Dillenberger, and P. Ortoleva (2015): “Cautious Expected Utility and the Certainty Effect,” Econometrica, 83(2), 693–728.
- Chakraborty (2021) Chakraborty, A. (2021): “Present Bias,” Econometrica, 89(4), 1921–1961.
- Chakraborty, Halevy, and Saito (2020) Chakraborty, A., Y. Halevy, and K. Saito (2020): “The Relation between Behavior under Risk and over Time,” American Economic Review: Insights, 2(1), 1–16.
- Chambers, Echenique, and Miller (2023) Chambers, C. P., F. Echenique, and A. D. Miller (2023): “Decreasing Impatience,” American Economic Journal: Microeconomics, 15(3), 527–551.
- Chapman, Dean, Ortoleva, Snowberg, and Camerer (2023) Chapman, J., M. Dean, P. Ortoleva, E. Snowberg, and C. Camerer (2023): “Econographics,” Journal of Political Economy Microeconomics, 1(1), 115–161.
- Charness and Rabin (2002) Charness, G., and M. Rabin (2002): “Understanding Social Preferences with Simple Tests,” Quarterly Journal of Economics, 117(3), 817–869.
- Chen, Liu, Shan, Zhong, and Zhou (2023) Chen, M., T. X. Liu, Y. Shan, S. Zhong, and Y. Zhou (2023): “The Consistency of Rationality Measures,” Unpublished.
- Cheung (2023) Cheung, P. H. Y. (2023): “Revealed Reciprocity,” Unpublished.
- Chew (1983) Chew, S. H. (1983): “A Generalization of the Quasilinear Mean with Applications to the Measurement of Income Inequality and Decision Theory Resolving the Allais Paradox,” Econometrica, 51(4), 1065–1092.
- Cox, Friedman, and Sadiraj (2008) Cox, J. C., D. Friedman, and V. Sadiraj (2008): “Revealed Altruism,” Econometrica, 76(1), 31–69.
- Cox, List, Price, Sadiraj, and Samek (2016) Cox, J. C., J. A. List, M. Price, V. Sadiraj, and A. Samek (2016): “Moral Costs and Rational Choice: Theory and Experimental Evidence,” Working paper, National Bureau of Economic Research.
- Dana, Cain, and Dawes (2006) Dana, J., D. M. Cain, and R. M. Dawes (2006): “What You Don’t Know Won’t Hurt Me: Costly (But Quiet) Exit in Dictator Games,” Organizational Behavior and Human Decision Processes, 100(2), 193–201.
- Dana, Weber, and Kuang (2007) Dana, J., R. A. Weber, and J. X. Kuang (2007): “Exploiting Moral Wiggle Room: Experiments Demonstrating an Illusory Preference for Fairness,” Economic Theory, 33(1), 67–80.
- de Clippel and Rozen (2021) de Clippel, G., and K. Rozen (2021): “Bounded Rationality and Limited Data Sets,” Theoretical Economics, 16(2), 359–380.
- de Clippel and Rozen (2023) (2023): “Relaxed Optimization: How Close Is a Consumer to Satisfying First-Order Conditions?,” Review of Economics and Statistics, 105(4), 883–898.
- Dean, Kıbrıs, and Masatlioglu (2017) Dean, M., Ö. Kıbrıs, and Y. Masatlioglu (2017): “Limited Attention and Status Quo Bias,” Journal of Economic Theory, 169, 93–127.
- Dean and Ortoleva (2019) Dean, M., and P. Ortoleva (2019): “The Empirical Relationship between Nonstandard Economic Behaviors,” Proceedings of the National Academy of Sciences, 116(33), 16262–16267.
- Dekel (1986) Dekel, E. (1986): “An Axiomatic Characterization of Preferences under Uncertainty: Weakening the Independence Axiom,” Journal of Economic Theory, 40(2), 304–318.
- Dembo, Kariv, Polisson, and Quah (2021) Dembo, A., S. Kariv, M. Polisson, and J. Quah (2021): “Ever Since Allais,” Bristol Economics Discussion Papers, 21/745.
- Dillenberger (2010) Dillenberger, D. (2010): “Preferences for One-Shot Resolution of Uncertainty and Allais-Type Behavior,” Econometrica, 78(6), 1973–2004.
- Dillenberger and Sadowski (2012) Dillenberger, D., and P. Sadowski (2012): “Ashamed to be Selfish,” Theoretical Economics, 7(1), 99–124.
- Dubra, Maccheroni, and Ok (2004) Dubra, J., F. Maccheroni, and E. A. Ok (2004): “Expected Utility Theory Without the Completeness Axiom,” Journal of Economic Theory, 115(1), 118–133.
- Echenique, Imai, and Saito (2020) Echenique, F., T. Imai, and K. Saito (2020): “Testable Implications of Models of Intertemporal Choice: Exponential Discounting and its Generalizations,” American Economic Journal: Microeconomics, 12(4), 114–143.
- Echenique, Imai, and Saito (2023) (2023): “Approximate Expected Utility Rationalization,” Journal of the European Economic Association, p. jvad028.
- Ellis and Masatlioglu (2022) Ellis, A., and Y. Masatlioglu (2022): “Choice with Endogenous Categorization,” Review of Economic Studies, 89(1), 240–278.
- Evren (2014) Evren, Ö. (2014): “Scalarization Methods and Expected Multi-Utility Representations,” Journal of Economic Theory, 151, 30–63.
- Falk, Becker, Dohmen, Enke, Huffman, and Sunde (2018) Falk, A., A. Becker, T. Dohmen, B. Enke, D. Huffman, and U. Sunde (2018): “Global Evidence on Economic Preferences,” Quarterly Journal of Economics, 133(4), 1645–1692.
- Fehr and Schmidt (1999) Fehr, E., and K. M. Schmidt (1999): “A Theory of Fairness, Competition, and Cooperation,” Quarterly Journal of Economics, 114(3), 817–868.
- Fehr and Schmidt (2006) (2006): “The Economics of Fairness, Reciprocity and Altruism: Experimental Evidence and New Theories,” Handbook of the Economics of Giving, Altruism and Reciprocity, 1, 615–691.
- Fishburn (1983) Fishburn, P. C. (1983): “Transitive Measurable Utility,” Journal of Economic Theory, 31(2), 293–317.
- Fishburn and Rubinstein (1982) Fishburn, P. C., and A. Rubinstein (1982): “Time Preference,” International Economic Review, 23(3), 677–694.
- Frederick, Loewenstein, and O’donoghue (2002) Frederick, S., G. Loewenstein, and T. O’donoghue (2002): “Time Discounting and Time Preference: A Critical Review,” Journal of Economic Literature, 40(2), 351–401.
- Freeman (2017) Freeman, D. J. (2017): “Preferred Personal Equilibrium and Simple Choices,” Journal of Economic Behavior & Organization, 143, 165–172.
- Freeman (2021) Freeman, D. J. (2021): “Revealing Naïveté and Sophistication from Procrastination and Preproperation,” American Economic Journal: Microeconomics, 13(2), 402–38.
- Giarlotta, Petralia, and Watson (2023) Giarlotta, A., A. Petralia, and S. Watson (2023): “Context-Sensitive Rationality: Choice by Salience,” Journal of Mathematical Economics, 109, 102913.
- Gilboa and Schmeidler (1989) Gilboa, I., and D. Schmeidler (1989): “Maxmin Expected Utility with Non-Unique Prior,” Journal of Mathematical Economics, 18(2), 141–153.
- Gul (1991) Gul, F. (1991): “A Theory of Disappointment Aversion,” Econometrica, 59(3), 667–686.
- Gul and Pesendorfer (2001) Gul, F., and W. Pesendorfer (2001): “Temptation and Self-Control,” Econometrica, 69(6), 1403–1435.
- Halevy (2015) Halevy, Y. (2015): “Time Consistency: Stationarity and Time Invariance,” Econometrica, 83(1), 335–352.
- Halevy, Walker-Jones, and Zrill (2023) Halevy, Y., D. Walker-Jones, and L. Zrill (2023): “Difficult Decisions,” Working papers, University of Toronto, Department of Economics.
- Hara, Ok, and Riella (2019) Hara, K., E. A. Ok, and G. Riella (2019): “Coalitional Expected Multi-Utility Theory,” Econometrica, 87(3), 933–980.
- Herne (1999) Herne, K. (1999): “The Effects of Decoy Gambles on Individual Choice,” Experimental Economics, 2(1), 31–40.
- Houthakker (1950) Houthakker, H. S. (1950): “Revealed Preference and the Utility Function,” Economica, 17(66), 159–174.
- Huber, Payne, and Puto (1982) Huber, J., J. W. Payne, and C. Puto (1982): “Adding Asymmetrically Dominated Alternatives: Violations of Regularity and the Similarity Hypothesis,” Journal of Consumer Research, 9(1), 90–98.
- Kagel and Roth (2016) Kagel, J. H., and A. E. Roth (2016): The Handbook of Experimental Economics, vol. 1. Princeton University Press.
- Kahneman, Knetsch, and Thaler (1991) Kahneman, D., J. L. Knetsch, and R. H. Thaler (1991): “Anomalies: The Endowment Effect, Loss Aversion, and Status Quo Bias,” Journal of Economic Perspectives, 5(1), 193–206.
- Kahneman and Tversky (1979) Kahneman, D., and A. Tversky (1979): “Prospect Theory: An Analysis of Choice under Risk,” Econometrica, 47(2), 263–292.
- Ke and Chen (2022) Ke, S., and Z. Chen (2022): “From Local Utility to Neural Networks,” Unpublished.
- Kıbrıs, Masatlioglu, and Suleymanov (2023) Kıbrıs, Ö., Y. Masatlioglu, and E. Suleymanov (2023): “A Theory of Reference Point Formation,” Economic Theory, 75, 137–166.
- Kibris, Masatlioglu, and Suleymanov (2024) Kibris, Ã., Y. Masatlioglu, and E. Suleymanov (2024): “A Random Reference Model,” American Economic Journal: Microeconomics, 16(1), 155–209.
- Kivetz, Netzer, and Srinivasan (2004) Kivetz, R., O. Netzer, and V. Srinivasan (2004): “Alternative Models for Capturing the Compromise Effect,” Journal of Marketing Research, 41(3), 237–257.
- Koopmans (1960) Koopmans, T. C. (1960): “Stationary Ordinal Utility and Impatience,” Econometrica, 28(2), 287–309.
- Korenok, Millner, and Razzolini (2014) Korenok, O., E. L. Millner, and L. Razzolini (2014): “Taking, Giving, and Impure Altruism in Dictator Games,” Experimental Economics, 17(3), 488–500.
- Kőszegi and Rabin (2006) Kőszegi, B., and M. Rabin (2006): “A Model of Reference-Dependent Preferences,” Quarterly Journal of Economics, 121(4), 1133–1165.
- Kőszegi and Rabin (2007) (2007): “Reference-Dependent Risk Attitudes,” American Economic Review, 97(4), 1047–1073.
- Kőszegi and Szeidl (2013) Kőszegi, B., and A. Szeidl (2013): “A Model of Focusing in Economic Choice,” Quarterly Journal of Economics, 128(1), 53–104.
- Kovach (2020) Kovach, M. (2020): “Twisting the Truth: Foundations of Wishful Thinking,” Theoretical Economics, 15(3), 989–1022.
- Laibson (1997) Laibson, D. (1997): “Golden Eggs and Hyperbolic Discounting,” Quarterly Journal of Economics, 112(2), 443–478.
- Lanzani (2022) Lanzani, G. (2022): “Correlation Made Simple: Applications to Salience and Regret Theory,” The Quarterly Journal of Economics, 137(2), 959–987.
- Lim (2023a) Lim, X. Z. (2023a): “An Analysis of Avoidable Risk Expected Utility,” Unpublished.
- Lim (2023b) (2023b): “An Analysis of Ordered Reference Dependent Utility,” Unpublished.
- Lipman, Pesendorfer, et al. (2013) Lipman, B. L., W. Pesendorfer, et al. (2013): “Temptation,” in Advances in economics and econometrics: Tenth World Congress, vol. 1, pp. 243–288. Cambridge University Press.
- List (2007) List, J. A. (2007): “On the Interpretation of Giving in Dictator Games,” Journal of Political Economy, 115(3), 482–493.
- Loewenstein and Prelec (1992) Loewenstein, G., and D. Prelec (1992): “Anomalies in Intertemporal Choice: Evidence and an Interpretation,” Quarterly Journal of Economics, 107(2), 573–597.
- Machina (1982) Machina, M. J. (1982): “‘Expected Utility’ Analysis without the Independence Axiom,” Econometrica, 50(2), 277–323.
- Manzini and Mariotti (2008) Manzini, P., and M. Mariotti (2008): “On the Representation of Incomplete Preferences over Risky Alternatives,” Theory and Decision, 65, 303–323.
- Masatlioglu and Ok (2005) Masatlioglu, Y., and E. A. Ok (2005): “Rational Choice with Status Quo Bias,” Journal of Economic Theory, 121(1), 1–29.
- Masatlioglu and Ok (2014) (2014): “A Canonical Model of Choice with Initial Endowments,” Review of Economic Studies, 81(2), 851–883.
- Nelson (2002) Nelson, Jr., W. R. (2002): “Equity or Intention: It is the Thought that Counts,” Journal of Economic Behavior & Organization, 48(4), 423–430.
- Nielsen and Rehbeck (2022) Nielsen, K., and J. Rehbeck (2022): “When Choices are Mistakes,” American Economic Review, 112(7), 2237–2268.
- Noor (2011) Noor, J. (2011): “Temptation and Revealed Preference,” Econometrica, 79(2), 601–644.
- Noor and Takeoka (2015) Noor, J., and N. Takeoka (2015): “Menu-dependent Self-control,” Journal of Mathematical Economics, 61, 1–20.
- Ok and Benoît (2007) Ok, E. A., and J.-P. Benoît (2007): “Delay Aversion,” Theoretical Economics, 2(1), 71–113.
- Ok, Ortoleva, and Riella (2015) Ok, E. A., P. Ortoleva, and G. Riella (2015): “Revealed (P)Reference Theory,” American Economic Review, 105(1), 299–321.
- Orhun (2009) Orhun, A. Y. (2009): “Optimal Product Line Design when Consumers Exhibit Choice Set-Dependent Preferences,” Marketing Science, 28(5), 868–886.
- Ortoleva (2010) Ortoleva, P. (2010): “Status Quo Bias, Multiple Priors and Uncertainty Aversion,” Games and Economic Behavior, 69(2), 411–424.
- Phelps and Pollak (1968) Phelps, E. S., and R. A. Pollak (1968): “On Second-best National Saving and Game-equilibrium Growth,” Review of Economic Studies, 35(2), 185–199.
- Polisson, Quah, and Renou (2020) Polisson, M., J. K.-H. Quah, and L. Renou (2020): “Revealed Preferences over Risk and Uncertainty,” American Economic Review, 110(6), 1782–1820.
- Quiggin (1982) Quiggin, J. (1982): “A Theory of Anticipated Utility,” Journal of Economic Behavior & Organization, 3(4), 323–343.
- Rabin (1993) Rabin, M. (1993): “Incorporating Fairness into Game Theory and Economics,” American Economic Review, 83(5), 1281–1302.
- Ravid and Steverson (2021) Ravid, D., and K. Steverson (2021): “Bad Temptation,” Journal of Mathematical Economics, 95, 102480.
- Rubinstein and Salant (2006) Rubinstein, A., and Y. Salant (2006): “Two Comments on the Principle of Revealed Preference,” Unpublished.
- Rubinstein and Zhou (1999) Rubinstein, A., and L. Zhou (1999): “Choice Problems with a ’Reference’ Point,” Mathematical Social Sciences, 37(3), 205–209.
- Sagi (2006) Sagi, J. S. (2006): “Anchored Preference Relations,” Journal of Economic Theory, 130(1), 283–295.
- Samuelson (1948) Samuelson, P. A. (1948): “Consumption Theory in Terms of Revealed Preference,” Economica, 15(60), 243–253.
- Simonson (1989) Simonson, I. (1989): “Choice Based on Reasons: The Case of Attraction and Compromise Effects,” Journal of Consumer Research, 16(2), 158–174.
- Stango and Zinman (2023) Stango, V., and J. Zinman (2023): “We Are All Behavioural, More, or Less: A Taxonomy of Consumer Decision-Making,” Review of Economic Studies, 90(3), 1470–1498.
- Sutter (2007) Sutter, M. (2007): “Outcomes versus Intentions: On the Nature of Fair Behavior and its Development with Age,” Journal of Economic Psychology, 28(1), 69–78.
- Tserenjigmid (2019) Tserenjigmid, G. (2019): “Choosing with the Worst in Mind: A Reference-Dependent Model,” Journal of Economic Behavior & Organization, 157, 631–652.
- Tversky and Kahneman (1991) Tversky, A., and D. Kahneman (1991): “Loss Aversion in Riskless Choice: A Reference-Dependent Model,” Quarterly Journal of Economics, 106(4), 1039–1061.
- van Bruggen, Heufer, and Yang (2023) van Bruggen, P., J. Heufer, and J. Yang (2023): “Giving According to Agreement,” Unpublished.
- Wakker and Deneffe (1996) Wakker, P., and D. Deneffe (1996): “Eliciting von Neumann-Morgenstern Utilities when Probabilities are Distorted or Unknown,” Management Science, 42(8), 1131–1150.
Appendix A Appendix: Proofs
1, 2, 3, and 4 require a technical result, 2, that generalizes a large class of behavioral postulates called finite theories in a reference dependent manner. The result is stated now but formally introduced and proved in B. The definition of a finite theory is also given in B, it includes WARP, Independence, Stationarity, and Quasi-linearity.
A correspondence where is called an correspondence if for all , if and , then . Given a linear order , let denote the unique element such that for all . A linear order is called -consistent if for all , .
Lemma 2.
Consider a choice correspondence , a finite theory , and an correspondence . The following are equivalent:
-
1.
(Reference Dependence) For every choice set , satisfies over for some .
-
2.
There exists a -consistent linear order such that for all , satisfies over .
A.1 Proof of Theorem 1
Lemma 3.
Suppose is finite. A choice correspondence satisfies 2.1 if and only if it admits an ORDU representation.
Proof.
“If” is straightforward. I prove “only if”. Denote by the set of alternatives in such that satisfies WARP over , guaranteed to be non-empty by 2.1. Create a list in the following way: List elements of with an arbitrary order. Since is again finite, continue listing elements of with an arbitrary order; continue until every is listed. Finally, let denote the position of in the list. For any , construct if .
For each , it maximizes among alternatives in , hence by construction satisfies WARP over . Now construct . Set for all . For each , since , we set or or both according to . For each such that and , since , we set or or both according to , this is guaranteed by the fact that satisfies WARP over . Now, is complete on the set , which we call the prediction set of . Now consider . Set for all and set for all , . The constructed is now complete. For transitivity, suppose and , and that (if any of them is in then the argument is straightforward by ), hence and . Furthermore, since , we have , and satisfies WARP over implies , and hence , which implies . So is transitive.
Finally, we show that and explain . For any , since is finite and is a linear order, there is a unique maximizer , hence . Suppose for contradiction but , so for some . Then . Since is a subset of , and both are in , this is a violation of the statement satisfies WARP on , hence a contradiction. Suppose for contradiction but . Consider any , since , . Since is a subset of , and both are in , this is a violation of the statement satisfies WARP on . Hence . It remains to show that each can be represented by a utility function, but this is standard since is finite and is complete and transitive. ∎
Now we prove the general case where is not finite. “If” is straightforward. I prove “only if”. Using 2, let be WARP and let be the identify function, then there exists a linear order such that satisfies WARP over for every . It is obviously -consistent. Proceed to build using the method outlined in the proof of 3, which gives us a complete and transitive for each such that .
It remains to show that each can be represented by a utility function. Based on our construction, is complete and transitive on . Moreover, it is continuous (, , and for each implies ) when restricted to the prediction set , otherwise a contradiction of Continuity would be detected in the choices from a sequence of choice problems of form that converges to ( guarantees that will not be the only one chosen in any of these sets, so that a contradiction of, say, , will be substantiated in choice: ) . Therefore, along with the fact is a subset of the separable metric space , admits a (continuous) utility function that represents when restricted to the alternatives in . Now define for all . Now also represents for all and and for all . And we are done. Finally, since our system of and explains , which satisfies Continuity, so has a closed graph.
A.2 Proof Outline of Theorems 2, 3, 4
The proofs for these theorems are repetitive and cannot be streamlined due to domain-specific details, I outline key ideas here and relegate complete proofs to B.
Step 1: Reference order
In their respective domains, 4, 7, and 10 prescribe ’s that are -correspondences. Moreover, WARP, Independence, Stationarity, and Quasi-linearity are finite theories. Therefore, Risk Reference Dependence (3.2), Time Reference Dependence (4.2), and Equality Reference Dependence (5.2) each qualifies as a special case of the “meta” axiom Reference Dependence. By invoking 2, we obtain a linear order that is -consistent such that for all (resp. and ), satisfies WARP and Independence (resp. Stationarity, Quasi-linearity) over .
Step 2: Fixed reference, standard representation
Next is to show that for each alternative , the subcorrespondence admits a standard representation of its respective domain (i.e., expected utility, exponential discounting, quasi-linear utility). This is not obvious; for example in the risk domain, satisfies WARP and Independence (and Continuity) over , which is a strict subset of all choice problems, so standard postulates could be insufficient.333333For example, if and where , , and , then satisfies WARP and Independence over but does not admit an expected utility representation (because, even though the lines and are parallel, are not related to by a common mixture). This issue is resolved by exploiting the structure provided by a -consistent linear order. In each domain, it guarantees that (for each ) the strict prediction set is rich in a sense that behavior inconsistent with the structural properties can always be substantiated with observations from within . For example in the risk domain, we first show that an expected utility representation, with , can be obtained for subcorrespondence where is a subset of and is a linear transformation of a dimensional simplex set; the existence of is given by the -consistent linear order, which determines which alternatives are in and in turn determines which choice sets are in . Then, for in but possibly outside , if , it can be shown that there exist common mixtures , in such that , and Independence requires (assuming ). Analogous methods, all derived using features of -consistent linear orders, guarantee the sufficiency of standard postulates in the time and social preference domains.
Step 3: Reference-dependent preferences
3.3, 4.3, and 5.3 each provides a “direction” for preference change, along the reference order, that must has been satisfied in the constructed representations. 3.3, 4.3, and 5.3 impose restrictions on behavior when a choice set expands, which necessarily imply that a reference point, if it changes, ranks higher in . If the constructed representations violate the direction of preference change from reference to where , then it can be shown that there exist choice problems and such that where 3.3 / 4.3 / 5.3 is violated when we compare and . Like in Step 2, the existence of axiom-violating choice behavior in the underlying subcorrespondences and is guaranteed by -consistent linear orders. For the time domain, 4.4 additionally guarantees a persistent consumption utility, so that reference effect can be summarized by changes in discount factors (in general, both discount factor and consumption utility can change).
A.3 Proof of Lemma 1
Only if: Fix and . Consider . For any , since , satisfies WARP and Stationarity over . Therefore, satisfies WARP and Stationarity over (because WARP and Stationarity are restrictions on pairs of choices). If: Take any such that . Take . Consider . Since and are both finite, is finite, and therefore . Since , , and so satisfies WARP and Stationarity over , which contains and by construction and we are done.
Supplemental Materials
(Online)
Appendix B Online Appendix: Omitted Proofs and Results
Let be an arbitrary set of alternatives and let be the set of all finite and nonempty subsets of , called choice problems or choice sets. Let be the set of all general choice correspondences such that and for all . For a general choice correspondence with domain , we simply call it a choice correspondence. Call a subcorrespondence of if and whenever defined. If, furthermore, is finite, then call a finite subcorrespondence of .
A behavioral postulate imposed on general choice correspondences can be captured using a subset of , where some general choice correspondences are admitted and others excluded. In line with how behavioral postulates are typically introduced, I focus on postulates that are easier to satisfy when fewer observations are considered, and call them theories.
Definition 12.
-
1.
is a theory if for all , every subcorrespondence of is in .
-
2.
is a finite theory if it is a theory and for all , there exists a finite subcorrespondence of that is not in .
Postulates that place restrictions on finitely many choice sets at a time are finite theories, such as the common definitions of WARP, monotonicity, transitivity, convexity, betweenness, separability, independence, stationarity, and many others. These are the cases where non-compliance can always be concluded using finitely many observations. An empirically falsifiable property need not be a finite theory, but a finite theory is empirically falsifiable unless it is trivial (i.e., ).343434It is commonly understood that an empirically falsifiable property is one that can be falsified with finitely many observations (i.e., there exists such that ). Consider the combination of WARP and some version of continuity, it is a theory, and it is empirically falsifiable since WARP needs just two observations to falsify. Yet in the absence of WARP violations, a choice correspondence can still violate continuity, which is a non-compliance that cannot be substantiated with finitely many observations. Non-examples include various versions of continuity and infinite acyclicity since they require an infinite number of observations to substantiate a violation. When is finite, every theory is trivially a finite theory.
Imposing multiple postulates, and , is equivalent to taking the intersection . Because taking intersection of theories (resp. finite theories) yields a theory (resp. finite theory), this characterization can simultaneously account for multiple postulates (or, a model).353535Theory: Consider any . For any where , since are theories, we have , and hence , so is a theory. Finite theory: Suppose and are finite theories, which are theories, and so is a theory. Consider any . Without loss of generality say , so by definition of finite theory we can find a finite subcorrespondence of where , which means .
B.1 Reference-dependent
In general, it is possible that is not in but its subcorrespondence is in , for which I say “ satisfies over ”. 2 provides the foundation for all four models in this paper. It introduces a reference-dependent generalization of a generic behavioral postulate, , and shows that it is equivalent to a representation in which observations are partitioned using a reference order such that holds within each part.3636362 falls short of delivering the target utility representation (of due to the well-known limitation of an incomplete dataset—when only a subset of choices are observed, canonical postulates may not be sufficient for canonical utility representation.
When is the identity function, the first condition in 2 is satisfied when, for each choice problem , some alternative serves as an anchor that guarantees compliance with finite theory in subsets of . In anticipation, this anchor is a potential reference alternative for , so the condition can be understood as “there is a reference in every ”. When is not the identity function, we further demand that a potential reference alternative can be found in a predetermined subset of the choice problem, , making reference formation less subjective. The case of fully objective reference is captured when is a singleton for all , since it fully pins down the reference.
Note that since every choice set is finite, Reference Dependence is both falsifiable (whenever is) and can be written without an explicit existential quantifier. However, the current formulation may be most suitable for describing a universal template of reference-dependent generalization. Applications of this formulation without an existential quantifier are considered in 4 (time preference) and 5 (social preference).
B.2 Proof of Lemma 2
The proof for (2) implies (1) is straightforward: For every (finite) set , the maximizer of the linear order is an “” in (1). We focus on the proof for (1) implies (2). The proof for (2) implies (1) begins with an observation using Zermelo’s well-ordering theorem and transfinite recursion, and then uses it build a reference order given an arbitrary finite theory (12).
Lemma 4.
Let be a set and let be the set of all finite and nonempty subsets of . Let be a self-map on such that . Suppose for all and such that , if , then (property ). Then, there exists a self-map on such that
-
(i)
For all , .
-
(ii)
For all and such that , if , then (property ), and
-
(iii)
For all ,
Proof.
We prove this by construction. Assume and invoke Zermelo’s theorem (also known as the well-ordering theorem) to well-order the set of all doubletons in the domain of . Now we start the transfinite recursion using this order.
In the zero case, we have . This correspondence satisfies and is nonempty-valued ( for all ).
For the successor ordinal , having supposed satisfies and is nonempty-valued, we take the corresponding doubleton and take such that , . Suppose such an does not exist, then for both , there are and such that and since is nonempty-valued. Consider . Since is nonempty-valued, . But since satisfies , it must be that , hence . Suppose without loss , then due to again and that , it must be that , which contradicts . (That is, we showed that with nonempty-valuedness and , no two elements can each have a unique appearance in the -image of a set containing those two elements.) Hence, such that , . Define from in the following way: , . Note: (i) Since is deleted from only if it is also deleted (if it is in it at all) from , we are preserving , and (ii) since is never the unique element in , we preserve nonempty-valuedness.
For a limit ordinal , define . Note that since , . Furthermore, for any , is constructed such that and nonempty-valuedness are preserved. Hence satisfies and is nonempty-valued.
Note that this process terminates when all the doubletons have been visited, for we would otherwise have constructed an injection from the class of all ordinals to the set of all doubletons in , which is impossible.
Finally, we check that for all . Suppose not, hence such that . Then by we have , which is not possible as the recursion process has visited and deleted something from . Now set and we are done. ∎
For notational convenience, subcorrespondence of is referred to as , as in “ restricted to ”. Given , for any and , define
Given and a general choice correspondence , let denote the set of reference alternatives of (note that need not be in ). The following observations are obtained when is a finite theory.
Lemma 5.
Let be a general choice correspondence and a finite theory. Consider .
-
1.
If and , then .
-
2.
If for all finite , then .
Proof.
Since implies and since , is a direct consequence of the definition of a theory. For (2), suppose for contradiction . Because is a finite theory, we can find a finite set of choice problems such that . Since the set is finite, . Note that , so the definition of a theory gives , a contradiction. ∎
Now I prove (1) implies (2) in 2. Let be a set-valued function that picks out reference alternatives, formally . Since is a finite theory, by point 1 of 5, satisfies property (defined in 4). Furthermore, (1) in 2 guarantees that is nonempty for all . Finally, define by . Since both and satisfy property , satisfies property .
Putting the we just built through 4, we get a function that picks one thing from every set and satisfies property . With this, we build the order by setting if and for all . It is well-known that this results in a linear order such that for all . Since for all , this means is also -consistent.
Finally, consider the set of alternatives that are “reference dominated” by according to (including itself), denoted by
For any finite subset such that , we have , which by definition implies is a reference alternative of . Using point 2 of 5, we conclude that is reference alternative for , which need not be finite.
To summarize, we have created a partition of where the parts are characterized by . To see this, take any , since is a linear order, there is a unique such that for all , and so and for any . Furthermore for each part , is in . Since is simply , the proof is complete.
B.3 Proof of Theorem 2
“If” is straightforward. I prove “only if”. We interpret as a dimensional simplex, and full-dimensional means dimensional. Also, where denotes the set of lotteries that only put non-zero probabilities on prizes and , we partition into three parts: , , and . Stage 1 builds the reference order . Stage 2 provides basic results about the prediction set of each reference lottery. Stage 3 builds a (Bernoulli) utility function for each and Stage 4 shows that they are related by concave transformations. Stage 5 deals with .
For any , let . For any , let . We call these prediction sets. Note that if , then the fact that satisfies WARP over implies . For any and lotteries , we call a -common mixture of if for some and , we have , , and .
Stage 1: Reference order
Stage 2: Technical Preparations
The next results guarantee that the revealed preference relation constructed using subcorrespondence , where is given, is complete and transitive on a full-dimensional convex subset of . This is due in large part to being risk-consistent, and because of it, choices that further satisfy Independence will have an expected utility representation.
Lemma 6.
For any and any open ball that contains , contains a full-dimensional convex subset of .
Proof.
Take any . By definition, for some . Consider the set . It consists of , all extreme spreads of , and all of their mean-preserving spreads.
To see is convex: First note that since is a convex set and is on the boundary of , so is convex. Take any two lotteries and consider their convex combination for some . Since , there exist such that either or and either or . If for both , then and by convexity of we are done. Suppose for some , then since the mean-preserving spread relation is preserved under convex combinations, we have . Then, since by the convexity of , we have .
To see is full-dimensional: For any , is dimensional, and it is a subset of the dimensional space defined by lotteries that have the same mean as . But contains lotteries that do not have the same mean as , and therefore is full-dimensional. This means is full dimensional as well since it contains for some .
To see : If , since is risk-consistent. If , for some since is risk-consistent, and by transitivity of we have . Since is also a full-dimensional and convex set, is a full-dimensional convex subset of . ∎
Lemma 7.
For any , contains a full-dimensional convex subset of .
Proof.
Fix . Note that contains an extreme spread of (else, there is a sequence of alternatives such that converges from above to such that for all , which by Continuity means , a violation of FOSD (3.1)). Consider . Since , . Since satisfies Independence over and , we establish . By Continuity, there exists an open ball around such that for all . By 6, contains a full-dimensional convex subset of . Moreover, , hence contains a full-dimensional convex subset of . ∎
Lemma 8.
Fix any . If , then contains a full-dimensional convex subset of . If , then contains a full-dimensional convex subset of .
Proof.
Fix and . Since , the set of extreme spreads of , , is nonempty. Also, . Since satisfies WARP over and , for all , so we can use the same technique in the proof of 7 to establish that contains an extreme spread of (else, by Continuity, which violates FOSD (3.1)). Consider . Because and , so . Since satisfies Independence over and , we establish . By Continuity, there exists an open ball around such that for all . By 6, contains a full-dimensional convex subset of . Moreover, , hence contains a full-dimensional convex subset of . The second statement is given by the first statement and the observation that satisfies WARP over implies . ∎
Stage 3: Expected utility when
7 and 8 establish that when , contains a full-dimensional convex subset of . The next result shows that for every , the subcorrespondence admits an expected utility representation.
Lemma 9.
For any , if contains a full-dimensional convex subset of , then there exists a strictly increasing utility function , unique up to a positive affine transformation, such that for all .
Proof.
Since contains a full-dimensional convex subset of , consider a subset that is a linear transformation of a dimensional simplex (hence also full-dimensional and convex). First, notice that for all , we have and . Recall that satisfies WARP and Independence over . By letting if , we obtain a binary relation that is complete, transitive, continuous, and satisfies the standard von Neumann-Morgenstern Independence, and it is well-known that there exists a utility function , unique up to a positive affine transformation, such that for all . Since satisfies FOSD (3.1), is strictly increasing. We normalize this function to where and .
We now show that this utility function can explain . First, note that for any two lotteries , there exist two (possibly different) lotteries such that is a -common mixture of . This can be done by taking an arbitrary and large enough so that both and enter (this is why we need to be full-dimensional and convex). Now consider any and let be a -common mixture of . Since satisfies Independence over , for , if and only if . Now take any such that and , then again by Independence over , if and only if , where is a -common mixture of . We have thus shown that for all and for all with and . Since satisfies WARP over , showing for all is straightforward from here. ∎
Stage 4: Concave transformations when
Lemma 10.
For any , if , then for some concave and strictly increasing function .
Proof.
This proof uses 3.3. Take any such that . Consider the function whose domain is the set of numbers such that . Since and are strictly increasing, is strictly increasing in its domain.
We show that if , then where solves . Suppose not, then there exists , strictly greater than , such that . Consider the lotteries and . The above equations give and . Let be a -common mixture of where is a full-dimensional convex subset of if and of otherwise (8 guarantees the existence of ). Let be a -common mixture of where is a full-dimensional convex subset of . Since explains and explains , we have and . Now consider , which is in , and so . Because we have established , , and for (the first two inequality are due to the way was picked), so we know . But and jointly violate 3.3.
To complete the proof, extend to a concave function (for example by connecting points with lines). ∎
Lemma 11.
For any and , either first-order stochastically dominates or first-order stochastically dominates .
Proof.
Take and , . Let , then . If and , then is an extreme spread of and , so . Furthermore, it is not possible that and since . Hence either and with at least one strict inequality, so first-order stochastically dominates , or and with at least one strict inequality, so first-order stochastically dominates . ∎
Lemma 12.
For any , if , then for some concave and increasing function .
Proof.
We use the proof in 10 with the following modifications. When , let be a -common mixture of , where is a full-dimensional convex subset of . (Before, we let be a full-dimensional convex subset of when , but now such a subset may not exist since ). Since and 11 guarantees and each first-order stochastically dominates , we replace the argument “” with “”. Everything else goes through according to the proof in 10, giving the desired result. ∎
Stage 5: Expected utility when and concave transformations by construction
We are left with , the alternatives in that are weakly preferred to everything they reference dominate. The construction of can be partly arbitrary, where the main goal is to make sure they are related by concave transformations to other utility functions.
By definition of , , so by 11 and FOSD (3.1), first-order stochastically dominates for all . For any , FOSD requires the choice to obey first order stochastic dominance. Together, any strictly increasing utility function will accomplish for all .
We now construct so that it is related to other utility functions by concave transformations. For any strictly increasing utility function , consider the object such that for all ,
(B.1) |
(so satisfies ). There is a one-to-one relationship between and . Also, it is an algebraic exercise to show that for some concave and strictly increasing if and only if for all .
Fix . Let , where , and subsequently construct using (LABEL:rhos), which is possible as long as is nonempty. Note that when , must be the mean preserving spread of something in , so is nonempty, and so is nonempty. In the exception where and is empty, this implies for all . Then, we let
for all and construct using (LABEL:rhos). For any , this construction results in for all , with equality for that also falls into this exception (there are at most two of them, and ).
We now show that for any where , we have for all . This is already shown for any by 12. It is also already shown for the special cases in the preceding paragraph, by careful construction. Hence, we restrict attention to the remaining cases. Suppose for some . Then , so for some . However, since is transitive, implies ; and since , this contradicts 12. Say , , but for some . Then , so for all . But , a contradiction. Finally, for , either or . If it is the former, it is immediate that . If it is the later, then for all , as desired.
Thus, we have now shown that for any such that , for all , or equivalently for some concave and strictly increasing .
B.4 Proof of Theorem 3
“If” is straightforward, where compliance with 4.4 is shown in a footnote. I prove “only if”. In Stage 1, we show that with 4.1 and 4.2, for any time , the set of all choice problems such that the earliest payment arrives at time can be explained by a nonempty set of Discounted Utility specifications, where a typical element of this set is a utility function and a discount factor. In Stage 2, we show that at least one (consumption) utility function can be supported for all , and for each we set as the corresponding discount factor associated with for ; this is the more involved portion of the proof and it uses 4.4. In Stage 3, with 4.3, we show the desired relationship between and for any two . Note that the representation constructed has discount factors indexed by time, not alternatives, so in Stage 4 we convert them back to alternatives.
Stage 1: DU representation for each
By 1 and 2, for any and , satisfies WARP and Stationarity over (the collection of choice sets such that the earliest timed payment is ). In fact, WARP and Stationarity hold even when we consolidate the collection of choice problems where the earliest payment arrives at the same time (although the payments themselves may be different), which we now show. Let .
Lemma 13.
For any , satisfies WARP and Stationarity over .
Proof.
Take any two choice sets . Suppose it is not true that satisfies WARP or Stationarity over . Therefore, it must be that . Now let’s take the worse payment at for each set: such that for all and such that for all . Suppose without loss of generality (due to ). By 4.1, adding to would not alter the choice, i.e., . Let ; note that and are both in , and therefore satisfies WARP or Stationarity over . If it is Stationarity that is violated between and , then it is also violated between and , a contradiction. If it is WARP that is violated between and , it remains to show, due to , if then and there is a contradiction, whereas if then and there is a contradiction. ∎
We just established that satisfies WARP and Stationarity over . This will give us, from the choices in , a revealed preference relation on that is complete, transitive, continuous, and satisfies stationarity, and then it is well-known (Fishburn and Rubinstein (1982)) that along with 4.1 we obtain (many) Discounted Utility (DU) representations, for instance by translating the time-index by so that time is, in that instance, time .
Stage 2: can coincide with for each
With existence guaranteed, arbitrarily pick a DU representation with parameters that explains . Define by . For every , arbitrarily pick a DU representation that explains and define by . We proceed to show that for every , there exists a DU representation that explains where . Fix a . This boils down to identifying a certain relationship between and due to the fact that they are DU representations and 4.4—indifferences are preserved under a common delay multiplier .
Fact 1.
For any , , and , if and only if .
Lemma 14.
For any (resp. and ), there exists an open interval (resp. proper interval where and proper interval where ) that contains such that for some unique , if and only if for all .
Proof.
Fix any . Consider . Since is continuous and decreasing in it’s second argument, there exists such that . Since there exists an open interval in that contains , by continuity of , there exists an open interval in that contains such that implies for some . Observation: for every such that , since we have for some , for some , and 13, we have in .
Now consider an open interval that contains . Consider any where . We show an intermediate result that (i) if and only if (ii) . Say (i) holds (for some ). Due to the observation, , and 13, we have where . Due to the observation and , we have for some . Consider the choice set , and note that is related to by transforming the time of each timed payment in from to , where and . Then, invoking 4.4 gives , which gives (ii) as desired. The converse, (ii) implies (i), can be shown analogously. Due to 1, we also note that if and only if .
Consider any . There exist such that for all . The intermediate result gives, for all , if and only if , so if and only if , so by 1, if and only if , which means if and only if where . Note that is independent of , hence the same applies to relate and to relate . Invoking 1 once more completes the proof for the existence of . Since , where are the unique solutions to and , therefore is unique (for the given ).
For and , the proof is similar other than we replace open intervals with half-open intervals and . ∎
Lemma 15.
There exists such that for all , if and only if . Moreover, is unique.
Proof.
Let be the collection intervals guaranteed by 14. Note that is an open cover of the closed and bounded interval , so a finite subcover is guaranteed by the Heine–Borel theorem. Consider a finite sequence of intervals in , , such that the first interval is , last interval is , and for all , . This is guaranteed by the fact that is a cover of and the intervals in are open except for and . Then, for every two consecutive intervals , the unique ’s guaranteed by 14, one for and another for , must coincide due to the nondegenerate intersection . Iterating through this finite sequence of intersecting consecutive intervals guarantees, for every , an increasing sequence of payments such that , , and for some , if and only if for all . The rest is straightforward using 1 (for example if , we have if and only if , which completes the proof since ). ∎
To recover , take any such that . For some and , and . Then since we must have , we have . With 15, we conclude that where and is a DU representation for .
The analysis thus far was for . When , since every choice problem in contains only timed payments that pay at time , a DU representation is trivially established with any positive and any strictly increasing . Therefore, we set and (this is why we cannot guarantee , even if 4.1 gives us for all ). From now on, we remove subscript from and simply write .
Stage 3: for all
If , this is trivial from the construction of . Consider any . Continuity of and guarantee the existence of such that and for some , with which we obtain and . Note that by 4.1, , so and . Suppose for contradiction , then , or equivalently . Note also that implies . So implies . Putting together what we established, we have , , , and , which implies . By Continuity of and , if we consider for some sufficiently small, we have and , which jointly contradict 4.3 because and where .
Stage 4: and
Create a complete, transitive, and antisymmetric on such that implies , which involves an arbitrary completion between when and when , and set, for every , .
B.5 Proof of Theorem 4
“If” is straightforward. I prove “only if”. Stage 1 and Stage 2 show that with 5.2 and 5.1, for each Gini coefficient , the set of all choice problems where the most balanced alternative has Gini coefficient can be explained by the maximization of for some unique . Stage 3 shows that implies for all . Stage 4 builds the reference order using Gini coefficient and arbitrary completion.
Stage 1: for each alternative
Fix . Like before, let , , and .
Note that for all , if and only if . Since satisfies Quasi-linearity over (5.2), restricted to the domain (which contains itself) must be a strictly increasing transformation of for some unique . B.1 provides an illustration of how is constructed, and is our target, quasi-linear, utility function. It is straightforward that for all such that , the maximization of gives . Next, we show that this consistency applies to other . For any , there is no such that , so we just need to guarantee . Suppose for contradiction this inequality fails. Since for some we have , and therefore , the fact that and are both in but (because ) contradicts satisfies Quasi-linearity over . It remains to consider the consistency of for alternative , but this is immediate since there is no such that . So explains .

Stage 2: for each Gini coefficient
Fix , we now show that must coincide for all where . Consider the collection of choice sets . It turns out that satisfies WARP and Quasi-linearity over . To see this, take any two choice problems in . For each , there must be an alternative such that and for all other in . Consider an income distribution such that and and . Due to , 5.2, and 5.1, we have for . But , so by 5.2 again, between and , which as established are equal to and respectively, WARP and Quasi-linearity must hold. Since satisfies WARP and Quasi-linearity over , there is a unique such that the utility function explains . But every constructed in Stage 1 is also unique, and if , so must coincide for all such that .
Stage 3: implies for all
Finally we show that the constructed are systematically related. Consider any such that (reminder: lower implies greater attainable equality) and any such that . Define and . We want to show . Suppose not, our goal is to find a contradiction of 5.3 in choice behavior.
Let be a number such that . Consider such that and , which exist because is continuous and increasing in from to and . Consider , for some such that and , where exists because for any fixed , is asymptotically increasing in and , and . Essentially, we have introduced reference points that will not be chosen (due in part to 5.1), forcing the choice to be between and .
We now use the constructed alternatives, , to demonstrate a violation of 5.3. For the choice problem , is the reference (so is used) and cannot be chosen. Since , or equivalently , we have , and therefore
(B.2) |
By analogous arguments, gives ( is used), which also gives
(B.3) |
due to 5.2 and . Since , (LABEL:fspu_increasing_v_4) and (LABEL:fspu_increasing_v_5) jointly contradict 5.3.
Stage 4: on
Create a complete, transitive, and antisymmetric on such that implies , which involves an arbitrary completion when .
B.6 Proof of Propositions 1, 2, 3
I focus on showing that WARP (1) and structural postulate (2) are independently sufficient for the standard model (3). The remaining statements, that WARP and structural postulates are necessary for standard models ((1) if (3) and (2) if (3)), and that WARP is sufficient and necessary for a (general) utility representation ((1) if and only if (4)), are well-known and omitted.
Proof of 1: (1) / (2) implies (3)
Suppose a choice correspondence admits an AREU representation with specification . Suppose satisfies WARP or Independence (or both). We first show that for all . Suppose without loss of generality . Suppose for contradiction , then the fact that is a concave transformation of and that both functions are normalized to implies for all . Consider the set . The interior of this set, , consists of lotteries that are extreme spreads of , hence . By 3.1, . Then by Continuity, there exist open balls around , and , such that they contain lotteries that are chosen over and respectively. Now consider an open subset of . Since for all , we can find lotteries such that and . This means
(B.4) | ||||
(B.5) |
Consider , , and , then , and therefore
(B.6) |
Finally we conclude that (LABEL:warp1_1) and (LABEL:warp1_2) jointly violate WARP, whereas (LABEL:warp1_1) and (LABEL:warp1_3) jointly violate Independence.
Next we turn to and show that is either identical, or has the freedom to be identical, to where . If or or , then any strictly increasing can explain over , so we can just pick one that is identical to for every . If doesn’t satisfy any of those conditions, then there exists such that is an extreme spread of and there exists . This implies is a concave transformation of (because ) and is a concave transformation of (because ), but we already showed that (since , so is identical to for all . We conclude that if either WARP or Independence (or both) holds, then admits an expected utility representation.
Proof of 2: (1) / (2) implies (3)
Suppose a choice correspondence admits a PEDU representation with specification . We show that if for some , then violates both WARP and Stationarity. ( only plays a role for choice problems where every alternative has , so we set it as .)
Suppose for contradiction for some . Say without loss of generality , then . Recall that . Consider alternatives such that (i) , (ii) , (iii) , and (iv) , which is possible due in part to the assumption that . Note that (i) and (ii) guarantee . Then, (iii) gives
(B.7) |
and (iv) gives
(B.8) | ||||
(B.9) |
where (LABEL:warp2_7) is due in part to the assumption that as it excludes from being uniquely chosen. Now note that (LABEL:warp2_5) and (LABEL:warp2_6) jointly violate Stationarity, whereas (LABEL:warp2_5) and (LABEL:warp2_7) jointly violate WARP. We conclude that if either WARP or Stationarity (or both) holds, then admits an exponential discounting utility representation.
Proof of 3: (1) / (2) implies (3)
Suppose a choice correspondence admits an FSPU representation with specification . We show that if for some and , then violates both WARP and Quasi-linearity.
Suppose for contradiction for some and . Without loss of generality, say . Then , and therefore there exist such that and . Consider such that and , which is possible since is continuous and increasing in it’s first argument from to . Since for any fixed , is asymptotically increasing in it’s first argument, there exists such that and . Let and . We have now established that (i) , (ii) , (iii) , and (iv) .
Then, (i), (ii), and (iii) together give
(B.10) |
whereas (i) and (iv) together give
(B.11) | ||||
(B.12) |
Note that (LABEL:warp3_3) and (LABEL:warp3_5) jointly violate Quasi-linearity. Separately, by WARP, (LABEL:warp3_3) and (LABEL:warp3_4) imply and respectively, which is also a contradiction. We conclude that if either WARP or Quasi-linearity (or both) holds, then admits a quasi-linear utility representation.