Top Banner
Estimating Risk Preferences from Deductible Choice By ALMA COHEN AND LIRAN EINAV* We develop a structural econometric model to estimate risk preferences from data on deductible choices in auto insurance contracts. We account for adverse selection by modeling unobserved heterogeneity in both risk (claim rate) and risk aversion. We find large and skewed heterogeneity in risk attitudes. In addition, women are more risk averse than men, risk aversion exhibits a U-shape with respect to age, and proxies for income and wealth are positively associated with absolute risk aversion. Finally, unobserved heterogeneity in risk aversion is greater than that of risk, and, as we illustrate, has important implications for insurance pricing. (JEL D81, G22) The analysis of decisions under uncertainty is central to many fields in economics, such as macroeconomics, finance, and insurance. In many of these applications it is important to know the degree of risk aversion, how hetero- geneous individuals are in their attitudes toward risk, and how these attitudes vary with individ- uals’ characteristics. Somewhat surprisingly, these questions have received only little atten- tion in empirical microeconomics, so answering them using direct evidence from risky decisions made by actual market participants is important. In this study, we address these questions by estimating risk preferences from the choice of deductible in insurance contracts. We use a rich dataset of more than 100,000 individuals choos- ing from an individual-specific menu of deduct- ible and premium combinations offered by an Israeli auto insurance company. An individual who chooses a low deductible is exposed to less risk, but is faced with a higher level of expected expenditure. Thus, an individual’s decision to choose a low (high) deductible provides a lower (upper) bound for his coefficient of absolute risk aversion. Inferring risk preferences from insurance data is particularly appealing, as risk aversion is the primary reason for the existence of insurance markets. To the extent that extrapolating utility parameters from one market context to another necessitates additional assumptions, there is an advantage to obtaining such parameters from the same markets to which they are subse- quently applied. The deductible choice is (al- most) an ideal setting for estimating risk aversion in this context. Other insurance deci- sions, such as the choice among health plans, annuities, or just whether to insure or not, may involve additional preference-based explana- tions that are unrelated to financial risk and make inference about risk aversion difficult. 1 In * Cohen: Department of Economics, Tel Aviv Univer- sity, P.O. Box 39040, Ramat Aviv, Tel Aviv 69978, Israel, National Bureau of Economic Research, and Harvard Law School John M. Olin Research Center for Law, Economics, and Business (e-mail: [email protected]); Einav: Depart- ment of Economics, Stanford University, Stanford, CA 94305-6072, and NBER (e-mail: [email protected]). We thank two anonymous referees and Judy Chevalier (the coeditor) for many helpful comments, which greatly im- proved the paper. We are also grateful to Susan Athey, Lucian Bebchuk, Lanier Benkard, Tim Bresnahan, Raj Chetty, Ignacio Esponda, Amy Finkelstein, Igal Hendel, Mark Israel, Felix Kubler, Jon Levin, Aviv Nevo, Harry Paarsch, Daniele Paserman, Peter Rossi, Esteban Rossi- Hansberg, Morten Sorensen, Manuel Trajtenberg, Frank Wolak, and seminar participants at University of California at Berkeley, CEMFI, University of Chicago Graduate School of Business, Duke University, Haas School of Busi- ness, Harvard University, the Hoover Institution, the Inter- disciplinary Center in Herzliya, University of Minnesota, Northwestern University, Penn State University, Stanford University, Tel Aviv University, University of Wisconsin— Madison, The Wharton School, SITE 2004, the Economet- ric Society 2005 and 2006 winter meetings, and the NBER IO 2005 winter meeting for many useful discussions and suggestions. All remaining errors are, of course, ours. Einav acknowledges financial support from the National Science Foundation (SES-0452555) and from Stanford’s Office of Technology Licensing, as well as the hospitality of the Hoover Institution. 1 For example, Matthew Rabin and Richard H. Thaler (2001, fn. 2) point out that one of their colleagues buys the insurance analyzed by Charles J. Cicchetti and Jeffrey A. Dubin (1994) in order to improve the service he will get in the event of a claim. We think that our deductible choice analysis is immune to such critique. 745
44

Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

Jun 26, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

Estimating Risk Preferences from Deductible Choice

By ALMA COHEN AND LIRAN EINAV

We develop a structural econometric model to estimate risk preferences from dataon deductible choices in auto insurance contracts We account for adverse selectionby modeling unobserved heterogeneity in both risk (claim rate) and risk aversionWe find large and skewed heterogeneity in risk attitudes In addition women aremore risk averse than men risk aversion exhibits a U-shape with respect to age andproxies for income and wealth are positively associated with absolute risk aversionFinally unobserved heterogeneity in risk aversion is greater than that of risk andas we illustrate has important implications for insurance pricing (JEL D81 G22)

The analysis of decisions under uncertainty iscentral to many fields in economics such asmacroeconomics finance and insurance Inmany of these applications it is important toknow the degree of risk aversion how hetero-geneous individuals are in their attitudes towardrisk and how these attitudes vary with individ-ualsrsquo characteristics Somewhat surprisinglythese questions have received only little atten-tion in empirical microeconomics so answering

them using direct evidence from risky decisionsmade by actual market participants is important

In this study we address these questions byestimating risk preferences from the choice ofdeductible in insurance contracts We use a richdataset of more than 100000 individuals choos-ing from an individual-specific menu of deduct-ible and premium combinations offered by anIsraeli auto insurance company An individualwho chooses a low deductible is exposed to lessrisk but is faced with a higher level of expectedexpenditure Thus an individualrsquos decision tochoose a low (high) deductible provides a lower(upper) bound for his coefficient of absolute riskaversion

Inferring risk preferences from insurance datais particularly appealing as risk aversion is theprimary reason for the existence of insurancemarkets To the extent that extrapolating utilityparameters from one market context to anothernecessitates additional assumptions there is anadvantage to obtaining such parameters fromthe same markets to which they are subse-quently applied The deductible choice is (al-most) an ideal setting for estimating riskaversion in this context Other insurance deci-sions such as the choice among health plansannuities or just whether to insure or not mayinvolve additional preference-based explana-tions that are unrelated to financial risk andmake inference about risk aversion difficult1 In

Cohen Department of Economics Tel Aviv Univer-sity PO Box 39040 Ramat Aviv Tel Aviv 69978 IsraelNational Bureau of Economic Research and Harvard LawSchool John M Olin Research Center for Law Economicsand Business (e-mail almacposttauacil) Einav Depart-ment of Economics Stanford University Stanford CA94305-6072 and NBER (e-mail leinavstanfordedu) Wethank two anonymous referees and Judy Chevalier (thecoeditor) for many helpful comments which greatly im-proved the paper We are also grateful to Susan AtheyLucian Bebchuk Lanier Benkard Tim Bresnahan RajChetty Ignacio Esponda Amy Finkelstein Igal HendelMark Israel Felix Kubler Jon Levin Aviv Nevo HarryPaarsch Daniele Paserman Peter Rossi Esteban Rossi-Hansberg Morten Sorensen Manuel Trajtenberg FrankWolak and seminar participants at University of Californiaat Berkeley CEMFI University of Chicago GraduateSchool of Business Duke University Haas School of Busi-ness Harvard University the Hoover Institution the Inter-disciplinary Center in Herzliya University of MinnesotaNorthwestern University Penn State University StanfordUniversity Tel Aviv University University of WisconsinmdashMadison The Wharton School SITE 2004 the Economet-ric Society 2005 and 2006 winter meetings and the NBERIO 2005 winter meeting for many useful discussions andsuggestions All remaining errors are of course ours Einavacknowledges financial support from the National ScienceFoundation (SES-0452555) and from Stanfordrsquos Office ofTechnology Licensing as well as the hospitality of theHoover Institution

1 For example Matthew Rabin and Richard H Thaler(2001 fn 2) point out that one of their colleagues buys theinsurance analyzed by Charles J Cicchetti and Jeffrey ADubin (1994) in order to improve the service he will get inthe event of a claim We think that our deductible choiceanalysis is immune to such critique

745

contrast the choice among different alternativesthat vary only in their financial parameters (thelevels of deductibles and premiums) is a case inwhich the effect of risk aversion can be moreplausibly isolated and estimated

The average deductible menu in our dataoffers an individual to pay an additional pre-mium of $55 (US) in order to save $182 indeductible payments in the event of a claim2 Arisk-neutral individual should choose a low de-ductible if and only if his claim propensity isgreater than the ratio between the premium($55) and the potential saving ($182) which is03 Although this pricing is actuarially unfairwith respect to the average claim rate of 024518 percent of the sample choose to purchase itAre these individuals exposed to greater riskthan the average individual are they more riskaverse or are they a combination of both Weanswer this question by developing a structuraleconometric model and estimating the joint dis-tribution of risk and risk aversion

Our benchmark specification uses expectedutility theory to model individualsrsquo deductiblechoices as a function of two utility parametersthe coefficient of absolute risk aversion and aclaim rate We allow both utility parameters todepend on individualsrsquo observable and unob-servable characteristics and assume that there isno moral hazard Two key assumptionsmdashthatclaims are generated by a Poisson process at theindividual level and that individuals have per-fect information about their Poisson claimratesmdashallow us to use data on (ex post) realizedclaims to estimate the distribution of (ex ante)claim rates Variation in the deductible menusacross individuals and their choices from thesemenus are then used to estimate the distributionof risk aversion in the sample and the correla-tion between risk aversion and claim risk Thuswe can estimate heterogeneous risk preferencesfrom deductible choices accounting for adverseselection (unobserved heterogeneity in claimrisk) which is an important confoundingfactor3

Our results suggest that heterogeneity inrisk preferences is rather large While themajority of the individuals are estimated to beclose to risk neutral with respect to lotteriesof $100 magnitude a significant fraction ofthe individuals in our sample exhibit signifi-cant levels of risk aversion even with respectto such relatively small bets Overall an in-dividual with the average risk aversion pa-rameter in our sample is indifferent aboutparticipating in a 50 ndash50 lottery in which hegains $100 or loses $56 We find that womenare more risk averse than men that risk pref-erences exhibit a U-shape with respect to ageand interestingly that most proxies for incomeand wealth are positively associated with abso-lute risk aversion

We perform an array of tests to verify thatthese qualitative results are robust to deviationfrom the modeling assumptions In particularwe explore alternative distributional assump-tions alternative restrictions on the von Neu-mannndashMorgenstern (vNM) utility function anda case in which individuals are allowed to makeldquomistakesrdquo in their coverage choices due to in-complete information about their own risk typesWe also show that the risk preferences we es-timate are stable over time and help predictother (but closely related) insurance decisionsFinally we justify our assumption to abstractfrom moral hazard and we discuss the way thisand other features of the setup (sample selectionand additional costs associated with an acci-dent) may affect the interpretation of the result

Throughout we focus primarily on absolute(rather than relative) risk aversion4 This allowsus to take a neutral position with respect to therecent debate over the empirical relevance ofexpected utility theory (Matthew Rabin 2000Rabin and Richard H Thaler 2001 Ariel Ru-binstein 2001 Richard Watt 2002 NicholasBarberis Ming Huang and Thaler 2006 Igna-

2 For ease of comparison we convert many of the re-ported figures from New Israeli Shekels to US dollars It isimportant to keep in mind however that GDP per capita inIsrael was 052ndash056 of that in the United States (067ndash070 when adjusted for PPP) during the observation period

3 Throughout the paper we use the term adverse selec-tion to denote selection on risk while selection on risk

aversion is just selection Some of the literature refers toboth selection mechanisms as adverse selection with thedistinction being between common values (selection on risk)and private values (selection on risk aversion)

4 Primarily as a way to compare our results with otherestimates in the literature Section III also provides esti-mates of relative risk aversion by following the literatureand multiplying our estimates for absolute risk aversion byannual income (in Israel) We obtain high double-digitestimates for the mean individual but below 05 for themedian

746 THE AMERICAN ECONOMIC REVIEW JUNE 2007

cio Palacios-Huerta and Roberto Serrano 2006)While the debate focuses on how the curvatureof the vNM utility function varies with wealthor across different settings we measure thiscurvature only at a particular wealth levelwhatever this wealth level may be By allowingunobserved heterogeneity in this curvatureacross individuals we place no conceptual re-strictions on the relationship between wealthand risk aversion Our estimated distribution ofrisk preferences can be thought of as a convo-lution of the distribution of (relevant) wealthand risk attitudes Avoiding this debate is also adrawback Without taking a stand on the wayabsolute risk preferences vary with sizes andcontexts we cannot discuss how relevant ourestimates are for other settings Obviously wethink they are But since statements about theirexternal relevance are mainly informed by whatwe think and less by what we do we defer thisdiscussion to the concluding section

Our analysis also provides two results regard-ing the relationship between the distribution ofrisk preferences and that of risk First we findthat unobserved heterogeneity in risk aversionis greater and more important (for profits andpricing) than unobserved heterogeneity in riskThis is consistent with the motivation for recenttheoretical work which emphasizes the impor-tance of allowing for preference heterogeneityin analyzing insurance markets (Michael Lands-berger and Isaac Meilijson 1999 Michael Smart2000 David de Meza and David C Webb 2001Bertrand Villeneuve 2003 Pierre-Andre Chiap-pori et al 2006 Bruno Jullien Bernard Salanieand Francois Salanie 2007) Second we findthat unobserved risk has a strong positive cor-relation with unobserved risk aversion and dis-cuss possible interpretations of it This isencouraging from a theoretical standpoint as itretains a single crossing property which weillustrate in the counterfactual exercise Thisfinding contrasts the negative correlation re-ported by Amy Finkelstein and Kathleen Mc-Garry (2006) for the long-term care insurancemarket and by Mark Israel (2005) for automo-bile insurance in Illinois We view these differ-ent results as cautioning against interpreting thiscorrelation parameter outside the context inwhich it is estimated Even if risk preferencesare stable across contexts risk is not and there-fore neither is the correlation structure

This study is related to two important strands

of literature The first shares our main goal ofmeasuring risk aversion Much of the existingevidence about risk preferences is based onintrospection laboratory experiments (Steven JKachelmeier and Mohamed Shehata 1992 Ver-non L Smith and James M Walker 1993Charles A Holt and Susan K Laury 2002) dataon bettors or television game show participants(Robert Gertner 1993 Andrew Metrick 1995Bruno Jullien and Salanie 2000 Roel MBeetsma and Peter C Schotman 2001 MatildeBombardini and Francesco Trebbi 2005) an-swers given by individuals to hypothetical sur-vey questions (W Kip Viscusi and William NEvans 1990 Evans and Viscusi 1991 Robert BBarsky et al 1997 Bas Donkers BertrandMelenberg and Arthur van Soest 2001 JoopHartog Ada Ferrer-i-Carbonell and NicoleJonker 2002) and estimates that are driven bythe imposed functional form relationship be-tween static risk-taking behavior and intertem-poral substitution5 We are aware of only a fewattempts to recover risk preferences from deci-sions of regular market participants Atanu Saha(1997) looks at firmsrsquo production decisionsand Raj Chetty (2006) recovers risk preferencesfrom labor supply In the context of insuranceCicchetti and Dubin (1994) look at individualsrsquodecisions whether to insure against failure ofinterior telephone wires Compared to theirsetting in our setting events are more frequentand commonly observed stakes are higher thepotential loss (the difference between the de-ductible amounts) is known and the deductiblechoice we analyze is more immune to alter-native preference-based explanations Finallyin a recent working paper Justin Sydnor (2006)uses data on deductible choices in homeownerrsquosinsurance to calibrate a bound for the impliedlevel of risk aversion6 An important differencebetween our paper and these papers is thatthey all rely on a representative individual

5 Much of the finance and macroeconomics literaturegoing back to Irwin Friend and Marshall E Blume (1975)relies on this assumption As noted by Narayana R Kocher-lakota (1996) in a review of this literature the level of staticrisk aversion is still a fairly open question

6 The possibility of using deductibles to make inferencesabout risk aversion was first pointed out by Jacques HDreze (1981) Dreze suggests however relying on theoptimality of the observed contracts (ldquosupply siderdquo infor-mation) while we rely on individualsrsquo choices of deduct-ibles (ldquodemand siderdquo information)

747VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

framework and therefore focus only on thelevel of risk aversion7 In contrast we explicitlymodel observed and unobserved heterogeneityin risk aversion as well as in risk We cantherefore provide results regarding the hetero-geneity in risk preferences and its relationshipwith risk which have potentially important im-plications for welfare and policy A representa-tive individual framework cannot address suchquestions

The second strand of related literature isthe recent empirical literature on adverse se-lection in insurance markets Much of thisliterature addresses the important question ofwhether adverse selection exists in differentmarkets As suggested by the influential workof Chiappori and Salanie (2000) it uses ldquore-duced formrdquo specifications to test whether aftercontrolling for observables accident outcomesand coverage choices are significantly corre-lated (Georges Dionne and Charles Vanasse1992 Robert Puelz and Arthur Snow 1994John Cawley and Tomas Philipson 1999Finkelstein and James Poterba 2004 Finkel-stein and McGarry 2006) Cohen (2005) appliesthis test to our data and finds evidence consis-tent with adverse selection As our main goal isquite different we take a more structural ap-proach By assuming a structure for the adverseselection mechanism we can account for itwhen estimating the distribution of risk prefer-ences While the structure of adverse selectionis assumed its relative importance is not im-posed The structural assumptions allow us toestimate the importance of adverse selectionrelative to the selection induced by unobservedheterogeneity in risk attitudes As we discussin Section IIC this approach is conceptuallysimilar to that of James H Cardon and IgalHendel (2001) who model health insurancechoices and also allow for two dimensions ofunobserved heterogeneity8

The rest of the paper is organized as followsSection I describes the environment the setup

and the data Section II lays out the theoreticalmodel and the related econometric model anddescribes its estimation and identification Sec-tion III describes the results We first provide aset of reduced-form estimates which motivatethe more structural approach We then presentestimates from the benchmark specification aswell as estimates from various extensions androbustness tests We discuss and justify some ofthe modeling assumptions and perform counter-factual analysis as a way to illustrate the impli-cations of the results to profits and pricingSection IV concludes by discussing the rele-vance of the results to other settings

I Data

A Economic Environment and Data Sources

We obtained data from a single insurancecompany that operates in the market for auto-mobile insurance in Israel The data containinformation about all 105800 new policyhold-ers who purchased (annual) policies from thecompany during the first five years of its oper-ation from November 1994 to October 1999Although many of these individuals stayed withthe insurer in subsequent years we focusthrough most of the paper on deductible choicesmade by individuals in their first contract withthe company This allows us to abstract fromthe selection implied by the endogenous choiceof individuals whether to remain with the com-pany or not (Cohen 2003 2005)

The company studied was the first company inthe Israeli auto insurance market that marketedinsurance to customers directly rather thanthrough insurance agents By the end of the stud-ied period the company sold about 7 percent ofthe automobile insurance policies issued in IsraelDirect insurers operate in many countries and ap-pear to have a significant cost advantage (J DavidCummins and Jack L Van Derhei 1979) Thestudied company estimated that selling insurancedirectly results in a cost advantage of roughly 25percent of the administrative costs involved inmarketing and handling policies Despite theircost advantage direct insurers generally have haddifficulty in making inroads beyond a part of themarket because the product does not provide theldquoamenityrdquo of having an agent to work with andturn to (Stephen P DrsquoArcy and Neil A Doherty1990) This aspect of the company clearly makes

7 An exception is Syngjoo Choi et al (2006) who use alaboratory experiment and similar to us find a high degreeof heterogeneity in risk attitudes across individuals

8 In an ongoing project Pierre-Andre Chiappori andBernard Salanie (2006) estimate an equilibrium model ofthe French auto insurance market where their model of thedemand side of the market is conceptually similar to the onewe estimate in this paper

748 THE AMERICAN ECONOMIC REVIEW JUNE 2007

the results of the paper applicable only to thoseconsumers who seriously consider buying directinsurance Section IIID discusses this selection inmore detail

While we focus primarily on the demand sideof the market by modeling the deductible choicethe supply side (pricing) will be relevant for anycounterfactual exercise as well as for understand-ing the viability of the outside option (which wedo not observe and do not model) During the firsttwo years of the companyrsquos operations the pricesit offered were lower by about 20 percent thanthose offered by other conventional insurersThus due to its differentiation and cost advantagethe company had market power with respect toindividuals who were more sensitive to price thanto the disamenity of not having an agent Thismakes monopolistic screening models apply morenaturally than competitive models of insurance(eg Michael Rothschild and Joseph E Stiglitz1976) During the companyrsquos third year of oper-ation (December 1996 to March 1998) it facedmore competitive conditions when the estab-lished companies trying to fight off the new en-trant lowered the premiums for policies withregular deductibles to the levels offered by thecompany In their remaining period included inthe data the established companies raisedtheir premiums back to previous levels leav-ing the company again with a substantialprice advantage9

For each policy our dataset includes all theinsurerrsquos information about the characteristicsof the policyholder demographic characteris-tics vehicle characteristics and details abouthis driving experience The Appendix providesa list of variables and their definitions andTable 1 provides summary statistics In addi-tion our data include the individual-specificmenu of four deductible and premium combi-nations that the individual was offered (see be-low) the individualrsquos choice from this menuand the realization of risks covered by the pol-icy the length of the period over which it was ineffect the number of claims submitted by the

policyholder and the amounts of the submittedclaims10 Finally we use the zip codes of thepolicyholdersrsquo home addresses11 to augment thedata with proxies for individualsrsquo wealth basedon the Israeli 1995 census12

The policies offered by the insurer (as all pol-icies offered in the studied market) are one-periodpolicies with no commitment on the part of eitherthe insurer or the policyholder13 The policy re-sembles the US version of ldquocomprehensiverdquo in-surance It is not mandatory but it is held by alarge fraction of Israeli car owners (above 70percent according to the companyrsquos executives)The policy does not cover death or injuries to thepolicyholder or to third parties which areinsured through a separate mandatory policyInsurance policies for car audio equipmentwindshield replacement car and towing ser-vices are structured and priced separately Cer-tain types of coverage do not carry a deductibleand are therefore not used in the analysis14

Throughout the paper we use and reportmonetary amounts in current (nominal) NewIsraeli Shekels (NIS) to avoid creating artificialvariation in the data Consequently the follow-ing facts may be useful for interpretation andcomparison with other papers in the literature

9 During this last period two other companies offeringinsurance directly were established Due to first-mover advan-tage (as viewed by the companyrsquos management) which helpedthe company maintain a strong position in the market thesetwo new companies did not affect pricing policies much untilthe end of our observation period Right in the end of thisperiod the studied company acquired one of these entrants

10 Throughout the analysis we make the assumption thatthe main policyholder is the individual who makes the deduct-ible choice Clearly to the extent that this is not always thecase the results should be interpreted accordingly

11 The company has the addresses on record for billingpurposes Although in principle the company could haveused these data for pricing they do not do so

12 The Israeli Central Bureau of Statistics (CBS) associateseach census respondent with a unique ldquostatistical areardquo eachincluding between 1000 and 10000 residents We matchedthese census tracts with zip codes based on street addressesand constructed variables at the zip code level These con-structed variables are available for more than 80 percent of thepolicyholders As a proxy for wealth we use (gross) monthlyincome which is based on self-reported income by censusrespondents augmented (by the CBS) with Social Security data

13 There is a substantial literature that studies the optimaldesign of policies that commit customers to a multiperiodcontract or that include a one-sided commitment of theinsurer to offer the policyholder certain terms in subsequentperiods (Georges Dionne and Pierre Lasserre 1985 RussellCooper and Beth Hayes 1987 Georges Dionne and Neil ADoherty 1994 Igal Hendel and Alessandro Lizzeri 2003)Although such policies are observed in certain countries(Dionne and Vanasse 1992) many insurance markets includ-ing the one we study use only one-period no-commitmentpolicies (Howard Kunreuther and Mark V Pauly 1985)

14 These include auto theft total loss accidents and notldquoat faultrdquo accidents

749VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The exchange rate between NIS and US dollarsmonotonically increased from 301 in 1995 to414 in 1999 (on average it was 352)15 Annualinflation was about 8 percent on average andcumulative inflation over the observation periodwas 48 percent We will account for these effects

as well as other general trends by using yeardummy variables throughout the analysis

B The Menu of Deductibles and Premiums

Let xi be the vector of characteristics individuali reports to the insurance company After learningxi the insurer offered individual i a menu of fourcontract choices One option offered a ldquoregularrdquodeductible and a ldquoregularrdquo premium The term

15 PPP figures were about 10 percent lower than the nom-inal exchange rates running from 260 in 1995 to 374 in 1999

TABLE 1mdashSUMMARY STATISTICSmdashCOVARIATES

Variable Mean Std dev Min Max

Demographics Age 41137 1237 1806 8943Female 0316 047 0 1Family Single 0143 035 0 1

Married 0779 042 0 1Divorced 0057 023 0 1Widower 0020 014 0 1Refused to say 0001 004 0 1

Education Elementary 0016 012 0 1High school 0230 042 0 1Technical 0053 022 0 1College 0233 042 0 1No response 0468 050 0 1

Emigrant 0335 047 0 1

Car attributes Value (current NIS)a 66958 37377 4000 617000Car age 3952 287 0 14Commercial car 0083 028 0 1Engine size (cc) 1568 385 700 5000

Driving License years 18178 1007 0 63Good driver 0548 050 0 1Any driver 0743 044 0 1Secondary car 0151 036 0 1Business use 0082 027 0 1Estimated mileage (km)b 14031 5891 1000 32200History length 2847 061 0 3Claims history 0060 015 0 2

Young driver Young 0192 039 0 1Gender Male 0113 032 0 1

Female 0079 027 0 1Age 17ndash19 0029 017 0 1

19ndash21 0051 022 0 121ndash24 0089 029 0 124 0022 015 0 1

Experience 1 0042 020 0 11ndash3 0071 026 0 13 0079 027 0 1

Company year First year 0207 041 0 1Second year 0225 042 0 1Third year 0194 040 0 1Fourth year 0178 038 0 1Fifth year 0195 040 0 1

Note The table is based on all 105800 new customers in the dataa The average exchange rate throughout the sample period was approximately $1 per 35 NIS starting at 13 in late 1994

and reaching 14 in late 1999b The estimated mileage is not reported by everyone It is available for only 60422 new customers

750 THE AMERICAN ECONOMIC REVIEW JUNE 2007

regular was used for this deductible level bothbecause it was relatively similar to the deductiblelevels offered by other insurers and because mostpolicyholders chose it The regular premium var-ied across individuals according to some deter-ministic function (unknown to us) pit ft(xi)which was quite stable over time The regulardeductible level was directly linked to the regularpremium according to

(1) dit min12

pit capt

That is it was set to one-half of the regular pre-mium subject to a deductible cap capt whichvaried over time but not across individuals Thepremiums associated with the other options werecomputed by multiplying pit by three differentconstants 106 for ldquolowrdquo deductible 0875 forldquohighrdquo deductible and 08 for ldquovery highrdquo de-ductible The regular deductible dit was con-verted to the other three deductible levels in asimilar way using multipliers of 06 for low 18for high and 26 for very high

There are two main sources of exogenous vari-ation in prices The first arises from companyexperimentation The multipliers described abovewere fixed across individuals and over time formost of the observation period but there was asix-month period during the insurerrsquos first year ofoperation (May 1995 to October 1995) in whichthe insurer experimented with slightly modifiedmultipliers16 This modified formula covers al-most 10 percent of the sample The secondsource of variation arises from discrete ad-justments to the uniform cap The cap variedover time due to inflation competitive condi-tions and as the company gained more expe-rience (Figure 1) The cap was binding for

16 For individuals with low levels of regular premiumsduring the specified period the regular deductible was set at53 percent (instead of 50 percent) of the regular premiumthe low deductible was set at 33 percent (instead of 30percent) of the regular premium and so on

FIGURE 1 VARIATION IN THE DEDUCTIBLE CAP OVER TIME

Notes This figure presents the variation in the deductible cap over time which is one of the main sources of pricing variation inthe data We do not observe the cap directly but it can be calculated from the observed menus The graph plots the maximalregular deductible offered to anyone who bought insurance from the company over a moving seven-day window Thelarge jumps in the graph reflect changes in the deductible cap There are three reasons why the graph is not perfectlysmooth First in a few holiday periods (eg October 1995) there are not enough sales within a seven-day window sonone of those sales hits the cap This gives rise to temporary jumps downward Second the pricing rule applies at thedate of the price quote given to the potential customer Our recorded date is the first date the policy becomes effectiveThe price quote is held for a period of two to four weeks so in periods in which the pricing changes we may still seenew policies sold using earlier quotes made according to a previous pricing regime Finally even within periods of constantcap the maximal deductible varies slightly (variation of less than 05 percent) We do not know the source of this variation

751VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

about a third of the policyholders in our dataAll these individuals would be affected by achange in the cap Much of the variation ofmenus in the data is driven by the exogenousshifts in the uniform deductible cap The un-derlying assumption is that conditional onobservables these sources of variation pri-marily affect the deductible choice of newcustomers but they do not have a significantimpact on the probability of purchasing insurancefrom the company Indeed this assumption holdsin the data with respect to observables there is nodistinguishable difference in the distribution ofobservable characteristics of consumers who buyinsurance just before and those who buy just aftera change in the deductible cap

C Summary Statistics

The top of Table 2A summarizes the deduct-ible menus all are calculated according to theformula described earlier Only 1 percent of thepolicyholders chose the high or very high de-ductible options Therefore for the rest of theanalysis we focus only on the choice betweenregular and low deductible options (chosen by811 and 178 percent of the individuals respec-tively)17 Focusing only on these options doesnot create any selection bias because we do notomit individuals who chose high or very highdeductibles For these individuals we assumethat they chose a regular deductible This as-sumption is consistent with the structural modelwe develop in the next section which impliesthat conditional on choosing high or very highdeductibles an individual would almost alwaysprefer the regular over the low deductible

The bottom of Table 2A as well as Table2B present summary statistics for the policy re-alizations We focus only on claim rates and noton the amounts of the claims This is because anyamount above the higher deductible level is cov-

ered irrespective of the deductible choice and thevast majority of the claims fit in this category (seeSection IIIE) For all these claims the gain fromchoosing a low deductible is the same in the eventof a claim and is equal to the difference betweenthe two deductible levels Therefore the claimamount is rarely relevant for the deductible choice(and likewise for the companyrsquos pricing decisionwe analyze in Section IIIF)

Averaging over all individuals the annualclaim rate was 0245 One can clearly observesome initial evidence of adverse selection Onaverage individuals who chose a low deductiblehad higher claim rates (0309) than those whochose the regular deductible (0232) Those whochose high and very high deductibles had muchlower claim rates (0128 and 0133 respectively)These figures can be interpreted in the context ofthe pricing formula A risk-neutral individual willchoose the low deductible if and only if her claimrate is higher than (pd) (plow pregular)(dregular dlow) When the deductible cap is notbinding which is the case for about two-thirds ofthe sample this ratio is given directly by thepricing formula and is equal to 03 Thus anyindividual with a claim rate higher than 03 willbenefit from buying the additional coverage pro-vided by a low deductible even without any riskaversion The claim data suggest that the offeredmenu is cheaper than an actuarially fair contractfor a nonnegligible part of the population (13percent according to the benchmark estimates re-ported below) This observation is in sharp con-trast to other types of insurance contracts such asappliance warranties which are much more ex-pensive than the actuarially fair price (Rabin andThaler 2001)

II The Empirical Model

A A Model of Deductible Choice

Let wi be individual irsquos wealth (pih di

h) theinsurance contract (premium and deductible re-spectively) with high deductible (pi

l dil) the

insurance contract with low deductible ti theduration of the policy and ui(w) individual irsquosvNM utility function We assume that the num-ber of insurance claims is drawn from a Poissondistribution with an annual claim rate iThrough most of the paper we assume that i isknown to the individual We also assume that iis independent of the deductible choice ie that

17 The small frequency of ldquohighrdquo and ldquovery highrdquochoices provides important information about the lowerends of the risk and risk aversion distributions but (for thatsame reason) makes the analysis sensitive to functionalform Considering these options or the option of not buyinginsurance creates a sharp lower bound on risk aversion forthe majority of the observations making the estimates muchhigher Given that these options are rarely selected how-ever it is not clear to us whether they were regularlymentioned during the insurance sales process renderingtheir use somewhat inappropriate

752 THE AMERICAN ECONOMIC REVIEW JUNE 2007

there is no moral hazard Finally we assumethat in the event of an accident the value of theclaim is greater than di

h We revisit all theseassumptions in Sections IIID and IIIE For therest of this section i subscripts are suppressedfor convenience

In the market we study insurance policies are

typically held for a full year after which theycan be automatically renewed with no commit-ment by either the company or the individualMoreover all auto-insurance policies sold inIsrael can be canceled without prior notice bythe policyholder with premium payments beinglinearly prorated Both the premium and the

TABLE 2AmdashSUMMARY STATISTICSmdashMENUS CHOICES AND OUTCOMES

Variable Obs Mean Std dev Min Max

Menu Deductible (current NIS)a Low 105800 87548 12101 37492 103911Regular 105800 145299 19779 62486 171543High 105800 260802 35291 112475 308778Very high 105800 376305 50853 162464 446013

Premium (current NIS)a Low 105800 338057 91404 132471 1923962Regular 105800 318922 8623 124972 1815058High 105800 279057 75451 109351 1588176Very high 105800 255137 68984 99978 1452046

pd 105800 0328 006 03 18

Realization Choice Low 105800 0178 038 0 1Regular 105800 0811 039 0 1High 105800 0006 008 0 1Very high 105800 0005 007 0 1

Policy termination Active 105800 0150 036 0 1Canceled 105800 0143 035 0 1Expired 105800 0707 046 0 1

Policy duration (years) 105800 0848 028 0005 108Claims All 105800 0208 048 0 5

Low 18799 0280 055 0 5Regular 85840 0194 046 0 5High 654 0109 034 0 3Very high 507 0107 032 0 2

Claims per yearb All 105800 0245 066 0 19882Low 18799 0309 066 0 9264Regular 85840 0232 066 0 19882High 654 0128 062 0 12636Very high 507 0133 050 0 3326

a The average exchange rate throughout the sample period was approximately $1 per 35 NIS starting at 13 in late 1994and reaching 14 in late 1999

b The mean and standard deviation of the claims per year are weighted by the observed policy duration to adjust forvariation in the exposure period These are the maximum likelihood estimates of a simple Poisson model with no covariates

TABLE 2BmdashSUMMARY STATISTICSmdashCONTRACT CHOICES AND REALIZATIONS

Claims Low Regular High Very high Total Share

0 11929 (0193) 49281 (0796) 412 (0007) 299 (0005) 61921 (100) 080341 3124 (0239) 9867 (0755) 47 (0004) 35 (0003) 13073 (100) 016962 565 (0308) 1261 (0688) 4 (0002) 2 (0001) 1832 (100) 002383 71 (0314) 154 (0681) 1 (0004) 0 (0000) 226 (100) 000294 6 (0353) 11 (0647) 0 (0000) 0 (0000) 17 (100) 000025 1 (0500) 1 (0500) 0 (0000) 0 (0000) 2 (100) 000003

Notes The table presents tabulation of choices and number of claims For comparability the figures are computed only forindividuals whose policies lasted at least 09 years (about 73 percent of the data) The bottom rows of Table 2A providedescriptive figures for the full dataset The numbers in parentheses in each cell represent percentages within each row Theright-hand-side column presents the marginal distribution of the number of claims

753VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

probability of a claim are proportional to thelength of the time interval taken into account soit is convenient to think of the contract choice asa commitment for only a short amount of timeThis approach has several advantages First ithelps to account for early cancellations andtruncated policies which together constitute 30percent of the policies in the data18 Second itmakes the deductible choice independent ofother longer-term uncertainties faced by the in-dividual so we can focus on static risk-takingbehavior Third this formulation helps to obtaina simple framework for analysis which is attrac-tive both analytically and computationally19

The expected utility that the individual obtainsfrom the choice of a contract (p d) is given by

(2) v p d

1 tuw pt tuw pt d

We characterize the set of parameters that willmake the individual indifferent between the twooffered contracts This set provides a lower (up-per) bound on the level of risk aversion for indi-viduals who choose the low (high) deductible (fora given ) Thus we analyze the equation v(phdh) v(pl dl) By taking limits with respect to t(and applying LrsquoHopitalrsquos rule) we obtain

(3)

limt30

1

t(u(w pht) u(w plt))

(u(w pht) u(w pht dh)) (u(w plt) u(w plt dl))

pl phuw

uw dl uw dh

or

(4)

pl phuw uw dl uw dh

The last expression has a simple intuition Theright-hand side is the expected gain (in utils) perunit of time from choosing a low deductibleThe left-hand side is the cost of such a choiceper unit of time For the individual to be indif-ferent the expected gains must equal the costs

In our benchmark specification we assumethat the third derivative of the vNM utilityfunction is not too large A Taylor expansion forboth terms on the right-hand side of equation(4) ie u(w d) u(w) du(w) (d22)u13(w) implies that

(5)pl ph

uw dh dluw

1

2dh dldh dlu13w

Let d dh dl 0 p pl ph 0 andd 1frasl2 (dh dl) to obtain

(6)p

duw uw d u13w

or

(7) r u13w

uw

p

d 1

d

where r is the coefficient of absolute risk aver-sion at wealth level w

18 As can be seen in Table 2A 70 percent of the policiesare observed through their full duration (one year) About15 percent are truncated by the end of our observationperiod and the remaining 15 percent are canceled for var-ious reasons such as change in car ownership total-lossaccident or a unilateral decision of the policyholder tochange insurance providers

19 This specification ignores the option value associatedwith not canceling a policy This is not very restrictiveSince experience rating is small and menus do not changeby much this option value is likely to be close to zero Asimple alternative is to assume that individuals behave as ifthey commit for a full year of coverage In such a case themodel will be similar to the one we estimate but willdepend on the functional form of the vNM utility functionand would generally require taking infinite sums (over thepotential realizations for the number of claims within the year)In the special case of quadratic expected utility maximizerswho care only about the mean and variance of the number ofclaims this is easy to solve The result is almost identical to theexpression we subsequently derive in equation (7)

754 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Equation (7) defines an indifference set in thespace of risk and risk aversion which we willrefer to by (r() ) and ((r) r) interchange-ably Both r() and (r) have a closed-formrepresentation a property that will be computa-tionally attractive for estimation20 Both termsare individual specific as they depend on thedeductible menu which varies across individu-als For the rest of the paper we regard eachindividual as associated with a two-dimensionaltype (ri i) An individual with a risk parameter

i who is offered a menu (pih di

h) (pil di

l)will choose the low-deductible contract if andonly if his coefficient of absolute risk aversionsatisfies ri ri(i) Figure 2 presents a graphi-cal illustration

B The Benchmark Econometric Model

The Econometric ModelmdashThe econometricmodel we estimate is fully described by the fiveequations presented in this section Our ob-jective is to estimate the joint distribution of(i ri)mdashthe claim rate and coefficient of abso-lute risk aversionmdashin the population of policy-holders conditional on observables xi The

20 For example estimating the CARA version of themodel (Section IIID) for which r() does not have aclosed-form representation takes almost ten times longer

FIGURE 2 THE INDIVIDUALrsquoS DECISIONmdashA GRAPHICAL ILLUSTRATION

Notes This graph illustrates the individualrsquos decision problem The solid line presents the indifference setmdashequation (7)mdashappliedfor the menu faced by the average individual in the sample Individuals are represented by points in the two-dimensional spaceabove In particular the scattered points are 10000 draws from the joint distribution of risk and risk aversion for the averageindividual (on observables) in the data based on the point estimates of the benchmark model (Table 4) If an individual is eitherto the right of the line (high risk) or above the line (high risk aversion) the low deductible is optimal Adverse selection is capturedby the fact that the line is downward sloping higher-risk individuals require lower levels of risk aversion to choose the lowdeductible Thus in the absence of correlation between risk and risk aversion higher-risk individuals are more likely to choosehigher levels of insurance An individual with i (pidi) will choose a lower deductible even if he is risk neutral ie withprobability one (we do not allow individuals to be risk loving) This does not create an estimation problem because i is notobserved only a posterior distribution for it Any such distribution will have a positive weight on values of i that are below(pidi) Second the indifference set is a function of the menu and in particular of (pidi) and d An increase in (pidi) willshift the set up and to the right and an increase in d will shift the set down and to the left Therefore exogenous shiftsof the menus that make both arguments change in the same direction can make the sets ldquocrossrdquo thereby allowing us toidentify the correlation between risk and risk aversion nonparametrically With positive correlation (shown in the figureby the ldquoright-bendingrdquo shape of the simulated draws) the marginal individuals are relatively high risk therefore creatinga stronger incentive for the insurer to raise the price of the low deductible

755VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

benchmark formulation assumes that (i ri) fol-lows a bivariate lognormal distribution Thuswe can write the model as

(8) ln i xi i

(9) ln ri xi vi

with

(10) i

vi

iid

N00

2 r

r r2

Neither i nor ri is directly observed Thereforewe treat both as latent variables Loosely speak-ing they can be thought of as random effects Weobserve two variables the number of claims andthe deductible choice which are related to thesetwo unobserved components Thus to completeour econometric model we have to specify therelationship between the observed variables andthe latent ones This is done by making two struc-tural assumptions First we assume that the num-ber of claims is drawn from a Poisson distributionnamely

(11) claimsi Poisson(i ti )

where ti is the observed duration of the policySecond we assume that when individuals maketheir deductible choices they follow the theoreti-cal model described in the previous section Themodel implies that individual i chooses the lowdeductible (choicei 1) if and only if ri ri(i)where ri is defined in equation (7) Thus theempirical model for deductible choice is given by

(12) Prchoicei 1 Prri

pi

idi 1

d i

Prexp(xi vi)

pi

exp(xi i)di 1

d i

With no unobserved heterogeneity in risk (i 0) equation (12) reduces to a simple probit In

such a case one can perfectly predict risk from thedata denote it by (xi) and construct an addi-tional covariate ln([pi((xi)di) 1]d i) Giventhe assumption that risk aversion is distributedlognormally running the probit regressionabove and renormalizing the coefficient onthe constructed covariate to 1 (instead ofthe typical normalization of the variance of theerror term to 1) has a structural interpretation withln (ri) as the dependent variable However Cohen(2005) provides evidence of adverse selection inthe data implying the existence of unobservedheterogeneity in risk This makes the simple probitregression misspecified Estimation of the fullmodel is more complicated Once we allow forunobserved heterogeneity in both unobserved riskaversion (vi) and claim rate (i) we have to inte-grate over the two-dimensional region depicted inFigure 2 for estimation

EstimationmdashA natural way to proceed is toestimate the model by maximum likelihoodwhere the likelihood of the data as a function ofthe parameters can be written by integrating outthe latent variables namely

(13) Lclaimsi choicei

Prclaimsi choiceii riPri ri

where is a vector of parameters to be esti-mated While formulating the empirical modelusing likelihood may help our thinking aboutthe data-generating process using maximumlikelihood (or generalized method of moments(GMM)) for estimation is computationally cum-bersome This is because in each iteration itrequires evaluating a separate complex integralfor each individual in the data In contrastMarkov Chain Monte Carlo (MCMC) Gibbssampling is quite attractive in such a case Us-ing data augmentation of latent variables (Mar-tin A Tanner and Wing Hung Wong 1987)according to which we simulate (i ri) and latertreat those simulations as if they are part ofthe data one can avoid evaluating the complexintegrals by sampling from truncated normaldistributions which is significantly less compu-tationally demanding (eg Luc Devroye 1986)This feature combined with the idea of aldquosliced samplerrdquo (Paul Damien John Wake-field and Stephen Walker 1999) to sample from

756 THE AMERICAN ECONOMIC REVIEW JUNE 2007

an unfamiliar posterior distribution makes theuse of a Gibbs sampler quite efficient for ourpurposes Finally the lognormality assumptionimplies that F(ln(i)ri) and F(ln(ri)i) follow a(conditional) normal distribution allowing us torestrict attention to univariate draws furtherreducing the computational burden

The Appendix provides a full description ofthe Gibbs sampler including the conditionaldistributions and the (flat) prior distributions weuse The basic intuition is that conditional onobserving (i ri) for each individual we have asimple linear regression model with two equa-tions The less standard part is to generate drawsfor (i ri) We do this iteratively Conditionalon i the posterior distribution for ln(ri) fol-lows a truncated normal distribution where thetruncation point depends on the menu individ-ual i faces and its direction (from above orbelow) depends on individual irsquos deductiblechoice The final step is to sample from theposterior distribution of ln(i) conditional on riThis is more complicated as we have bothtruncation which arises from adverse selection(just as we do when sampling for ri) and thenumber of claims which provides additionalinformation about the posterior of i Thus theposterior for i takes an unfamiliar form forwhich we use a ldquosliced samplerrdquo

We use 100000 iterations of the Gibbs sam-pler It seems to converge to the stationarydistribution after about 5000 iterations There-fore we drop the first 10000 draws and use thelast 90000 draws of each variable to report ourresults The results are based on the posteriormean and posterior standard deviation fromthese 90000 draws Note that each iterationinvolves generating separate draws of (i ri) foreach individual Performing 100000 iterationsof the benchmark specification (coded in Mat-lab) takes about 60 hours on a Dell Precision530 workstation

C Identification

The parametric version of the model is iden-tified mechanically There are more equationsthan unknowns and no linear dependenciesamong them so (as also verified using MonteCarlo exercises) the model parameters can bebacked out from simulated data Our goal in thissection is not to provide a formal identificationproof Rather we want to provide intuition for

which features of the data allow us to identifyparticular parameters of the model The discus-sion also highlights the assumptions that areessential for identification vis-a-vis those thatare made only for computational convenience(making them in principle testable)

Discussion of Nonparametric Identifica-tionmdashThe main difficulty in identifying themodel arises from the gap between the (exante) risk type i which individuals usewhen choosing a deductible and the (ex post)realization of the number of claims we ob-serve We identify between the variation inrisk types and the variation in risk realizationsusing our parametric distributional assump-tions The key is that the distribution of risktypes can be uniquely backed out from theclaim data alone This allows us to use thedeductible choice as an additional layer ofinformation which identifies unobserved het-erogeneity in risk aversion21 Any distribu-tional assumption that allows us to uniquelyback out the distribution of risk types fromclaim data would be sufficient to identify thedistribution of risk aversion As is customaryin the analysis of count processes we make aparametric assumption that claims are gener-ated by a lognormal mixture of Poisson dis-tributions (Section IIID discusses this furtherand explores an alternative) Using a mixtureenables us to account for adverse selectionthrough unobserved heterogeneity in risk Italso allows us to better fit the tails of theclaim distribution In principle a more flexi-ble mixture or a more flexible claim-generat-ing process could be identified as long as theclaims data are sufficiently rich22

21 Cardon and Hendel (2001) face a similar identificationproblem in the context of health insurance They use vari-ation in coverage choice (analogous to deductible choice) toidentify the variation in health-status signals (analogous torisk types) from the variation in health expenditure (analo-gous to number of claims) They can rely on the coveragechoice to identify this because they make an assumptionregarding unobserved heterogeneity in preferences (iidlogit) We take a different approach as our main goal is toestimate rather than assume the distribution of preferences

22 Although it may seem that the claim data are limited(as they take only integer values between 0 and 5 in ourdata) variation in policy duration generates continuousvariation in the observed claim propensity Of course thisvariation also introduces an additional selection into themodel due to policy cancellations which are potentially

757VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Once the distribution of risk types is iden-tified from claims data the marginal distribu-tion of risk aversion (and its relationship tothe distribution of risk types) is nonparametri-cally identified from the variation in theoffered menus discussed in Section I Thisvariation implies different deductible and pre-mium options to identical (on observables)individuals who purchased insurance at dif-ferent times Different menus lead to differentindifference sets (similar to the one depictedin Figure 2) These sets often cross each otherand nonparametrically identify the distribu-tion of risk aversion and the correlation struc-ture at least within the region in which theindifference sets vary For the tails of thedistribution as is typically the case we haveto rely on parametric assumptions or usebounds The parametric assumption of log-normality we use for most of the paper ismade only for computational convenience

Intuition for the Parametric IdentificationMechanismmdashVariation in the offered menus isimportant for the nonparametric identificationThe parametric assumptions could identify themodel without such variation Thus to keep theintuition simple let us take the bivariate log-normal distribution as given and contrary to thedata assume that all individuals are identical onobservables and face the same menu Supposealso that all individuals are observed for exactlyone year and have up to two claims23 In thissimplified case the model has five parametersto be estimated the mean and variance of risk and

2 the mean and variance of risk aver-sion r and r

2 and the correlation parameter The data can be summarized by five numbersLet 130 131 and 132 1 131 130 be thefraction of individuals with zero one and twoclaims respectively Let 0 1 and 2 be theproportion of individuals who chose a low de-ductible within each ldquoclaim grouprdquo Given ourdistributional assumption about the claim-

generating process we can use 130 and 131 touniquely identify and

2 Loosely isidentified by the average claim rate in the dataand

2 is identified by how fat the tail of theclaim distribution is ie by how large (132131)is compared to (131130) Given and

2 and thelognormality assumption we can (implicitly)construct a posterior distribution of risk typesfor each claim group F(r claims c) andintegrate over it when predicting the deductiblechoice This provides us with three additionalmoments each of the form

(14) Ec Prchoice 1r

dFr claims c dFr

for c 0 1 2 These moments identify thethree remaining parameters of the model rr

2 and Let us now provide more economic content to

the identification argument Using the same ex-ample and conditional on identifying and

2

from the claim data one can think about thedeductible choice data 0 1 2 as a graph(c) The absolute level of the graph identifiesr In the absence of correlation between riskand risk aversion the slope of the graph iden-tifies r

2 with no correlation the slope shouldalways be positive (due to adverse selection)but higher r

2 would imply a flatter graph be-cause more variation in the deductible choiceswill be attributed to variation in risk aversionFinally is identified by the curvature of thegraph The more convex (concave) the graph isthe more positive (negative) is the estimated For example if 0 05 1 051 and 2 099 it is likely that r

2 is high (explaining why0 and 1 are so close) and is highly positive(explaining why 2 is not also close to 1) Incontrast if 0 1 it must mean that thecorrelation between risk and risk aversion isnegative which is the only way the originalpositive correlation induced by adverse selec-tion can be offset This intuition also clarifiesthat the identification of relies on observingindividuals with multiple claims (or differentpolicy durations) and that it is likely to besensitive to the distributional assumptions Thedata (Table 2B) provide a direct (positive) cor-relation between deductible choice and claims

endogenous The results are similar when we use onlyexpired and truncated policies

23 This variation is sufficient (and necessary) to identifythe benchmark model The data provide more variation weobserve up to five claims per individual we observe con-tinuous variation in the policy durations we observe vari-ation in prices and we exploit distributional restrictionsacross individuals with different observable characteristics

758 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The structural assumptions allow us to explainhow much of this correlation can be attributedto adverse selection The remaining correlation(positive or negative) is therefore attributed tocorrelation in the underlying distribution of riskand risk aversion

III Results

A Reduced-Form Estimation

To get an initial sense for the levels of abso-lute risk aversion implied by the data we use asimple back-of-the-envelope exercise We com-pute unconditional averages of p d dh dland d (Table 2A)24 and substitute these valuesin equation (7) The implied coefficient of ab-solute risk aversion is 29 104 NIS125 Thisfigure could be thought of as the average indif-ference point implying that about 18 percent ofthe policyholders have coefficients of absoluterisk aversion exceeding it To convert to USdollar amounts one needs to multiply thesefigures by the average exchange rate (352)resulting in an average indifference point of102 103 $US1 This figure is less than halfof a similar estimate reported by Sydnor (2006)for buyers of homeownerrsquos insurance but about3 and 13 times higher than comparable fig-ures reported by Gertner (1993) and Metrick(1995) respectively for television game showparticipants

Table 3 provides reduced-form analysis ofthe relationship between the observables andour two left-hand-side variables the number ofclaims and the deductible choice26 Column 1reports the estimates from a Poisson regressionof the number of claims on observed character-istics This regression is closely related to therisk equation we estimate in the benchmarkmodel It shows that older people women andpeople with a college education are less likely to

have an accident Bigger more expensiveolder and noncommercial cars are more likelyto be involved in an accident Driving experi-ence and variables associated with less intenseuse of the car reduce accident rates As could beimagined claim propensity is highly correlatedover time past claims are a strong predictor offuture claims Young drivers are 50 to 70 per-cent more likely to be involved in an accidentwith young men significantly more likely thanyoung women Finally as indicated by the trendin the estimated year dummies the accident ratesignificantly declined over time Part of thisdecline is likely due to the decline in accidentrates in Israel in general27 This decline mightalso be partly due to the better selection ofindividuals the company obtained over time asit gained more experience (Cohen 2003)

Columns 2 and 3 of Table 3 present estimatesfrom probit regressions in which the dependentvariable is equal to one if the policyholder chosea low deductible and is equal to zero otherwiseColumn 3 shows the marginal effects of thecovariates on the propensity to choose a lowdeductible These marginal effects do not have astructural interpretation as the choice of lowdeductible depends on its price on risk aver-sion and on risk In this regression we againobserve a strong trend over time Fewer policy-holders chose the low deductible as time wentby One reason for this trend according to thecompany executives is that over time the com-panyrsquos sales persons were directed to focusmainly on the ldquodefaultrdquo regular deductible op-tion28 The effect of other covariates will bediscussed later in the context of the full modelIn unreported probit regressions we also test thequalitative assumptions of the structural model byadding three additional regressors the price ratio(pidi) the average deductible offered d i

24 The unconditional is computed by maximum likeli-hood using the data on claims and observed durations of thepolicies

25 Using the CARA specification as in equation (16) weobtain a slightly lower value of 25 104 NIS1

26 We find positive correlation in the errors of these tworegressions suggesting the existence of adverse selection inthe data and motivating a model with unobserved heteroge-neity in risk This test is similar to the bivariate probit testproposed by Chiappori and Salanie (2000) and replicatesearlier results reported in Cohen (2005)

27 In particular traffic fatalities and counts of trafficaccidents in Israel fell by 11 percent and 18 percent during1998 and 1999 respectively

28 Such biased marketing efforts will bias consumersagainst choosing the low deductible thus making them lookless risk averse This would make our estimate a lowerbound on the true level of risk aversion If only sophisti-cated consumers could see beyond the marketing effort andthis sophistication were related to observables (eg educa-tion) the coefficients on such observables would be biasedupward This is not a major concern given that the coefficientson most of the covariates are fairly stable when we estimatethe benchmark model separately for different years

759VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

TABLE 3mdashNO HETEROGENEITY IN RISK

Variable

Poisson regressiona Probit regressionsc

Dep var Number of claims Dep var 1 if low deductible

(1) (2) (3)

Coeff Std Err IRRb Coeff Std Err dPdX dPdXDemographics Constant 43535 02744 mdash 224994 23562 mdash

Age 00074 00052 09926 01690 00464 00034 00042Age2 989 105 536 105 10001 00017 00005 339 105 455 105Female 00456 00161 09555 08003 01316 00165 00134Family Single omitted omitted omitted

Married 01115 00217 08945 06417 01950 00128 00045Divorced 00717 00346 10743 03191 03127 00064 00044Widower 00155 00527 10156 03441 04573 00069 00025Other (NA) 01347 01755 11442 29518 19836 00511 00370

Education Elementary 00522 00550 09491 02927 04278 00061 00016High school omitted omitted omittedTechnical 00373 00308 10380 05037 02520 00105 00138College 00745 00197 09282 05279 01546 00109 00053Other (NA) 00116 00184 10116 00630 01540 00013 00025

Emigrant 00210 00163 10213 00497 01328 00010 00003

Car attributes Log(Value) 01227 00281 11306 13285 02342 00271 00299Car age 00187 00042 10189 01603 00357 00033 00017Commercial car 01394 00326 08699 09038 02781 00177 00294Log(engine size) 02972 00459 13461 06924 03952 00141 00075

Driving License years 00204 00034 09798 01043 00312 00021 00005License years2 00002 677 105 10002 00015 00005 299 105 167 105

Good driver 00176 00191 09825 07207 01574 00148 00152Any driver 00564 00169 10580 10923 01321 00217 00258Secondary car 00859 00209 09177 00038 01626 00001 00070Business use 01852 00293 12034 07951 02737 00156 00017History length 00527 00110 09486 11218 01450 00228 00171Claims history 06577 00390 19304 04654 05460 00095 00496

Young driver Young driver 05235 00361 16879 28847 06305 00524 00012Gender Male omitted omitted omitted

Female 01475 00288 08629 17959 03135 00396 00195Age 17ndash19 omitted omitted omitted

19ndash21 00701 00532 10726 07800 07509 00153 0014721ndash24 00267 00574 09737 05746 07773 00114 0015624 02082 00567 0812 17869 07328 00397 00136

Experience 1 omitted omitted omitted1ndash3 02416 00458 07854 10175 06577 00217 000093 02827 00532 07538 32513 07386 00762 00410

Company year First year omitted omitted omittedSecond year 00888 00198 09150 44513 01506 00792 00859Third year 00690 00222 09334 85888 01820 01303 01367Fourth year 01817 00232 08339 118277 02102 01616 01734Fifth year 05431 0028 05810 143206 02778 01920 02085

105586Obs 105800 94000 105800Pseudo R2 00162 01364 01296Log likelihood 577459 369595 430869

Significant at the 5 percent confidence levela Maximum likelihood estimates Variation in exposure (policy duration) is accounted forb IRR Incidence rate ratio Each figure should be interpreted as the increasedecrease in claim probability as a result of an increase of one unit in the

right-hand-side variablec There are two separate probit regressions reported in this table Column 2 relies on the deductible choice model and the lognormality assumption As discussed

in the text by including an additional regressor ln([pi((xi)di) 1]d i) (with (xi) predicted from column 1 above) and normalizing its coefficient to 1 we obtaina structural interpretation of this regression with ln (ri) as the dependent variable Thus the reported coefficients are comparable to those estimated for the benchmarkmodel One should be cautious however in interpreting these coefficients Unlike the benchmark model this regression does not allow unobserved heterogeneity inrisk and suffers from some selection bias because observations with high predicted risk rate are omitted (which is why the number of observations is 94000 ratherthan the full sample of 105800) For comparison column 3 reports the marginal effects from a comparable probit regression that uses the full sample and does notcontrol for pricing and predicted risk through the additional structural regressor Column 3 does not have a structural interpretation and its (unreported) coefficientscannot be compared to those estimated from the benchmark model

760 THE AMERICAN ECONOMIC REVIEW JUNE 2007

and the risk rate (xi) as predicted from thePoisson regression of column 1 All three addi-tional regressors enter with the predicted signand with large and highly significant marginaleffects29

Finally column 2 of Table 3 presents animportant specification of the probit regres-sion in which ln([pi(( xi)di) 1]d i) isadded as a regressor and its coefficient isnormalized to 130 As already mentioned ifwe assume that there is no unobserved heter-ogeneity in risk then column 2 is analogousto the ln(ri) equation of the benchmark modelThis restriction of the model is rejected by thedata About 10 percent of the individuals arepredicted to have (xi) (pi di) implying achoice of low deductible for any level of riskaversion Many of these individuals howeverstill choose higher deductible levels Column2 reports the results for the remaining indi-viduals (94000 out of 105800) for which theregression can be run While the signs of theestimated coefficients are similar to those inthe benchmark model presented below therestricted version of the model suggests muchhigher levels and dispersion of risk aversionwell above any reasonable level31 The fullestimation of the benchmark model clearly re-jects this restriction on the model

B Estimation of the Benchmark Model

The Effect of Individual Characteristics onRisk AversionmdashTable 4 presents the estimationresults from the benchmark model The secondcolumn shows how the level of absolute riskaversion is related to individual characteristicsAs the dependent variable is in natural loga-

rithm coefficients on dummy variables can bedirectly interpreted as approximate percentagechanges The vast majority of these coefficientsare quite stable across a wide range of specifi-cations which are mentioned later

The results indicate that women are more riskaverse than men and have a coefficient of ab-solute risk aversion about 20 percent greaterthan that of men These results are consistentwith those of Donkers et al (2001) and Hartoget al (2002) The estimated effect of age sug-gests a nonmonotone pattern of risk preferencesover the life cycle The estimated coefficientsimply that initially (that is at age 18 the youngestindividual in the data) individuals become less riskaverse with age but around the age of 48 indi-viduals start becoming more risk averse32 Mar-ried individuals are estimated to be significantlymore risk averse compared to singles while di-vorced individuals are less (although the coeffi-cient is statistically insignificant)

The analysis suggests that variables that arelikely to be correlated with income or wealthsuch as post-high-school education and thevalue of the car have a positive coefficientindicating that wealthier people have higherlevels of absolute risk aversion Although we donot have data on individualsrsquo income or wealthwe use other proxies for income in additionalspecifications and obtain mixed results Whenwe include as a covariate the average householdincome among those who live in the same zipcode we obtain a significant and negative co-efficient of 0333 (0154) (standard deviationin parentheses) When we match zip code in-come on demographics of the individuals thecoefficient is effectively zero 0036 (0047)and when we use a socioeconomic index ofthe locality in which the individual lives thecoefficient is positive 0127 (0053)33 Thus

29 The estimated marginal effect (z-statistic in parenthe-ses) is 0352 (1376) 16 104 (1481) and 0154(255) for (pidi) d i and (xi) respectively

30 The level is a normalization The sign is estimatedHad the sign on this regressor been positive this would haveimplied a rejection of the model

31 For the implied median level of risk aversion the re-stricted model produces a similar estimate to the estimate wereport below for the benchmark model However since someindividuals who chose the low deductible are estimated to havevery low claim rates the restricted model is ldquoforcedrdquo to estimatevery high risk aversion for these individuals (in contrast thebenchmark model can explain such choices by positive risk re-siduals) resulting in very high dispersion and (due to the lognor-mality assumption) very high average risk aversion which isabout 1025 higher than the benchmark estimates we report below

32 This nonmonotone pattern may explain why age enterswith different signs in the estimation results of Donkers et al(2001) and Hartog et al (2002) A somewhat similar U-shapepattern with respect to age is also reported by Sumit Agarwalet al (2006) in the context of consumer credit markets

33 The full results from these specifications are providedin the online Appendix (available at httpwwwe-aerorgdatajune0720050644_apppdf ) The reason we do not usethese income variables in the benchmark specification istheir imperfect coverage which would require us to omitalmost 20 percent of the individuals Other results fromthese regressions are similar to those we report for thebenchmark specification

761VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

overall we interpret our findings as suggestiveof a nonnegative and perhaps positive associ-ation between incomewealth and absolute riskaversion

At first glance these results may appear tobe inconsistent with the widely held beliefthat absolute risk aversion declines withwealth One should distinguish however be-tween two questions (a) whether for a givenindividual the vNM utility function exhibitsdecreasing absolute risk aversion and (b)how risk preferences vary across individuals

Our results do not speak to the first questionand should not be thought of as a test of thedecreasing absolute risk aversion propertyTesting this property would require observingthe same individual making multiple choicesat different wealth levels Rather our resultsindicate that individuals with greater wealthhave utility functions that involve a greaterdegree of risk aversion It might be thatwealth is endogenous and that risk aversion(or unobserved individual characteristics thatare correlated with it) leads individuals to

TABLE 4mdashTHE BENCHMARK MODEL

Variable Ln() equation Ln(r) equation Additional quantities

Demographics Constant 15406 (00073) 118118 (01032) Var-covar matrix ()

Age 00001 (00026) 00623 (00213) 01498 (00097)Age2 624 106 (263 105) 644 104 (211 104) r 31515 (00773)Female 00006 (00086) 02049 (00643) 08391 (00265)Family Single Omitted Omitted

Married 00198 (00115) 01927 (00974) Unconditional statisticsa

Divorced 00396 (00155) 01754 (01495) Mean 02196 (00013)Widower 00135 (00281) 01320 (02288) Median 02174 (00017)Other (NA) 00557 (00968) 04599 (07397) Std Dev 00483 (00019)

Education Elementary 00194 (00333) 01283 (02156) Mean r 00019 (00002)High school Omitted Omitted Median r 727 106 (756 107)Technical 00017 (00189) 02306 (01341) Std Dev r 00197 (00015)College 00277 (00124) 02177 (00840) Corr(r ) 02067 (00085)Other (NA) 00029 (00107) 00128 (00819)

Emigrant 00030 (00090) 00001 (00651) Obs 105800

Car attributes Log(value) 00794 (00177) 07244 (01272)Car age 00053 (00023) 00411 (00176)Commercial car 00719 (00187) 00313 (01239)Log(engine size) 01299 (00235) 03195 (01847)

Driving License years 00015 (00017) 00157 (00137)License years2 183 105 (351 105) 148 104 (254 104)Good driver 00635 (00112) 00317 (00822)Any driver 00360 (00105) 03000 (00722)Secondary car 00415 (00141) 01209 (00875)Business use 00614 (00134) 03790 (01124)History length 00012 (00052) 03092 (00518)Claims history 01295 (00154) 00459 (01670)

Young driver Young driver 00525 (00253) 02499 (02290)Gender Male Omitted mdash

Female 00355 (00061) mdashAge 17ndash19 Omitted mdash

19ndash21 00387 (00121) mdash21ndash24 00445 (00124) mdash24 00114 (00119) mdash

Experience 1 Omitted mdash1ndash3 00059 (00104) mdash3 00762 (00121) mdash

Company year First year Omitted OmittedSecond year 00771 (00122) 14334 (00853)Third year 00857 (00137) 28459 (01191)Fourth year 01515 (00160) 38089 (01343)Fifth year 04062 (00249) 39525 (01368)

Note Standard deviations based on the draws from the posterior distribution in parentheses Significant at the 5 percent confidence levela Unconditional statistics represent implied quantities for the sample population as a whole ie integrating over the distribution of covariates in the sample (as

well as over the unobserved components)

762 THE AMERICAN ECONOMIC REVIEW JUNE 2007

save more to obtain more education or totake other actions that lead to greaterwealth34

Let us make several additional observationsFirst while owners of more expensive cars ap-pear to have both higher risk exposure andhigher levels of risk aversion owners of biggercars have higher risk exposure but lower levelsof risk aversion This should indicate that thestructure of the model does not constrain therelationship between the coefficients in the twoequations Rather it is the data that ldquospeak uprdquoSecond individuals who are classified by theinsurer as ldquogood driversrdquo indeed have lowerrisk but also appear to have lower risk aversionThis result is somewhat similar to the positivecorrelation between unobserved risk and unob-served risk aversion which we report belowThird policyholders who tend to use the car forbusiness are less risk averse This could bebecause uninsured costs of accidents occurringto such policyholders are tax deductible Fourthpolicyholders who reported three full years ofclaims history are more risk averse but are notdifferent in their risk exposure The attitude thatleads such policyholders to comply with therequest to (voluntarily) report three full years ofclaims history is apparently and not surpris-ingly correlated with higher levels of risk aver-sion In contrast while past claims indicate highrisk they have no significant relationship withrisk aversion Finally we find a strong trendtoward lower levels of risk aversion over timeThis is a replication of the probit results dis-cussed earlier

The Effect of Individual Characteristics onClaim RiskmdashThe first column of Table 4 de-scribes the relationship between observablesand risk exposure Accident risk is higher fordivorced individuals and lower for people witha college education Bigger older more expen-sive and noncommercial cars are all morelikely to be involved in an accident Drivingexperience reduces accident rates as do mea-sures of less intense use of the car while youngdrivers are more exposed to risk Claim propen-sity is highly correlated over time the voluntaryreport of past claims is a strong predictor offuture claims This risk equation produces re-sults that are similar to those of the simplerPoisson regression reported in Table 3 Al-though some of the coefficients lose signifi-cance the magnitude of most coefficients isquite similar The similarity between these twosets of results is to be expected as the riskregression is identified primarily from the dataon claims so incorporating the information ondeductible choice does not qualitatively changethe conceptual identification strategy (see Sec-tion IIC) If the results were not similar thiscould have indicated a misspecification of themodel The slight differences between the riskregressions in Table 3 and Table 4 are drivenmainly by the structural assumptions First thebenchmark model estimates a lognormal mix-ture of Poisson rates rather than a single Pois-son model By incorporating the fatter tails ofthe claim distribution it slightly changes theresults increases the standard errors and de-creases the average predicted claim rate Sec-ond the information on deductible choiceslightly helps us in obtaining more precise es-timates through the correlation structure be-tween the error terms in the two equations

The Implied Level of Risk AversionmdashOne ofthe main goals of the estimation is to obtainmeasures of the level of risk aversion in thepopulation we study Since we use Gibbs sam-pling and augment the latent coefficients ofabsolute risk aversion we can directly obtainthe posterior distribution of various moments ofthis distribution At each iteration of the Gibbssampler we compute the mean standard devi-ation (across individuals) and various percen-tiles of the simulated draws of i and ri and thecorrelation between them The far-right columnof Table 4 reports the averages and standard

34 One may be tempted to interpret the positive wealtheffects as an indirect indication of credit constraints wealth-ier individuals are less credit constrained and therefore canafford to purchase more insurance We do not share thisinterpretation for two reasons First the insurance companyobserves these proxies for wealth and conditions on themwhen setting prices Since the willingness to pay for insur-ance is likely to be correlated with the willingness to pay forthe additional insurance provided by the low deductibleoption premiums already reflect this variation We condi-tion on the level of the premium Second paying less exante implies paying more ex post so applying the creditconstraint argument only to the ex ante payment but not tothe probabilistic ex post deductible payments has no theo-retical foundation Essentially the setup of the model linksthe ex ante decisions to the ex post losses which are bothdriven by the curvature of the vNM utility function

763VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

deviations of these computed quantities over theiterations of the Gibbs sampler35 The impliedrisk aversion of the mean individual is 00019which is about seven times greater than theback-of-the-envelope calculation presented inthe previous section As we assume a lognormal

distribution and estimate a relatively high dis-persion coefficient r the estimates also implysignificantly lower values for the median levelof risk aversion As shown later the qualitativepattern of these results is quite robust acrossspecifications

Table 5 summarizes our findings regardingthe level and dispersion of risk aversion Itpresents two ways to interpret the estimates weobtained from the benchmark model as well ascomparisons to specifications with constant ab-solute risk aversion (CARA) utility and with

35 Note that these estimated quantities are unconditionalIn computing these quantities we integrate over the distri-bution of observable characteristics in the data so onecannot compute these estimates from the estimated param-eter directly

TABLE 5mdashRISK-AVERSION ESTIMATES

Specificationa Absolute risk aversionb Interpretationc Relative risk aversiond

Back-of-the-envelope 10 103 9070 1484Benchmark model

Mean individual 67 103 5605 972225th percentile 23 106 9998 003Median individual 26 105 9974 03775th percentile 29 104 9714 42790th percentile 27 103 7834 390295th percentile 99 103 4937 14327

CARA utilityMean individual 31 103 7651 4436Median individual 34 105 9966 050

Learning modelMean individual 42 103 6886 6140Median individual 56 106 9995 008

Comparable estimatesGertner (1993) 31 104 9699 479Metrick (1995) 66 105 9934 102Holt and Laury (2002)e 32 102 2096 86575Sydnor (2006) 20 103 8329 5395

a The table summarizes the results with respect to the level of risk aversion ldquoBack-of-the-enveloperdquo refers to thecalculation we report in the beginning of Section III ldquobenchmark modelrdquo refers to the results from the benchmark model(Table 4) ldquoCARA utilityrdquo refers to a specification of a CARA utility function and ldquolearning modelrdquo refers to a specificationin which individuals do not know their risk types perfectly (see Section IIID) The last four rows are the closest comparableresults available in the literature

b The second column presents the point estimates for the coefficient of absolute risk aversion converted to $US1 unitsFor the comparable estimates this is given by their estimate of a representative CARA utility maximizer For all otherspecifications this is given by computing the unconditional mean and median in the population using the figures we reportin the mean and median columns of risk aversion in Table 6 (multiplied by 352 the average exchange rate during the periodto convert to US dollars)

c To interpret the absolute risk aversion estimates (ARA) we translate them into x u(w) 1frasl2 u(w 100) 1frasl2 u(w x) That is we report x such that an individual with the estimated ARA is indifferent about participating in a 50ndash50lottery of gaining 100 US dollars and losing x US dollars Note that since our estimate is of absolute risk aversion the quantityx is independent of w To be consistent with the specification we use a quadratic utility function for the back-of-the-envelopebenchmark and learning models and use a CARA utility function for the others

d The last column attempts to translate the ARA estimates into relative risk aversion We follow the literature and do soby multiplying the ARA estimate by average annual income We use the average annual income (after tax) in Israel in 1995(51168 NIS from the Israeli Census) for all our specifications and we use average disposable income in the United Statesin 1987 ($15437) for Gertner (1993) and Metrick (1995) For Holt and Laury (2002) and Sydnor (2006) we use a similarfigure for 2002 ($26974)

e Holt and Laury (2002) do not report a comparable estimate The estimate we provide above is based on estimating aCARA utility model for the 18 subjects in their experiment who participated in the ldquo90rdquo treatment which involved stakescomparable to our setting For these subjects we assume a CARA utility function and a lognormal distribution of theircoefficient of absolute risk aversion The table reports the point estimate of the mean from this distribution

764 THE AMERICAN ECONOMIC REVIEW JUNE 2007

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 2: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

contrast the choice among different alternativesthat vary only in their financial parameters (thelevels of deductibles and premiums) is a case inwhich the effect of risk aversion can be moreplausibly isolated and estimated

The average deductible menu in our dataoffers an individual to pay an additional pre-mium of $55 (US) in order to save $182 indeductible payments in the event of a claim2 Arisk-neutral individual should choose a low de-ductible if and only if his claim propensity isgreater than the ratio between the premium($55) and the potential saving ($182) which is03 Although this pricing is actuarially unfairwith respect to the average claim rate of 024518 percent of the sample choose to purchase itAre these individuals exposed to greater riskthan the average individual are they more riskaverse or are they a combination of both Weanswer this question by developing a structuraleconometric model and estimating the joint dis-tribution of risk and risk aversion

Our benchmark specification uses expectedutility theory to model individualsrsquo deductiblechoices as a function of two utility parametersthe coefficient of absolute risk aversion and aclaim rate We allow both utility parameters todepend on individualsrsquo observable and unob-servable characteristics and assume that there isno moral hazard Two key assumptionsmdashthatclaims are generated by a Poisson process at theindividual level and that individuals have per-fect information about their Poisson claimratesmdashallow us to use data on (ex post) realizedclaims to estimate the distribution of (ex ante)claim rates Variation in the deductible menusacross individuals and their choices from thesemenus are then used to estimate the distributionof risk aversion in the sample and the correla-tion between risk aversion and claim risk Thuswe can estimate heterogeneous risk preferencesfrom deductible choices accounting for adverseselection (unobserved heterogeneity in claimrisk) which is an important confoundingfactor3

Our results suggest that heterogeneity inrisk preferences is rather large While themajority of the individuals are estimated to beclose to risk neutral with respect to lotteriesof $100 magnitude a significant fraction ofthe individuals in our sample exhibit signifi-cant levels of risk aversion even with respectto such relatively small bets Overall an in-dividual with the average risk aversion pa-rameter in our sample is indifferent aboutparticipating in a 50 ndash50 lottery in which hegains $100 or loses $56 We find that womenare more risk averse than men that risk pref-erences exhibit a U-shape with respect to ageand interestingly that most proxies for incomeand wealth are positively associated with abso-lute risk aversion

We perform an array of tests to verify thatthese qualitative results are robust to deviationfrom the modeling assumptions In particularwe explore alternative distributional assump-tions alternative restrictions on the von Neu-mannndashMorgenstern (vNM) utility function anda case in which individuals are allowed to makeldquomistakesrdquo in their coverage choices due to in-complete information about their own risk typesWe also show that the risk preferences we es-timate are stable over time and help predictother (but closely related) insurance decisionsFinally we justify our assumption to abstractfrom moral hazard and we discuss the way thisand other features of the setup (sample selectionand additional costs associated with an acci-dent) may affect the interpretation of the result

Throughout we focus primarily on absolute(rather than relative) risk aversion4 This allowsus to take a neutral position with respect to therecent debate over the empirical relevance ofexpected utility theory (Matthew Rabin 2000Rabin and Richard H Thaler 2001 Ariel Ru-binstein 2001 Richard Watt 2002 NicholasBarberis Ming Huang and Thaler 2006 Igna-

2 For ease of comparison we convert many of the re-ported figures from New Israeli Shekels to US dollars It isimportant to keep in mind however that GDP per capita inIsrael was 052ndash056 of that in the United States (067ndash070 when adjusted for PPP) during the observation period

3 Throughout the paper we use the term adverse selec-tion to denote selection on risk while selection on risk

aversion is just selection Some of the literature refers toboth selection mechanisms as adverse selection with thedistinction being between common values (selection on risk)and private values (selection on risk aversion)

4 Primarily as a way to compare our results with otherestimates in the literature Section III also provides esti-mates of relative risk aversion by following the literatureand multiplying our estimates for absolute risk aversion byannual income (in Israel) We obtain high double-digitestimates for the mean individual but below 05 for themedian

746 THE AMERICAN ECONOMIC REVIEW JUNE 2007

cio Palacios-Huerta and Roberto Serrano 2006)While the debate focuses on how the curvatureof the vNM utility function varies with wealthor across different settings we measure thiscurvature only at a particular wealth levelwhatever this wealth level may be By allowingunobserved heterogeneity in this curvatureacross individuals we place no conceptual re-strictions on the relationship between wealthand risk aversion Our estimated distribution ofrisk preferences can be thought of as a convo-lution of the distribution of (relevant) wealthand risk attitudes Avoiding this debate is also adrawback Without taking a stand on the wayabsolute risk preferences vary with sizes andcontexts we cannot discuss how relevant ourestimates are for other settings Obviously wethink they are But since statements about theirexternal relevance are mainly informed by whatwe think and less by what we do we defer thisdiscussion to the concluding section

Our analysis also provides two results regard-ing the relationship between the distribution ofrisk preferences and that of risk First we findthat unobserved heterogeneity in risk aversionis greater and more important (for profits andpricing) than unobserved heterogeneity in riskThis is consistent with the motivation for recenttheoretical work which emphasizes the impor-tance of allowing for preference heterogeneityin analyzing insurance markets (Michael Lands-berger and Isaac Meilijson 1999 Michael Smart2000 David de Meza and David C Webb 2001Bertrand Villeneuve 2003 Pierre-Andre Chiap-pori et al 2006 Bruno Jullien Bernard Salanieand Francois Salanie 2007) Second we findthat unobserved risk has a strong positive cor-relation with unobserved risk aversion and dis-cuss possible interpretations of it This isencouraging from a theoretical standpoint as itretains a single crossing property which weillustrate in the counterfactual exercise Thisfinding contrasts the negative correlation re-ported by Amy Finkelstein and Kathleen Mc-Garry (2006) for the long-term care insurancemarket and by Mark Israel (2005) for automo-bile insurance in Illinois We view these differ-ent results as cautioning against interpreting thiscorrelation parameter outside the context inwhich it is estimated Even if risk preferencesare stable across contexts risk is not and there-fore neither is the correlation structure

This study is related to two important strands

of literature The first shares our main goal ofmeasuring risk aversion Much of the existingevidence about risk preferences is based onintrospection laboratory experiments (Steven JKachelmeier and Mohamed Shehata 1992 Ver-non L Smith and James M Walker 1993Charles A Holt and Susan K Laury 2002) dataon bettors or television game show participants(Robert Gertner 1993 Andrew Metrick 1995Bruno Jullien and Salanie 2000 Roel MBeetsma and Peter C Schotman 2001 MatildeBombardini and Francesco Trebbi 2005) an-swers given by individuals to hypothetical sur-vey questions (W Kip Viscusi and William NEvans 1990 Evans and Viscusi 1991 Robert BBarsky et al 1997 Bas Donkers BertrandMelenberg and Arthur van Soest 2001 JoopHartog Ada Ferrer-i-Carbonell and NicoleJonker 2002) and estimates that are driven bythe imposed functional form relationship be-tween static risk-taking behavior and intertem-poral substitution5 We are aware of only a fewattempts to recover risk preferences from deci-sions of regular market participants Atanu Saha(1997) looks at firmsrsquo production decisionsand Raj Chetty (2006) recovers risk preferencesfrom labor supply In the context of insuranceCicchetti and Dubin (1994) look at individualsrsquodecisions whether to insure against failure ofinterior telephone wires Compared to theirsetting in our setting events are more frequentand commonly observed stakes are higher thepotential loss (the difference between the de-ductible amounts) is known and the deductiblechoice we analyze is more immune to alter-native preference-based explanations Finallyin a recent working paper Justin Sydnor (2006)uses data on deductible choices in homeownerrsquosinsurance to calibrate a bound for the impliedlevel of risk aversion6 An important differencebetween our paper and these papers is thatthey all rely on a representative individual

5 Much of the finance and macroeconomics literaturegoing back to Irwin Friend and Marshall E Blume (1975)relies on this assumption As noted by Narayana R Kocher-lakota (1996) in a review of this literature the level of staticrisk aversion is still a fairly open question

6 The possibility of using deductibles to make inferencesabout risk aversion was first pointed out by Jacques HDreze (1981) Dreze suggests however relying on theoptimality of the observed contracts (ldquosupply siderdquo infor-mation) while we rely on individualsrsquo choices of deduct-ibles (ldquodemand siderdquo information)

747VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

framework and therefore focus only on thelevel of risk aversion7 In contrast we explicitlymodel observed and unobserved heterogeneityin risk aversion as well as in risk We cantherefore provide results regarding the hetero-geneity in risk preferences and its relationshipwith risk which have potentially important im-plications for welfare and policy A representa-tive individual framework cannot address suchquestions

The second strand of related literature isthe recent empirical literature on adverse se-lection in insurance markets Much of thisliterature addresses the important question ofwhether adverse selection exists in differentmarkets As suggested by the influential workof Chiappori and Salanie (2000) it uses ldquore-duced formrdquo specifications to test whether aftercontrolling for observables accident outcomesand coverage choices are significantly corre-lated (Georges Dionne and Charles Vanasse1992 Robert Puelz and Arthur Snow 1994John Cawley and Tomas Philipson 1999Finkelstein and James Poterba 2004 Finkel-stein and McGarry 2006) Cohen (2005) appliesthis test to our data and finds evidence consis-tent with adverse selection As our main goal isquite different we take a more structural ap-proach By assuming a structure for the adverseselection mechanism we can account for itwhen estimating the distribution of risk prefer-ences While the structure of adverse selectionis assumed its relative importance is not im-posed The structural assumptions allow us toestimate the importance of adverse selectionrelative to the selection induced by unobservedheterogeneity in risk attitudes As we discussin Section IIC this approach is conceptuallysimilar to that of James H Cardon and IgalHendel (2001) who model health insurancechoices and also allow for two dimensions ofunobserved heterogeneity8

The rest of the paper is organized as followsSection I describes the environment the setup

and the data Section II lays out the theoreticalmodel and the related econometric model anddescribes its estimation and identification Sec-tion III describes the results We first provide aset of reduced-form estimates which motivatethe more structural approach We then presentestimates from the benchmark specification aswell as estimates from various extensions androbustness tests We discuss and justify some ofthe modeling assumptions and perform counter-factual analysis as a way to illustrate the impli-cations of the results to profits and pricingSection IV concludes by discussing the rele-vance of the results to other settings

I Data

A Economic Environment and Data Sources

We obtained data from a single insurancecompany that operates in the market for auto-mobile insurance in Israel The data containinformation about all 105800 new policyhold-ers who purchased (annual) policies from thecompany during the first five years of its oper-ation from November 1994 to October 1999Although many of these individuals stayed withthe insurer in subsequent years we focusthrough most of the paper on deductible choicesmade by individuals in their first contract withthe company This allows us to abstract fromthe selection implied by the endogenous choiceof individuals whether to remain with the com-pany or not (Cohen 2003 2005)

The company studied was the first company inthe Israeli auto insurance market that marketedinsurance to customers directly rather thanthrough insurance agents By the end of the stud-ied period the company sold about 7 percent ofthe automobile insurance policies issued in IsraelDirect insurers operate in many countries and ap-pear to have a significant cost advantage (J DavidCummins and Jack L Van Derhei 1979) Thestudied company estimated that selling insurancedirectly results in a cost advantage of roughly 25percent of the administrative costs involved inmarketing and handling policies Despite theircost advantage direct insurers generally have haddifficulty in making inroads beyond a part of themarket because the product does not provide theldquoamenityrdquo of having an agent to work with andturn to (Stephen P DrsquoArcy and Neil A Doherty1990) This aspect of the company clearly makes

7 An exception is Syngjoo Choi et al (2006) who use alaboratory experiment and similar to us find a high degreeof heterogeneity in risk attitudes across individuals

8 In an ongoing project Pierre-Andre Chiappori andBernard Salanie (2006) estimate an equilibrium model ofthe French auto insurance market where their model of thedemand side of the market is conceptually similar to the onewe estimate in this paper

748 THE AMERICAN ECONOMIC REVIEW JUNE 2007

the results of the paper applicable only to thoseconsumers who seriously consider buying directinsurance Section IIID discusses this selection inmore detail

While we focus primarily on the demand sideof the market by modeling the deductible choicethe supply side (pricing) will be relevant for anycounterfactual exercise as well as for understand-ing the viability of the outside option (which wedo not observe and do not model) During the firsttwo years of the companyrsquos operations the pricesit offered were lower by about 20 percent thanthose offered by other conventional insurersThus due to its differentiation and cost advantagethe company had market power with respect toindividuals who were more sensitive to price thanto the disamenity of not having an agent Thismakes monopolistic screening models apply morenaturally than competitive models of insurance(eg Michael Rothschild and Joseph E Stiglitz1976) During the companyrsquos third year of oper-ation (December 1996 to March 1998) it facedmore competitive conditions when the estab-lished companies trying to fight off the new en-trant lowered the premiums for policies withregular deductibles to the levels offered by thecompany In their remaining period included inthe data the established companies raisedtheir premiums back to previous levels leav-ing the company again with a substantialprice advantage9

For each policy our dataset includes all theinsurerrsquos information about the characteristicsof the policyholder demographic characteris-tics vehicle characteristics and details abouthis driving experience The Appendix providesa list of variables and their definitions andTable 1 provides summary statistics In addi-tion our data include the individual-specificmenu of four deductible and premium combi-nations that the individual was offered (see be-low) the individualrsquos choice from this menuand the realization of risks covered by the pol-icy the length of the period over which it was ineffect the number of claims submitted by the

policyholder and the amounts of the submittedclaims10 Finally we use the zip codes of thepolicyholdersrsquo home addresses11 to augment thedata with proxies for individualsrsquo wealth basedon the Israeli 1995 census12

The policies offered by the insurer (as all pol-icies offered in the studied market) are one-periodpolicies with no commitment on the part of eitherthe insurer or the policyholder13 The policy re-sembles the US version of ldquocomprehensiverdquo in-surance It is not mandatory but it is held by alarge fraction of Israeli car owners (above 70percent according to the companyrsquos executives)The policy does not cover death or injuries to thepolicyholder or to third parties which areinsured through a separate mandatory policyInsurance policies for car audio equipmentwindshield replacement car and towing ser-vices are structured and priced separately Cer-tain types of coverage do not carry a deductibleand are therefore not used in the analysis14

Throughout the paper we use and reportmonetary amounts in current (nominal) NewIsraeli Shekels (NIS) to avoid creating artificialvariation in the data Consequently the follow-ing facts may be useful for interpretation andcomparison with other papers in the literature

9 During this last period two other companies offeringinsurance directly were established Due to first-mover advan-tage (as viewed by the companyrsquos management) which helpedthe company maintain a strong position in the market thesetwo new companies did not affect pricing policies much untilthe end of our observation period Right in the end of thisperiod the studied company acquired one of these entrants

10 Throughout the analysis we make the assumption thatthe main policyholder is the individual who makes the deduct-ible choice Clearly to the extent that this is not always thecase the results should be interpreted accordingly

11 The company has the addresses on record for billingpurposes Although in principle the company could haveused these data for pricing they do not do so

12 The Israeli Central Bureau of Statistics (CBS) associateseach census respondent with a unique ldquostatistical areardquo eachincluding between 1000 and 10000 residents We matchedthese census tracts with zip codes based on street addressesand constructed variables at the zip code level These con-structed variables are available for more than 80 percent of thepolicyholders As a proxy for wealth we use (gross) monthlyincome which is based on self-reported income by censusrespondents augmented (by the CBS) with Social Security data

13 There is a substantial literature that studies the optimaldesign of policies that commit customers to a multiperiodcontract or that include a one-sided commitment of theinsurer to offer the policyholder certain terms in subsequentperiods (Georges Dionne and Pierre Lasserre 1985 RussellCooper and Beth Hayes 1987 Georges Dionne and Neil ADoherty 1994 Igal Hendel and Alessandro Lizzeri 2003)Although such policies are observed in certain countries(Dionne and Vanasse 1992) many insurance markets includ-ing the one we study use only one-period no-commitmentpolicies (Howard Kunreuther and Mark V Pauly 1985)

14 These include auto theft total loss accidents and notldquoat faultrdquo accidents

749VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The exchange rate between NIS and US dollarsmonotonically increased from 301 in 1995 to414 in 1999 (on average it was 352)15 Annualinflation was about 8 percent on average andcumulative inflation over the observation periodwas 48 percent We will account for these effects

as well as other general trends by using yeardummy variables throughout the analysis

B The Menu of Deductibles and Premiums

Let xi be the vector of characteristics individuali reports to the insurance company After learningxi the insurer offered individual i a menu of fourcontract choices One option offered a ldquoregularrdquodeductible and a ldquoregularrdquo premium The term

15 PPP figures were about 10 percent lower than the nom-inal exchange rates running from 260 in 1995 to 374 in 1999

TABLE 1mdashSUMMARY STATISTICSmdashCOVARIATES

Variable Mean Std dev Min Max

Demographics Age 41137 1237 1806 8943Female 0316 047 0 1Family Single 0143 035 0 1

Married 0779 042 0 1Divorced 0057 023 0 1Widower 0020 014 0 1Refused to say 0001 004 0 1

Education Elementary 0016 012 0 1High school 0230 042 0 1Technical 0053 022 0 1College 0233 042 0 1No response 0468 050 0 1

Emigrant 0335 047 0 1

Car attributes Value (current NIS)a 66958 37377 4000 617000Car age 3952 287 0 14Commercial car 0083 028 0 1Engine size (cc) 1568 385 700 5000

Driving License years 18178 1007 0 63Good driver 0548 050 0 1Any driver 0743 044 0 1Secondary car 0151 036 0 1Business use 0082 027 0 1Estimated mileage (km)b 14031 5891 1000 32200History length 2847 061 0 3Claims history 0060 015 0 2

Young driver Young 0192 039 0 1Gender Male 0113 032 0 1

Female 0079 027 0 1Age 17ndash19 0029 017 0 1

19ndash21 0051 022 0 121ndash24 0089 029 0 124 0022 015 0 1

Experience 1 0042 020 0 11ndash3 0071 026 0 13 0079 027 0 1

Company year First year 0207 041 0 1Second year 0225 042 0 1Third year 0194 040 0 1Fourth year 0178 038 0 1Fifth year 0195 040 0 1

Note The table is based on all 105800 new customers in the dataa The average exchange rate throughout the sample period was approximately $1 per 35 NIS starting at 13 in late 1994

and reaching 14 in late 1999b The estimated mileage is not reported by everyone It is available for only 60422 new customers

750 THE AMERICAN ECONOMIC REVIEW JUNE 2007

regular was used for this deductible level bothbecause it was relatively similar to the deductiblelevels offered by other insurers and because mostpolicyholders chose it The regular premium var-ied across individuals according to some deter-ministic function (unknown to us) pit ft(xi)which was quite stable over time The regulardeductible level was directly linked to the regularpremium according to

(1) dit min12

pit capt

That is it was set to one-half of the regular pre-mium subject to a deductible cap capt whichvaried over time but not across individuals Thepremiums associated with the other options werecomputed by multiplying pit by three differentconstants 106 for ldquolowrdquo deductible 0875 forldquohighrdquo deductible and 08 for ldquovery highrdquo de-ductible The regular deductible dit was con-verted to the other three deductible levels in asimilar way using multipliers of 06 for low 18for high and 26 for very high

There are two main sources of exogenous vari-ation in prices The first arises from companyexperimentation The multipliers described abovewere fixed across individuals and over time formost of the observation period but there was asix-month period during the insurerrsquos first year ofoperation (May 1995 to October 1995) in whichthe insurer experimented with slightly modifiedmultipliers16 This modified formula covers al-most 10 percent of the sample The secondsource of variation arises from discrete ad-justments to the uniform cap The cap variedover time due to inflation competitive condi-tions and as the company gained more expe-rience (Figure 1) The cap was binding for

16 For individuals with low levels of regular premiumsduring the specified period the regular deductible was set at53 percent (instead of 50 percent) of the regular premiumthe low deductible was set at 33 percent (instead of 30percent) of the regular premium and so on

FIGURE 1 VARIATION IN THE DEDUCTIBLE CAP OVER TIME

Notes This figure presents the variation in the deductible cap over time which is one of the main sources of pricing variation inthe data We do not observe the cap directly but it can be calculated from the observed menus The graph plots the maximalregular deductible offered to anyone who bought insurance from the company over a moving seven-day window Thelarge jumps in the graph reflect changes in the deductible cap There are three reasons why the graph is not perfectlysmooth First in a few holiday periods (eg October 1995) there are not enough sales within a seven-day window sonone of those sales hits the cap This gives rise to temporary jumps downward Second the pricing rule applies at thedate of the price quote given to the potential customer Our recorded date is the first date the policy becomes effectiveThe price quote is held for a period of two to four weeks so in periods in which the pricing changes we may still seenew policies sold using earlier quotes made according to a previous pricing regime Finally even within periods of constantcap the maximal deductible varies slightly (variation of less than 05 percent) We do not know the source of this variation

751VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

about a third of the policyholders in our dataAll these individuals would be affected by achange in the cap Much of the variation ofmenus in the data is driven by the exogenousshifts in the uniform deductible cap The un-derlying assumption is that conditional onobservables these sources of variation pri-marily affect the deductible choice of newcustomers but they do not have a significantimpact on the probability of purchasing insurancefrom the company Indeed this assumption holdsin the data with respect to observables there is nodistinguishable difference in the distribution ofobservable characteristics of consumers who buyinsurance just before and those who buy just aftera change in the deductible cap

C Summary Statistics

The top of Table 2A summarizes the deduct-ible menus all are calculated according to theformula described earlier Only 1 percent of thepolicyholders chose the high or very high de-ductible options Therefore for the rest of theanalysis we focus only on the choice betweenregular and low deductible options (chosen by811 and 178 percent of the individuals respec-tively)17 Focusing only on these options doesnot create any selection bias because we do notomit individuals who chose high or very highdeductibles For these individuals we assumethat they chose a regular deductible This as-sumption is consistent with the structural modelwe develop in the next section which impliesthat conditional on choosing high or very highdeductibles an individual would almost alwaysprefer the regular over the low deductible

The bottom of Table 2A as well as Table2B present summary statistics for the policy re-alizations We focus only on claim rates and noton the amounts of the claims This is because anyamount above the higher deductible level is cov-

ered irrespective of the deductible choice and thevast majority of the claims fit in this category (seeSection IIIE) For all these claims the gain fromchoosing a low deductible is the same in the eventof a claim and is equal to the difference betweenthe two deductible levels Therefore the claimamount is rarely relevant for the deductible choice(and likewise for the companyrsquos pricing decisionwe analyze in Section IIIF)

Averaging over all individuals the annualclaim rate was 0245 One can clearly observesome initial evidence of adverse selection Onaverage individuals who chose a low deductiblehad higher claim rates (0309) than those whochose the regular deductible (0232) Those whochose high and very high deductibles had muchlower claim rates (0128 and 0133 respectively)These figures can be interpreted in the context ofthe pricing formula A risk-neutral individual willchoose the low deductible if and only if her claimrate is higher than (pd) (plow pregular)(dregular dlow) When the deductible cap is notbinding which is the case for about two-thirds ofthe sample this ratio is given directly by thepricing formula and is equal to 03 Thus anyindividual with a claim rate higher than 03 willbenefit from buying the additional coverage pro-vided by a low deductible even without any riskaversion The claim data suggest that the offeredmenu is cheaper than an actuarially fair contractfor a nonnegligible part of the population (13percent according to the benchmark estimates re-ported below) This observation is in sharp con-trast to other types of insurance contracts such asappliance warranties which are much more ex-pensive than the actuarially fair price (Rabin andThaler 2001)

II The Empirical Model

A A Model of Deductible Choice

Let wi be individual irsquos wealth (pih di

h) theinsurance contract (premium and deductible re-spectively) with high deductible (pi

l dil) the

insurance contract with low deductible ti theduration of the policy and ui(w) individual irsquosvNM utility function We assume that the num-ber of insurance claims is drawn from a Poissondistribution with an annual claim rate iThrough most of the paper we assume that i isknown to the individual We also assume that iis independent of the deductible choice ie that

17 The small frequency of ldquohighrdquo and ldquovery highrdquochoices provides important information about the lowerends of the risk and risk aversion distributions but (for thatsame reason) makes the analysis sensitive to functionalform Considering these options or the option of not buyinginsurance creates a sharp lower bound on risk aversion forthe majority of the observations making the estimates muchhigher Given that these options are rarely selected how-ever it is not clear to us whether they were regularlymentioned during the insurance sales process renderingtheir use somewhat inappropriate

752 THE AMERICAN ECONOMIC REVIEW JUNE 2007

there is no moral hazard Finally we assumethat in the event of an accident the value of theclaim is greater than di

h We revisit all theseassumptions in Sections IIID and IIIE For therest of this section i subscripts are suppressedfor convenience

In the market we study insurance policies are

typically held for a full year after which theycan be automatically renewed with no commit-ment by either the company or the individualMoreover all auto-insurance policies sold inIsrael can be canceled without prior notice bythe policyholder with premium payments beinglinearly prorated Both the premium and the

TABLE 2AmdashSUMMARY STATISTICSmdashMENUS CHOICES AND OUTCOMES

Variable Obs Mean Std dev Min Max

Menu Deductible (current NIS)a Low 105800 87548 12101 37492 103911Regular 105800 145299 19779 62486 171543High 105800 260802 35291 112475 308778Very high 105800 376305 50853 162464 446013

Premium (current NIS)a Low 105800 338057 91404 132471 1923962Regular 105800 318922 8623 124972 1815058High 105800 279057 75451 109351 1588176Very high 105800 255137 68984 99978 1452046

pd 105800 0328 006 03 18

Realization Choice Low 105800 0178 038 0 1Regular 105800 0811 039 0 1High 105800 0006 008 0 1Very high 105800 0005 007 0 1

Policy termination Active 105800 0150 036 0 1Canceled 105800 0143 035 0 1Expired 105800 0707 046 0 1

Policy duration (years) 105800 0848 028 0005 108Claims All 105800 0208 048 0 5

Low 18799 0280 055 0 5Regular 85840 0194 046 0 5High 654 0109 034 0 3Very high 507 0107 032 0 2

Claims per yearb All 105800 0245 066 0 19882Low 18799 0309 066 0 9264Regular 85840 0232 066 0 19882High 654 0128 062 0 12636Very high 507 0133 050 0 3326

a The average exchange rate throughout the sample period was approximately $1 per 35 NIS starting at 13 in late 1994and reaching 14 in late 1999

b The mean and standard deviation of the claims per year are weighted by the observed policy duration to adjust forvariation in the exposure period These are the maximum likelihood estimates of a simple Poisson model with no covariates

TABLE 2BmdashSUMMARY STATISTICSmdashCONTRACT CHOICES AND REALIZATIONS

Claims Low Regular High Very high Total Share

0 11929 (0193) 49281 (0796) 412 (0007) 299 (0005) 61921 (100) 080341 3124 (0239) 9867 (0755) 47 (0004) 35 (0003) 13073 (100) 016962 565 (0308) 1261 (0688) 4 (0002) 2 (0001) 1832 (100) 002383 71 (0314) 154 (0681) 1 (0004) 0 (0000) 226 (100) 000294 6 (0353) 11 (0647) 0 (0000) 0 (0000) 17 (100) 000025 1 (0500) 1 (0500) 0 (0000) 0 (0000) 2 (100) 000003

Notes The table presents tabulation of choices and number of claims For comparability the figures are computed only forindividuals whose policies lasted at least 09 years (about 73 percent of the data) The bottom rows of Table 2A providedescriptive figures for the full dataset The numbers in parentheses in each cell represent percentages within each row Theright-hand-side column presents the marginal distribution of the number of claims

753VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

probability of a claim are proportional to thelength of the time interval taken into account soit is convenient to think of the contract choice asa commitment for only a short amount of timeThis approach has several advantages First ithelps to account for early cancellations andtruncated policies which together constitute 30percent of the policies in the data18 Second itmakes the deductible choice independent ofother longer-term uncertainties faced by the in-dividual so we can focus on static risk-takingbehavior Third this formulation helps to obtaina simple framework for analysis which is attrac-tive both analytically and computationally19

The expected utility that the individual obtainsfrom the choice of a contract (p d) is given by

(2) v p d

1 tuw pt tuw pt d

We characterize the set of parameters that willmake the individual indifferent between the twooffered contracts This set provides a lower (up-per) bound on the level of risk aversion for indi-viduals who choose the low (high) deductible (fora given ) Thus we analyze the equation v(phdh) v(pl dl) By taking limits with respect to t(and applying LrsquoHopitalrsquos rule) we obtain

(3)

limt30

1

t(u(w pht) u(w plt))

(u(w pht) u(w pht dh)) (u(w plt) u(w plt dl))

pl phuw

uw dl uw dh

or

(4)

pl phuw uw dl uw dh

The last expression has a simple intuition Theright-hand side is the expected gain (in utils) perunit of time from choosing a low deductibleThe left-hand side is the cost of such a choiceper unit of time For the individual to be indif-ferent the expected gains must equal the costs

In our benchmark specification we assumethat the third derivative of the vNM utilityfunction is not too large A Taylor expansion forboth terms on the right-hand side of equation(4) ie u(w d) u(w) du(w) (d22)u13(w) implies that

(5)pl ph

uw dh dluw

1

2dh dldh dlu13w

Let d dh dl 0 p pl ph 0 andd 1frasl2 (dh dl) to obtain

(6)p

duw uw d u13w

or

(7) r u13w

uw

p

d 1

d

where r is the coefficient of absolute risk aver-sion at wealth level w

18 As can be seen in Table 2A 70 percent of the policiesare observed through their full duration (one year) About15 percent are truncated by the end of our observationperiod and the remaining 15 percent are canceled for var-ious reasons such as change in car ownership total-lossaccident or a unilateral decision of the policyholder tochange insurance providers

19 This specification ignores the option value associatedwith not canceling a policy This is not very restrictiveSince experience rating is small and menus do not changeby much this option value is likely to be close to zero Asimple alternative is to assume that individuals behave as ifthey commit for a full year of coverage In such a case themodel will be similar to the one we estimate but willdepend on the functional form of the vNM utility functionand would generally require taking infinite sums (over thepotential realizations for the number of claims within the year)In the special case of quadratic expected utility maximizerswho care only about the mean and variance of the number ofclaims this is easy to solve The result is almost identical to theexpression we subsequently derive in equation (7)

754 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Equation (7) defines an indifference set in thespace of risk and risk aversion which we willrefer to by (r() ) and ((r) r) interchange-ably Both r() and (r) have a closed-formrepresentation a property that will be computa-tionally attractive for estimation20 Both termsare individual specific as they depend on thedeductible menu which varies across individu-als For the rest of the paper we regard eachindividual as associated with a two-dimensionaltype (ri i) An individual with a risk parameter

i who is offered a menu (pih di

h) (pil di

l)will choose the low-deductible contract if andonly if his coefficient of absolute risk aversionsatisfies ri ri(i) Figure 2 presents a graphi-cal illustration

B The Benchmark Econometric Model

The Econometric ModelmdashThe econometricmodel we estimate is fully described by the fiveequations presented in this section Our ob-jective is to estimate the joint distribution of(i ri)mdashthe claim rate and coefficient of abso-lute risk aversionmdashin the population of policy-holders conditional on observables xi The

20 For example estimating the CARA version of themodel (Section IIID) for which r() does not have aclosed-form representation takes almost ten times longer

FIGURE 2 THE INDIVIDUALrsquoS DECISIONmdashA GRAPHICAL ILLUSTRATION

Notes This graph illustrates the individualrsquos decision problem The solid line presents the indifference setmdashequation (7)mdashappliedfor the menu faced by the average individual in the sample Individuals are represented by points in the two-dimensional spaceabove In particular the scattered points are 10000 draws from the joint distribution of risk and risk aversion for the averageindividual (on observables) in the data based on the point estimates of the benchmark model (Table 4) If an individual is eitherto the right of the line (high risk) or above the line (high risk aversion) the low deductible is optimal Adverse selection is capturedby the fact that the line is downward sloping higher-risk individuals require lower levels of risk aversion to choose the lowdeductible Thus in the absence of correlation between risk and risk aversion higher-risk individuals are more likely to choosehigher levels of insurance An individual with i (pidi) will choose a lower deductible even if he is risk neutral ie withprobability one (we do not allow individuals to be risk loving) This does not create an estimation problem because i is notobserved only a posterior distribution for it Any such distribution will have a positive weight on values of i that are below(pidi) Second the indifference set is a function of the menu and in particular of (pidi) and d An increase in (pidi) willshift the set up and to the right and an increase in d will shift the set down and to the left Therefore exogenous shiftsof the menus that make both arguments change in the same direction can make the sets ldquocrossrdquo thereby allowing us toidentify the correlation between risk and risk aversion nonparametrically With positive correlation (shown in the figureby the ldquoright-bendingrdquo shape of the simulated draws) the marginal individuals are relatively high risk therefore creatinga stronger incentive for the insurer to raise the price of the low deductible

755VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

benchmark formulation assumes that (i ri) fol-lows a bivariate lognormal distribution Thuswe can write the model as

(8) ln i xi i

(9) ln ri xi vi

with

(10) i

vi

iid

N00

2 r

r r2

Neither i nor ri is directly observed Thereforewe treat both as latent variables Loosely speak-ing they can be thought of as random effects Weobserve two variables the number of claims andthe deductible choice which are related to thesetwo unobserved components Thus to completeour econometric model we have to specify therelationship between the observed variables andthe latent ones This is done by making two struc-tural assumptions First we assume that the num-ber of claims is drawn from a Poisson distributionnamely

(11) claimsi Poisson(i ti )

where ti is the observed duration of the policySecond we assume that when individuals maketheir deductible choices they follow the theoreti-cal model described in the previous section Themodel implies that individual i chooses the lowdeductible (choicei 1) if and only if ri ri(i)where ri is defined in equation (7) Thus theempirical model for deductible choice is given by

(12) Prchoicei 1 Prri

pi

idi 1

d i

Prexp(xi vi)

pi

exp(xi i)di 1

d i

With no unobserved heterogeneity in risk (i 0) equation (12) reduces to a simple probit In

such a case one can perfectly predict risk from thedata denote it by (xi) and construct an addi-tional covariate ln([pi((xi)di) 1]d i) Giventhe assumption that risk aversion is distributedlognormally running the probit regressionabove and renormalizing the coefficient onthe constructed covariate to 1 (instead ofthe typical normalization of the variance of theerror term to 1) has a structural interpretation withln (ri) as the dependent variable However Cohen(2005) provides evidence of adverse selection inthe data implying the existence of unobservedheterogeneity in risk This makes the simple probitregression misspecified Estimation of the fullmodel is more complicated Once we allow forunobserved heterogeneity in both unobserved riskaversion (vi) and claim rate (i) we have to inte-grate over the two-dimensional region depicted inFigure 2 for estimation

EstimationmdashA natural way to proceed is toestimate the model by maximum likelihoodwhere the likelihood of the data as a function ofthe parameters can be written by integrating outthe latent variables namely

(13) Lclaimsi choicei

Prclaimsi choiceii riPri ri

where is a vector of parameters to be esti-mated While formulating the empirical modelusing likelihood may help our thinking aboutthe data-generating process using maximumlikelihood (or generalized method of moments(GMM)) for estimation is computationally cum-bersome This is because in each iteration itrequires evaluating a separate complex integralfor each individual in the data In contrastMarkov Chain Monte Carlo (MCMC) Gibbssampling is quite attractive in such a case Us-ing data augmentation of latent variables (Mar-tin A Tanner and Wing Hung Wong 1987)according to which we simulate (i ri) and latertreat those simulations as if they are part ofthe data one can avoid evaluating the complexintegrals by sampling from truncated normaldistributions which is significantly less compu-tationally demanding (eg Luc Devroye 1986)This feature combined with the idea of aldquosliced samplerrdquo (Paul Damien John Wake-field and Stephen Walker 1999) to sample from

756 THE AMERICAN ECONOMIC REVIEW JUNE 2007

an unfamiliar posterior distribution makes theuse of a Gibbs sampler quite efficient for ourpurposes Finally the lognormality assumptionimplies that F(ln(i)ri) and F(ln(ri)i) follow a(conditional) normal distribution allowing us torestrict attention to univariate draws furtherreducing the computational burden

The Appendix provides a full description ofthe Gibbs sampler including the conditionaldistributions and the (flat) prior distributions weuse The basic intuition is that conditional onobserving (i ri) for each individual we have asimple linear regression model with two equa-tions The less standard part is to generate drawsfor (i ri) We do this iteratively Conditionalon i the posterior distribution for ln(ri) fol-lows a truncated normal distribution where thetruncation point depends on the menu individ-ual i faces and its direction (from above orbelow) depends on individual irsquos deductiblechoice The final step is to sample from theposterior distribution of ln(i) conditional on riThis is more complicated as we have bothtruncation which arises from adverse selection(just as we do when sampling for ri) and thenumber of claims which provides additionalinformation about the posterior of i Thus theposterior for i takes an unfamiliar form forwhich we use a ldquosliced samplerrdquo

We use 100000 iterations of the Gibbs sam-pler It seems to converge to the stationarydistribution after about 5000 iterations There-fore we drop the first 10000 draws and use thelast 90000 draws of each variable to report ourresults The results are based on the posteriormean and posterior standard deviation fromthese 90000 draws Note that each iterationinvolves generating separate draws of (i ri) foreach individual Performing 100000 iterationsof the benchmark specification (coded in Mat-lab) takes about 60 hours on a Dell Precision530 workstation

C Identification

The parametric version of the model is iden-tified mechanically There are more equationsthan unknowns and no linear dependenciesamong them so (as also verified using MonteCarlo exercises) the model parameters can bebacked out from simulated data Our goal in thissection is not to provide a formal identificationproof Rather we want to provide intuition for

which features of the data allow us to identifyparticular parameters of the model The discus-sion also highlights the assumptions that areessential for identification vis-a-vis those thatare made only for computational convenience(making them in principle testable)

Discussion of Nonparametric Identifica-tionmdashThe main difficulty in identifying themodel arises from the gap between the (exante) risk type i which individuals usewhen choosing a deductible and the (ex post)realization of the number of claims we ob-serve We identify between the variation inrisk types and the variation in risk realizationsusing our parametric distributional assump-tions The key is that the distribution of risktypes can be uniquely backed out from theclaim data alone This allows us to use thedeductible choice as an additional layer ofinformation which identifies unobserved het-erogeneity in risk aversion21 Any distribu-tional assumption that allows us to uniquelyback out the distribution of risk types fromclaim data would be sufficient to identify thedistribution of risk aversion As is customaryin the analysis of count processes we make aparametric assumption that claims are gener-ated by a lognormal mixture of Poisson dis-tributions (Section IIID discusses this furtherand explores an alternative) Using a mixtureenables us to account for adverse selectionthrough unobserved heterogeneity in risk Italso allows us to better fit the tails of theclaim distribution In principle a more flexi-ble mixture or a more flexible claim-generat-ing process could be identified as long as theclaims data are sufficiently rich22

21 Cardon and Hendel (2001) face a similar identificationproblem in the context of health insurance They use vari-ation in coverage choice (analogous to deductible choice) toidentify the variation in health-status signals (analogous torisk types) from the variation in health expenditure (analo-gous to number of claims) They can rely on the coveragechoice to identify this because they make an assumptionregarding unobserved heterogeneity in preferences (iidlogit) We take a different approach as our main goal is toestimate rather than assume the distribution of preferences

22 Although it may seem that the claim data are limited(as they take only integer values between 0 and 5 in ourdata) variation in policy duration generates continuousvariation in the observed claim propensity Of course thisvariation also introduces an additional selection into themodel due to policy cancellations which are potentially

757VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Once the distribution of risk types is iden-tified from claims data the marginal distribu-tion of risk aversion (and its relationship tothe distribution of risk types) is nonparametri-cally identified from the variation in theoffered menus discussed in Section I Thisvariation implies different deductible and pre-mium options to identical (on observables)individuals who purchased insurance at dif-ferent times Different menus lead to differentindifference sets (similar to the one depictedin Figure 2) These sets often cross each otherand nonparametrically identify the distribu-tion of risk aversion and the correlation struc-ture at least within the region in which theindifference sets vary For the tails of thedistribution as is typically the case we haveto rely on parametric assumptions or usebounds The parametric assumption of log-normality we use for most of the paper ismade only for computational convenience

Intuition for the Parametric IdentificationMechanismmdashVariation in the offered menus isimportant for the nonparametric identificationThe parametric assumptions could identify themodel without such variation Thus to keep theintuition simple let us take the bivariate log-normal distribution as given and contrary to thedata assume that all individuals are identical onobservables and face the same menu Supposealso that all individuals are observed for exactlyone year and have up to two claims23 In thissimplified case the model has five parametersto be estimated the mean and variance of risk and

2 the mean and variance of risk aver-sion r and r

2 and the correlation parameter The data can be summarized by five numbersLet 130 131 and 132 1 131 130 be thefraction of individuals with zero one and twoclaims respectively Let 0 1 and 2 be theproportion of individuals who chose a low de-ductible within each ldquoclaim grouprdquo Given ourdistributional assumption about the claim-

generating process we can use 130 and 131 touniquely identify and

2 Loosely isidentified by the average claim rate in the dataand

2 is identified by how fat the tail of theclaim distribution is ie by how large (132131)is compared to (131130) Given and

2 and thelognormality assumption we can (implicitly)construct a posterior distribution of risk typesfor each claim group F(r claims c) andintegrate over it when predicting the deductiblechoice This provides us with three additionalmoments each of the form

(14) Ec Prchoice 1r

dFr claims c dFr

for c 0 1 2 These moments identify thethree remaining parameters of the model rr

2 and Let us now provide more economic content to

the identification argument Using the same ex-ample and conditional on identifying and

2

from the claim data one can think about thedeductible choice data 0 1 2 as a graph(c) The absolute level of the graph identifiesr In the absence of correlation between riskand risk aversion the slope of the graph iden-tifies r

2 with no correlation the slope shouldalways be positive (due to adverse selection)but higher r

2 would imply a flatter graph be-cause more variation in the deductible choiceswill be attributed to variation in risk aversionFinally is identified by the curvature of thegraph The more convex (concave) the graph isthe more positive (negative) is the estimated For example if 0 05 1 051 and 2 099 it is likely that r

2 is high (explaining why0 and 1 are so close) and is highly positive(explaining why 2 is not also close to 1) Incontrast if 0 1 it must mean that thecorrelation between risk and risk aversion isnegative which is the only way the originalpositive correlation induced by adverse selec-tion can be offset This intuition also clarifiesthat the identification of relies on observingindividuals with multiple claims (or differentpolicy durations) and that it is likely to besensitive to the distributional assumptions Thedata (Table 2B) provide a direct (positive) cor-relation between deductible choice and claims

endogenous The results are similar when we use onlyexpired and truncated policies

23 This variation is sufficient (and necessary) to identifythe benchmark model The data provide more variation weobserve up to five claims per individual we observe con-tinuous variation in the policy durations we observe vari-ation in prices and we exploit distributional restrictionsacross individuals with different observable characteristics

758 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The structural assumptions allow us to explainhow much of this correlation can be attributedto adverse selection The remaining correlation(positive or negative) is therefore attributed tocorrelation in the underlying distribution of riskand risk aversion

III Results

A Reduced-Form Estimation

To get an initial sense for the levels of abso-lute risk aversion implied by the data we use asimple back-of-the-envelope exercise We com-pute unconditional averages of p d dh dland d (Table 2A)24 and substitute these valuesin equation (7) The implied coefficient of ab-solute risk aversion is 29 104 NIS125 Thisfigure could be thought of as the average indif-ference point implying that about 18 percent ofthe policyholders have coefficients of absoluterisk aversion exceeding it To convert to USdollar amounts one needs to multiply thesefigures by the average exchange rate (352)resulting in an average indifference point of102 103 $US1 This figure is less than halfof a similar estimate reported by Sydnor (2006)for buyers of homeownerrsquos insurance but about3 and 13 times higher than comparable fig-ures reported by Gertner (1993) and Metrick(1995) respectively for television game showparticipants

Table 3 provides reduced-form analysis ofthe relationship between the observables andour two left-hand-side variables the number ofclaims and the deductible choice26 Column 1reports the estimates from a Poisson regressionof the number of claims on observed character-istics This regression is closely related to therisk equation we estimate in the benchmarkmodel It shows that older people women andpeople with a college education are less likely to

have an accident Bigger more expensiveolder and noncommercial cars are more likelyto be involved in an accident Driving experi-ence and variables associated with less intenseuse of the car reduce accident rates As could beimagined claim propensity is highly correlatedover time past claims are a strong predictor offuture claims Young drivers are 50 to 70 per-cent more likely to be involved in an accidentwith young men significantly more likely thanyoung women Finally as indicated by the trendin the estimated year dummies the accident ratesignificantly declined over time Part of thisdecline is likely due to the decline in accidentrates in Israel in general27 This decline mightalso be partly due to the better selection ofindividuals the company obtained over time asit gained more experience (Cohen 2003)

Columns 2 and 3 of Table 3 present estimatesfrom probit regressions in which the dependentvariable is equal to one if the policyholder chosea low deductible and is equal to zero otherwiseColumn 3 shows the marginal effects of thecovariates on the propensity to choose a lowdeductible These marginal effects do not have astructural interpretation as the choice of lowdeductible depends on its price on risk aver-sion and on risk In this regression we againobserve a strong trend over time Fewer policy-holders chose the low deductible as time wentby One reason for this trend according to thecompany executives is that over time the com-panyrsquos sales persons were directed to focusmainly on the ldquodefaultrdquo regular deductible op-tion28 The effect of other covariates will bediscussed later in the context of the full modelIn unreported probit regressions we also test thequalitative assumptions of the structural model byadding three additional regressors the price ratio(pidi) the average deductible offered d i

24 The unconditional is computed by maximum likeli-hood using the data on claims and observed durations of thepolicies

25 Using the CARA specification as in equation (16) weobtain a slightly lower value of 25 104 NIS1

26 We find positive correlation in the errors of these tworegressions suggesting the existence of adverse selection inthe data and motivating a model with unobserved heteroge-neity in risk This test is similar to the bivariate probit testproposed by Chiappori and Salanie (2000) and replicatesearlier results reported in Cohen (2005)

27 In particular traffic fatalities and counts of trafficaccidents in Israel fell by 11 percent and 18 percent during1998 and 1999 respectively

28 Such biased marketing efforts will bias consumersagainst choosing the low deductible thus making them lookless risk averse This would make our estimate a lowerbound on the true level of risk aversion If only sophisti-cated consumers could see beyond the marketing effort andthis sophistication were related to observables (eg educa-tion) the coefficients on such observables would be biasedupward This is not a major concern given that the coefficientson most of the covariates are fairly stable when we estimatethe benchmark model separately for different years

759VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

TABLE 3mdashNO HETEROGENEITY IN RISK

Variable

Poisson regressiona Probit regressionsc

Dep var Number of claims Dep var 1 if low deductible

(1) (2) (3)

Coeff Std Err IRRb Coeff Std Err dPdX dPdXDemographics Constant 43535 02744 mdash 224994 23562 mdash

Age 00074 00052 09926 01690 00464 00034 00042Age2 989 105 536 105 10001 00017 00005 339 105 455 105Female 00456 00161 09555 08003 01316 00165 00134Family Single omitted omitted omitted

Married 01115 00217 08945 06417 01950 00128 00045Divorced 00717 00346 10743 03191 03127 00064 00044Widower 00155 00527 10156 03441 04573 00069 00025Other (NA) 01347 01755 11442 29518 19836 00511 00370

Education Elementary 00522 00550 09491 02927 04278 00061 00016High school omitted omitted omittedTechnical 00373 00308 10380 05037 02520 00105 00138College 00745 00197 09282 05279 01546 00109 00053Other (NA) 00116 00184 10116 00630 01540 00013 00025

Emigrant 00210 00163 10213 00497 01328 00010 00003

Car attributes Log(Value) 01227 00281 11306 13285 02342 00271 00299Car age 00187 00042 10189 01603 00357 00033 00017Commercial car 01394 00326 08699 09038 02781 00177 00294Log(engine size) 02972 00459 13461 06924 03952 00141 00075

Driving License years 00204 00034 09798 01043 00312 00021 00005License years2 00002 677 105 10002 00015 00005 299 105 167 105

Good driver 00176 00191 09825 07207 01574 00148 00152Any driver 00564 00169 10580 10923 01321 00217 00258Secondary car 00859 00209 09177 00038 01626 00001 00070Business use 01852 00293 12034 07951 02737 00156 00017History length 00527 00110 09486 11218 01450 00228 00171Claims history 06577 00390 19304 04654 05460 00095 00496

Young driver Young driver 05235 00361 16879 28847 06305 00524 00012Gender Male omitted omitted omitted

Female 01475 00288 08629 17959 03135 00396 00195Age 17ndash19 omitted omitted omitted

19ndash21 00701 00532 10726 07800 07509 00153 0014721ndash24 00267 00574 09737 05746 07773 00114 0015624 02082 00567 0812 17869 07328 00397 00136

Experience 1 omitted omitted omitted1ndash3 02416 00458 07854 10175 06577 00217 000093 02827 00532 07538 32513 07386 00762 00410

Company year First year omitted omitted omittedSecond year 00888 00198 09150 44513 01506 00792 00859Third year 00690 00222 09334 85888 01820 01303 01367Fourth year 01817 00232 08339 118277 02102 01616 01734Fifth year 05431 0028 05810 143206 02778 01920 02085

105586Obs 105800 94000 105800Pseudo R2 00162 01364 01296Log likelihood 577459 369595 430869

Significant at the 5 percent confidence levela Maximum likelihood estimates Variation in exposure (policy duration) is accounted forb IRR Incidence rate ratio Each figure should be interpreted as the increasedecrease in claim probability as a result of an increase of one unit in the

right-hand-side variablec There are two separate probit regressions reported in this table Column 2 relies on the deductible choice model and the lognormality assumption As discussed

in the text by including an additional regressor ln([pi((xi)di) 1]d i) (with (xi) predicted from column 1 above) and normalizing its coefficient to 1 we obtaina structural interpretation of this regression with ln (ri) as the dependent variable Thus the reported coefficients are comparable to those estimated for the benchmarkmodel One should be cautious however in interpreting these coefficients Unlike the benchmark model this regression does not allow unobserved heterogeneity inrisk and suffers from some selection bias because observations with high predicted risk rate are omitted (which is why the number of observations is 94000 ratherthan the full sample of 105800) For comparison column 3 reports the marginal effects from a comparable probit regression that uses the full sample and does notcontrol for pricing and predicted risk through the additional structural regressor Column 3 does not have a structural interpretation and its (unreported) coefficientscannot be compared to those estimated from the benchmark model

760 THE AMERICAN ECONOMIC REVIEW JUNE 2007

and the risk rate (xi) as predicted from thePoisson regression of column 1 All three addi-tional regressors enter with the predicted signand with large and highly significant marginaleffects29

Finally column 2 of Table 3 presents animportant specification of the probit regres-sion in which ln([pi(( xi)di) 1]d i) isadded as a regressor and its coefficient isnormalized to 130 As already mentioned ifwe assume that there is no unobserved heter-ogeneity in risk then column 2 is analogousto the ln(ri) equation of the benchmark modelThis restriction of the model is rejected by thedata About 10 percent of the individuals arepredicted to have (xi) (pi di) implying achoice of low deductible for any level of riskaversion Many of these individuals howeverstill choose higher deductible levels Column2 reports the results for the remaining indi-viduals (94000 out of 105800) for which theregression can be run While the signs of theestimated coefficients are similar to those inthe benchmark model presented below therestricted version of the model suggests muchhigher levels and dispersion of risk aversionwell above any reasonable level31 The fullestimation of the benchmark model clearly re-jects this restriction on the model

B Estimation of the Benchmark Model

The Effect of Individual Characteristics onRisk AversionmdashTable 4 presents the estimationresults from the benchmark model The secondcolumn shows how the level of absolute riskaversion is related to individual characteristicsAs the dependent variable is in natural loga-

rithm coefficients on dummy variables can bedirectly interpreted as approximate percentagechanges The vast majority of these coefficientsare quite stable across a wide range of specifi-cations which are mentioned later

The results indicate that women are more riskaverse than men and have a coefficient of ab-solute risk aversion about 20 percent greaterthan that of men These results are consistentwith those of Donkers et al (2001) and Hartoget al (2002) The estimated effect of age sug-gests a nonmonotone pattern of risk preferencesover the life cycle The estimated coefficientsimply that initially (that is at age 18 the youngestindividual in the data) individuals become less riskaverse with age but around the age of 48 indi-viduals start becoming more risk averse32 Mar-ried individuals are estimated to be significantlymore risk averse compared to singles while di-vorced individuals are less (although the coeffi-cient is statistically insignificant)

The analysis suggests that variables that arelikely to be correlated with income or wealthsuch as post-high-school education and thevalue of the car have a positive coefficientindicating that wealthier people have higherlevels of absolute risk aversion Although we donot have data on individualsrsquo income or wealthwe use other proxies for income in additionalspecifications and obtain mixed results Whenwe include as a covariate the average householdincome among those who live in the same zipcode we obtain a significant and negative co-efficient of 0333 (0154) (standard deviationin parentheses) When we match zip code in-come on demographics of the individuals thecoefficient is effectively zero 0036 (0047)and when we use a socioeconomic index ofthe locality in which the individual lives thecoefficient is positive 0127 (0053)33 Thus

29 The estimated marginal effect (z-statistic in parenthe-ses) is 0352 (1376) 16 104 (1481) and 0154(255) for (pidi) d i and (xi) respectively

30 The level is a normalization The sign is estimatedHad the sign on this regressor been positive this would haveimplied a rejection of the model

31 For the implied median level of risk aversion the re-stricted model produces a similar estimate to the estimate wereport below for the benchmark model However since someindividuals who chose the low deductible are estimated to havevery low claim rates the restricted model is ldquoforcedrdquo to estimatevery high risk aversion for these individuals (in contrast thebenchmark model can explain such choices by positive risk re-siduals) resulting in very high dispersion and (due to the lognor-mality assumption) very high average risk aversion which isabout 1025 higher than the benchmark estimates we report below

32 This nonmonotone pattern may explain why age enterswith different signs in the estimation results of Donkers et al(2001) and Hartog et al (2002) A somewhat similar U-shapepattern with respect to age is also reported by Sumit Agarwalet al (2006) in the context of consumer credit markets

33 The full results from these specifications are providedin the online Appendix (available at httpwwwe-aerorgdatajune0720050644_apppdf ) The reason we do not usethese income variables in the benchmark specification istheir imperfect coverage which would require us to omitalmost 20 percent of the individuals Other results fromthese regressions are similar to those we report for thebenchmark specification

761VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

overall we interpret our findings as suggestiveof a nonnegative and perhaps positive associ-ation between incomewealth and absolute riskaversion

At first glance these results may appear tobe inconsistent with the widely held beliefthat absolute risk aversion declines withwealth One should distinguish however be-tween two questions (a) whether for a givenindividual the vNM utility function exhibitsdecreasing absolute risk aversion and (b)how risk preferences vary across individuals

Our results do not speak to the first questionand should not be thought of as a test of thedecreasing absolute risk aversion propertyTesting this property would require observingthe same individual making multiple choicesat different wealth levels Rather our resultsindicate that individuals with greater wealthhave utility functions that involve a greaterdegree of risk aversion It might be thatwealth is endogenous and that risk aversion(or unobserved individual characteristics thatare correlated with it) leads individuals to

TABLE 4mdashTHE BENCHMARK MODEL

Variable Ln() equation Ln(r) equation Additional quantities

Demographics Constant 15406 (00073) 118118 (01032) Var-covar matrix ()

Age 00001 (00026) 00623 (00213) 01498 (00097)Age2 624 106 (263 105) 644 104 (211 104) r 31515 (00773)Female 00006 (00086) 02049 (00643) 08391 (00265)Family Single Omitted Omitted

Married 00198 (00115) 01927 (00974) Unconditional statisticsa

Divorced 00396 (00155) 01754 (01495) Mean 02196 (00013)Widower 00135 (00281) 01320 (02288) Median 02174 (00017)Other (NA) 00557 (00968) 04599 (07397) Std Dev 00483 (00019)

Education Elementary 00194 (00333) 01283 (02156) Mean r 00019 (00002)High school Omitted Omitted Median r 727 106 (756 107)Technical 00017 (00189) 02306 (01341) Std Dev r 00197 (00015)College 00277 (00124) 02177 (00840) Corr(r ) 02067 (00085)Other (NA) 00029 (00107) 00128 (00819)

Emigrant 00030 (00090) 00001 (00651) Obs 105800

Car attributes Log(value) 00794 (00177) 07244 (01272)Car age 00053 (00023) 00411 (00176)Commercial car 00719 (00187) 00313 (01239)Log(engine size) 01299 (00235) 03195 (01847)

Driving License years 00015 (00017) 00157 (00137)License years2 183 105 (351 105) 148 104 (254 104)Good driver 00635 (00112) 00317 (00822)Any driver 00360 (00105) 03000 (00722)Secondary car 00415 (00141) 01209 (00875)Business use 00614 (00134) 03790 (01124)History length 00012 (00052) 03092 (00518)Claims history 01295 (00154) 00459 (01670)

Young driver Young driver 00525 (00253) 02499 (02290)Gender Male Omitted mdash

Female 00355 (00061) mdashAge 17ndash19 Omitted mdash

19ndash21 00387 (00121) mdash21ndash24 00445 (00124) mdash24 00114 (00119) mdash

Experience 1 Omitted mdash1ndash3 00059 (00104) mdash3 00762 (00121) mdash

Company year First year Omitted OmittedSecond year 00771 (00122) 14334 (00853)Third year 00857 (00137) 28459 (01191)Fourth year 01515 (00160) 38089 (01343)Fifth year 04062 (00249) 39525 (01368)

Note Standard deviations based on the draws from the posterior distribution in parentheses Significant at the 5 percent confidence levela Unconditional statistics represent implied quantities for the sample population as a whole ie integrating over the distribution of covariates in the sample (as

well as over the unobserved components)

762 THE AMERICAN ECONOMIC REVIEW JUNE 2007

save more to obtain more education or totake other actions that lead to greaterwealth34

Let us make several additional observationsFirst while owners of more expensive cars ap-pear to have both higher risk exposure andhigher levels of risk aversion owners of biggercars have higher risk exposure but lower levelsof risk aversion This should indicate that thestructure of the model does not constrain therelationship between the coefficients in the twoequations Rather it is the data that ldquospeak uprdquoSecond individuals who are classified by theinsurer as ldquogood driversrdquo indeed have lowerrisk but also appear to have lower risk aversionThis result is somewhat similar to the positivecorrelation between unobserved risk and unob-served risk aversion which we report belowThird policyholders who tend to use the car forbusiness are less risk averse This could bebecause uninsured costs of accidents occurringto such policyholders are tax deductible Fourthpolicyholders who reported three full years ofclaims history are more risk averse but are notdifferent in their risk exposure The attitude thatleads such policyholders to comply with therequest to (voluntarily) report three full years ofclaims history is apparently and not surpris-ingly correlated with higher levels of risk aver-sion In contrast while past claims indicate highrisk they have no significant relationship withrisk aversion Finally we find a strong trendtoward lower levels of risk aversion over timeThis is a replication of the probit results dis-cussed earlier

The Effect of Individual Characteristics onClaim RiskmdashThe first column of Table 4 de-scribes the relationship between observablesand risk exposure Accident risk is higher fordivorced individuals and lower for people witha college education Bigger older more expen-sive and noncommercial cars are all morelikely to be involved in an accident Drivingexperience reduces accident rates as do mea-sures of less intense use of the car while youngdrivers are more exposed to risk Claim propen-sity is highly correlated over time the voluntaryreport of past claims is a strong predictor offuture claims This risk equation produces re-sults that are similar to those of the simplerPoisson regression reported in Table 3 Al-though some of the coefficients lose signifi-cance the magnitude of most coefficients isquite similar The similarity between these twosets of results is to be expected as the riskregression is identified primarily from the dataon claims so incorporating the information ondeductible choice does not qualitatively changethe conceptual identification strategy (see Sec-tion IIC) If the results were not similar thiscould have indicated a misspecification of themodel The slight differences between the riskregressions in Table 3 and Table 4 are drivenmainly by the structural assumptions First thebenchmark model estimates a lognormal mix-ture of Poisson rates rather than a single Pois-son model By incorporating the fatter tails ofthe claim distribution it slightly changes theresults increases the standard errors and de-creases the average predicted claim rate Sec-ond the information on deductible choiceslightly helps us in obtaining more precise es-timates through the correlation structure be-tween the error terms in the two equations

The Implied Level of Risk AversionmdashOne ofthe main goals of the estimation is to obtainmeasures of the level of risk aversion in thepopulation we study Since we use Gibbs sam-pling and augment the latent coefficients ofabsolute risk aversion we can directly obtainthe posterior distribution of various moments ofthis distribution At each iteration of the Gibbssampler we compute the mean standard devi-ation (across individuals) and various percen-tiles of the simulated draws of i and ri and thecorrelation between them The far-right columnof Table 4 reports the averages and standard

34 One may be tempted to interpret the positive wealtheffects as an indirect indication of credit constraints wealth-ier individuals are less credit constrained and therefore canafford to purchase more insurance We do not share thisinterpretation for two reasons First the insurance companyobserves these proxies for wealth and conditions on themwhen setting prices Since the willingness to pay for insur-ance is likely to be correlated with the willingness to pay forthe additional insurance provided by the low deductibleoption premiums already reflect this variation We condi-tion on the level of the premium Second paying less exante implies paying more ex post so applying the creditconstraint argument only to the ex ante payment but not tothe probabilistic ex post deductible payments has no theo-retical foundation Essentially the setup of the model linksthe ex ante decisions to the ex post losses which are bothdriven by the curvature of the vNM utility function

763VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

deviations of these computed quantities over theiterations of the Gibbs sampler35 The impliedrisk aversion of the mean individual is 00019which is about seven times greater than theback-of-the-envelope calculation presented inthe previous section As we assume a lognormal

distribution and estimate a relatively high dis-persion coefficient r the estimates also implysignificantly lower values for the median levelof risk aversion As shown later the qualitativepattern of these results is quite robust acrossspecifications

Table 5 summarizes our findings regardingthe level and dispersion of risk aversion Itpresents two ways to interpret the estimates weobtained from the benchmark model as well ascomparisons to specifications with constant ab-solute risk aversion (CARA) utility and with

35 Note that these estimated quantities are unconditionalIn computing these quantities we integrate over the distri-bution of observable characteristics in the data so onecannot compute these estimates from the estimated param-eter directly

TABLE 5mdashRISK-AVERSION ESTIMATES

Specificationa Absolute risk aversionb Interpretationc Relative risk aversiond

Back-of-the-envelope 10 103 9070 1484Benchmark model

Mean individual 67 103 5605 972225th percentile 23 106 9998 003Median individual 26 105 9974 03775th percentile 29 104 9714 42790th percentile 27 103 7834 390295th percentile 99 103 4937 14327

CARA utilityMean individual 31 103 7651 4436Median individual 34 105 9966 050

Learning modelMean individual 42 103 6886 6140Median individual 56 106 9995 008

Comparable estimatesGertner (1993) 31 104 9699 479Metrick (1995) 66 105 9934 102Holt and Laury (2002)e 32 102 2096 86575Sydnor (2006) 20 103 8329 5395

a The table summarizes the results with respect to the level of risk aversion ldquoBack-of-the-enveloperdquo refers to thecalculation we report in the beginning of Section III ldquobenchmark modelrdquo refers to the results from the benchmark model(Table 4) ldquoCARA utilityrdquo refers to a specification of a CARA utility function and ldquolearning modelrdquo refers to a specificationin which individuals do not know their risk types perfectly (see Section IIID) The last four rows are the closest comparableresults available in the literature

b The second column presents the point estimates for the coefficient of absolute risk aversion converted to $US1 unitsFor the comparable estimates this is given by their estimate of a representative CARA utility maximizer For all otherspecifications this is given by computing the unconditional mean and median in the population using the figures we reportin the mean and median columns of risk aversion in Table 6 (multiplied by 352 the average exchange rate during the periodto convert to US dollars)

c To interpret the absolute risk aversion estimates (ARA) we translate them into x u(w) 1frasl2 u(w 100) 1frasl2 u(w x) That is we report x such that an individual with the estimated ARA is indifferent about participating in a 50ndash50lottery of gaining 100 US dollars and losing x US dollars Note that since our estimate is of absolute risk aversion the quantityx is independent of w To be consistent with the specification we use a quadratic utility function for the back-of-the-envelopebenchmark and learning models and use a CARA utility function for the others

d The last column attempts to translate the ARA estimates into relative risk aversion We follow the literature and do soby multiplying the ARA estimate by average annual income We use the average annual income (after tax) in Israel in 1995(51168 NIS from the Israeli Census) for all our specifications and we use average disposable income in the United Statesin 1987 ($15437) for Gertner (1993) and Metrick (1995) For Holt and Laury (2002) and Sydnor (2006) we use a similarfigure for 2002 ($26974)

e Holt and Laury (2002) do not report a comparable estimate The estimate we provide above is based on estimating aCARA utility model for the 18 subjects in their experiment who participated in the ldquo90rdquo treatment which involved stakescomparable to our setting For these subjects we assume a CARA utility function and a lognormal distribution of theircoefficient of absolute risk aversion The table reports the point estimate of the mean from this distribution

764 THE AMERICAN ECONOMIC REVIEW JUNE 2007

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 3: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

cio Palacios-Huerta and Roberto Serrano 2006)While the debate focuses on how the curvatureof the vNM utility function varies with wealthor across different settings we measure thiscurvature only at a particular wealth levelwhatever this wealth level may be By allowingunobserved heterogeneity in this curvatureacross individuals we place no conceptual re-strictions on the relationship between wealthand risk aversion Our estimated distribution ofrisk preferences can be thought of as a convo-lution of the distribution of (relevant) wealthand risk attitudes Avoiding this debate is also adrawback Without taking a stand on the wayabsolute risk preferences vary with sizes andcontexts we cannot discuss how relevant ourestimates are for other settings Obviously wethink they are But since statements about theirexternal relevance are mainly informed by whatwe think and less by what we do we defer thisdiscussion to the concluding section

Our analysis also provides two results regard-ing the relationship between the distribution ofrisk preferences and that of risk First we findthat unobserved heterogeneity in risk aversionis greater and more important (for profits andpricing) than unobserved heterogeneity in riskThis is consistent with the motivation for recenttheoretical work which emphasizes the impor-tance of allowing for preference heterogeneityin analyzing insurance markets (Michael Lands-berger and Isaac Meilijson 1999 Michael Smart2000 David de Meza and David C Webb 2001Bertrand Villeneuve 2003 Pierre-Andre Chiap-pori et al 2006 Bruno Jullien Bernard Salanieand Francois Salanie 2007) Second we findthat unobserved risk has a strong positive cor-relation with unobserved risk aversion and dis-cuss possible interpretations of it This isencouraging from a theoretical standpoint as itretains a single crossing property which weillustrate in the counterfactual exercise Thisfinding contrasts the negative correlation re-ported by Amy Finkelstein and Kathleen Mc-Garry (2006) for the long-term care insurancemarket and by Mark Israel (2005) for automo-bile insurance in Illinois We view these differ-ent results as cautioning against interpreting thiscorrelation parameter outside the context inwhich it is estimated Even if risk preferencesare stable across contexts risk is not and there-fore neither is the correlation structure

This study is related to two important strands

of literature The first shares our main goal ofmeasuring risk aversion Much of the existingevidence about risk preferences is based onintrospection laboratory experiments (Steven JKachelmeier and Mohamed Shehata 1992 Ver-non L Smith and James M Walker 1993Charles A Holt and Susan K Laury 2002) dataon bettors or television game show participants(Robert Gertner 1993 Andrew Metrick 1995Bruno Jullien and Salanie 2000 Roel MBeetsma and Peter C Schotman 2001 MatildeBombardini and Francesco Trebbi 2005) an-swers given by individuals to hypothetical sur-vey questions (W Kip Viscusi and William NEvans 1990 Evans and Viscusi 1991 Robert BBarsky et al 1997 Bas Donkers BertrandMelenberg and Arthur van Soest 2001 JoopHartog Ada Ferrer-i-Carbonell and NicoleJonker 2002) and estimates that are driven bythe imposed functional form relationship be-tween static risk-taking behavior and intertem-poral substitution5 We are aware of only a fewattempts to recover risk preferences from deci-sions of regular market participants Atanu Saha(1997) looks at firmsrsquo production decisionsand Raj Chetty (2006) recovers risk preferencesfrom labor supply In the context of insuranceCicchetti and Dubin (1994) look at individualsrsquodecisions whether to insure against failure ofinterior telephone wires Compared to theirsetting in our setting events are more frequentand commonly observed stakes are higher thepotential loss (the difference between the de-ductible amounts) is known and the deductiblechoice we analyze is more immune to alter-native preference-based explanations Finallyin a recent working paper Justin Sydnor (2006)uses data on deductible choices in homeownerrsquosinsurance to calibrate a bound for the impliedlevel of risk aversion6 An important differencebetween our paper and these papers is thatthey all rely on a representative individual

5 Much of the finance and macroeconomics literaturegoing back to Irwin Friend and Marshall E Blume (1975)relies on this assumption As noted by Narayana R Kocher-lakota (1996) in a review of this literature the level of staticrisk aversion is still a fairly open question

6 The possibility of using deductibles to make inferencesabout risk aversion was first pointed out by Jacques HDreze (1981) Dreze suggests however relying on theoptimality of the observed contracts (ldquosupply siderdquo infor-mation) while we rely on individualsrsquo choices of deduct-ibles (ldquodemand siderdquo information)

747VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

framework and therefore focus only on thelevel of risk aversion7 In contrast we explicitlymodel observed and unobserved heterogeneityin risk aversion as well as in risk We cantherefore provide results regarding the hetero-geneity in risk preferences and its relationshipwith risk which have potentially important im-plications for welfare and policy A representa-tive individual framework cannot address suchquestions

The second strand of related literature isthe recent empirical literature on adverse se-lection in insurance markets Much of thisliterature addresses the important question ofwhether adverse selection exists in differentmarkets As suggested by the influential workof Chiappori and Salanie (2000) it uses ldquore-duced formrdquo specifications to test whether aftercontrolling for observables accident outcomesand coverage choices are significantly corre-lated (Georges Dionne and Charles Vanasse1992 Robert Puelz and Arthur Snow 1994John Cawley and Tomas Philipson 1999Finkelstein and James Poterba 2004 Finkel-stein and McGarry 2006) Cohen (2005) appliesthis test to our data and finds evidence consis-tent with adverse selection As our main goal isquite different we take a more structural ap-proach By assuming a structure for the adverseselection mechanism we can account for itwhen estimating the distribution of risk prefer-ences While the structure of adverse selectionis assumed its relative importance is not im-posed The structural assumptions allow us toestimate the importance of adverse selectionrelative to the selection induced by unobservedheterogeneity in risk attitudes As we discussin Section IIC this approach is conceptuallysimilar to that of James H Cardon and IgalHendel (2001) who model health insurancechoices and also allow for two dimensions ofunobserved heterogeneity8

The rest of the paper is organized as followsSection I describes the environment the setup

and the data Section II lays out the theoreticalmodel and the related econometric model anddescribes its estimation and identification Sec-tion III describes the results We first provide aset of reduced-form estimates which motivatethe more structural approach We then presentestimates from the benchmark specification aswell as estimates from various extensions androbustness tests We discuss and justify some ofthe modeling assumptions and perform counter-factual analysis as a way to illustrate the impli-cations of the results to profits and pricingSection IV concludes by discussing the rele-vance of the results to other settings

I Data

A Economic Environment and Data Sources

We obtained data from a single insurancecompany that operates in the market for auto-mobile insurance in Israel The data containinformation about all 105800 new policyhold-ers who purchased (annual) policies from thecompany during the first five years of its oper-ation from November 1994 to October 1999Although many of these individuals stayed withthe insurer in subsequent years we focusthrough most of the paper on deductible choicesmade by individuals in their first contract withthe company This allows us to abstract fromthe selection implied by the endogenous choiceof individuals whether to remain with the com-pany or not (Cohen 2003 2005)

The company studied was the first company inthe Israeli auto insurance market that marketedinsurance to customers directly rather thanthrough insurance agents By the end of the stud-ied period the company sold about 7 percent ofthe automobile insurance policies issued in IsraelDirect insurers operate in many countries and ap-pear to have a significant cost advantage (J DavidCummins and Jack L Van Derhei 1979) Thestudied company estimated that selling insurancedirectly results in a cost advantage of roughly 25percent of the administrative costs involved inmarketing and handling policies Despite theircost advantage direct insurers generally have haddifficulty in making inroads beyond a part of themarket because the product does not provide theldquoamenityrdquo of having an agent to work with andturn to (Stephen P DrsquoArcy and Neil A Doherty1990) This aspect of the company clearly makes

7 An exception is Syngjoo Choi et al (2006) who use alaboratory experiment and similar to us find a high degreeof heterogeneity in risk attitudes across individuals

8 In an ongoing project Pierre-Andre Chiappori andBernard Salanie (2006) estimate an equilibrium model ofthe French auto insurance market where their model of thedemand side of the market is conceptually similar to the onewe estimate in this paper

748 THE AMERICAN ECONOMIC REVIEW JUNE 2007

the results of the paper applicable only to thoseconsumers who seriously consider buying directinsurance Section IIID discusses this selection inmore detail

While we focus primarily on the demand sideof the market by modeling the deductible choicethe supply side (pricing) will be relevant for anycounterfactual exercise as well as for understand-ing the viability of the outside option (which wedo not observe and do not model) During the firsttwo years of the companyrsquos operations the pricesit offered were lower by about 20 percent thanthose offered by other conventional insurersThus due to its differentiation and cost advantagethe company had market power with respect toindividuals who were more sensitive to price thanto the disamenity of not having an agent Thismakes monopolistic screening models apply morenaturally than competitive models of insurance(eg Michael Rothschild and Joseph E Stiglitz1976) During the companyrsquos third year of oper-ation (December 1996 to March 1998) it facedmore competitive conditions when the estab-lished companies trying to fight off the new en-trant lowered the premiums for policies withregular deductibles to the levels offered by thecompany In their remaining period included inthe data the established companies raisedtheir premiums back to previous levels leav-ing the company again with a substantialprice advantage9

For each policy our dataset includes all theinsurerrsquos information about the characteristicsof the policyholder demographic characteris-tics vehicle characteristics and details abouthis driving experience The Appendix providesa list of variables and their definitions andTable 1 provides summary statistics In addi-tion our data include the individual-specificmenu of four deductible and premium combi-nations that the individual was offered (see be-low) the individualrsquos choice from this menuand the realization of risks covered by the pol-icy the length of the period over which it was ineffect the number of claims submitted by the

policyholder and the amounts of the submittedclaims10 Finally we use the zip codes of thepolicyholdersrsquo home addresses11 to augment thedata with proxies for individualsrsquo wealth basedon the Israeli 1995 census12

The policies offered by the insurer (as all pol-icies offered in the studied market) are one-periodpolicies with no commitment on the part of eitherthe insurer or the policyholder13 The policy re-sembles the US version of ldquocomprehensiverdquo in-surance It is not mandatory but it is held by alarge fraction of Israeli car owners (above 70percent according to the companyrsquos executives)The policy does not cover death or injuries to thepolicyholder or to third parties which areinsured through a separate mandatory policyInsurance policies for car audio equipmentwindshield replacement car and towing ser-vices are structured and priced separately Cer-tain types of coverage do not carry a deductibleand are therefore not used in the analysis14

Throughout the paper we use and reportmonetary amounts in current (nominal) NewIsraeli Shekels (NIS) to avoid creating artificialvariation in the data Consequently the follow-ing facts may be useful for interpretation andcomparison with other papers in the literature

9 During this last period two other companies offeringinsurance directly were established Due to first-mover advan-tage (as viewed by the companyrsquos management) which helpedthe company maintain a strong position in the market thesetwo new companies did not affect pricing policies much untilthe end of our observation period Right in the end of thisperiod the studied company acquired one of these entrants

10 Throughout the analysis we make the assumption thatthe main policyholder is the individual who makes the deduct-ible choice Clearly to the extent that this is not always thecase the results should be interpreted accordingly

11 The company has the addresses on record for billingpurposes Although in principle the company could haveused these data for pricing they do not do so

12 The Israeli Central Bureau of Statistics (CBS) associateseach census respondent with a unique ldquostatistical areardquo eachincluding between 1000 and 10000 residents We matchedthese census tracts with zip codes based on street addressesand constructed variables at the zip code level These con-structed variables are available for more than 80 percent of thepolicyholders As a proxy for wealth we use (gross) monthlyincome which is based on self-reported income by censusrespondents augmented (by the CBS) with Social Security data

13 There is a substantial literature that studies the optimaldesign of policies that commit customers to a multiperiodcontract or that include a one-sided commitment of theinsurer to offer the policyholder certain terms in subsequentperiods (Georges Dionne and Pierre Lasserre 1985 RussellCooper and Beth Hayes 1987 Georges Dionne and Neil ADoherty 1994 Igal Hendel and Alessandro Lizzeri 2003)Although such policies are observed in certain countries(Dionne and Vanasse 1992) many insurance markets includ-ing the one we study use only one-period no-commitmentpolicies (Howard Kunreuther and Mark V Pauly 1985)

14 These include auto theft total loss accidents and notldquoat faultrdquo accidents

749VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The exchange rate between NIS and US dollarsmonotonically increased from 301 in 1995 to414 in 1999 (on average it was 352)15 Annualinflation was about 8 percent on average andcumulative inflation over the observation periodwas 48 percent We will account for these effects

as well as other general trends by using yeardummy variables throughout the analysis

B The Menu of Deductibles and Premiums

Let xi be the vector of characteristics individuali reports to the insurance company After learningxi the insurer offered individual i a menu of fourcontract choices One option offered a ldquoregularrdquodeductible and a ldquoregularrdquo premium The term

15 PPP figures were about 10 percent lower than the nom-inal exchange rates running from 260 in 1995 to 374 in 1999

TABLE 1mdashSUMMARY STATISTICSmdashCOVARIATES

Variable Mean Std dev Min Max

Demographics Age 41137 1237 1806 8943Female 0316 047 0 1Family Single 0143 035 0 1

Married 0779 042 0 1Divorced 0057 023 0 1Widower 0020 014 0 1Refused to say 0001 004 0 1

Education Elementary 0016 012 0 1High school 0230 042 0 1Technical 0053 022 0 1College 0233 042 0 1No response 0468 050 0 1

Emigrant 0335 047 0 1

Car attributes Value (current NIS)a 66958 37377 4000 617000Car age 3952 287 0 14Commercial car 0083 028 0 1Engine size (cc) 1568 385 700 5000

Driving License years 18178 1007 0 63Good driver 0548 050 0 1Any driver 0743 044 0 1Secondary car 0151 036 0 1Business use 0082 027 0 1Estimated mileage (km)b 14031 5891 1000 32200History length 2847 061 0 3Claims history 0060 015 0 2

Young driver Young 0192 039 0 1Gender Male 0113 032 0 1

Female 0079 027 0 1Age 17ndash19 0029 017 0 1

19ndash21 0051 022 0 121ndash24 0089 029 0 124 0022 015 0 1

Experience 1 0042 020 0 11ndash3 0071 026 0 13 0079 027 0 1

Company year First year 0207 041 0 1Second year 0225 042 0 1Third year 0194 040 0 1Fourth year 0178 038 0 1Fifth year 0195 040 0 1

Note The table is based on all 105800 new customers in the dataa The average exchange rate throughout the sample period was approximately $1 per 35 NIS starting at 13 in late 1994

and reaching 14 in late 1999b The estimated mileage is not reported by everyone It is available for only 60422 new customers

750 THE AMERICAN ECONOMIC REVIEW JUNE 2007

regular was used for this deductible level bothbecause it was relatively similar to the deductiblelevels offered by other insurers and because mostpolicyholders chose it The regular premium var-ied across individuals according to some deter-ministic function (unknown to us) pit ft(xi)which was quite stable over time The regulardeductible level was directly linked to the regularpremium according to

(1) dit min12

pit capt

That is it was set to one-half of the regular pre-mium subject to a deductible cap capt whichvaried over time but not across individuals Thepremiums associated with the other options werecomputed by multiplying pit by three differentconstants 106 for ldquolowrdquo deductible 0875 forldquohighrdquo deductible and 08 for ldquovery highrdquo de-ductible The regular deductible dit was con-verted to the other three deductible levels in asimilar way using multipliers of 06 for low 18for high and 26 for very high

There are two main sources of exogenous vari-ation in prices The first arises from companyexperimentation The multipliers described abovewere fixed across individuals and over time formost of the observation period but there was asix-month period during the insurerrsquos first year ofoperation (May 1995 to October 1995) in whichthe insurer experimented with slightly modifiedmultipliers16 This modified formula covers al-most 10 percent of the sample The secondsource of variation arises from discrete ad-justments to the uniform cap The cap variedover time due to inflation competitive condi-tions and as the company gained more expe-rience (Figure 1) The cap was binding for

16 For individuals with low levels of regular premiumsduring the specified period the regular deductible was set at53 percent (instead of 50 percent) of the regular premiumthe low deductible was set at 33 percent (instead of 30percent) of the regular premium and so on

FIGURE 1 VARIATION IN THE DEDUCTIBLE CAP OVER TIME

Notes This figure presents the variation in the deductible cap over time which is one of the main sources of pricing variation inthe data We do not observe the cap directly but it can be calculated from the observed menus The graph plots the maximalregular deductible offered to anyone who bought insurance from the company over a moving seven-day window Thelarge jumps in the graph reflect changes in the deductible cap There are three reasons why the graph is not perfectlysmooth First in a few holiday periods (eg October 1995) there are not enough sales within a seven-day window sonone of those sales hits the cap This gives rise to temporary jumps downward Second the pricing rule applies at thedate of the price quote given to the potential customer Our recorded date is the first date the policy becomes effectiveThe price quote is held for a period of two to four weeks so in periods in which the pricing changes we may still seenew policies sold using earlier quotes made according to a previous pricing regime Finally even within periods of constantcap the maximal deductible varies slightly (variation of less than 05 percent) We do not know the source of this variation

751VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

about a third of the policyholders in our dataAll these individuals would be affected by achange in the cap Much of the variation ofmenus in the data is driven by the exogenousshifts in the uniform deductible cap The un-derlying assumption is that conditional onobservables these sources of variation pri-marily affect the deductible choice of newcustomers but they do not have a significantimpact on the probability of purchasing insurancefrom the company Indeed this assumption holdsin the data with respect to observables there is nodistinguishable difference in the distribution ofobservable characteristics of consumers who buyinsurance just before and those who buy just aftera change in the deductible cap

C Summary Statistics

The top of Table 2A summarizes the deduct-ible menus all are calculated according to theformula described earlier Only 1 percent of thepolicyholders chose the high or very high de-ductible options Therefore for the rest of theanalysis we focus only on the choice betweenregular and low deductible options (chosen by811 and 178 percent of the individuals respec-tively)17 Focusing only on these options doesnot create any selection bias because we do notomit individuals who chose high or very highdeductibles For these individuals we assumethat they chose a regular deductible This as-sumption is consistent with the structural modelwe develop in the next section which impliesthat conditional on choosing high or very highdeductibles an individual would almost alwaysprefer the regular over the low deductible

The bottom of Table 2A as well as Table2B present summary statistics for the policy re-alizations We focus only on claim rates and noton the amounts of the claims This is because anyamount above the higher deductible level is cov-

ered irrespective of the deductible choice and thevast majority of the claims fit in this category (seeSection IIIE) For all these claims the gain fromchoosing a low deductible is the same in the eventof a claim and is equal to the difference betweenthe two deductible levels Therefore the claimamount is rarely relevant for the deductible choice(and likewise for the companyrsquos pricing decisionwe analyze in Section IIIF)

Averaging over all individuals the annualclaim rate was 0245 One can clearly observesome initial evidence of adverse selection Onaverage individuals who chose a low deductiblehad higher claim rates (0309) than those whochose the regular deductible (0232) Those whochose high and very high deductibles had muchlower claim rates (0128 and 0133 respectively)These figures can be interpreted in the context ofthe pricing formula A risk-neutral individual willchoose the low deductible if and only if her claimrate is higher than (pd) (plow pregular)(dregular dlow) When the deductible cap is notbinding which is the case for about two-thirds ofthe sample this ratio is given directly by thepricing formula and is equal to 03 Thus anyindividual with a claim rate higher than 03 willbenefit from buying the additional coverage pro-vided by a low deductible even without any riskaversion The claim data suggest that the offeredmenu is cheaper than an actuarially fair contractfor a nonnegligible part of the population (13percent according to the benchmark estimates re-ported below) This observation is in sharp con-trast to other types of insurance contracts such asappliance warranties which are much more ex-pensive than the actuarially fair price (Rabin andThaler 2001)

II The Empirical Model

A A Model of Deductible Choice

Let wi be individual irsquos wealth (pih di

h) theinsurance contract (premium and deductible re-spectively) with high deductible (pi

l dil) the

insurance contract with low deductible ti theduration of the policy and ui(w) individual irsquosvNM utility function We assume that the num-ber of insurance claims is drawn from a Poissondistribution with an annual claim rate iThrough most of the paper we assume that i isknown to the individual We also assume that iis independent of the deductible choice ie that

17 The small frequency of ldquohighrdquo and ldquovery highrdquochoices provides important information about the lowerends of the risk and risk aversion distributions but (for thatsame reason) makes the analysis sensitive to functionalform Considering these options or the option of not buyinginsurance creates a sharp lower bound on risk aversion forthe majority of the observations making the estimates muchhigher Given that these options are rarely selected how-ever it is not clear to us whether they were regularlymentioned during the insurance sales process renderingtheir use somewhat inappropriate

752 THE AMERICAN ECONOMIC REVIEW JUNE 2007

there is no moral hazard Finally we assumethat in the event of an accident the value of theclaim is greater than di

h We revisit all theseassumptions in Sections IIID and IIIE For therest of this section i subscripts are suppressedfor convenience

In the market we study insurance policies are

typically held for a full year after which theycan be automatically renewed with no commit-ment by either the company or the individualMoreover all auto-insurance policies sold inIsrael can be canceled without prior notice bythe policyholder with premium payments beinglinearly prorated Both the premium and the

TABLE 2AmdashSUMMARY STATISTICSmdashMENUS CHOICES AND OUTCOMES

Variable Obs Mean Std dev Min Max

Menu Deductible (current NIS)a Low 105800 87548 12101 37492 103911Regular 105800 145299 19779 62486 171543High 105800 260802 35291 112475 308778Very high 105800 376305 50853 162464 446013

Premium (current NIS)a Low 105800 338057 91404 132471 1923962Regular 105800 318922 8623 124972 1815058High 105800 279057 75451 109351 1588176Very high 105800 255137 68984 99978 1452046

pd 105800 0328 006 03 18

Realization Choice Low 105800 0178 038 0 1Regular 105800 0811 039 0 1High 105800 0006 008 0 1Very high 105800 0005 007 0 1

Policy termination Active 105800 0150 036 0 1Canceled 105800 0143 035 0 1Expired 105800 0707 046 0 1

Policy duration (years) 105800 0848 028 0005 108Claims All 105800 0208 048 0 5

Low 18799 0280 055 0 5Regular 85840 0194 046 0 5High 654 0109 034 0 3Very high 507 0107 032 0 2

Claims per yearb All 105800 0245 066 0 19882Low 18799 0309 066 0 9264Regular 85840 0232 066 0 19882High 654 0128 062 0 12636Very high 507 0133 050 0 3326

a The average exchange rate throughout the sample period was approximately $1 per 35 NIS starting at 13 in late 1994and reaching 14 in late 1999

b The mean and standard deviation of the claims per year are weighted by the observed policy duration to adjust forvariation in the exposure period These are the maximum likelihood estimates of a simple Poisson model with no covariates

TABLE 2BmdashSUMMARY STATISTICSmdashCONTRACT CHOICES AND REALIZATIONS

Claims Low Regular High Very high Total Share

0 11929 (0193) 49281 (0796) 412 (0007) 299 (0005) 61921 (100) 080341 3124 (0239) 9867 (0755) 47 (0004) 35 (0003) 13073 (100) 016962 565 (0308) 1261 (0688) 4 (0002) 2 (0001) 1832 (100) 002383 71 (0314) 154 (0681) 1 (0004) 0 (0000) 226 (100) 000294 6 (0353) 11 (0647) 0 (0000) 0 (0000) 17 (100) 000025 1 (0500) 1 (0500) 0 (0000) 0 (0000) 2 (100) 000003

Notes The table presents tabulation of choices and number of claims For comparability the figures are computed only forindividuals whose policies lasted at least 09 years (about 73 percent of the data) The bottom rows of Table 2A providedescriptive figures for the full dataset The numbers in parentheses in each cell represent percentages within each row Theright-hand-side column presents the marginal distribution of the number of claims

753VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

probability of a claim are proportional to thelength of the time interval taken into account soit is convenient to think of the contract choice asa commitment for only a short amount of timeThis approach has several advantages First ithelps to account for early cancellations andtruncated policies which together constitute 30percent of the policies in the data18 Second itmakes the deductible choice independent ofother longer-term uncertainties faced by the in-dividual so we can focus on static risk-takingbehavior Third this formulation helps to obtaina simple framework for analysis which is attrac-tive both analytically and computationally19

The expected utility that the individual obtainsfrom the choice of a contract (p d) is given by

(2) v p d

1 tuw pt tuw pt d

We characterize the set of parameters that willmake the individual indifferent between the twooffered contracts This set provides a lower (up-per) bound on the level of risk aversion for indi-viduals who choose the low (high) deductible (fora given ) Thus we analyze the equation v(phdh) v(pl dl) By taking limits with respect to t(and applying LrsquoHopitalrsquos rule) we obtain

(3)

limt30

1

t(u(w pht) u(w plt))

(u(w pht) u(w pht dh)) (u(w plt) u(w plt dl))

pl phuw

uw dl uw dh

or

(4)

pl phuw uw dl uw dh

The last expression has a simple intuition Theright-hand side is the expected gain (in utils) perunit of time from choosing a low deductibleThe left-hand side is the cost of such a choiceper unit of time For the individual to be indif-ferent the expected gains must equal the costs

In our benchmark specification we assumethat the third derivative of the vNM utilityfunction is not too large A Taylor expansion forboth terms on the right-hand side of equation(4) ie u(w d) u(w) du(w) (d22)u13(w) implies that

(5)pl ph

uw dh dluw

1

2dh dldh dlu13w

Let d dh dl 0 p pl ph 0 andd 1frasl2 (dh dl) to obtain

(6)p

duw uw d u13w

or

(7) r u13w

uw

p

d 1

d

where r is the coefficient of absolute risk aver-sion at wealth level w

18 As can be seen in Table 2A 70 percent of the policiesare observed through their full duration (one year) About15 percent are truncated by the end of our observationperiod and the remaining 15 percent are canceled for var-ious reasons such as change in car ownership total-lossaccident or a unilateral decision of the policyholder tochange insurance providers

19 This specification ignores the option value associatedwith not canceling a policy This is not very restrictiveSince experience rating is small and menus do not changeby much this option value is likely to be close to zero Asimple alternative is to assume that individuals behave as ifthey commit for a full year of coverage In such a case themodel will be similar to the one we estimate but willdepend on the functional form of the vNM utility functionand would generally require taking infinite sums (over thepotential realizations for the number of claims within the year)In the special case of quadratic expected utility maximizerswho care only about the mean and variance of the number ofclaims this is easy to solve The result is almost identical to theexpression we subsequently derive in equation (7)

754 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Equation (7) defines an indifference set in thespace of risk and risk aversion which we willrefer to by (r() ) and ((r) r) interchange-ably Both r() and (r) have a closed-formrepresentation a property that will be computa-tionally attractive for estimation20 Both termsare individual specific as they depend on thedeductible menu which varies across individu-als For the rest of the paper we regard eachindividual as associated with a two-dimensionaltype (ri i) An individual with a risk parameter

i who is offered a menu (pih di

h) (pil di

l)will choose the low-deductible contract if andonly if his coefficient of absolute risk aversionsatisfies ri ri(i) Figure 2 presents a graphi-cal illustration

B The Benchmark Econometric Model

The Econometric ModelmdashThe econometricmodel we estimate is fully described by the fiveequations presented in this section Our ob-jective is to estimate the joint distribution of(i ri)mdashthe claim rate and coefficient of abso-lute risk aversionmdashin the population of policy-holders conditional on observables xi The

20 For example estimating the CARA version of themodel (Section IIID) for which r() does not have aclosed-form representation takes almost ten times longer

FIGURE 2 THE INDIVIDUALrsquoS DECISIONmdashA GRAPHICAL ILLUSTRATION

Notes This graph illustrates the individualrsquos decision problem The solid line presents the indifference setmdashequation (7)mdashappliedfor the menu faced by the average individual in the sample Individuals are represented by points in the two-dimensional spaceabove In particular the scattered points are 10000 draws from the joint distribution of risk and risk aversion for the averageindividual (on observables) in the data based on the point estimates of the benchmark model (Table 4) If an individual is eitherto the right of the line (high risk) or above the line (high risk aversion) the low deductible is optimal Adverse selection is capturedby the fact that the line is downward sloping higher-risk individuals require lower levels of risk aversion to choose the lowdeductible Thus in the absence of correlation between risk and risk aversion higher-risk individuals are more likely to choosehigher levels of insurance An individual with i (pidi) will choose a lower deductible even if he is risk neutral ie withprobability one (we do not allow individuals to be risk loving) This does not create an estimation problem because i is notobserved only a posterior distribution for it Any such distribution will have a positive weight on values of i that are below(pidi) Second the indifference set is a function of the menu and in particular of (pidi) and d An increase in (pidi) willshift the set up and to the right and an increase in d will shift the set down and to the left Therefore exogenous shiftsof the menus that make both arguments change in the same direction can make the sets ldquocrossrdquo thereby allowing us toidentify the correlation between risk and risk aversion nonparametrically With positive correlation (shown in the figureby the ldquoright-bendingrdquo shape of the simulated draws) the marginal individuals are relatively high risk therefore creatinga stronger incentive for the insurer to raise the price of the low deductible

755VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

benchmark formulation assumes that (i ri) fol-lows a bivariate lognormal distribution Thuswe can write the model as

(8) ln i xi i

(9) ln ri xi vi

with

(10) i

vi

iid

N00

2 r

r r2

Neither i nor ri is directly observed Thereforewe treat both as latent variables Loosely speak-ing they can be thought of as random effects Weobserve two variables the number of claims andthe deductible choice which are related to thesetwo unobserved components Thus to completeour econometric model we have to specify therelationship between the observed variables andthe latent ones This is done by making two struc-tural assumptions First we assume that the num-ber of claims is drawn from a Poisson distributionnamely

(11) claimsi Poisson(i ti )

where ti is the observed duration of the policySecond we assume that when individuals maketheir deductible choices they follow the theoreti-cal model described in the previous section Themodel implies that individual i chooses the lowdeductible (choicei 1) if and only if ri ri(i)where ri is defined in equation (7) Thus theempirical model for deductible choice is given by

(12) Prchoicei 1 Prri

pi

idi 1

d i

Prexp(xi vi)

pi

exp(xi i)di 1

d i

With no unobserved heterogeneity in risk (i 0) equation (12) reduces to a simple probit In

such a case one can perfectly predict risk from thedata denote it by (xi) and construct an addi-tional covariate ln([pi((xi)di) 1]d i) Giventhe assumption that risk aversion is distributedlognormally running the probit regressionabove and renormalizing the coefficient onthe constructed covariate to 1 (instead ofthe typical normalization of the variance of theerror term to 1) has a structural interpretation withln (ri) as the dependent variable However Cohen(2005) provides evidence of adverse selection inthe data implying the existence of unobservedheterogeneity in risk This makes the simple probitregression misspecified Estimation of the fullmodel is more complicated Once we allow forunobserved heterogeneity in both unobserved riskaversion (vi) and claim rate (i) we have to inte-grate over the two-dimensional region depicted inFigure 2 for estimation

EstimationmdashA natural way to proceed is toestimate the model by maximum likelihoodwhere the likelihood of the data as a function ofthe parameters can be written by integrating outthe latent variables namely

(13) Lclaimsi choicei

Prclaimsi choiceii riPri ri

where is a vector of parameters to be esti-mated While formulating the empirical modelusing likelihood may help our thinking aboutthe data-generating process using maximumlikelihood (or generalized method of moments(GMM)) for estimation is computationally cum-bersome This is because in each iteration itrequires evaluating a separate complex integralfor each individual in the data In contrastMarkov Chain Monte Carlo (MCMC) Gibbssampling is quite attractive in such a case Us-ing data augmentation of latent variables (Mar-tin A Tanner and Wing Hung Wong 1987)according to which we simulate (i ri) and latertreat those simulations as if they are part ofthe data one can avoid evaluating the complexintegrals by sampling from truncated normaldistributions which is significantly less compu-tationally demanding (eg Luc Devroye 1986)This feature combined with the idea of aldquosliced samplerrdquo (Paul Damien John Wake-field and Stephen Walker 1999) to sample from

756 THE AMERICAN ECONOMIC REVIEW JUNE 2007

an unfamiliar posterior distribution makes theuse of a Gibbs sampler quite efficient for ourpurposes Finally the lognormality assumptionimplies that F(ln(i)ri) and F(ln(ri)i) follow a(conditional) normal distribution allowing us torestrict attention to univariate draws furtherreducing the computational burden

The Appendix provides a full description ofthe Gibbs sampler including the conditionaldistributions and the (flat) prior distributions weuse The basic intuition is that conditional onobserving (i ri) for each individual we have asimple linear regression model with two equa-tions The less standard part is to generate drawsfor (i ri) We do this iteratively Conditionalon i the posterior distribution for ln(ri) fol-lows a truncated normal distribution where thetruncation point depends on the menu individ-ual i faces and its direction (from above orbelow) depends on individual irsquos deductiblechoice The final step is to sample from theposterior distribution of ln(i) conditional on riThis is more complicated as we have bothtruncation which arises from adverse selection(just as we do when sampling for ri) and thenumber of claims which provides additionalinformation about the posterior of i Thus theposterior for i takes an unfamiliar form forwhich we use a ldquosliced samplerrdquo

We use 100000 iterations of the Gibbs sam-pler It seems to converge to the stationarydistribution after about 5000 iterations There-fore we drop the first 10000 draws and use thelast 90000 draws of each variable to report ourresults The results are based on the posteriormean and posterior standard deviation fromthese 90000 draws Note that each iterationinvolves generating separate draws of (i ri) foreach individual Performing 100000 iterationsof the benchmark specification (coded in Mat-lab) takes about 60 hours on a Dell Precision530 workstation

C Identification

The parametric version of the model is iden-tified mechanically There are more equationsthan unknowns and no linear dependenciesamong them so (as also verified using MonteCarlo exercises) the model parameters can bebacked out from simulated data Our goal in thissection is not to provide a formal identificationproof Rather we want to provide intuition for

which features of the data allow us to identifyparticular parameters of the model The discus-sion also highlights the assumptions that areessential for identification vis-a-vis those thatare made only for computational convenience(making them in principle testable)

Discussion of Nonparametric Identifica-tionmdashThe main difficulty in identifying themodel arises from the gap between the (exante) risk type i which individuals usewhen choosing a deductible and the (ex post)realization of the number of claims we ob-serve We identify between the variation inrisk types and the variation in risk realizationsusing our parametric distributional assump-tions The key is that the distribution of risktypes can be uniquely backed out from theclaim data alone This allows us to use thedeductible choice as an additional layer ofinformation which identifies unobserved het-erogeneity in risk aversion21 Any distribu-tional assumption that allows us to uniquelyback out the distribution of risk types fromclaim data would be sufficient to identify thedistribution of risk aversion As is customaryin the analysis of count processes we make aparametric assumption that claims are gener-ated by a lognormal mixture of Poisson dis-tributions (Section IIID discusses this furtherand explores an alternative) Using a mixtureenables us to account for adverse selectionthrough unobserved heterogeneity in risk Italso allows us to better fit the tails of theclaim distribution In principle a more flexi-ble mixture or a more flexible claim-generat-ing process could be identified as long as theclaims data are sufficiently rich22

21 Cardon and Hendel (2001) face a similar identificationproblem in the context of health insurance They use vari-ation in coverage choice (analogous to deductible choice) toidentify the variation in health-status signals (analogous torisk types) from the variation in health expenditure (analo-gous to number of claims) They can rely on the coveragechoice to identify this because they make an assumptionregarding unobserved heterogeneity in preferences (iidlogit) We take a different approach as our main goal is toestimate rather than assume the distribution of preferences

22 Although it may seem that the claim data are limited(as they take only integer values between 0 and 5 in ourdata) variation in policy duration generates continuousvariation in the observed claim propensity Of course thisvariation also introduces an additional selection into themodel due to policy cancellations which are potentially

757VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Once the distribution of risk types is iden-tified from claims data the marginal distribu-tion of risk aversion (and its relationship tothe distribution of risk types) is nonparametri-cally identified from the variation in theoffered menus discussed in Section I Thisvariation implies different deductible and pre-mium options to identical (on observables)individuals who purchased insurance at dif-ferent times Different menus lead to differentindifference sets (similar to the one depictedin Figure 2) These sets often cross each otherand nonparametrically identify the distribu-tion of risk aversion and the correlation struc-ture at least within the region in which theindifference sets vary For the tails of thedistribution as is typically the case we haveto rely on parametric assumptions or usebounds The parametric assumption of log-normality we use for most of the paper ismade only for computational convenience

Intuition for the Parametric IdentificationMechanismmdashVariation in the offered menus isimportant for the nonparametric identificationThe parametric assumptions could identify themodel without such variation Thus to keep theintuition simple let us take the bivariate log-normal distribution as given and contrary to thedata assume that all individuals are identical onobservables and face the same menu Supposealso that all individuals are observed for exactlyone year and have up to two claims23 In thissimplified case the model has five parametersto be estimated the mean and variance of risk and

2 the mean and variance of risk aver-sion r and r

2 and the correlation parameter The data can be summarized by five numbersLet 130 131 and 132 1 131 130 be thefraction of individuals with zero one and twoclaims respectively Let 0 1 and 2 be theproportion of individuals who chose a low de-ductible within each ldquoclaim grouprdquo Given ourdistributional assumption about the claim-

generating process we can use 130 and 131 touniquely identify and

2 Loosely isidentified by the average claim rate in the dataand

2 is identified by how fat the tail of theclaim distribution is ie by how large (132131)is compared to (131130) Given and

2 and thelognormality assumption we can (implicitly)construct a posterior distribution of risk typesfor each claim group F(r claims c) andintegrate over it when predicting the deductiblechoice This provides us with three additionalmoments each of the form

(14) Ec Prchoice 1r

dFr claims c dFr

for c 0 1 2 These moments identify thethree remaining parameters of the model rr

2 and Let us now provide more economic content to

the identification argument Using the same ex-ample and conditional on identifying and

2

from the claim data one can think about thedeductible choice data 0 1 2 as a graph(c) The absolute level of the graph identifiesr In the absence of correlation between riskand risk aversion the slope of the graph iden-tifies r

2 with no correlation the slope shouldalways be positive (due to adverse selection)but higher r

2 would imply a flatter graph be-cause more variation in the deductible choiceswill be attributed to variation in risk aversionFinally is identified by the curvature of thegraph The more convex (concave) the graph isthe more positive (negative) is the estimated For example if 0 05 1 051 and 2 099 it is likely that r

2 is high (explaining why0 and 1 are so close) and is highly positive(explaining why 2 is not also close to 1) Incontrast if 0 1 it must mean that thecorrelation between risk and risk aversion isnegative which is the only way the originalpositive correlation induced by adverse selec-tion can be offset This intuition also clarifiesthat the identification of relies on observingindividuals with multiple claims (or differentpolicy durations) and that it is likely to besensitive to the distributional assumptions Thedata (Table 2B) provide a direct (positive) cor-relation between deductible choice and claims

endogenous The results are similar when we use onlyexpired and truncated policies

23 This variation is sufficient (and necessary) to identifythe benchmark model The data provide more variation weobserve up to five claims per individual we observe con-tinuous variation in the policy durations we observe vari-ation in prices and we exploit distributional restrictionsacross individuals with different observable characteristics

758 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The structural assumptions allow us to explainhow much of this correlation can be attributedto adverse selection The remaining correlation(positive or negative) is therefore attributed tocorrelation in the underlying distribution of riskand risk aversion

III Results

A Reduced-Form Estimation

To get an initial sense for the levels of abso-lute risk aversion implied by the data we use asimple back-of-the-envelope exercise We com-pute unconditional averages of p d dh dland d (Table 2A)24 and substitute these valuesin equation (7) The implied coefficient of ab-solute risk aversion is 29 104 NIS125 Thisfigure could be thought of as the average indif-ference point implying that about 18 percent ofthe policyholders have coefficients of absoluterisk aversion exceeding it To convert to USdollar amounts one needs to multiply thesefigures by the average exchange rate (352)resulting in an average indifference point of102 103 $US1 This figure is less than halfof a similar estimate reported by Sydnor (2006)for buyers of homeownerrsquos insurance but about3 and 13 times higher than comparable fig-ures reported by Gertner (1993) and Metrick(1995) respectively for television game showparticipants

Table 3 provides reduced-form analysis ofthe relationship between the observables andour two left-hand-side variables the number ofclaims and the deductible choice26 Column 1reports the estimates from a Poisson regressionof the number of claims on observed character-istics This regression is closely related to therisk equation we estimate in the benchmarkmodel It shows that older people women andpeople with a college education are less likely to

have an accident Bigger more expensiveolder and noncommercial cars are more likelyto be involved in an accident Driving experi-ence and variables associated with less intenseuse of the car reduce accident rates As could beimagined claim propensity is highly correlatedover time past claims are a strong predictor offuture claims Young drivers are 50 to 70 per-cent more likely to be involved in an accidentwith young men significantly more likely thanyoung women Finally as indicated by the trendin the estimated year dummies the accident ratesignificantly declined over time Part of thisdecline is likely due to the decline in accidentrates in Israel in general27 This decline mightalso be partly due to the better selection ofindividuals the company obtained over time asit gained more experience (Cohen 2003)

Columns 2 and 3 of Table 3 present estimatesfrom probit regressions in which the dependentvariable is equal to one if the policyholder chosea low deductible and is equal to zero otherwiseColumn 3 shows the marginal effects of thecovariates on the propensity to choose a lowdeductible These marginal effects do not have astructural interpretation as the choice of lowdeductible depends on its price on risk aver-sion and on risk In this regression we againobserve a strong trend over time Fewer policy-holders chose the low deductible as time wentby One reason for this trend according to thecompany executives is that over time the com-panyrsquos sales persons were directed to focusmainly on the ldquodefaultrdquo regular deductible op-tion28 The effect of other covariates will bediscussed later in the context of the full modelIn unreported probit regressions we also test thequalitative assumptions of the structural model byadding three additional regressors the price ratio(pidi) the average deductible offered d i

24 The unconditional is computed by maximum likeli-hood using the data on claims and observed durations of thepolicies

25 Using the CARA specification as in equation (16) weobtain a slightly lower value of 25 104 NIS1

26 We find positive correlation in the errors of these tworegressions suggesting the existence of adverse selection inthe data and motivating a model with unobserved heteroge-neity in risk This test is similar to the bivariate probit testproposed by Chiappori and Salanie (2000) and replicatesearlier results reported in Cohen (2005)

27 In particular traffic fatalities and counts of trafficaccidents in Israel fell by 11 percent and 18 percent during1998 and 1999 respectively

28 Such biased marketing efforts will bias consumersagainst choosing the low deductible thus making them lookless risk averse This would make our estimate a lowerbound on the true level of risk aversion If only sophisti-cated consumers could see beyond the marketing effort andthis sophistication were related to observables (eg educa-tion) the coefficients on such observables would be biasedupward This is not a major concern given that the coefficientson most of the covariates are fairly stable when we estimatethe benchmark model separately for different years

759VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

TABLE 3mdashNO HETEROGENEITY IN RISK

Variable

Poisson regressiona Probit regressionsc

Dep var Number of claims Dep var 1 if low deductible

(1) (2) (3)

Coeff Std Err IRRb Coeff Std Err dPdX dPdXDemographics Constant 43535 02744 mdash 224994 23562 mdash

Age 00074 00052 09926 01690 00464 00034 00042Age2 989 105 536 105 10001 00017 00005 339 105 455 105Female 00456 00161 09555 08003 01316 00165 00134Family Single omitted omitted omitted

Married 01115 00217 08945 06417 01950 00128 00045Divorced 00717 00346 10743 03191 03127 00064 00044Widower 00155 00527 10156 03441 04573 00069 00025Other (NA) 01347 01755 11442 29518 19836 00511 00370

Education Elementary 00522 00550 09491 02927 04278 00061 00016High school omitted omitted omittedTechnical 00373 00308 10380 05037 02520 00105 00138College 00745 00197 09282 05279 01546 00109 00053Other (NA) 00116 00184 10116 00630 01540 00013 00025

Emigrant 00210 00163 10213 00497 01328 00010 00003

Car attributes Log(Value) 01227 00281 11306 13285 02342 00271 00299Car age 00187 00042 10189 01603 00357 00033 00017Commercial car 01394 00326 08699 09038 02781 00177 00294Log(engine size) 02972 00459 13461 06924 03952 00141 00075

Driving License years 00204 00034 09798 01043 00312 00021 00005License years2 00002 677 105 10002 00015 00005 299 105 167 105

Good driver 00176 00191 09825 07207 01574 00148 00152Any driver 00564 00169 10580 10923 01321 00217 00258Secondary car 00859 00209 09177 00038 01626 00001 00070Business use 01852 00293 12034 07951 02737 00156 00017History length 00527 00110 09486 11218 01450 00228 00171Claims history 06577 00390 19304 04654 05460 00095 00496

Young driver Young driver 05235 00361 16879 28847 06305 00524 00012Gender Male omitted omitted omitted

Female 01475 00288 08629 17959 03135 00396 00195Age 17ndash19 omitted omitted omitted

19ndash21 00701 00532 10726 07800 07509 00153 0014721ndash24 00267 00574 09737 05746 07773 00114 0015624 02082 00567 0812 17869 07328 00397 00136

Experience 1 omitted omitted omitted1ndash3 02416 00458 07854 10175 06577 00217 000093 02827 00532 07538 32513 07386 00762 00410

Company year First year omitted omitted omittedSecond year 00888 00198 09150 44513 01506 00792 00859Third year 00690 00222 09334 85888 01820 01303 01367Fourth year 01817 00232 08339 118277 02102 01616 01734Fifth year 05431 0028 05810 143206 02778 01920 02085

105586Obs 105800 94000 105800Pseudo R2 00162 01364 01296Log likelihood 577459 369595 430869

Significant at the 5 percent confidence levela Maximum likelihood estimates Variation in exposure (policy duration) is accounted forb IRR Incidence rate ratio Each figure should be interpreted as the increasedecrease in claim probability as a result of an increase of one unit in the

right-hand-side variablec There are two separate probit regressions reported in this table Column 2 relies on the deductible choice model and the lognormality assumption As discussed

in the text by including an additional regressor ln([pi((xi)di) 1]d i) (with (xi) predicted from column 1 above) and normalizing its coefficient to 1 we obtaina structural interpretation of this regression with ln (ri) as the dependent variable Thus the reported coefficients are comparable to those estimated for the benchmarkmodel One should be cautious however in interpreting these coefficients Unlike the benchmark model this regression does not allow unobserved heterogeneity inrisk and suffers from some selection bias because observations with high predicted risk rate are omitted (which is why the number of observations is 94000 ratherthan the full sample of 105800) For comparison column 3 reports the marginal effects from a comparable probit regression that uses the full sample and does notcontrol for pricing and predicted risk through the additional structural regressor Column 3 does not have a structural interpretation and its (unreported) coefficientscannot be compared to those estimated from the benchmark model

760 THE AMERICAN ECONOMIC REVIEW JUNE 2007

and the risk rate (xi) as predicted from thePoisson regression of column 1 All three addi-tional regressors enter with the predicted signand with large and highly significant marginaleffects29

Finally column 2 of Table 3 presents animportant specification of the probit regres-sion in which ln([pi(( xi)di) 1]d i) isadded as a regressor and its coefficient isnormalized to 130 As already mentioned ifwe assume that there is no unobserved heter-ogeneity in risk then column 2 is analogousto the ln(ri) equation of the benchmark modelThis restriction of the model is rejected by thedata About 10 percent of the individuals arepredicted to have (xi) (pi di) implying achoice of low deductible for any level of riskaversion Many of these individuals howeverstill choose higher deductible levels Column2 reports the results for the remaining indi-viduals (94000 out of 105800) for which theregression can be run While the signs of theestimated coefficients are similar to those inthe benchmark model presented below therestricted version of the model suggests muchhigher levels and dispersion of risk aversionwell above any reasonable level31 The fullestimation of the benchmark model clearly re-jects this restriction on the model

B Estimation of the Benchmark Model

The Effect of Individual Characteristics onRisk AversionmdashTable 4 presents the estimationresults from the benchmark model The secondcolumn shows how the level of absolute riskaversion is related to individual characteristicsAs the dependent variable is in natural loga-

rithm coefficients on dummy variables can bedirectly interpreted as approximate percentagechanges The vast majority of these coefficientsare quite stable across a wide range of specifi-cations which are mentioned later

The results indicate that women are more riskaverse than men and have a coefficient of ab-solute risk aversion about 20 percent greaterthan that of men These results are consistentwith those of Donkers et al (2001) and Hartoget al (2002) The estimated effect of age sug-gests a nonmonotone pattern of risk preferencesover the life cycle The estimated coefficientsimply that initially (that is at age 18 the youngestindividual in the data) individuals become less riskaverse with age but around the age of 48 indi-viduals start becoming more risk averse32 Mar-ried individuals are estimated to be significantlymore risk averse compared to singles while di-vorced individuals are less (although the coeffi-cient is statistically insignificant)

The analysis suggests that variables that arelikely to be correlated with income or wealthsuch as post-high-school education and thevalue of the car have a positive coefficientindicating that wealthier people have higherlevels of absolute risk aversion Although we donot have data on individualsrsquo income or wealthwe use other proxies for income in additionalspecifications and obtain mixed results Whenwe include as a covariate the average householdincome among those who live in the same zipcode we obtain a significant and negative co-efficient of 0333 (0154) (standard deviationin parentheses) When we match zip code in-come on demographics of the individuals thecoefficient is effectively zero 0036 (0047)and when we use a socioeconomic index ofthe locality in which the individual lives thecoefficient is positive 0127 (0053)33 Thus

29 The estimated marginal effect (z-statistic in parenthe-ses) is 0352 (1376) 16 104 (1481) and 0154(255) for (pidi) d i and (xi) respectively

30 The level is a normalization The sign is estimatedHad the sign on this regressor been positive this would haveimplied a rejection of the model

31 For the implied median level of risk aversion the re-stricted model produces a similar estimate to the estimate wereport below for the benchmark model However since someindividuals who chose the low deductible are estimated to havevery low claim rates the restricted model is ldquoforcedrdquo to estimatevery high risk aversion for these individuals (in contrast thebenchmark model can explain such choices by positive risk re-siduals) resulting in very high dispersion and (due to the lognor-mality assumption) very high average risk aversion which isabout 1025 higher than the benchmark estimates we report below

32 This nonmonotone pattern may explain why age enterswith different signs in the estimation results of Donkers et al(2001) and Hartog et al (2002) A somewhat similar U-shapepattern with respect to age is also reported by Sumit Agarwalet al (2006) in the context of consumer credit markets

33 The full results from these specifications are providedin the online Appendix (available at httpwwwe-aerorgdatajune0720050644_apppdf ) The reason we do not usethese income variables in the benchmark specification istheir imperfect coverage which would require us to omitalmost 20 percent of the individuals Other results fromthese regressions are similar to those we report for thebenchmark specification

761VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

overall we interpret our findings as suggestiveof a nonnegative and perhaps positive associ-ation between incomewealth and absolute riskaversion

At first glance these results may appear tobe inconsistent with the widely held beliefthat absolute risk aversion declines withwealth One should distinguish however be-tween two questions (a) whether for a givenindividual the vNM utility function exhibitsdecreasing absolute risk aversion and (b)how risk preferences vary across individuals

Our results do not speak to the first questionand should not be thought of as a test of thedecreasing absolute risk aversion propertyTesting this property would require observingthe same individual making multiple choicesat different wealth levels Rather our resultsindicate that individuals with greater wealthhave utility functions that involve a greaterdegree of risk aversion It might be thatwealth is endogenous and that risk aversion(or unobserved individual characteristics thatare correlated with it) leads individuals to

TABLE 4mdashTHE BENCHMARK MODEL

Variable Ln() equation Ln(r) equation Additional quantities

Demographics Constant 15406 (00073) 118118 (01032) Var-covar matrix ()

Age 00001 (00026) 00623 (00213) 01498 (00097)Age2 624 106 (263 105) 644 104 (211 104) r 31515 (00773)Female 00006 (00086) 02049 (00643) 08391 (00265)Family Single Omitted Omitted

Married 00198 (00115) 01927 (00974) Unconditional statisticsa

Divorced 00396 (00155) 01754 (01495) Mean 02196 (00013)Widower 00135 (00281) 01320 (02288) Median 02174 (00017)Other (NA) 00557 (00968) 04599 (07397) Std Dev 00483 (00019)

Education Elementary 00194 (00333) 01283 (02156) Mean r 00019 (00002)High school Omitted Omitted Median r 727 106 (756 107)Technical 00017 (00189) 02306 (01341) Std Dev r 00197 (00015)College 00277 (00124) 02177 (00840) Corr(r ) 02067 (00085)Other (NA) 00029 (00107) 00128 (00819)

Emigrant 00030 (00090) 00001 (00651) Obs 105800

Car attributes Log(value) 00794 (00177) 07244 (01272)Car age 00053 (00023) 00411 (00176)Commercial car 00719 (00187) 00313 (01239)Log(engine size) 01299 (00235) 03195 (01847)

Driving License years 00015 (00017) 00157 (00137)License years2 183 105 (351 105) 148 104 (254 104)Good driver 00635 (00112) 00317 (00822)Any driver 00360 (00105) 03000 (00722)Secondary car 00415 (00141) 01209 (00875)Business use 00614 (00134) 03790 (01124)History length 00012 (00052) 03092 (00518)Claims history 01295 (00154) 00459 (01670)

Young driver Young driver 00525 (00253) 02499 (02290)Gender Male Omitted mdash

Female 00355 (00061) mdashAge 17ndash19 Omitted mdash

19ndash21 00387 (00121) mdash21ndash24 00445 (00124) mdash24 00114 (00119) mdash

Experience 1 Omitted mdash1ndash3 00059 (00104) mdash3 00762 (00121) mdash

Company year First year Omitted OmittedSecond year 00771 (00122) 14334 (00853)Third year 00857 (00137) 28459 (01191)Fourth year 01515 (00160) 38089 (01343)Fifth year 04062 (00249) 39525 (01368)

Note Standard deviations based on the draws from the posterior distribution in parentheses Significant at the 5 percent confidence levela Unconditional statistics represent implied quantities for the sample population as a whole ie integrating over the distribution of covariates in the sample (as

well as over the unobserved components)

762 THE AMERICAN ECONOMIC REVIEW JUNE 2007

save more to obtain more education or totake other actions that lead to greaterwealth34

Let us make several additional observationsFirst while owners of more expensive cars ap-pear to have both higher risk exposure andhigher levels of risk aversion owners of biggercars have higher risk exposure but lower levelsof risk aversion This should indicate that thestructure of the model does not constrain therelationship between the coefficients in the twoequations Rather it is the data that ldquospeak uprdquoSecond individuals who are classified by theinsurer as ldquogood driversrdquo indeed have lowerrisk but also appear to have lower risk aversionThis result is somewhat similar to the positivecorrelation between unobserved risk and unob-served risk aversion which we report belowThird policyholders who tend to use the car forbusiness are less risk averse This could bebecause uninsured costs of accidents occurringto such policyholders are tax deductible Fourthpolicyholders who reported three full years ofclaims history are more risk averse but are notdifferent in their risk exposure The attitude thatleads such policyholders to comply with therequest to (voluntarily) report three full years ofclaims history is apparently and not surpris-ingly correlated with higher levels of risk aver-sion In contrast while past claims indicate highrisk they have no significant relationship withrisk aversion Finally we find a strong trendtoward lower levels of risk aversion over timeThis is a replication of the probit results dis-cussed earlier

The Effect of Individual Characteristics onClaim RiskmdashThe first column of Table 4 de-scribes the relationship between observablesand risk exposure Accident risk is higher fordivorced individuals and lower for people witha college education Bigger older more expen-sive and noncommercial cars are all morelikely to be involved in an accident Drivingexperience reduces accident rates as do mea-sures of less intense use of the car while youngdrivers are more exposed to risk Claim propen-sity is highly correlated over time the voluntaryreport of past claims is a strong predictor offuture claims This risk equation produces re-sults that are similar to those of the simplerPoisson regression reported in Table 3 Al-though some of the coefficients lose signifi-cance the magnitude of most coefficients isquite similar The similarity between these twosets of results is to be expected as the riskregression is identified primarily from the dataon claims so incorporating the information ondeductible choice does not qualitatively changethe conceptual identification strategy (see Sec-tion IIC) If the results were not similar thiscould have indicated a misspecification of themodel The slight differences between the riskregressions in Table 3 and Table 4 are drivenmainly by the structural assumptions First thebenchmark model estimates a lognormal mix-ture of Poisson rates rather than a single Pois-son model By incorporating the fatter tails ofthe claim distribution it slightly changes theresults increases the standard errors and de-creases the average predicted claim rate Sec-ond the information on deductible choiceslightly helps us in obtaining more precise es-timates through the correlation structure be-tween the error terms in the two equations

The Implied Level of Risk AversionmdashOne ofthe main goals of the estimation is to obtainmeasures of the level of risk aversion in thepopulation we study Since we use Gibbs sam-pling and augment the latent coefficients ofabsolute risk aversion we can directly obtainthe posterior distribution of various moments ofthis distribution At each iteration of the Gibbssampler we compute the mean standard devi-ation (across individuals) and various percen-tiles of the simulated draws of i and ri and thecorrelation between them The far-right columnof Table 4 reports the averages and standard

34 One may be tempted to interpret the positive wealtheffects as an indirect indication of credit constraints wealth-ier individuals are less credit constrained and therefore canafford to purchase more insurance We do not share thisinterpretation for two reasons First the insurance companyobserves these proxies for wealth and conditions on themwhen setting prices Since the willingness to pay for insur-ance is likely to be correlated with the willingness to pay forthe additional insurance provided by the low deductibleoption premiums already reflect this variation We condi-tion on the level of the premium Second paying less exante implies paying more ex post so applying the creditconstraint argument only to the ex ante payment but not tothe probabilistic ex post deductible payments has no theo-retical foundation Essentially the setup of the model linksthe ex ante decisions to the ex post losses which are bothdriven by the curvature of the vNM utility function

763VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

deviations of these computed quantities over theiterations of the Gibbs sampler35 The impliedrisk aversion of the mean individual is 00019which is about seven times greater than theback-of-the-envelope calculation presented inthe previous section As we assume a lognormal

distribution and estimate a relatively high dis-persion coefficient r the estimates also implysignificantly lower values for the median levelof risk aversion As shown later the qualitativepattern of these results is quite robust acrossspecifications

Table 5 summarizes our findings regardingthe level and dispersion of risk aversion Itpresents two ways to interpret the estimates weobtained from the benchmark model as well ascomparisons to specifications with constant ab-solute risk aversion (CARA) utility and with

35 Note that these estimated quantities are unconditionalIn computing these quantities we integrate over the distri-bution of observable characteristics in the data so onecannot compute these estimates from the estimated param-eter directly

TABLE 5mdashRISK-AVERSION ESTIMATES

Specificationa Absolute risk aversionb Interpretationc Relative risk aversiond

Back-of-the-envelope 10 103 9070 1484Benchmark model

Mean individual 67 103 5605 972225th percentile 23 106 9998 003Median individual 26 105 9974 03775th percentile 29 104 9714 42790th percentile 27 103 7834 390295th percentile 99 103 4937 14327

CARA utilityMean individual 31 103 7651 4436Median individual 34 105 9966 050

Learning modelMean individual 42 103 6886 6140Median individual 56 106 9995 008

Comparable estimatesGertner (1993) 31 104 9699 479Metrick (1995) 66 105 9934 102Holt and Laury (2002)e 32 102 2096 86575Sydnor (2006) 20 103 8329 5395

a The table summarizes the results with respect to the level of risk aversion ldquoBack-of-the-enveloperdquo refers to thecalculation we report in the beginning of Section III ldquobenchmark modelrdquo refers to the results from the benchmark model(Table 4) ldquoCARA utilityrdquo refers to a specification of a CARA utility function and ldquolearning modelrdquo refers to a specificationin which individuals do not know their risk types perfectly (see Section IIID) The last four rows are the closest comparableresults available in the literature

b The second column presents the point estimates for the coefficient of absolute risk aversion converted to $US1 unitsFor the comparable estimates this is given by their estimate of a representative CARA utility maximizer For all otherspecifications this is given by computing the unconditional mean and median in the population using the figures we reportin the mean and median columns of risk aversion in Table 6 (multiplied by 352 the average exchange rate during the periodto convert to US dollars)

c To interpret the absolute risk aversion estimates (ARA) we translate them into x u(w) 1frasl2 u(w 100) 1frasl2 u(w x) That is we report x such that an individual with the estimated ARA is indifferent about participating in a 50ndash50lottery of gaining 100 US dollars and losing x US dollars Note that since our estimate is of absolute risk aversion the quantityx is independent of w To be consistent with the specification we use a quadratic utility function for the back-of-the-envelopebenchmark and learning models and use a CARA utility function for the others

d The last column attempts to translate the ARA estimates into relative risk aversion We follow the literature and do soby multiplying the ARA estimate by average annual income We use the average annual income (after tax) in Israel in 1995(51168 NIS from the Israeli Census) for all our specifications and we use average disposable income in the United Statesin 1987 ($15437) for Gertner (1993) and Metrick (1995) For Holt and Laury (2002) and Sydnor (2006) we use a similarfigure for 2002 ($26974)

e Holt and Laury (2002) do not report a comparable estimate The estimate we provide above is based on estimating aCARA utility model for the 18 subjects in their experiment who participated in the ldquo90rdquo treatment which involved stakescomparable to our setting For these subjects we assume a CARA utility function and a lognormal distribution of theircoefficient of absolute risk aversion The table reports the point estimate of the mean from this distribution

764 THE AMERICAN ECONOMIC REVIEW JUNE 2007

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 4: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

framework and therefore focus only on thelevel of risk aversion7 In contrast we explicitlymodel observed and unobserved heterogeneityin risk aversion as well as in risk We cantherefore provide results regarding the hetero-geneity in risk preferences and its relationshipwith risk which have potentially important im-plications for welfare and policy A representa-tive individual framework cannot address suchquestions

The second strand of related literature isthe recent empirical literature on adverse se-lection in insurance markets Much of thisliterature addresses the important question ofwhether adverse selection exists in differentmarkets As suggested by the influential workof Chiappori and Salanie (2000) it uses ldquore-duced formrdquo specifications to test whether aftercontrolling for observables accident outcomesand coverage choices are significantly corre-lated (Georges Dionne and Charles Vanasse1992 Robert Puelz and Arthur Snow 1994John Cawley and Tomas Philipson 1999Finkelstein and James Poterba 2004 Finkel-stein and McGarry 2006) Cohen (2005) appliesthis test to our data and finds evidence consis-tent with adverse selection As our main goal isquite different we take a more structural ap-proach By assuming a structure for the adverseselection mechanism we can account for itwhen estimating the distribution of risk prefer-ences While the structure of adverse selectionis assumed its relative importance is not im-posed The structural assumptions allow us toestimate the importance of adverse selectionrelative to the selection induced by unobservedheterogeneity in risk attitudes As we discussin Section IIC this approach is conceptuallysimilar to that of James H Cardon and IgalHendel (2001) who model health insurancechoices and also allow for two dimensions ofunobserved heterogeneity8

The rest of the paper is organized as followsSection I describes the environment the setup

and the data Section II lays out the theoreticalmodel and the related econometric model anddescribes its estimation and identification Sec-tion III describes the results We first provide aset of reduced-form estimates which motivatethe more structural approach We then presentestimates from the benchmark specification aswell as estimates from various extensions androbustness tests We discuss and justify some ofthe modeling assumptions and perform counter-factual analysis as a way to illustrate the impli-cations of the results to profits and pricingSection IV concludes by discussing the rele-vance of the results to other settings

I Data

A Economic Environment and Data Sources

We obtained data from a single insurancecompany that operates in the market for auto-mobile insurance in Israel The data containinformation about all 105800 new policyhold-ers who purchased (annual) policies from thecompany during the first five years of its oper-ation from November 1994 to October 1999Although many of these individuals stayed withthe insurer in subsequent years we focusthrough most of the paper on deductible choicesmade by individuals in their first contract withthe company This allows us to abstract fromthe selection implied by the endogenous choiceof individuals whether to remain with the com-pany or not (Cohen 2003 2005)

The company studied was the first company inthe Israeli auto insurance market that marketedinsurance to customers directly rather thanthrough insurance agents By the end of the stud-ied period the company sold about 7 percent ofthe automobile insurance policies issued in IsraelDirect insurers operate in many countries and ap-pear to have a significant cost advantage (J DavidCummins and Jack L Van Derhei 1979) Thestudied company estimated that selling insurancedirectly results in a cost advantage of roughly 25percent of the administrative costs involved inmarketing and handling policies Despite theircost advantage direct insurers generally have haddifficulty in making inroads beyond a part of themarket because the product does not provide theldquoamenityrdquo of having an agent to work with andturn to (Stephen P DrsquoArcy and Neil A Doherty1990) This aspect of the company clearly makes

7 An exception is Syngjoo Choi et al (2006) who use alaboratory experiment and similar to us find a high degreeof heterogeneity in risk attitudes across individuals

8 In an ongoing project Pierre-Andre Chiappori andBernard Salanie (2006) estimate an equilibrium model ofthe French auto insurance market where their model of thedemand side of the market is conceptually similar to the onewe estimate in this paper

748 THE AMERICAN ECONOMIC REVIEW JUNE 2007

the results of the paper applicable only to thoseconsumers who seriously consider buying directinsurance Section IIID discusses this selection inmore detail

While we focus primarily on the demand sideof the market by modeling the deductible choicethe supply side (pricing) will be relevant for anycounterfactual exercise as well as for understand-ing the viability of the outside option (which wedo not observe and do not model) During the firsttwo years of the companyrsquos operations the pricesit offered were lower by about 20 percent thanthose offered by other conventional insurersThus due to its differentiation and cost advantagethe company had market power with respect toindividuals who were more sensitive to price thanto the disamenity of not having an agent Thismakes monopolistic screening models apply morenaturally than competitive models of insurance(eg Michael Rothschild and Joseph E Stiglitz1976) During the companyrsquos third year of oper-ation (December 1996 to March 1998) it facedmore competitive conditions when the estab-lished companies trying to fight off the new en-trant lowered the premiums for policies withregular deductibles to the levels offered by thecompany In their remaining period included inthe data the established companies raisedtheir premiums back to previous levels leav-ing the company again with a substantialprice advantage9

For each policy our dataset includes all theinsurerrsquos information about the characteristicsof the policyholder demographic characteris-tics vehicle characteristics and details abouthis driving experience The Appendix providesa list of variables and their definitions andTable 1 provides summary statistics In addi-tion our data include the individual-specificmenu of four deductible and premium combi-nations that the individual was offered (see be-low) the individualrsquos choice from this menuand the realization of risks covered by the pol-icy the length of the period over which it was ineffect the number of claims submitted by the

policyholder and the amounts of the submittedclaims10 Finally we use the zip codes of thepolicyholdersrsquo home addresses11 to augment thedata with proxies for individualsrsquo wealth basedon the Israeli 1995 census12

The policies offered by the insurer (as all pol-icies offered in the studied market) are one-periodpolicies with no commitment on the part of eitherthe insurer or the policyholder13 The policy re-sembles the US version of ldquocomprehensiverdquo in-surance It is not mandatory but it is held by alarge fraction of Israeli car owners (above 70percent according to the companyrsquos executives)The policy does not cover death or injuries to thepolicyholder or to third parties which areinsured through a separate mandatory policyInsurance policies for car audio equipmentwindshield replacement car and towing ser-vices are structured and priced separately Cer-tain types of coverage do not carry a deductibleand are therefore not used in the analysis14

Throughout the paper we use and reportmonetary amounts in current (nominal) NewIsraeli Shekels (NIS) to avoid creating artificialvariation in the data Consequently the follow-ing facts may be useful for interpretation andcomparison with other papers in the literature

9 During this last period two other companies offeringinsurance directly were established Due to first-mover advan-tage (as viewed by the companyrsquos management) which helpedthe company maintain a strong position in the market thesetwo new companies did not affect pricing policies much untilthe end of our observation period Right in the end of thisperiod the studied company acquired one of these entrants

10 Throughout the analysis we make the assumption thatthe main policyholder is the individual who makes the deduct-ible choice Clearly to the extent that this is not always thecase the results should be interpreted accordingly

11 The company has the addresses on record for billingpurposes Although in principle the company could haveused these data for pricing they do not do so

12 The Israeli Central Bureau of Statistics (CBS) associateseach census respondent with a unique ldquostatistical areardquo eachincluding between 1000 and 10000 residents We matchedthese census tracts with zip codes based on street addressesand constructed variables at the zip code level These con-structed variables are available for more than 80 percent of thepolicyholders As a proxy for wealth we use (gross) monthlyincome which is based on self-reported income by censusrespondents augmented (by the CBS) with Social Security data

13 There is a substantial literature that studies the optimaldesign of policies that commit customers to a multiperiodcontract or that include a one-sided commitment of theinsurer to offer the policyholder certain terms in subsequentperiods (Georges Dionne and Pierre Lasserre 1985 RussellCooper and Beth Hayes 1987 Georges Dionne and Neil ADoherty 1994 Igal Hendel and Alessandro Lizzeri 2003)Although such policies are observed in certain countries(Dionne and Vanasse 1992) many insurance markets includ-ing the one we study use only one-period no-commitmentpolicies (Howard Kunreuther and Mark V Pauly 1985)

14 These include auto theft total loss accidents and notldquoat faultrdquo accidents

749VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The exchange rate between NIS and US dollarsmonotonically increased from 301 in 1995 to414 in 1999 (on average it was 352)15 Annualinflation was about 8 percent on average andcumulative inflation over the observation periodwas 48 percent We will account for these effects

as well as other general trends by using yeardummy variables throughout the analysis

B The Menu of Deductibles and Premiums

Let xi be the vector of characteristics individuali reports to the insurance company After learningxi the insurer offered individual i a menu of fourcontract choices One option offered a ldquoregularrdquodeductible and a ldquoregularrdquo premium The term

15 PPP figures were about 10 percent lower than the nom-inal exchange rates running from 260 in 1995 to 374 in 1999

TABLE 1mdashSUMMARY STATISTICSmdashCOVARIATES

Variable Mean Std dev Min Max

Demographics Age 41137 1237 1806 8943Female 0316 047 0 1Family Single 0143 035 0 1

Married 0779 042 0 1Divorced 0057 023 0 1Widower 0020 014 0 1Refused to say 0001 004 0 1

Education Elementary 0016 012 0 1High school 0230 042 0 1Technical 0053 022 0 1College 0233 042 0 1No response 0468 050 0 1

Emigrant 0335 047 0 1

Car attributes Value (current NIS)a 66958 37377 4000 617000Car age 3952 287 0 14Commercial car 0083 028 0 1Engine size (cc) 1568 385 700 5000

Driving License years 18178 1007 0 63Good driver 0548 050 0 1Any driver 0743 044 0 1Secondary car 0151 036 0 1Business use 0082 027 0 1Estimated mileage (km)b 14031 5891 1000 32200History length 2847 061 0 3Claims history 0060 015 0 2

Young driver Young 0192 039 0 1Gender Male 0113 032 0 1

Female 0079 027 0 1Age 17ndash19 0029 017 0 1

19ndash21 0051 022 0 121ndash24 0089 029 0 124 0022 015 0 1

Experience 1 0042 020 0 11ndash3 0071 026 0 13 0079 027 0 1

Company year First year 0207 041 0 1Second year 0225 042 0 1Third year 0194 040 0 1Fourth year 0178 038 0 1Fifth year 0195 040 0 1

Note The table is based on all 105800 new customers in the dataa The average exchange rate throughout the sample period was approximately $1 per 35 NIS starting at 13 in late 1994

and reaching 14 in late 1999b The estimated mileage is not reported by everyone It is available for only 60422 new customers

750 THE AMERICAN ECONOMIC REVIEW JUNE 2007

regular was used for this deductible level bothbecause it was relatively similar to the deductiblelevels offered by other insurers and because mostpolicyholders chose it The regular premium var-ied across individuals according to some deter-ministic function (unknown to us) pit ft(xi)which was quite stable over time The regulardeductible level was directly linked to the regularpremium according to

(1) dit min12

pit capt

That is it was set to one-half of the regular pre-mium subject to a deductible cap capt whichvaried over time but not across individuals Thepremiums associated with the other options werecomputed by multiplying pit by three differentconstants 106 for ldquolowrdquo deductible 0875 forldquohighrdquo deductible and 08 for ldquovery highrdquo de-ductible The regular deductible dit was con-verted to the other three deductible levels in asimilar way using multipliers of 06 for low 18for high and 26 for very high

There are two main sources of exogenous vari-ation in prices The first arises from companyexperimentation The multipliers described abovewere fixed across individuals and over time formost of the observation period but there was asix-month period during the insurerrsquos first year ofoperation (May 1995 to October 1995) in whichthe insurer experimented with slightly modifiedmultipliers16 This modified formula covers al-most 10 percent of the sample The secondsource of variation arises from discrete ad-justments to the uniform cap The cap variedover time due to inflation competitive condi-tions and as the company gained more expe-rience (Figure 1) The cap was binding for

16 For individuals with low levels of regular premiumsduring the specified period the regular deductible was set at53 percent (instead of 50 percent) of the regular premiumthe low deductible was set at 33 percent (instead of 30percent) of the regular premium and so on

FIGURE 1 VARIATION IN THE DEDUCTIBLE CAP OVER TIME

Notes This figure presents the variation in the deductible cap over time which is one of the main sources of pricing variation inthe data We do not observe the cap directly but it can be calculated from the observed menus The graph plots the maximalregular deductible offered to anyone who bought insurance from the company over a moving seven-day window Thelarge jumps in the graph reflect changes in the deductible cap There are three reasons why the graph is not perfectlysmooth First in a few holiday periods (eg October 1995) there are not enough sales within a seven-day window sonone of those sales hits the cap This gives rise to temporary jumps downward Second the pricing rule applies at thedate of the price quote given to the potential customer Our recorded date is the first date the policy becomes effectiveThe price quote is held for a period of two to four weeks so in periods in which the pricing changes we may still seenew policies sold using earlier quotes made according to a previous pricing regime Finally even within periods of constantcap the maximal deductible varies slightly (variation of less than 05 percent) We do not know the source of this variation

751VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

about a third of the policyholders in our dataAll these individuals would be affected by achange in the cap Much of the variation ofmenus in the data is driven by the exogenousshifts in the uniform deductible cap The un-derlying assumption is that conditional onobservables these sources of variation pri-marily affect the deductible choice of newcustomers but they do not have a significantimpact on the probability of purchasing insurancefrom the company Indeed this assumption holdsin the data with respect to observables there is nodistinguishable difference in the distribution ofobservable characteristics of consumers who buyinsurance just before and those who buy just aftera change in the deductible cap

C Summary Statistics

The top of Table 2A summarizes the deduct-ible menus all are calculated according to theformula described earlier Only 1 percent of thepolicyholders chose the high or very high de-ductible options Therefore for the rest of theanalysis we focus only on the choice betweenregular and low deductible options (chosen by811 and 178 percent of the individuals respec-tively)17 Focusing only on these options doesnot create any selection bias because we do notomit individuals who chose high or very highdeductibles For these individuals we assumethat they chose a regular deductible This as-sumption is consistent with the structural modelwe develop in the next section which impliesthat conditional on choosing high or very highdeductibles an individual would almost alwaysprefer the regular over the low deductible

The bottom of Table 2A as well as Table2B present summary statistics for the policy re-alizations We focus only on claim rates and noton the amounts of the claims This is because anyamount above the higher deductible level is cov-

ered irrespective of the deductible choice and thevast majority of the claims fit in this category (seeSection IIIE) For all these claims the gain fromchoosing a low deductible is the same in the eventof a claim and is equal to the difference betweenthe two deductible levels Therefore the claimamount is rarely relevant for the deductible choice(and likewise for the companyrsquos pricing decisionwe analyze in Section IIIF)

Averaging over all individuals the annualclaim rate was 0245 One can clearly observesome initial evidence of adverse selection Onaverage individuals who chose a low deductiblehad higher claim rates (0309) than those whochose the regular deductible (0232) Those whochose high and very high deductibles had muchlower claim rates (0128 and 0133 respectively)These figures can be interpreted in the context ofthe pricing formula A risk-neutral individual willchoose the low deductible if and only if her claimrate is higher than (pd) (plow pregular)(dregular dlow) When the deductible cap is notbinding which is the case for about two-thirds ofthe sample this ratio is given directly by thepricing formula and is equal to 03 Thus anyindividual with a claim rate higher than 03 willbenefit from buying the additional coverage pro-vided by a low deductible even without any riskaversion The claim data suggest that the offeredmenu is cheaper than an actuarially fair contractfor a nonnegligible part of the population (13percent according to the benchmark estimates re-ported below) This observation is in sharp con-trast to other types of insurance contracts such asappliance warranties which are much more ex-pensive than the actuarially fair price (Rabin andThaler 2001)

II The Empirical Model

A A Model of Deductible Choice

Let wi be individual irsquos wealth (pih di

h) theinsurance contract (premium and deductible re-spectively) with high deductible (pi

l dil) the

insurance contract with low deductible ti theduration of the policy and ui(w) individual irsquosvNM utility function We assume that the num-ber of insurance claims is drawn from a Poissondistribution with an annual claim rate iThrough most of the paper we assume that i isknown to the individual We also assume that iis independent of the deductible choice ie that

17 The small frequency of ldquohighrdquo and ldquovery highrdquochoices provides important information about the lowerends of the risk and risk aversion distributions but (for thatsame reason) makes the analysis sensitive to functionalform Considering these options or the option of not buyinginsurance creates a sharp lower bound on risk aversion forthe majority of the observations making the estimates muchhigher Given that these options are rarely selected how-ever it is not clear to us whether they were regularlymentioned during the insurance sales process renderingtheir use somewhat inappropriate

752 THE AMERICAN ECONOMIC REVIEW JUNE 2007

there is no moral hazard Finally we assumethat in the event of an accident the value of theclaim is greater than di

h We revisit all theseassumptions in Sections IIID and IIIE For therest of this section i subscripts are suppressedfor convenience

In the market we study insurance policies are

typically held for a full year after which theycan be automatically renewed with no commit-ment by either the company or the individualMoreover all auto-insurance policies sold inIsrael can be canceled without prior notice bythe policyholder with premium payments beinglinearly prorated Both the premium and the

TABLE 2AmdashSUMMARY STATISTICSmdashMENUS CHOICES AND OUTCOMES

Variable Obs Mean Std dev Min Max

Menu Deductible (current NIS)a Low 105800 87548 12101 37492 103911Regular 105800 145299 19779 62486 171543High 105800 260802 35291 112475 308778Very high 105800 376305 50853 162464 446013

Premium (current NIS)a Low 105800 338057 91404 132471 1923962Regular 105800 318922 8623 124972 1815058High 105800 279057 75451 109351 1588176Very high 105800 255137 68984 99978 1452046

pd 105800 0328 006 03 18

Realization Choice Low 105800 0178 038 0 1Regular 105800 0811 039 0 1High 105800 0006 008 0 1Very high 105800 0005 007 0 1

Policy termination Active 105800 0150 036 0 1Canceled 105800 0143 035 0 1Expired 105800 0707 046 0 1

Policy duration (years) 105800 0848 028 0005 108Claims All 105800 0208 048 0 5

Low 18799 0280 055 0 5Regular 85840 0194 046 0 5High 654 0109 034 0 3Very high 507 0107 032 0 2

Claims per yearb All 105800 0245 066 0 19882Low 18799 0309 066 0 9264Regular 85840 0232 066 0 19882High 654 0128 062 0 12636Very high 507 0133 050 0 3326

a The average exchange rate throughout the sample period was approximately $1 per 35 NIS starting at 13 in late 1994and reaching 14 in late 1999

b The mean and standard deviation of the claims per year are weighted by the observed policy duration to adjust forvariation in the exposure period These are the maximum likelihood estimates of a simple Poisson model with no covariates

TABLE 2BmdashSUMMARY STATISTICSmdashCONTRACT CHOICES AND REALIZATIONS

Claims Low Regular High Very high Total Share

0 11929 (0193) 49281 (0796) 412 (0007) 299 (0005) 61921 (100) 080341 3124 (0239) 9867 (0755) 47 (0004) 35 (0003) 13073 (100) 016962 565 (0308) 1261 (0688) 4 (0002) 2 (0001) 1832 (100) 002383 71 (0314) 154 (0681) 1 (0004) 0 (0000) 226 (100) 000294 6 (0353) 11 (0647) 0 (0000) 0 (0000) 17 (100) 000025 1 (0500) 1 (0500) 0 (0000) 0 (0000) 2 (100) 000003

Notes The table presents tabulation of choices and number of claims For comparability the figures are computed only forindividuals whose policies lasted at least 09 years (about 73 percent of the data) The bottom rows of Table 2A providedescriptive figures for the full dataset The numbers in parentheses in each cell represent percentages within each row Theright-hand-side column presents the marginal distribution of the number of claims

753VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

probability of a claim are proportional to thelength of the time interval taken into account soit is convenient to think of the contract choice asa commitment for only a short amount of timeThis approach has several advantages First ithelps to account for early cancellations andtruncated policies which together constitute 30percent of the policies in the data18 Second itmakes the deductible choice independent ofother longer-term uncertainties faced by the in-dividual so we can focus on static risk-takingbehavior Third this formulation helps to obtaina simple framework for analysis which is attrac-tive both analytically and computationally19

The expected utility that the individual obtainsfrom the choice of a contract (p d) is given by

(2) v p d

1 tuw pt tuw pt d

We characterize the set of parameters that willmake the individual indifferent between the twooffered contracts This set provides a lower (up-per) bound on the level of risk aversion for indi-viduals who choose the low (high) deductible (fora given ) Thus we analyze the equation v(phdh) v(pl dl) By taking limits with respect to t(and applying LrsquoHopitalrsquos rule) we obtain

(3)

limt30

1

t(u(w pht) u(w plt))

(u(w pht) u(w pht dh)) (u(w plt) u(w plt dl))

pl phuw

uw dl uw dh

or

(4)

pl phuw uw dl uw dh

The last expression has a simple intuition Theright-hand side is the expected gain (in utils) perunit of time from choosing a low deductibleThe left-hand side is the cost of such a choiceper unit of time For the individual to be indif-ferent the expected gains must equal the costs

In our benchmark specification we assumethat the third derivative of the vNM utilityfunction is not too large A Taylor expansion forboth terms on the right-hand side of equation(4) ie u(w d) u(w) du(w) (d22)u13(w) implies that

(5)pl ph

uw dh dluw

1

2dh dldh dlu13w

Let d dh dl 0 p pl ph 0 andd 1frasl2 (dh dl) to obtain

(6)p

duw uw d u13w

or

(7) r u13w

uw

p

d 1

d

where r is the coefficient of absolute risk aver-sion at wealth level w

18 As can be seen in Table 2A 70 percent of the policiesare observed through their full duration (one year) About15 percent are truncated by the end of our observationperiod and the remaining 15 percent are canceled for var-ious reasons such as change in car ownership total-lossaccident or a unilateral decision of the policyholder tochange insurance providers

19 This specification ignores the option value associatedwith not canceling a policy This is not very restrictiveSince experience rating is small and menus do not changeby much this option value is likely to be close to zero Asimple alternative is to assume that individuals behave as ifthey commit for a full year of coverage In such a case themodel will be similar to the one we estimate but willdepend on the functional form of the vNM utility functionand would generally require taking infinite sums (over thepotential realizations for the number of claims within the year)In the special case of quadratic expected utility maximizerswho care only about the mean and variance of the number ofclaims this is easy to solve The result is almost identical to theexpression we subsequently derive in equation (7)

754 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Equation (7) defines an indifference set in thespace of risk and risk aversion which we willrefer to by (r() ) and ((r) r) interchange-ably Both r() and (r) have a closed-formrepresentation a property that will be computa-tionally attractive for estimation20 Both termsare individual specific as they depend on thedeductible menu which varies across individu-als For the rest of the paper we regard eachindividual as associated with a two-dimensionaltype (ri i) An individual with a risk parameter

i who is offered a menu (pih di

h) (pil di

l)will choose the low-deductible contract if andonly if his coefficient of absolute risk aversionsatisfies ri ri(i) Figure 2 presents a graphi-cal illustration

B The Benchmark Econometric Model

The Econometric ModelmdashThe econometricmodel we estimate is fully described by the fiveequations presented in this section Our ob-jective is to estimate the joint distribution of(i ri)mdashthe claim rate and coefficient of abso-lute risk aversionmdashin the population of policy-holders conditional on observables xi The

20 For example estimating the CARA version of themodel (Section IIID) for which r() does not have aclosed-form representation takes almost ten times longer

FIGURE 2 THE INDIVIDUALrsquoS DECISIONmdashA GRAPHICAL ILLUSTRATION

Notes This graph illustrates the individualrsquos decision problem The solid line presents the indifference setmdashequation (7)mdashappliedfor the menu faced by the average individual in the sample Individuals are represented by points in the two-dimensional spaceabove In particular the scattered points are 10000 draws from the joint distribution of risk and risk aversion for the averageindividual (on observables) in the data based on the point estimates of the benchmark model (Table 4) If an individual is eitherto the right of the line (high risk) or above the line (high risk aversion) the low deductible is optimal Adverse selection is capturedby the fact that the line is downward sloping higher-risk individuals require lower levels of risk aversion to choose the lowdeductible Thus in the absence of correlation between risk and risk aversion higher-risk individuals are more likely to choosehigher levels of insurance An individual with i (pidi) will choose a lower deductible even if he is risk neutral ie withprobability one (we do not allow individuals to be risk loving) This does not create an estimation problem because i is notobserved only a posterior distribution for it Any such distribution will have a positive weight on values of i that are below(pidi) Second the indifference set is a function of the menu and in particular of (pidi) and d An increase in (pidi) willshift the set up and to the right and an increase in d will shift the set down and to the left Therefore exogenous shiftsof the menus that make both arguments change in the same direction can make the sets ldquocrossrdquo thereby allowing us toidentify the correlation between risk and risk aversion nonparametrically With positive correlation (shown in the figureby the ldquoright-bendingrdquo shape of the simulated draws) the marginal individuals are relatively high risk therefore creatinga stronger incentive for the insurer to raise the price of the low deductible

755VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

benchmark formulation assumes that (i ri) fol-lows a bivariate lognormal distribution Thuswe can write the model as

(8) ln i xi i

(9) ln ri xi vi

with

(10) i

vi

iid

N00

2 r

r r2

Neither i nor ri is directly observed Thereforewe treat both as latent variables Loosely speak-ing they can be thought of as random effects Weobserve two variables the number of claims andthe deductible choice which are related to thesetwo unobserved components Thus to completeour econometric model we have to specify therelationship between the observed variables andthe latent ones This is done by making two struc-tural assumptions First we assume that the num-ber of claims is drawn from a Poisson distributionnamely

(11) claimsi Poisson(i ti )

where ti is the observed duration of the policySecond we assume that when individuals maketheir deductible choices they follow the theoreti-cal model described in the previous section Themodel implies that individual i chooses the lowdeductible (choicei 1) if and only if ri ri(i)where ri is defined in equation (7) Thus theempirical model for deductible choice is given by

(12) Prchoicei 1 Prri

pi

idi 1

d i

Prexp(xi vi)

pi

exp(xi i)di 1

d i

With no unobserved heterogeneity in risk (i 0) equation (12) reduces to a simple probit In

such a case one can perfectly predict risk from thedata denote it by (xi) and construct an addi-tional covariate ln([pi((xi)di) 1]d i) Giventhe assumption that risk aversion is distributedlognormally running the probit regressionabove and renormalizing the coefficient onthe constructed covariate to 1 (instead ofthe typical normalization of the variance of theerror term to 1) has a structural interpretation withln (ri) as the dependent variable However Cohen(2005) provides evidence of adverse selection inthe data implying the existence of unobservedheterogeneity in risk This makes the simple probitregression misspecified Estimation of the fullmodel is more complicated Once we allow forunobserved heterogeneity in both unobserved riskaversion (vi) and claim rate (i) we have to inte-grate over the two-dimensional region depicted inFigure 2 for estimation

EstimationmdashA natural way to proceed is toestimate the model by maximum likelihoodwhere the likelihood of the data as a function ofthe parameters can be written by integrating outthe latent variables namely

(13) Lclaimsi choicei

Prclaimsi choiceii riPri ri

where is a vector of parameters to be esti-mated While formulating the empirical modelusing likelihood may help our thinking aboutthe data-generating process using maximumlikelihood (or generalized method of moments(GMM)) for estimation is computationally cum-bersome This is because in each iteration itrequires evaluating a separate complex integralfor each individual in the data In contrastMarkov Chain Monte Carlo (MCMC) Gibbssampling is quite attractive in such a case Us-ing data augmentation of latent variables (Mar-tin A Tanner and Wing Hung Wong 1987)according to which we simulate (i ri) and latertreat those simulations as if they are part ofthe data one can avoid evaluating the complexintegrals by sampling from truncated normaldistributions which is significantly less compu-tationally demanding (eg Luc Devroye 1986)This feature combined with the idea of aldquosliced samplerrdquo (Paul Damien John Wake-field and Stephen Walker 1999) to sample from

756 THE AMERICAN ECONOMIC REVIEW JUNE 2007

an unfamiliar posterior distribution makes theuse of a Gibbs sampler quite efficient for ourpurposes Finally the lognormality assumptionimplies that F(ln(i)ri) and F(ln(ri)i) follow a(conditional) normal distribution allowing us torestrict attention to univariate draws furtherreducing the computational burden

The Appendix provides a full description ofthe Gibbs sampler including the conditionaldistributions and the (flat) prior distributions weuse The basic intuition is that conditional onobserving (i ri) for each individual we have asimple linear regression model with two equa-tions The less standard part is to generate drawsfor (i ri) We do this iteratively Conditionalon i the posterior distribution for ln(ri) fol-lows a truncated normal distribution where thetruncation point depends on the menu individ-ual i faces and its direction (from above orbelow) depends on individual irsquos deductiblechoice The final step is to sample from theposterior distribution of ln(i) conditional on riThis is more complicated as we have bothtruncation which arises from adverse selection(just as we do when sampling for ri) and thenumber of claims which provides additionalinformation about the posterior of i Thus theposterior for i takes an unfamiliar form forwhich we use a ldquosliced samplerrdquo

We use 100000 iterations of the Gibbs sam-pler It seems to converge to the stationarydistribution after about 5000 iterations There-fore we drop the first 10000 draws and use thelast 90000 draws of each variable to report ourresults The results are based on the posteriormean and posterior standard deviation fromthese 90000 draws Note that each iterationinvolves generating separate draws of (i ri) foreach individual Performing 100000 iterationsof the benchmark specification (coded in Mat-lab) takes about 60 hours on a Dell Precision530 workstation

C Identification

The parametric version of the model is iden-tified mechanically There are more equationsthan unknowns and no linear dependenciesamong them so (as also verified using MonteCarlo exercises) the model parameters can bebacked out from simulated data Our goal in thissection is not to provide a formal identificationproof Rather we want to provide intuition for

which features of the data allow us to identifyparticular parameters of the model The discus-sion also highlights the assumptions that areessential for identification vis-a-vis those thatare made only for computational convenience(making them in principle testable)

Discussion of Nonparametric Identifica-tionmdashThe main difficulty in identifying themodel arises from the gap between the (exante) risk type i which individuals usewhen choosing a deductible and the (ex post)realization of the number of claims we ob-serve We identify between the variation inrisk types and the variation in risk realizationsusing our parametric distributional assump-tions The key is that the distribution of risktypes can be uniquely backed out from theclaim data alone This allows us to use thedeductible choice as an additional layer ofinformation which identifies unobserved het-erogeneity in risk aversion21 Any distribu-tional assumption that allows us to uniquelyback out the distribution of risk types fromclaim data would be sufficient to identify thedistribution of risk aversion As is customaryin the analysis of count processes we make aparametric assumption that claims are gener-ated by a lognormal mixture of Poisson dis-tributions (Section IIID discusses this furtherand explores an alternative) Using a mixtureenables us to account for adverse selectionthrough unobserved heterogeneity in risk Italso allows us to better fit the tails of theclaim distribution In principle a more flexi-ble mixture or a more flexible claim-generat-ing process could be identified as long as theclaims data are sufficiently rich22

21 Cardon and Hendel (2001) face a similar identificationproblem in the context of health insurance They use vari-ation in coverage choice (analogous to deductible choice) toidentify the variation in health-status signals (analogous torisk types) from the variation in health expenditure (analo-gous to number of claims) They can rely on the coveragechoice to identify this because they make an assumptionregarding unobserved heterogeneity in preferences (iidlogit) We take a different approach as our main goal is toestimate rather than assume the distribution of preferences

22 Although it may seem that the claim data are limited(as they take only integer values between 0 and 5 in ourdata) variation in policy duration generates continuousvariation in the observed claim propensity Of course thisvariation also introduces an additional selection into themodel due to policy cancellations which are potentially

757VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Once the distribution of risk types is iden-tified from claims data the marginal distribu-tion of risk aversion (and its relationship tothe distribution of risk types) is nonparametri-cally identified from the variation in theoffered menus discussed in Section I Thisvariation implies different deductible and pre-mium options to identical (on observables)individuals who purchased insurance at dif-ferent times Different menus lead to differentindifference sets (similar to the one depictedin Figure 2) These sets often cross each otherand nonparametrically identify the distribu-tion of risk aversion and the correlation struc-ture at least within the region in which theindifference sets vary For the tails of thedistribution as is typically the case we haveto rely on parametric assumptions or usebounds The parametric assumption of log-normality we use for most of the paper ismade only for computational convenience

Intuition for the Parametric IdentificationMechanismmdashVariation in the offered menus isimportant for the nonparametric identificationThe parametric assumptions could identify themodel without such variation Thus to keep theintuition simple let us take the bivariate log-normal distribution as given and contrary to thedata assume that all individuals are identical onobservables and face the same menu Supposealso that all individuals are observed for exactlyone year and have up to two claims23 In thissimplified case the model has five parametersto be estimated the mean and variance of risk and

2 the mean and variance of risk aver-sion r and r

2 and the correlation parameter The data can be summarized by five numbersLet 130 131 and 132 1 131 130 be thefraction of individuals with zero one and twoclaims respectively Let 0 1 and 2 be theproportion of individuals who chose a low de-ductible within each ldquoclaim grouprdquo Given ourdistributional assumption about the claim-

generating process we can use 130 and 131 touniquely identify and

2 Loosely isidentified by the average claim rate in the dataand

2 is identified by how fat the tail of theclaim distribution is ie by how large (132131)is compared to (131130) Given and

2 and thelognormality assumption we can (implicitly)construct a posterior distribution of risk typesfor each claim group F(r claims c) andintegrate over it when predicting the deductiblechoice This provides us with three additionalmoments each of the form

(14) Ec Prchoice 1r

dFr claims c dFr

for c 0 1 2 These moments identify thethree remaining parameters of the model rr

2 and Let us now provide more economic content to

the identification argument Using the same ex-ample and conditional on identifying and

2

from the claim data one can think about thedeductible choice data 0 1 2 as a graph(c) The absolute level of the graph identifiesr In the absence of correlation between riskand risk aversion the slope of the graph iden-tifies r

2 with no correlation the slope shouldalways be positive (due to adverse selection)but higher r

2 would imply a flatter graph be-cause more variation in the deductible choiceswill be attributed to variation in risk aversionFinally is identified by the curvature of thegraph The more convex (concave) the graph isthe more positive (negative) is the estimated For example if 0 05 1 051 and 2 099 it is likely that r

2 is high (explaining why0 and 1 are so close) and is highly positive(explaining why 2 is not also close to 1) Incontrast if 0 1 it must mean that thecorrelation between risk and risk aversion isnegative which is the only way the originalpositive correlation induced by adverse selec-tion can be offset This intuition also clarifiesthat the identification of relies on observingindividuals with multiple claims (or differentpolicy durations) and that it is likely to besensitive to the distributional assumptions Thedata (Table 2B) provide a direct (positive) cor-relation between deductible choice and claims

endogenous The results are similar when we use onlyexpired and truncated policies

23 This variation is sufficient (and necessary) to identifythe benchmark model The data provide more variation weobserve up to five claims per individual we observe con-tinuous variation in the policy durations we observe vari-ation in prices and we exploit distributional restrictionsacross individuals with different observable characteristics

758 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The structural assumptions allow us to explainhow much of this correlation can be attributedto adverse selection The remaining correlation(positive or negative) is therefore attributed tocorrelation in the underlying distribution of riskand risk aversion

III Results

A Reduced-Form Estimation

To get an initial sense for the levels of abso-lute risk aversion implied by the data we use asimple back-of-the-envelope exercise We com-pute unconditional averages of p d dh dland d (Table 2A)24 and substitute these valuesin equation (7) The implied coefficient of ab-solute risk aversion is 29 104 NIS125 Thisfigure could be thought of as the average indif-ference point implying that about 18 percent ofthe policyholders have coefficients of absoluterisk aversion exceeding it To convert to USdollar amounts one needs to multiply thesefigures by the average exchange rate (352)resulting in an average indifference point of102 103 $US1 This figure is less than halfof a similar estimate reported by Sydnor (2006)for buyers of homeownerrsquos insurance but about3 and 13 times higher than comparable fig-ures reported by Gertner (1993) and Metrick(1995) respectively for television game showparticipants

Table 3 provides reduced-form analysis ofthe relationship between the observables andour two left-hand-side variables the number ofclaims and the deductible choice26 Column 1reports the estimates from a Poisson regressionof the number of claims on observed character-istics This regression is closely related to therisk equation we estimate in the benchmarkmodel It shows that older people women andpeople with a college education are less likely to

have an accident Bigger more expensiveolder and noncommercial cars are more likelyto be involved in an accident Driving experi-ence and variables associated with less intenseuse of the car reduce accident rates As could beimagined claim propensity is highly correlatedover time past claims are a strong predictor offuture claims Young drivers are 50 to 70 per-cent more likely to be involved in an accidentwith young men significantly more likely thanyoung women Finally as indicated by the trendin the estimated year dummies the accident ratesignificantly declined over time Part of thisdecline is likely due to the decline in accidentrates in Israel in general27 This decline mightalso be partly due to the better selection ofindividuals the company obtained over time asit gained more experience (Cohen 2003)

Columns 2 and 3 of Table 3 present estimatesfrom probit regressions in which the dependentvariable is equal to one if the policyholder chosea low deductible and is equal to zero otherwiseColumn 3 shows the marginal effects of thecovariates on the propensity to choose a lowdeductible These marginal effects do not have astructural interpretation as the choice of lowdeductible depends on its price on risk aver-sion and on risk In this regression we againobserve a strong trend over time Fewer policy-holders chose the low deductible as time wentby One reason for this trend according to thecompany executives is that over time the com-panyrsquos sales persons were directed to focusmainly on the ldquodefaultrdquo regular deductible op-tion28 The effect of other covariates will bediscussed later in the context of the full modelIn unreported probit regressions we also test thequalitative assumptions of the structural model byadding three additional regressors the price ratio(pidi) the average deductible offered d i

24 The unconditional is computed by maximum likeli-hood using the data on claims and observed durations of thepolicies

25 Using the CARA specification as in equation (16) weobtain a slightly lower value of 25 104 NIS1

26 We find positive correlation in the errors of these tworegressions suggesting the existence of adverse selection inthe data and motivating a model with unobserved heteroge-neity in risk This test is similar to the bivariate probit testproposed by Chiappori and Salanie (2000) and replicatesearlier results reported in Cohen (2005)

27 In particular traffic fatalities and counts of trafficaccidents in Israel fell by 11 percent and 18 percent during1998 and 1999 respectively

28 Such biased marketing efforts will bias consumersagainst choosing the low deductible thus making them lookless risk averse This would make our estimate a lowerbound on the true level of risk aversion If only sophisti-cated consumers could see beyond the marketing effort andthis sophistication were related to observables (eg educa-tion) the coefficients on such observables would be biasedupward This is not a major concern given that the coefficientson most of the covariates are fairly stable when we estimatethe benchmark model separately for different years

759VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

TABLE 3mdashNO HETEROGENEITY IN RISK

Variable

Poisson regressiona Probit regressionsc

Dep var Number of claims Dep var 1 if low deductible

(1) (2) (3)

Coeff Std Err IRRb Coeff Std Err dPdX dPdXDemographics Constant 43535 02744 mdash 224994 23562 mdash

Age 00074 00052 09926 01690 00464 00034 00042Age2 989 105 536 105 10001 00017 00005 339 105 455 105Female 00456 00161 09555 08003 01316 00165 00134Family Single omitted omitted omitted

Married 01115 00217 08945 06417 01950 00128 00045Divorced 00717 00346 10743 03191 03127 00064 00044Widower 00155 00527 10156 03441 04573 00069 00025Other (NA) 01347 01755 11442 29518 19836 00511 00370

Education Elementary 00522 00550 09491 02927 04278 00061 00016High school omitted omitted omittedTechnical 00373 00308 10380 05037 02520 00105 00138College 00745 00197 09282 05279 01546 00109 00053Other (NA) 00116 00184 10116 00630 01540 00013 00025

Emigrant 00210 00163 10213 00497 01328 00010 00003

Car attributes Log(Value) 01227 00281 11306 13285 02342 00271 00299Car age 00187 00042 10189 01603 00357 00033 00017Commercial car 01394 00326 08699 09038 02781 00177 00294Log(engine size) 02972 00459 13461 06924 03952 00141 00075

Driving License years 00204 00034 09798 01043 00312 00021 00005License years2 00002 677 105 10002 00015 00005 299 105 167 105

Good driver 00176 00191 09825 07207 01574 00148 00152Any driver 00564 00169 10580 10923 01321 00217 00258Secondary car 00859 00209 09177 00038 01626 00001 00070Business use 01852 00293 12034 07951 02737 00156 00017History length 00527 00110 09486 11218 01450 00228 00171Claims history 06577 00390 19304 04654 05460 00095 00496

Young driver Young driver 05235 00361 16879 28847 06305 00524 00012Gender Male omitted omitted omitted

Female 01475 00288 08629 17959 03135 00396 00195Age 17ndash19 omitted omitted omitted

19ndash21 00701 00532 10726 07800 07509 00153 0014721ndash24 00267 00574 09737 05746 07773 00114 0015624 02082 00567 0812 17869 07328 00397 00136

Experience 1 omitted omitted omitted1ndash3 02416 00458 07854 10175 06577 00217 000093 02827 00532 07538 32513 07386 00762 00410

Company year First year omitted omitted omittedSecond year 00888 00198 09150 44513 01506 00792 00859Third year 00690 00222 09334 85888 01820 01303 01367Fourth year 01817 00232 08339 118277 02102 01616 01734Fifth year 05431 0028 05810 143206 02778 01920 02085

105586Obs 105800 94000 105800Pseudo R2 00162 01364 01296Log likelihood 577459 369595 430869

Significant at the 5 percent confidence levela Maximum likelihood estimates Variation in exposure (policy duration) is accounted forb IRR Incidence rate ratio Each figure should be interpreted as the increasedecrease in claim probability as a result of an increase of one unit in the

right-hand-side variablec There are two separate probit regressions reported in this table Column 2 relies on the deductible choice model and the lognormality assumption As discussed

in the text by including an additional regressor ln([pi((xi)di) 1]d i) (with (xi) predicted from column 1 above) and normalizing its coefficient to 1 we obtaina structural interpretation of this regression with ln (ri) as the dependent variable Thus the reported coefficients are comparable to those estimated for the benchmarkmodel One should be cautious however in interpreting these coefficients Unlike the benchmark model this regression does not allow unobserved heterogeneity inrisk and suffers from some selection bias because observations with high predicted risk rate are omitted (which is why the number of observations is 94000 ratherthan the full sample of 105800) For comparison column 3 reports the marginal effects from a comparable probit regression that uses the full sample and does notcontrol for pricing and predicted risk through the additional structural regressor Column 3 does not have a structural interpretation and its (unreported) coefficientscannot be compared to those estimated from the benchmark model

760 THE AMERICAN ECONOMIC REVIEW JUNE 2007

and the risk rate (xi) as predicted from thePoisson regression of column 1 All three addi-tional regressors enter with the predicted signand with large and highly significant marginaleffects29

Finally column 2 of Table 3 presents animportant specification of the probit regres-sion in which ln([pi(( xi)di) 1]d i) isadded as a regressor and its coefficient isnormalized to 130 As already mentioned ifwe assume that there is no unobserved heter-ogeneity in risk then column 2 is analogousto the ln(ri) equation of the benchmark modelThis restriction of the model is rejected by thedata About 10 percent of the individuals arepredicted to have (xi) (pi di) implying achoice of low deductible for any level of riskaversion Many of these individuals howeverstill choose higher deductible levels Column2 reports the results for the remaining indi-viduals (94000 out of 105800) for which theregression can be run While the signs of theestimated coefficients are similar to those inthe benchmark model presented below therestricted version of the model suggests muchhigher levels and dispersion of risk aversionwell above any reasonable level31 The fullestimation of the benchmark model clearly re-jects this restriction on the model

B Estimation of the Benchmark Model

The Effect of Individual Characteristics onRisk AversionmdashTable 4 presents the estimationresults from the benchmark model The secondcolumn shows how the level of absolute riskaversion is related to individual characteristicsAs the dependent variable is in natural loga-

rithm coefficients on dummy variables can bedirectly interpreted as approximate percentagechanges The vast majority of these coefficientsare quite stable across a wide range of specifi-cations which are mentioned later

The results indicate that women are more riskaverse than men and have a coefficient of ab-solute risk aversion about 20 percent greaterthan that of men These results are consistentwith those of Donkers et al (2001) and Hartoget al (2002) The estimated effect of age sug-gests a nonmonotone pattern of risk preferencesover the life cycle The estimated coefficientsimply that initially (that is at age 18 the youngestindividual in the data) individuals become less riskaverse with age but around the age of 48 indi-viduals start becoming more risk averse32 Mar-ried individuals are estimated to be significantlymore risk averse compared to singles while di-vorced individuals are less (although the coeffi-cient is statistically insignificant)

The analysis suggests that variables that arelikely to be correlated with income or wealthsuch as post-high-school education and thevalue of the car have a positive coefficientindicating that wealthier people have higherlevels of absolute risk aversion Although we donot have data on individualsrsquo income or wealthwe use other proxies for income in additionalspecifications and obtain mixed results Whenwe include as a covariate the average householdincome among those who live in the same zipcode we obtain a significant and negative co-efficient of 0333 (0154) (standard deviationin parentheses) When we match zip code in-come on demographics of the individuals thecoefficient is effectively zero 0036 (0047)and when we use a socioeconomic index ofthe locality in which the individual lives thecoefficient is positive 0127 (0053)33 Thus

29 The estimated marginal effect (z-statistic in parenthe-ses) is 0352 (1376) 16 104 (1481) and 0154(255) for (pidi) d i and (xi) respectively

30 The level is a normalization The sign is estimatedHad the sign on this regressor been positive this would haveimplied a rejection of the model

31 For the implied median level of risk aversion the re-stricted model produces a similar estimate to the estimate wereport below for the benchmark model However since someindividuals who chose the low deductible are estimated to havevery low claim rates the restricted model is ldquoforcedrdquo to estimatevery high risk aversion for these individuals (in contrast thebenchmark model can explain such choices by positive risk re-siduals) resulting in very high dispersion and (due to the lognor-mality assumption) very high average risk aversion which isabout 1025 higher than the benchmark estimates we report below

32 This nonmonotone pattern may explain why age enterswith different signs in the estimation results of Donkers et al(2001) and Hartog et al (2002) A somewhat similar U-shapepattern with respect to age is also reported by Sumit Agarwalet al (2006) in the context of consumer credit markets

33 The full results from these specifications are providedin the online Appendix (available at httpwwwe-aerorgdatajune0720050644_apppdf ) The reason we do not usethese income variables in the benchmark specification istheir imperfect coverage which would require us to omitalmost 20 percent of the individuals Other results fromthese regressions are similar to those we report for thebenchmark specification

761VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

overall we interpret our findings as suggestiveof a nonnegative and perhaps positive associ-ation between incomewealth and absolute riskaversion

At first glance these results may appear tobe inconsistent with the widely held beliefthat absolute risk aversion declines withwealth One should distinguish however be-tween two questions (a) whether for a givenindividual the vNM utility function exhibitsdecreasing absolute risk aversion and (b)how risk preferences vary across individuals

Our results do not speak to the first questionand should not be thought of as a test of thedecreasing absolute risk aversion propertyTesting this property would require observingthe same individual making multiple choicesat different wealth levels Rather our resultsindicate that individuals with greater wealthhave utility functions that involve a greaterdegree of risk aversion It might be thatwealth is endogenous and that risk aversion(or unobserved individual characteristics thatare correlated with it) leads individuals to

TABLE 4mdashTHE BENCHMARK MODEL

Variable Ln() equation Ln(r) equation Additional quantities

Demographics Constant 15406 (00073) 118118 (01032) Var-covar matrix ()

Age 00001 (00026) 00623 (00213) 01498 (00097)Age2 624 106 (263 105) 644 104 (211 104) r 31515 (00773)Female 00006 (00086) 02049 (00643) 08391 (00265)Family Single Omitted Omitted

Married 00198 (00115) 01927 (00974) Unconditional statisticsa

Divorced 00396 (00155) 01754 (01495) Mean 02196 (00013)Widower 00135 (00281) 01320 (02288) Median 02174 (00017)Other (NA) 00557 (00968) 04599 (07397) Std Dev 00483 (00019)

Education Elementary 00194 (00333) 01283 (02156) Mean r 00019 (00002)High school Omitted Omitted Median r 727 106 (756 107)Technical 00017 (00189) 02306 (01341) Std Dev r 00197 (00015)College 00277 (00124) 02177 (00840) Corr(r ) 02067 (00085)Other (NA) 00029 (00107) 00128 (00819)

Emigrant 00030 (00090) 00001 (00651) Obs 105800

Car attributes Log(value) 00794 (00177) 07244 (01272)Car age 00053 (00023) 00411 (00176)Commercial car 00719 (00187) 00313 (01239)Log(engine size) 01299 (00235) 03195 (01847)

Driving License years 00015 (00017) 00157 (00137)License years2 183 105 (351 105) 148 104 (254 104)Good driver 00635 (00112) 00317 (00822)Any driver 00360 (00105) 03000 (00722)Secondary car 00415 (00141) 01209 (00875)Business use 00614 (00134) 03790 (01124)History length 00012 (00052) 03092 (00518)Claims history 01295 (00154) 00459 (01670)

Young driver Young driver 00525 (00253) 02499 (02290)Gender Male Omitted mdash

Female 00355 (00061) mdashAge 17ndash19 Omitted mdash

19ndash21 00387 (00121) mdash21ndash24 00445 (00124) mdash24 00114 (00119) mdash

Experience 1 Omitted mdash1ndash3 00059 (00104) mdash3 00762 (00121) mdash

Company year First year Omitted OmittedSecond year 00771 (00122) 14334 (00853)Third year 00857 (00137) 28459 (01191)Fourth year 01515 (00160) 38089 (01343)Fifth year 04062 (00249) 39525 (01368)

Note Standard deviations based on the draws from the posterior distribution in parentheses Significant at the 5 percent confidence levela Unconditional statistics represent implied quantities for the sample population as a whole ie integrating over the distribution of covariates in the sample (as

well as over the unobserved components)

762 THE AMERICAN ECONOMIC REVIEW JUNE 2007

save more to obtain more education or totake other actions that lead to greaterwealth34

Let us make several additional observationsFirst while owners of more expensive cars ap-pear to have both higher risk exposure andhigher levels of risk aversion owners of biggercars have higher risk exposure but lower levelsof risk aversion This should indicate that thestructure of the model does not constrain therelationship between the coefficients in the twoequations Rather it is the data that ldquospeak uprdquoSecond individuals who are classified by theinsurer as ldquogood driversrdquo indeed have lowerrisk but also appear to have lower risk aversionThis result is somewhat similar to the positivecorrelation between unobserved risk and unob-served risk aversion which we report belowThird policyholders who tend to use the car forbusiness are less risk averse This could bebecause uninsured costs of accidents occurringto such policyholders are tax deductible Fourthpolicyholders who reported three full years ofclaims history are more risk averse but are notdifferent in their risk exposure The attitude thatleads such policyholders to comply with therequest to (voluntarily) report three full years ofclaims history is apparently and not surpris-ingly correlated with higher levels of risk aver-sion In contrast while past claims indicate highrisk they have no significant relationship withrisk aversion Finally we find a strong trendtoward lower levels of risk aversion over timeThis is a replication of the probit results dis-cussed earlier

The Effect of Individual Characteristics onClaim RiskmdashThe first column of Table 4 de-scribes the relationship between observablesand risk exposure Accident risk is higher fordivorced individuals and lower for people witha college education Bigger older more expen-sive and noncommercial cars are all morelikely to be involved in an accident Drivingexperience reduces accident rates as do mea-sures of less intense use of the car while youngdrivers are more exposed to risk Claim propen-sity is highly correlated over time the voluntaryreport of past claims is a strong predictor offuture claims This risk equation produces re-sults that are similar to those of the simplerPoisson regression reported in Table 3 Al-though some of the coefficients lose signifi-cance the magnitude of most coefficients isquite similar The similarity between these twosets of results is to be expected as the riskregression is identified primarily from the dataon claims so incorporating the information ondeductible choice does not qualitatively changethe conceptual identification strategy (see Sec-tion IIC) If the results were not similar thiscould have indicated a misspecification of themodel The slight differences between the riskregressions in Table 3 and Table 4 are drivenmainly by the structural assumptions First thebenchmark model estimates a lognormal mix-ture of Poisson rates rather than a single Pois-son model By incorporating the fatter tails ofthe claim distribution it slightly changes theresults increases the standard errors and de-creases the average predicted claim rate Sec-ond the information on deductible choiceslightly helps us in obtaining more precise es-timates through the correlation structure be-tween the error terms in the two equations

The Implied Level of Risk AversionmdashOne ofthe main goals of the estimation is to obtainmeasures of the level of risk aversion in thepopulation we study Since we use Gibbs sam-pling and augment the latent coefficients ofabsolute risk aversion we can directly obtainthe posterior distribution of various moments ofthis distribution At each iteration of the Gibbssampler we compute the mean standard devi-ation (across individuals) and various percen-tiles of the simulated draws of i and ri and thecorrelation between them The far-right columnof Table 4 reports the averages and standard

34 One may be tempted to interpret the positive wealtheffects as an indirect indication of credit constraints wealth-ier individuals are less credit constrained and therefore canafford to purchase more insurance We do not share thisinterpretation for two reasons First the insurance companyobserves these proxies for wealth and conditions on themwhen setting prices Since the willingness to pay for insur-ance is likely to be correlated with the willingness to pay forthe additional insurance provided by the low deductibleoption premiums already reflect this variation We condi-tion on the level of the premium Second paying less exante implies paying more ex post so applying the creditconstraint argument only to the ex ante payment but not tothe probabilistic ex post deductible payments has no theo-retical foundation Essentially the setup of the model linksthe ex ante decisions to the ex post losses which are bothdriven by the curvature of the vNM utility function

763VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

deviations of these computed quantities over theiterations of the Gibbs sampler35 The impliedrisk aversion of the mean individual is 00019which is about seven times greater than theback-of-the-envelope calculation presented inthe previous section As we assume a lognormal

distribution and estimate a relatively high dis-persion coefficient r the estimates also implysignificantly lower values for the median levelof risk aversion As shown later the qualitativepattern of these results is quite robust acrossspecifications

Table 5 summarizes our findings regardingthe level and dispersion of risk aversion Itpresents two ways to interpret the estimates weobtained from the benchmark model as well ascomparisons to specifications with constant ab-solute risk aversion (CARA) utility and with

35 Note that these estimated quantities are unconditionalIn computing these quantities we integrate over the distri-bution of observable characteristics in the data so onecannot compute these estimates from the estimated param-eter directly

TABLE 5mdashRISK-AVERSION ESTIMATES

Specificationa Absolute risk aversionb Interpretationc Relative risk aversiond

Back-of-the-envelope 10 103 9070 1484Benchmark model

Mean individual 67 103 5605 972225th percentile 23 106 9998 003Median individual 26 105 9974 03775th percentile 29 104 9714 42790th percentile 27 103 7834 390295th percentile 99 103 4937 14327

CARA utilityMean individual 31 103 7651 4436Median individual 34 105 9966 050

Learning modelMean individual 42 103 6886 6140Median individual 56 106 9995 008

Comparable estimatesGertner (1993) 31 104 9699 479Metrick (1995) 66 105 9934 102Holt and Laury (2002)e 32 102 2096 86575Sydnor (2006) 20 103 8329 5395

a The table summarizes the results with respect to the level of risk aversion ldquoBack-of-the-enveloperdquo refers to thecalculation we report in the beginning of Section III ldquobenchmark modelrdquo refers to the results from the benchmark model(Table 4) ldquoCARA utilityrdquo refers to a specification of a CARA utility function and ldquolearning modelrdquo refers to a specificationin which individuals do not know their risk types perfectly (see Section IIID) The last four rows are the closest comparableresults available in the literature

b The second column presents the point estimates for the coefficient of absolute risk aversion converted to $US1 unitsFor the comparable estimates this is given by their estimate of a representative CARA utility maximizer For all otherspecifications this is given by computing the unconditional mean and median in the population using the figures we reportin the mean and median columns of risk aversion in Table 6 (multiplied by 352 the average exchange rate during the periodto convert to US dollars)

c To interpret the absolute risk aversion estimates (ARA) we translate them into x u(w) 1frasl2 u(w 100) 1frasl2 u(w x) That is we report x such that an individual with the estimated ARA is indifferent about participating in a 50ndash50lottery of gaining 100 US dollars and losing x US dollars Note that since our estimate is of absolute risk aversion the quantityx is independent of w To be consistent with the specification we use a quadratic utility function for the back-of-the-envelopebenchmark and learning models and use a CARA utility function for the others

d The last column attempts to translate the ARA estimates into relative risk aversion We follow the literature and do soby multiplying the ARA estimate by average annual income We use the average annual income (after tax) in Israel in 1995(51168 NIS from the Israeli Census) for all our specifications and we use average disposable income in the United Statesin 1987 ($15437) for Gertner (1993) and Metrick (1995) For Holt and Laury (2002) and Sydnor (2006) we use a similarfigure for 2002 ($26974)

e Holt and Laury (2002) do not report a comparable estimate The estimate we provide above is based on estimating aCARA utility model for the 18 subjects in their experiment who participated in the ldquo90rdquo treatment which involved stakescomparable to our setting For these subjects we assume a CARA utility function and a lognormal distribution of theircoefficient of absolute risk aversion The table reports the point estimate of the mean from this distribution

764 THE AMERICAN ECONOMIC REVIEW JUNE 2007

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 5: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

the results of the paper applicable only to thoseconsumers who seriously consider buying directinsurance Section IIID discusses this selection inmore detail

While we focus primarily on the demand sideof the market by modeling the deductible choicethe supply side (pricing) will be relevant for anycounterfactual exercise as well as for understand-ing the viability of the outside option (which wedo not observe and do not model) During the firsttwo years of the companyrsquos operations the pricesit offered were lower by about 20 percent thanthose offered by other conventional insurersThus due to its differentiation and cost advantagethe company had market power with respect toindividuals who were more sensitive to price thanto the disamenity of not having an agent Thismakes monopolistic screening models apply morenaturally than competitive models of insurance(eg Michael Rothschild and Joseph E Stiglitz1976) During the companyrsquos third year of oper-ation (December 1996 to March 1998) it facedmore competitive conditions when the estab-lished companies trying to fight off the new en-trant lowered the premiums for policies withregular deductibles to the levels offered by thecompany In their remaining period included inthe data the established companies raisedtheir premiums back to previous levels leav-ing the company again with a substantialprice advantage9

For each policy our dataset includes all theinsurerrsquos information about the characteristicsof the policyholder demographic characteris-tics vehicle characteristics and details abouthis driving experience The Appendix providesa list of variables and their definitions andTable 1 provides summary statistics In addi-tion our data include the individual-specificmenu of four deductible and premium combi-nations that the individual was offered (see be-low) the individualrsquos choice from this menuand the realization of risks covered by the pol-icy the length of the period over which it was ineffect the number of claims submitted by the

policyholder and the amounts of the submittedclaims10 Finally we use the zip codes of thepolicyholdersrsquo home addresses11 to augment thedata with proxies for individualsrsquo wealth basedon the Israeli 1995 census12

The policies offered by the insurer (as all pol-icies offered in the studied market) are one-periodpolicies with no commitment on the part of eitherthe insurer or the policyholder13 The policy re-sembles the US version of ldquocomprehensiverdquo in-surance It is not mandatory but it is held by alarge fraction of Israeli car owners (above 70percent according to the companyrsquos executives)The policy does not cover death or injuries to thepolicyholder or to third parties which areinsured through a separate mandatory policyInsurance policies for car audio equipmentwindshield replacement car and towing ser-vices are structured and priced separately Cer-tain types of coverage do not carry a deductibleand are therefore not used in the analysis14

Throughout the paper we use and reportmonetary amounts in current (nominal) NewIsraeli Shekels (NIS) to avoid creating artificialvariation in the data Consequently the follow-ing facts may be useful for interpretation andcomparison with other papers in the literature

9 During this last period two other companies offeringinsurance directly were established Due to first-mover advan-tage (as viewed by the companyrsquos management) which helpedthe company maintain a strong position in the market thesetwo new companies did not affect pricing policies much untilthe end of our observation period Right in the end of thisperiod the studied company acquired one of these entrants

10 Throughout the analysis we make the assumption thatthe main policyholder is the individual who makes the deduct-ible choice Clearly to the extent that this is not always thecase the results should be interpreted accordingly

11 The company has the addresses on record for billingpurposes Although in principle the company could haveused these data for pricing they do not do so

12 The Israeli Central Bureau of Statistics (CBS) associateseach census respondent with a unique ldquostatistical areardquo eachincluding between 1000 and 10000 residents We matchedthese census tracts with zip codes based on street addressesand constructed variables at the zip code level These con-structed variables are available for more than 80 percent of thepolicyholders As a proxy for wealth we use (gross) monthlyincome which is based on self-reported income by censusrespondents augmented (by the CBS) with Social Security data

13 There is a substantial literature that studies the optimaldesign of policies that commit customers to a multiperiodcontract or that include a one-sided commitment of theinsurer to offer the policyholder certain terms in subsequentperiods (Georges Dionne and Pierre Lasserre 1985 RussellCooper and Beth Hayes 1987 Georges Dionne and Neil ADoherty 1994 Igal Hendel and Alessandro Lizzeri 2003)Although such policies are observed in certain countries(Dionne and Vanasse 1992) many insurance markets includ-ing the one we study use only one-period no-commitmentpolicies (Howard Kunreuther and Mark V Pauly 1985)

14 These include auto theft total loss accidents and notldquoat faultrdquo accidents

749VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The exchange rate between NIS and US dollarsmonotonically increased from 301 in 1995 to414 in 1999 (on average it was 352)15 Annualinflation was about 8 percent on average andcumulative inflation over the observation periodwas 48 percent We will account for these effects

as well as other general trends by using yeardummy variables throughout the analysis

B The Menu of Deductibles and Premiums

Let xi be the vector of characteristics individuali reports to the insurance company After learningxi the insurer offered individual i a menu of fourcontract choices One option offered a ldquoregularrdquodeductible and a ldquoregularrdquo premium The term

15 PPP figures were about 10 percent lower than the nom-inal exchange rates running from 260 in 1995 to 374 in 1999

TABLE 1mdashSUMMARY STATISTICSmdashCOVARIATES

Variable Mean Std dev Min Max

Demographics Age 41137 1237 1806 8943Female 0316 047 0 1Family Single 0143 035 0 1

Married 0779 042 0 1Divorced 0057 023 0 1Widower 0020 014 0 1Refused to say 0001 004 0 1

Education Elementary 0016 012 0 1High school 0230 042 0 1Technical 0053 022 0 1College 0233 042 0 1No response 0468 050 0 1

Emigrant 0335 047 0 1

Car attributes Value (current NIS)a 66958 37377 4000 617000Car age 3952 287 0 14Commercial car 0083 028 0 1Engine size (cc) 1568 385 700 5000

Driving License years 18178 1007 0 63Good driver 0548 050 0 1Any driver 0743 044 0 1Secondary car 0151 036 0 1Business use 0082 027 0 1Estimated mileage (km)b 14031 5891 1000 32200History length 2847 061 0 3Claims history 0060 015 0 2

Young driver Young 0192 039 0 1Gender Male 0113 032 0 1

Female 0079 027 0 1Age 17ndash19 0029 017 0 1

19ndash21 0051 022 0 121ndash24 0089 029 0 124 0022 015 0 1

Experience 1 0042 020 0 11ndash3 0071 026 0 13 0079 027 0 1

Company year First year 0207 041 0 1Second year 0225 042 0 1Third year 0194 040 0 1Fourth year 0178 038 0 1Fifth year 0195 040 0 1

Note The table is based on all 105800 new customers in the dataa The average exchange rate throughout the sample period was approximately $1 per 35 NIS starting at 13 in late 1994

and reaching 14 in late 1999b The estimated mileage is not reported by everyone It is available for only 60422 new customers

750 THE AMERICAN ECONOMIC REVIEW JUNE 2007

regular was used for this deductible level bothbecause it was relatively similar to the deductiblelevels offered by other insurers and because mostpolicyholders chose it The regular premium var-ied across individuals according to some deter-ministic function (unknown to us) pit ft(xi)which was quite stable over time The regulardeductible level was directly linked to the regularpremium according to

(1) dit min12

pit capt

That is it was set to one-half of the regular pre-mium subject to a deductible cap capt whichvaried over time but not across individuals Thepremiums associated with the other options werecomputed by multiplying pit by three differentconstants 106 for ldquolowrdquo deductible 0875 forldquohighrdquo deductible and 08 for ldquovery highrdquo de-ductible The regular deductible dit was con-verted to the other three deductible levels in asimilar way using multipliers of 06 for low 18for high and 26 for very high

There are two main sources of exogenous vari-ation in prices The first arises from companyexperimentation The multipliers described abovewere fixed across individuals and over time formost of the observation period but there was asix-month period during the insurerrsquos first year ofoperation (May 1995 to October 1995) in whichthe insurer experimented with slightly modifiedmultipliers16 This modified formula covers al-most 10 percent of the sample The secondsource of variation arises from discrete ad-justments to the uniform cap The cap variedover time due to inflation competitive condi-tions and as the company gained more expe-rience (Figure 1) The cap was binding for

16 For individuals with low levels of regular premiumsduring the specified period the regular deductible was set at53 percent (instead of 50 percent) of the regular premiumthe low deductible was set at 33 percent (instead of 30percent) of the regular premium and so on

FIGURE 1 VARIATION IN THE DEDUCTIBLE CAP OVER TIME

Notes This figure presents the variation in the deductible cap over time which is one of the main sources of pricing variation inthe data We do not observe the cap directly but it can be calculated from the observed menus The graph plots the maximalregular deductible offered to anyone who bought insurance from the company over a moving seven-day window Thelarge jumps in the graph reflect changes in the deductible cap There are three reasons why the graph is not perfectlysmooth First in a few holiday periods (eg October 1995) there are not enough sales within a seven-day window sonone of those sales hits the cap This gives rise to temporary jumps downward Second the pricing rule applies at thedate of the price quote given to the potential customer Our recorded date is the first date the policy becomes effectiveThe price quote is held for a period of two to four weeks so in periods in which the pricing changes we may still seenew policies sold using earlier quotes made according to a previous pricing regime Finally even within periods of constantcap the maximal deductible varies slightly (variation of less than 05 percent) We do not know the source of this variation

751VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

about a third of the policyholders in our dataAll these individuals would be affected by achange in the cap Much of the variation ofmenus in the data is driven by the exogenousshifts in the uniform deductible cap The un-derlying assumption is that conditional onobservables these sources of variation pri-marily affect the deductible choice of newcustomers but they do not have a significantimpact on the probability of purchasing insurancefrom the company Indeed this assumption holdsin the data with respect to observables there is nodistinguishable difference in the distribution ofobservable characteristics of consumers who buyinsurance just before and those who buy just aftera change in the deductible cap

C Summary Statistics

The top of Table 2A summarizes the deduct-ible menus all are calculated according to theformula described earlier Only 1 percent of thepolicyholders chose the high or very high de-ductible options Therefore for the rest of theanalysis we focus only on the choice betweenregular and low deductible options (chosen by811 and 178 percent of the individuals respec-tively)17 Focusing only on these options doesnot create any selection bias because we do notomit individuals who chose high or very highdeductibles For these individuals we assumethat they chose a regular deductible This as-sumption is consistent with the structural modelwe develop in the next section which impliesthat conditional on choosing high or very highdeductibles an individual would almost alwaysprefer the regular over the low deductible

The bottom of Table 2A as well as Table2B present summary statistics for the policy re-alizations We focus only on claim rates and noton the amounts of the claims This is because anyamount above the higher deductible level is cov-

ered irrespective of the deductible choice and thevast majority of the claims fit in this category (seeSection IIIE) For all these claims the gain fromchoosing a low deductible is the same in the eventof a claim and is equal to the difference betweenthe two deductible levels Therefore the claimamount is rarely relevant for the deductible choice(and likewise for the companyrsquos pricing decisionwe analyze in Section IIIF)

Averaging over all individuals the annualclaim rate was 0245 One can clearly observesome initial evidence of adverse selection Onaverage individuals who chose a low deductiblehad higher claim rates (0309) than those whochose the regular deductible (0232) Those whochose high and very high deductibles had muchlower claim rates (0128 and 0133 respectively)These figures can be interpreted in the context ofthe pricing formula A risk-neutral individual willchoose the low deductible if and only if her claimrate is higher than (pd) (plow pregular)(dregular dlow) When the deductible cap is notbinding which is the case for about two-thirds ofthe sample this ratio is given directly by thepricing formula and is equal to 03 Thus anyindividual with a claim rate higher than 03 willbenefit from buying the additional coverage pro-vided by a low deductible even without any riskaversion The claim data suggest that the offeredmenu is cheaper than an actuarially fair contractfor a nonnegligible part of the population (13percent according to the benchmark estimates re-ported below) This observation is in sharp con-trast to other types of insurance contracts such asappliance warranties which are much more ex-pensive than the actuarially fair price (Rabin andThaler 2001)

II The Empirical Model

A A Model of Deductible Choice

Let wi be individual irsquos wealth (pih di

h) theinsurance contract (premium and deductible re-spectively) with high deductible (pi

l dil) the

insurance contract with low deductible ti theduration of the policy and ui(w) individual irsquosvNM utility function We assume that the num-ber of insurance claims is drawn from a Poissondistribution with an annual claim rate iThrough most of the paper we assume that i isknown to the individual We also assume that iis independent of the deductible choice ie that

17 The small frequency of ldquohighrdquo and ldquovery highrdquochoices provides important information about the lowerends of the risk and risk aversion distributions but (for thatsame reason) makes the analysis sensitive to functionalform Considering these options or the option of not buyinginsurance creates a sharp lower bound on risk aversion forthe majority of the observations making the estimates muchhigher Given that these options are rarely selected how-ever it is not clear to us whether they were regularlymentioned during the insurance sales process renderingtheir use somewhat inappropriate

752 THE AMERICAN ECONOMIC REVIEW JUNE 2007

there is no moral hazard Finally we assumethat in the event of an accident the value of theclaim is greater than di

h We revisit all theseassumptions in Sections IIID and IIIE For therest of this section i subscripts are suppressedfor convenience

In the market we study insurance policies are

typically held for a full year after which theycan be automatically renewed with no commit-ment by either the company or the individualMoreover all auto-insurance policies sold inIsrael can be canceled without prior notice bythe policyholder with premium payments beinglinearly prorated Both the premium and the

TABLE 2AmdashSUMMARY STATISTICSmdashMENUS CHOICES AND OUTCOMES

Variable Obs Mean Std dev Min Max

Menu Deductible (current NIS)a Low 105800 87548 12101 37492 103911Regular 105800 145299 19779 62486 171543High 105800 260802 35291 112475 308778Very high 105800 376305 50853 162464 446013

Premium (current NIS)a Low 105800 338057 91404 132471 1923962Regular 105800 318922 8623 124972 1815058High 105800 279057 75451 109351 1588176Very high 105800 255137 68984 99978 1452046

pd 105800 0328 006 03 18

Realization Choice Low 105800 0178 038 0 1Regular 105800 0811 039 0 1High 105800 0006 008 0 1Very high 105800 0005 007 0 1

Policy termination Active 105800 0150 036 0 1Canceled 105800 0143 035 0 1Expired 105800 0707 046 0 1

Policy duration (years) 105800 0848 028 0005 108Claims All 105800 0208 048 0 5

Low 18799 0280 055 0 5Regular 85840 0194 046 0 5High 654 0109 034 0 3Very high 507 0107 032 0 2

Claims per yearb All 105800 0245 066 0 19882Low 18799 0309 066 0 9264Regular 85840 0232 066 0 19882High 654 0128 062 0 12636Very high 507 0133 050 0 3326

a The average exchange rate throughout the sample period was approximately $1 per 35 NIS starting at 13 in late 1994and reaching 14 in late 1999

b The mean and standard deviation of the claims per year are weighted by the observed policy duration to adjust forvariation in the exposure period These are the maximum likelihood estimates of a simple Poisson model with no covariates

TABLE 2BmdashSUMMARY STATISTICSmdashCONTRACT CHOICES AND REALIZATIONS

Claims Low Regular High Very high Total Share

0 11929 (0193) 49281 (0796) 412 (0007) 299 (0005) 61921 (100) 080341 3124 (0239) 9867 (0755) 47 (0004) 35 (0003) 13073 (100) 016962 565 (0308) 1261 (0688) 4 (0002) 2 (0001) 1832 (100) 002383 71 (0314) 154 (0681) 1 (0004) 0 (0000) 226 (100) 000294 6 (0353) 11 (0647) 0 (0000) 0 (0000) 17 (100) 000025 1 (0500) 1 (0500) 0 (0000) 0 (0000) 2 (100) 000003

Notes The table presents tabulation of choices and number of claims For comparability the figures are computed only forindividuals whose policies lasted at least 09 years (about 73 percent of the data) The bottom rows of Table 2A providedescriptive figures for the full dataset The numbers in parentheses in each cell represent percentages within each row Theright-hand-side column presents the marginal distribution of the number of claims

753VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

probability of a claim are proportional to thelength of the time interval taken into account soit is convenient to think of the contract choice asa commitment for only a short amount of timeThis approach has several advantages First ithelps to account for early cancellations andtruncated policies which together constitute 30percent of the policies in the data18 Second itmakes the deductible choice independent ofother longer-term uncertainties faced by the in-dividual so we can focus on static risk-takingbehavior Third this formulation helps to obtaina simple framework for analysis which is attrac-tive both analytically and computationally19

The expected utility that the individual obtainsfrom the choice of a contract (p d) is given by

(2) v p d

1 tuw pt tuw pt d

We characterize the set of parameters that willmake the individual indifferent between the twooffered contracts This set provides a lower (up-per) bound on the level of risk aversion for indi-viduals who choose the low (high) deductible (fora given ) Thus we analyze the equation v(phdh) v(pl dl) By taking limits with respect to t(and applying LrsquoHopitalrsquos rule) we obtain

(3)

limt30

1

t(u(w pht) u(w plt))

(u(w pht) u(w pht dh)) (u(w plt) u(w plt dl))

pl phuw

uw dl uw dh

or

(4)

pl phuw uw dl uw dh

The last expression has a simple intuition Theright-hand side is the expected gain (in utils) perunit of time from choosing a low deductibleThe left-hand side is the cost of such a choiceper unit of time For the individual to be indif-ferent the expected gains must equal the costs

In our benchmark specification we assumethat the third derivative of the vNM utilityfunction is not too large A Taylor expansion forboth terms on the right-hand side of equation(4) ie u(w d) u(w) du(w) (d22)u13(w) implies that

(5)pl ph

uw dh dluw

1

2dh dldh dlu13w

Let d dh dl 0 p pl ph 0 andd 1frasl2 (dh dl) to obtain

(6)p

duw uw d u13w

or

(7) r u13w

uw

p

d 1

d

where r is the coefficient of absolute risk aver-sion at wealth level w

18 As can be seen in Table 2A 70 percent of the policiesare observed through their full duration (one year) About15 percent are truncated by the end of our observationperiod and the remaining 15 percent are canceled for var-ious reasons such as change in car ownership total-lossaccident or a unilateral decision of the policyholder tochange insurance providers

19 This specification ignores the option value associatedwith not canceling a policy This is not very restrictiveSince experience rating is small and menus do not changeby much this option value is likely to be close to zero Asimple alternative is to assume that individuals behave as ifthey commit for a full year of coverage In such a case themodel will be similar to the one we estimate but willdepend on the functional form of the vNM utility functionand would generally require taking infinite sums (over thepotential realizations for the number of claims within the year)In the special case of quadratic expected utility maximizerswho care only about the mean and variance of the number ofclaims this is easy to solve The result is almost identical to theexpression we subsequently derive in equation (7)

754 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Equation (7) defines an indifference set in thespace of risk and risk aversion which we willrefer to by (r() ) and ((r) r) interchange-ably Both r() and (r) have a closed-formrepresentation a property that will be computa-tionally attractive for estimation20 Both termsare individual specific as they depend on thedeductible menu which varies across individu-als For the rest of the paper we regard eachindividual as associated with a two-dimensionaltype (ri i) An individual with a risk parameter

i who is offered a menu (pih di

h) (pil di

l)will choose the low-deductible contract if andonly if his coefficient of absolute risk aversionsatisfies ri ri(i) Figure 2 presents a graphi-cal illustration

B The Benchmark Econometric Model

The Econometric ModelmdashThe econometricmodel we estimate is fully described by the fiveequations presented in this section Our ob-jective is to estimate the joint distribution of(i ri)mdashthe claim rate and coefficient of abso-lute risk aversionmdashin the population of policy-holders conditional on observables xi The

20 For example estimating the CARA version of themodel (Section IIID) for which r() does not have aclosed-form representation takes almost ten times longer

FIGURE 2 THE INDIVIDUALrsquoS DECISIONmdashA GRAPHICAL ILLUSTRATION

Notes This graph illustrates the individualrsquos decision problem The solid line presents the indifference setmdashequation (7)mdashappliedfor the menu faced by the average individual in the sample Individuals are represented by points in the two-dimensional spaceabove In particular the scattered points are 10000 draws from the joint distribution of risk and risk aversion for the averageindividual (on observables) in the data based on the point estimates of the benchmark model (Table 4) If an individual is eitherto the right of the line (high risk) or above the line (high risk aversion) the low deductible is optimal Adverse selection is capturedby the fact that the line is downward sloping higher-risk individuals require lower levels of risk aversion to choose the lowdeductible Thus in the absence of correlation between risk and risk aversion higher-risk individuals are more likely to choosehigher levels of insurance An individual with i (pidi) will choose a lower deductible even if he is risk neutral ie withprobability one (we do not allow individuals to be risk loving) This does not create an estimation problem because i is notobserved only a posterior distribution for it Any such distribution will have a positive weight on values of i that are below(pidi) Second the indifference set is a function of the menu and in particular of (pidi) and d An increase in (pidi) willshift the set up and to the right and an increase in d will shift the set down and to the left Therefore exogenous shiftsof the menus that make both arguments change in the same direction can make the sets ldquocrossrdquo thereby allowing us toidentify the correlation between risk and risk aversion nonparametrically With positive correlation (shown in the figureby the ldquoright-bendingrdquo shape of the simulated draws) the marginal individuals are relatively high risk therefore creatinga stronger incentive for the insurer to raise the price of the low deductible

755VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

benchmark formulation assumes that (i ri) fol-lows a bivariate lognormal distribution Thuswe can write the model as

(8) ln i xi i

(9) ln ri xi vi

with

(10) i

vi

iid

N00

2 r

r r2

Neither i nor ri is directly observed Thereforewe treat both as latent variables Loosely speak-ing they can be thought of as random effects Weobserve two variables the number of claims andthe deductible choice which are related to thesetwo unobserved components Thus to completeour econometric model we have to specify therelationship between the observed variables andthe latent ones This is done by making two struc-tural assumptions First we assume that the num-ber of claims is drawn from a Poisson distributionnamely

(11) claimsi Poisson(i ti )

where ti is the observed duration of the policySecond we assume that when individuals maketheir deductible choices they follow the theoreti-cal model described in the previous section Themodel implies that individual i chooses the lowdeductible (choicei 1) if and only if ri ri(i)where ri is defined in equation (7) Thus theempirical model for deductible choice is given by

(12) Prchoicei 1 Prri

pi

idi 1

d i

Prexp(xi vi)

pi

exp(xi i)di 1

d i

With no unobserved heterogeneity in risk (i 0) equation (12) reduces to a simple probit In

such a case one can perfectly predict risk from thedata denote it by (xi) and construct an addi-tional covariate ln([pi((xi)di) 1]d i) Giventhe assumption that risk aversion is distributedlognormally running the probit regressionabove and renormalizing the coefficient onthe constructed covariate to 1 (instead ofthe typical normalization of the variance of theerror term to 1) has a structural interpretation withln (ri) as the dependent variable However Cohen(2005) provides evidence of adverse selection inthe data implying the existence of unobservedheterogeneity in risk This makes the simple probitregression misspecified Estimation of the fullmodel is more complicated Once we allow forunobserved heterogeneity in both unobserved riskaversion (vi) and claim rate (i) we have to inte-grate over the two-dimensional region depicted inFigure 2 for estimation

EstimationmdashA natural way to proceed is toestimate the model by maximum likelihoodwhere the likelihood of the data as a function ofthe parameters can be written by integrating outthe latent variables namely

(13) Lclaimsi choicei

Prclaimsi choiceii riPri ri

where is a vector of parameters to be esti-mated While formulating the empirical modelusing likelihood may help our thinking aboutthe data-generating process using maximumlikelihood (or generalized method of moments(GMM)) for estimation is computationally cum-bersome This is because in each iteration itrequires evaluating a separate complex integralfor each individual in the data In contrastMarkov Chain Monte Carlo (MCMC) Gibbssampling is quite attractive in such a case Us-ing data augmentation of latent variables (Mar-tin A Tanner and Wing Hung Wong 1987)according to which we simulate (i ri) and latertreat those simulations as if they are part ofthe data one can avoid evaluating the complexintegrals by sampling from truncated normaldistributions which is significantly less compu-tationally demanding (eg Luc Devroye 1986)This feature combined with the idea of aldquosliced samplerrdquo (Paul Damien John Wake-field and Stephen Walker 1999) to sample from

756 THE AMERICAN ECONOMIC REVIEW JUNE 2007

an unfamiliar posterior distribution makes theuse of a Gibbs sampler quite efficient for ourpurposes Finally the lognormality assumptionimplies that F(ln(i)ri) and F(ln(ri)i) follow a(conditional) normal distribution allowing us torestrict attention to univariate draws furtherreducing the computational burden

The Appendix provides a full description ofthe Gibbs sampler including the conditionaldistributions and the (flat) prior distributions weuse The basic intuition is that conditional onobserving (i ri) for each individual we have asimple linear regression model with two equa-tions The less standard part is to generate drawsfor (i ri) We do this iteratively Conditionalon i the posterior distribution for ln(ri) fol-lows a truncated normal distribution where thetruncation point depends on the menu individ-ual i faces and its direction (from above orbelow) depends on individual irsquos deductiblechoice The final step is to sample from theposterior distribution of ln(i) conditional on riThis is more complicated as we have bothtruncation which arises from adverse selection(just as we do when sampling for ri) and thenumber of claims which provides additionalinformation about the posterior of i Thus theposterior for i takes an unfamiliar form forwhich we use a ldquosliced samplerrdquo

We use 100000 iterations of the Gibbs sam-pler It seems to converge to the stationarydistribution after about 5000 iterations There-fore we drop the first 10000 draws and use thelast 90000 draws of each variable to report ourresults The results are based on the posteriormean and posterior standard deviation fromthese 90000 draws Note that each iterationinvolves generating separate draws of (i ri) foreach individual Performing 100000 iterationsof the benchmark specification (coded in Mat-lab) takes about 60 hours on a Dell Precision530 workstation

C Identification

The parametric version of the model is iden-tified mechanically There are more equationsthan unknowns and no linear dependenciesamong them so (as also verified using MonteCarlo exercises) the model parameters can bebacked out from simulated data Our goal in thissection is not to provide a formal identificationproof Rather we want to provide intuition for

which features of the data allow us to identifyparticular parameters of the model The discus-sion also highlights the assumptions that areessential for identification vis-a-vis those thatare made only for computational convenience(making them in principle testable)

Discussion of Nonparametric Identifica-tionmdashThe main difficulty in identifying themodel arises from the gap between the (exante) risk type i which individuals usewhen choosing a deductible and the (ex post)realization of the number of claims we ob-serve We identify between the variation inrisk types and the variation in risk realizationsusing our parametric distributional assump-tions The key is that the distribution of risktypes can be uniquely backed out from theclaim data alone This allows us to use thedeductible choice as an additional layer ofinformation which identifies unobserved het-erogeneity in risk aversion21 Any distribu-tional assumption that allows us to uniquelyback out the distribution of risk types fromclaim data would be sufficient to identify thedistribution of risk aversion As is customaryin the analysis of count processes we make aparametric assumption that claims are gener-ated by a lognormal mixture of Poisson dis-tributions (Section IIID discusses this furtherand explores an alternative) Using a mixtureenables us to account for adverse selectionthrough unobserved heterogeneity in risk Italso allows us to better fit the tails of theclaim distribution In principle a more flexi-ble mixture or a more flexible claim-generat-ing process could be identified as long as theclaims data are sufficiently rich22

21 Cardon and Hendel (2001) face a similar identificationproblem in the context of health insurance They use vari-ation in coverage choice (analogous to deductible choice) toidentify the variation in health-status signals (analogous torisk types) from the variation in health expenditure (analo-gous to number of claims) They can rely on the coveragechoice to identify this because they make an assumptionregarding unobserved heterogeneity in preferences (iidlogit) We take a different approach as our main goal is toestimate rather than assume the distribution of preferences

22 Although it may seem that the claim data are limited(as they take only integer values between 0 and 5 in ourdata) variation in policy duration generates continuousvariation in the observed claim propensity Of course thisvariation also introduces an additional selection into themodel due to policy cancellations which are potentially

757VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Once the distribution of risk types is iden-tified from claims data the marginal distribu-tion of risk aversion (and its relationship tothe distribution of risk types) is nonparametri-cally identified from the variation in theoffered menus discussed in Section I Thisvariation implies different deductible and pre-mium options to identical (on observables)individuals who purchased insurance at dif-ferent times Different menus lead to differentindifference sets (similar to the one depictedin Figure 2) These sets often cross each otherand nonparametrically identify the distribu-tion of risk aversion and the correlation struc-ture at least within the region in which theindifference sets vary For the tails of thedistribution as is typically the case we haveto rely on parametric assumptions or usebounds The parametric assumption of log-normality we use for most of the paper ismade only for computational convenience

Intuition for the Parametric IdentificationMechanismmdashVariation in the offered menus isimportant for the nonparametric identificationThe parametric assumptions could identify themodel without such variation Thus to keep theintuition simple let us take the bivariate log-normal distribution as given and contrary to thedata assume that all individuals are identical onobservables and face the same menu Supposealso that all individuals are observed for exactlyone year and have up to two claims23 In thissimplified case the model has five parametersto be estimated the mean and variance of risk and

2 the mean and variance of risk aver-sion r and r

2 and the correlation parameter The data can be summarized by five numbersLet 130 131 and 132 1 131 130 be thefraction of individuals with zero one and twoclaims respectively Let 0 1 and 2 be theproportion of individuals who chose a low de-ductible within each ldquoclaim grouprdquo Given ourdistributional assumption about the claim-

generating process we can use 130 and 131 touniquely identify and

2 Loosely isidentified by the average claim rate in the dataand

2 is identified by how fat the tail of theclaim distribution is ie by how large (132131)is compared to (131130) Given and

2 and thelognormality assumption we can (implicitly)construct a posterior distribution of risk typesfor each claim group F(r claims c) andintegrate over it when predicting the deductiblechoice This provides us with three additionalmoments each of the form

(14) Ec Prchoice 1r

dFr claims c dFr

for c 0 1 2 These moments identify thethree remaining parameters of the model rr

2 and Let us now provide more economic content to

the identification argument Using the same ex-ample and conditional on identifying and

2

from the claim data one can think about thedeductible choice data 0 1 2 as a graph(c) The absolute level of the graph identifiesr In the absence of correlation between riskand risk aversion the slope of the graph iden-tifies r

2 with no correlation the slope shouldalways be positive (due to adverse selection)but higher r

2 would imply a flatter graph be-cause more variation in the deductible choiceswill be attributed to variation in risk aversionFinally is identified by the curvature of thegraph The more convex (concave) the graph isthe more positive (negative) is the estimated For example if 0 05 1 051 and 2 099 it is likely that r

2 is high (explaining why0 and 1 are so close) and is highly positive(explaining why 2 is not also close to 1) Incontrast if 0 1 it must mean that thecorrelation between risk and risk aversion isnegative which is the only way the originalpositive correlation induced by adverse selec-tion can be offset This intuition also clarifiesthat the identification of relies on observingindividuals with multiple claims (or differentpolicy durations) and that it is likely to besensitive to the distributional assumptions Thedata (Table 2B) provide a direct (positive) cor-relation between deductible choice and claims

endogenous The results are similar when we use onlyexpired and truncated policies

23 This variation is sufficient (and necessary) to identifythe benchmark model The data provide more variation weobserve up to five claims per individual we observe con-tinuous variation in the policy durations we observe vari-ation in prices and we exploit distributional restrictionsacross individuals with different observable characteristics

758 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The structural assumptions allow us to explainhow much of this correlation can be attributedto adverse selection The remaining correlation(positive or negative) is therefore attributed tocorrelation in the underlying distribution of riskand risk aversion

III Results

A Reduced-Form Estimation

To get an initial sense for the levels of abso-lute risk aversion implied by the data we use asimple back-of-the-envelope exercise We com-pute unconditional averages of p d dh dland d (Table 2A)24 and substitute these valuesin equation (7) The implied coefficient of ab-solute risk aversion is 29 104 NIS125 Thisfigure could be thought of as the average indif-ference point implying that about 18 percent ofthe policyholders have coefficients of absoluterisk aversion exceeding it To convert to USdollar amounts one needs to multiply thesefigures by the average exchange rate (352)resulting in an average indifference point of102 103 $US1 This figure is less than halfof a similar estimate reported by Sydnor (2006)for buyers of homeownerrsquos insurance but about3 and 13 times higher than comparable fig-ures reported by Gertner (1993) and Metrick(1995) respectively for television game showparticipants

Table 3 provides reduced-form analysis ofthe relationship between the observables andour two left-hand-side variables the number ofclaims and the deductible choice26 Column 1reports the estimates from a Poisson regressionof the number of claims on observed character-istics This regression is closely related to therisk equation we estimate in the benchmarkmodel It shows that older people women andpeople with a college education are less likely to

have an accident Bigger more expensiveolder and noncommercial cars are more likelyto be involved in an accident Driving experi-ence and variables associated with less intenseuse of the car reduce accident rates As could beimagined claim propensity is highly correlatedover time past claims are a strong predictor offuture claims Young drivers are 50 to 70 per-cent more likely to be involved in an accidentwith young men significantly more likely thanyoung women Finally as indicated by the trendin the estimated year dummies the accident ratesignificantly declined over time Part of thisdecline is likely due to the decline in accidentrates in Israel in general27 This decline mightalso be partly due to the better selection ofindividuals the company obtained over time asit gained more experience (Cohen 2003)

Columns 2 and 3 of Table 3 present estimatesfrom probit regressions in which the dependentvariable is equal to one if the policyholder chosea low deductible and is equal to zero otherwiseColumn 3 shows the marginal effects of thecovariates on the propensity to choose a lowdeductible These marginal effects do not have astructural interpretation as the choice of lowdeductible depends on its price on risk aver-sion and on risk In this regression we againobserve a strong trend over time Fewer policy-holders chose the low deductible as time wentby One reason for this trend according to thecompany executives is that over time the com-panyrsquos sales persons were directed to focusmainly on the ldquodefaultrdquo regular deductible op-tion28 The effect of other covariates will bediscussed later in the context of the full modelIn unreported probit regressions we also test thequalitative assumptions of the structural model byadding three additional regressors the price ratio(pidi) the average deductible offered d i

24 The unconditional is computed by maximum likeli-hood using the data on claims and observed durations of thepolicies

25 Using the CARA specification as in equation (16) weobtain a slightly lower value of 25 104 NIS1

26 We find positive correlation in the errors of these tworegressions suggesting the existence of adverse selection inthe data and motivating a model with unobserved heteroge-neity in risk This test is similar to the bivariate probit testproposed by Chiappori and Salanie (2000) and replicatesearlier results reported in Cohen (2005)

27 In particular traffic fatalities and counts of trafficaccidents in Israel fell by 11 percent and 18 percent during1998 and 1999 respectively

28 Such biased marketing efforts will bias consumersagainst choosing the low deductible thus making them lookless risk averse This would make our estimate a lowerbound on the true level of risk aversion If only sophisti-cated consumers could see beyond the marketing effort andthis sophistication were related to observables (eg educa-tion) the coefficients on such observables would be biasedupward This is not a major concern given that the coefficientson most of the covariates are fairly stable when we estimatethe benchmark model separately for different years

759VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

TABLE 3mdashNO HETEROGENEITY IN RISK

Variable

Poisson regressiona Probit regressionsc

Dep var Number of claims Dep var 1 if low deductible

(1) (2) (3)

Coeff Std Err IRRb Coeff Std Err dPdX dPdXDemographics Constant 43535 02744 mdash 224994 23562 mdash

Age 00074 00052 09926 01690 00464 00034 00042Age2 989 105 536 105 10001 00017 00005 339 105 455 105Female 00456 00161 09555 08003 01316 00165 00134Family Single omitted omitted omitted

Married 01115 00217 08945 06417 01950 00128 00045Divorced 00717 00346 10743 03191 03127 00064 00044Widower 00155 00527 10156 03441 04573 00069 00025Other (NA) 01347 01755 11442 29518 19836 00511 00370

Education Elementary 00522 00550 09491 02927 04278 00061 00016High school omitted omitted omittedTechnical 00373 00308 10380 05037 02520 00105 00138College 00745 00197 09282 05279 01546 00109 00053Other (NA) 00116 00184 10116 00630 01540 00013 00025

Emigrant 00210 00163 10213 00497 01328 00010 00003

Car attributes Log(Value) 01227 00281 11306 13285 02342 00271 00299Car age 00187 00042 10189 01603 00357 00033 00017Commercial car 01394 00326 08699 09038 02781 00177 00294Log(engine size) 02972 00459 13461 06924 03952 00141 00075

Driving License years 00204 00034 09798 01043 00312 00021 00005License years2 00002 677 105 10002 00015 00005 299 105 167 105

Good driver 00176 00191 09825 07207 01574 00148 00152Any driver 00564 00169 10580 10923 01321 00217 00258Secondary car 00859 00209 09177 00038 01626 00001 00070Business use 01852 00293 12034 07951 02737 00156 00017History length 00527 00110 09486 11218 01450 00228 00171Claims history 06577 00390 19304 04654 05460 00095 00496

Young driver Young driver 05235 00361 16879 28847 06305 00524 00012Gender Male omitted omitted omitted

Female 01475 00288 08629 17959 03135 00396 00195Age 17ndash19 omitted omitted omitted

19ndash21 00701 00532 10726 07800 07509 00153 0014721ndash24 00267 00574 09737 05746 07773 00114 0015624 02082 00567 0812 17869 07328 00397 00136

Experience 1 omitted omitted omitted1ndash3 02416 00458 07854 10175 06577 00217 000093 02827 00532 07538 32513 07386 00762 00410

Company year First year omitted omitted omittedSecond year 00888 00198 09150 44513 01506 00792 00859Third year 00690 00222 09334 85888 01820 01303 01367Fourth year 01817 00232 08339 118277 02102 01616 01734Fifth year 05431 0028 05810 143206 02778 01920 02085

105586Obs 105800 94000 105800Pseudo R2 00162 01364 01296Log likelihood 577459 369595 430869

Significant at the 5 percent confidence levela Maximum likelihood estimates Variation in exposure (policy duration) is accounted forb IRR Incidence rate ratio Each figure should be interpreted as the increasedecrease in claim probability as a result of an increase of one unit in the

right-hand-side variablec There are two separate probit regressions reported in this table Column 2 relies on the deductible choice model and the lognormality assumption As discussed

in the text by including an additional regressor ln([pi((xi)di) 1]d i) (with (xi) predicted from column 1 above) and normalizing its coefficient to 1 we obtaina structural interpretation of this regression with ln (ri) as the dependent variable Thus the reported coefficients are comparable to those estimated for the benchmarkmodel One should be cautious however in interpreting these coefficients Unlike the benchmark model this regression does not allow unobserved heterogeneity inrisk and suffers from some selection bias because observations with high predicted risk rate are omitted (which is why the number of observations is 94000 ratherthan the full sample of 105800) For comparison column 3 reports the marginal effects from a comparable probit regression that uses the full sample and does notcontrol for pricing and predicted risk through the additional structural regressor Column 3 does not have a structural interpretation and its (unreported) coefficientscannot be compared to those estimated from the benchmark model

760 THE AMERICAN ECONOMIC REVIEW JUNE 2007

and the risk rate (xi) as predicted from thePoisson regression of column 1 All three addi-tional regressors enter with the predicted signand with large and highly significant marginaleffects29

Finally column 2 of Table 3 presents animportant specification of the probit regres-sion in which ln([pi(( xi)di) 1]d i) isadded as a regressor and its coefficient isnormalized to 130 As already mentioned ifwe assume that there is no unobserved heter-ogeneity in risk then column 2 is analogousto the ln(ri) equation of the benchmark modelThis restriction of the model is rejected by thedata About 10 percent of the individuals arepredicted to have (xi) (pi di) implying achoice of low deductible for any level of riskaversion Many of these individuals howeverstill choose higher deductible levels Column2 reports the results for the remaining indi-viduals (94000 out of 105800) for which theregression can be run While the signs of theestimated coefficients are similar to those inthe benchmark model presented below therestricted version of the model suggests muchhigher levels and dispersion of risk aversionwell above any reasonable level31 The fullestimation of the benchmark model clearly re-jects this restriction on the model

B Estimation of the Benchmark Model

The Effect of Individual Characteristics onRisk AversionmdashTable 4 presents the estimationresults from the benchmark model The secondcolumn shows how the level of absolute riskaversion is related to individual characteristicsAs the dependent variable is in natural loga-

rithm coefficients on dummy variables can bedirectly interpreted as approximate percentagechanges The vast majority of these coefficientsare quite stable across a wide range of specifi-cations which are mentioned later

The results indicate that women are more riskaverse than men and have a coefficient of ab-solute risk aversion about 20 percent greaterthan that of men These results are consistentwith those of Donkers et al (2001) and Hartoget al (2002) The estimated effect of age sug-gests a nonmonotone pattern of risk preferencesover the life cycle The estimated coefficientsimply that initially (that is at age 18 the youngestindividual in the data) individuals become less riskaverse with age but around the age of 48 indi-viduals start becoming more risk averse32 Mar-ried individuals are estimated to be significantlymore risk averse compared to singles while di-vorced individuals are less (although the coeffi-cient is statistically insignificant)

The analysis suggests that variables that arelikely to be correlated with income or wealthsuch as post-high-school education and thevalue of the car have a positive coefficientindicating that wealthier people have higherlevels of absolute risk aversion Although we donot have data on individualsrsquo income or wealthwe use other proxies for income in additionalspecifications and obtain mixed results Whenwe include as a covariate the average householdincome among those who live in the same zipcode we obtain a significant and negative co-efficient of 0333 (0154) (standard deviationin parentheses) When we match zip code in-come on demographics of the individuals thecoefficient is effectively zero 0036 (0047)and when we use a socioeconomic index ofthe locality in which the individual lives thecoefficient is positive 0127 (0053)33 Thus

29 The estimated marginal effect (z-statistic in parenthe-ses) is 0352 (1376) 16 104 (1481) and 0154(255) for (pidi) d i and (xi) respectively

30 The level is a normalization The sign is estimatedHad the sign on this regressor been positive this would haveimplied a rejection of the model

31 For the implied median level of risk aversion the re-stricted model produces a similar estimate to the estimate wereport below for the benchmark model However since someindividuals who chose the low deductible are estimated to havevery low claim rates the restricted model is ldquoforcedrdquo to estimatevery high risk aversion for these individuals (in contrast thebenchmark model can explain such choices by positive risk re-siduals) resulting in very high dispersion and (due to the lognor-mality assumption) very high average risk aversion which isabout 1025 higher than the benchmark estimates we report below

32 This nonmonotone pattern may explain why age enterswith different signs in the estimation results of Donkers et al(2001) and Hartog et al (2002) A somewhat similar U-shapepattern with respect to age is also reported by Sumit Agarwalet al (2006) in the context of consumer credit markets

33 The full results from these specifications are providedin the online Appendix (available at httpwwwe-aerorgdatajune0720050644_apppdf ) The reason we do not usethese income variables in the benchmark specification istheir imperfect coverage which would require us to omitalmost 20 percent of the individuals Other results fromthese regressions are similar to those we report for thebenchmark specification

761VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

overall we interpret our findings as suggestiveof a nonnegative and perhaps positive associ-ation between incomewealth and absolute riskaversion

At first glance these results may appear tobe inconsistent with the widely held beliefthat absolute risk aversion declines withwealth One should distinguish however be-tween two questions (a) whether for a givenindividual the vNM utility function exhibitsdecreasing absolute risk aversion and (b)how risk preferences vary across individuals

Our results do not speak to the first questionand should not be thought of as a test of thedecreasing absolute risk aversion propertyTesting this property would require observingthe same individual making multiple choicesat different wealth levels Rather our resultsindicate that individuals with greater wealthhave utility functions that involve a greaterdegree of risk aversion It might be thatwealth is endogenous and that risk aversion(or unobserved individual characteristics thatare correlated with it) leads individuals to

TABLE 4mdashTHE BENCHMARK MODEL

Variable Ln() equation Ln(r) equation Additional quantities

Demographics Constant 15406 (00073) 118118 (01032) Var-covar matrix ()

Age 00001 (00026) 00623 (00213) 01498 (00097)Age2 624 106 (263 105) 644 104 (211 104) r 31515 (00773)Female 00006 (00086) 02049 (00643) 08391 (00265)Family Single Omitted Omitted

Married 00198 (00115) 01927 (00974) Unconditional statisticsa

Divorced 00396 (00155) 01754 (01495) Mean 02196 (00013)Widower 00135 (00281) 01320 (02288) Median 02174 (00017)Other (NA) 00557 (00968) 04599 (07397) Std Dev 00483 (00019)

Education Elementary 00194 (00333) 01283 (02156) Mean r 00019 (00002)High school Omitted Omitted Median r 727 106 (756 107)Technical 00017 (00189) 02306 (01341) Std Dev r 00197 (00015)College 00277 (00124) 02177 (00840) Corr(r ) 02067 (00085)Other (NA) 00029 (00107) 00128 (00819)

Emigrant 00030 (00090) 00001 (00651) Obs 105800

Car attributes Log(value) 00794 (00177) 07244 (01272)Car age 00053 (00023) 00411 (00176)Commercial car 00719 (00187) 00313 (01239)Log(engine size) 01299 (00235) 03195 (01847)

Driving License years 00015 (00017) 00157 (00137)License years2 183 105 (351 105) 148 104 (254 104)Good driver 00635 (00112) 00317 (00822)Any driver 00360 (00105) 03000 (00722)Secondary car 00415 (00141) 01209 (00875)Business use 00614 (00134) 03790 (01124)History length 00012 (00052) 03092 (00518)Claims history 01295 (00154) 00459 (01670)

Young driver Young driver 00525 (00253) 02499 (02290)Gender Male Omitted mdash

Female 00355 (00061) mdashAge 17ndash19 Omitted mdash

19ndash21 00387 (00121) mdash21ndash24 00445 (00124) mdash24 00114 (00119) mdash

Experience 1 Omitted mdash1ndash3 00059 (00104) mdash3 00762 (00121) mdash

Company year First year Omitted OmittedSecond year 00771 (00122) 14334 (00853)Third year 00857 (00137) 28459 (01191)Fourth year 01515 (00160) 38089 (01343)Fifth year 04062 (00249) 39525 (01368)

Note Standard deviations based on the draws from the posterior distribution in parentheses Significant at the 5 percent confidence levela Unconditional statistics represent implied quantities for the sample population as a whole ie integrating over the distribution of covariates in the sample (as

well as over the unobserved components)

762 THE AMERICAN ECONOMIC REVIEW JUNE 2007

save more to obtain more education or totake other actions that lead to greaterwealth34

Let us make several additional observationsFirst while owners of more expensive cars ap-pear to have both higher risk exposure andhigher levels of risk aversion owners of biggercars have higher risk exposure but lower levelsof risk aversion This should indicate that thestructure of the model does not constrain therelationship between the coefficients in the twoequations Rather it is the data that ldquospeak uprdquoSecond individuals who are classified by theinsurer as ldquogood driversrdquo indeed have lowerrisk but also appear to have lower risk aversionThis result is somewhat similar to the positivecorrelation between unobserved risk and unob-served risk aversion which we report belowThird policyholders who tend to use the car forbusiness are less risk averse This could bebecause uninsured costs of accidents occurringto such policyholders are tax deductible Fourthpolicyholders who reported three full years ofclaims history are more risk averse but are notdifferent in their risk exposure The attitude thatleads such policyholders to comply with therequest to (voluntarily) report three full years ofclaims history is apparently and not surpris-ingly correlated with higher levels of risk aver-sion In contrast while past claims indicate highrisk they have no significant relationship withrisk aversion Finally we find a strong trendtoward lower levels of risk aversion over timeThis is a replication of the probit results dis-cussed earlier

The Effect of Individual Characteristics onClaim RiskmdashThe first column of Table 4 de-scribes the relationship between observablesand risk exposure Accident risk is higher fordivorced individuals and lower for people witha college education Bigger older more expen-sive and noncommercial cars are all morelikely to be involved in an accident Drivingexperience reduces accident rates as do mea-sures of less intense use of the car while youngdrivers are more exposed to risk Claim propen-sity is highly correlated over time the voluntaryreport of past claims is a strong predictor offuture claims This risk equation produces re-sults that are similar to those of the simplerPoisson regression reported in Table 3 Al-though some of the coefficients lose signifi-cance the magnitude of most coefficients isquite similar The similarity between these twosets of results is to be expected as the riskregression is identified primarily from the dataon claims so incorporating the information ondeductible choice does not qualitatively changethe conceptual identification strategy (see Sec-tion IIC) If the results were not similar thiscould have indicated a misspecification of themodel The slight differences between the riskregressions in Table 3 and Table 4 are drivenmainly by the structural assumptions First thebenchmark model estimates a lognormal mix-ture of Poisson rates rather than a single Pois-son model By incorporating the fatter tails ofthe claim distribution it slightly changes theresults increases the standard errors and de-creases the average predicted claim rate Sec-ond the information on deductible choiceslightly helps us in obtaining more precise es-timates through the correlation structure be-tween the error terms in the two equations

The Implied Level of Risk AversionmdashOne ofthe main goals of the estimation is to obtainmeasures of the level of risk aversion in thepopulation we study Since we use Gibbs sam-pling and augment the latent coefficients ofabsolute risk aversion we can directly obtainthe posterior distribution of various moments ofthis distribution At each iteration of the Gibbssampler we compute the mean standard devi-ation (across individuals) and various percen-tiles of the simulated draws of i and ri and thecorrelation between them The far-right columnof Table 4 reports the averages and standard

34 One may be tempted to interpret the positive wealtheffects as an indirect indication of credit constraints wealth-ier individuals are less credit constrained and therefore canafford to purchase more insurance We do not share thisinterpretation for two reasons First the insurance companyobserves these proxies for wealth and conditions on themwhen setting prices Since the willingness to pay for insur-ance is likely to be correlated with the willingness to pay forthe additional insurance provided by the low deductibleoption premiums already reflect this variation We condi-tion on the level of the premium Second paying less exante implies paying more ex post so applying the creditconstraint argument only to the ex ante payment but not tothe probabilistic ex post deductible payments has no theo-retical foundation Essentially the setup of the model linksthe ex ante decisions to the ex post losses which are bothdriven by the curvature of the vNM utility function

763VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

deviations of these computed quantities over theiterations of the Gibbs sampler35 The impliedrisk aversion of the mean individual is 00019which is about seven times greater than theback-of-the-envelope calculation presented inthe previous section As we assume a lognormal

distribution and estimate a relatively high dis-persion coefficient r the estimates also implysignificantly lower values for the median levelof risk aversion As shown later the qualitativepattern of these results is quite robust acrossspecifications

Table 5 summarizes our findings regardingthe level and dispersion of risk aversion Itpresents two ways to interpret the estimates weobtained from the benchmark model as well ascomparisons to specifications with constant ab-solute risk aversion (CARA) utility and with

35 Note that these estimated quantities are unconditionalIn computing these quantities we integrate over the distri-bution of observable characteristics in the data so onecannot compute these estimates from the estimated param-eter directly

TABLE 5mdashRISK-AVERSION ESTIMATES

Specificationa Absolute risk aversionb Interpretationc Relative risk aversiond

Back-of-the-envelope 10 103 9070 1484Benchmark model

Mean individual 67 103 5605 972225th percentile 23 106 9998 003Median individual 26 105 9974 03775th percentile 29 104 9714 42790th percentile 27 103 7834 390295th percentile 99 103 4937 14327

CARA utilityMean individual 31 103 7651 4436Median individual 34 105 9966 050

Learning modelMean individual 42 103 6886 6140Median individual 56 106 9995 008

Comparable estimatesGertner (1993) 31 104 9699 479Metrick (1995) 66 105 9934 102Holt and Laury (2002)e 32 102 2096 86575Sydnor (2006) 20 103 8329 5395

a The table summarizes the results with respect to the level of risk aversion ldquoBack-of-the-enveloperdquo refers to thecalculation we report in the beginning of Section III ldquobenchmark modelrdquo refers to the results from the benchmark model(Table 4) ldquoCARA utilityrdquo refers to a specification of a CARA utility function and ldquolearning modelrdquo refers to a specificationin which individuals do not know their risk types perfectly (see Section IIID) The last four rows are the closest comparableresults available in the literature

b The second column presents the point estimates for the coefficient of absolute risk aversion converted to $US1 unitsFor the comparable estimates this is given by their estimate of a representative CARA utility maximizer For all otherspecifications this is given by computing the unconditional mean and median in the population using the figures we reportin the mean and median columns of risk aversion in Table 6 (multiplied by 352 the average exchange rate during the periodto convert to US dollars)

c To interpret the absolute risk aversion estimates (ARA) we translate them into x u(w) 1frasl2 u(w 100) 1frasl2 u(w x) That is we report x such that an individual with the estimated ARA is indifferent about participating in a 50ndash50lottery of gaining 100 US dollars and losing x US dollars Note that since our estimate is of absolute risk aversion the quantityx is independent of w To be consistent with the specification we use a quadratic utility function for the back-of-the-envelopebenchmark and learning models and use a CARA utility function for the others

d The last column attempts to translate the ARA estimates into relative risk aversion We follow the literature and do soby multiplying the ARA estimate by average annual income We use the average annual income (after tax) in Israel in 1995(51168 NIS from the Israeli Census) for all our specifications and we use average disposable income in the United Statesin 1987 ($15437) for Gertner (1993) and Metrick (1995) For Holt and Laury (2002) and Sydnor (2006) we use a similarfigure for 2002 ($26974)

e Holt and Laury (2002) do not report a comparable estimate The estimate we provide above is based on estimating aCARA utility model for the 18 subjects in their experiment who participated in the ldquo90rdquo treatment which involved stakescomparable to our setting For these subjects we assume a CARA utility function and a lognormal distribution of theircoefficient of absolute risk aversion The table reports the point estimate of the mean from this distribution

764 THE AMERICAN ECONOMIC REVIEW JUNE 2007

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 6: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

The exchange rate between NIS and US dollarsmonotonically increased from 301 in 1995 to414 in 1999 (on average it was 352)15 Annualinflation was about 8 percent on average andcumulative inflation over the observation periodwas 48 percent We will account for these effects

as well as other general trends by using yeardummy variables throughout the analysis

B The Menu of Deductibles and Premiums

Let xi be the vector of characteristics individuali reports to the insurance company After learningxi the insurer offered individual i a menu of fourcontract choices One option offered a ldquoregularrdquodeductible and a ldquoregularrdquo premium The term

15 PPP figures were about 10 percent lower than the nom-inal exchange rates running from 260 in 1995 to 374 in 1999

TABLE 1mdashSUMMARY STATISTICSmdashCOVARIATES

Variable Mean Std dev Min Max

Demographics Age 41137 1237 1806 8943Female 0316 047 0 1Family Single 0143 035 0 1

Married 0779 042 0 1Divorced 0057 023 0 1Widower 0020 014 0 1Refused to say 0001 004 0 1

Education Elementary 0016 012 0 1High school 0230 042 0 1Technical 0053 022 0 1College 0233 042 0 1No response 0468 050 0 1

Emigrant 0335 047 0 1

Car attributes Value (current NIS)a 66958 37377 4000 617000Car age 3952 287 0 14Commercial car 0083 028 0 1Engine size (cc) 1568 385 700 5000

Driving License years 18178 1007 0 63Good driver 0548 050 0 1Any driver 0743 044 0 1Secondary car 0151 036 0 1Business use 0082 027 0 1Estimated mileage (km)b 14031 5891 1000 32200History length 2847 061 0 3Claims history 0060 015 0 2

Young driver Young 0192 039 0 1Gender Male 0113 032 0 1

Female 0079 027 0 1Age 17ndash19 0029 017 0 1

19ndash21 0051 022 0 121ndash24 0089 029 0 124 0022 015 0 1

Experience 1 0042 020 0 11ndash3 0071 026 0 13 0079 027 0 1

Company year First year 0207 041 0 1Second year 0225 042 0 1Third year 0194 040 0 1Fourth year 0178 038 0 1Fifth year 0195 040 0 1

Note The table is based on all 105800 new customers in the dataa The average exchange rate throughout the sample period was approximately $1 per 35 NIS starting at 13 in late 1994

and reaching 14 in late 1999b The estimated mileage is not reported by everyone It is available for only 60422 new customers

750 THE AMERICAN ECONOMIC REVIEW JUNE 2007

regular was used for this deductible level bothbecause it was relatively similar to the deductiblelevels offered by other insurers and because mostpolicyholders chose it The regular premium var-ied across individuals according to some deter-ministic function (unknown to us) pit ft(xi)which was quite stable over time The regulardeductible level was directly linked to the regularpremium according to

(1) dit min12

pit capt

That is it was set to one-half of the regular pre-mium subject to a deductible cap capt whichvaried over time but not across individuals Thepremiums associated with the other options werecomputed by multiplying pit by three differentconstants 106 for ldquolowrdquo deductible 0875 forldquohighrdquo deductible and 08 for ldquovery highrdquo de-ductible The regular deductible dit was con-verted to the other three deductible levels in asimilar way using multipliers of 06 for low 18for high and 26 for very high

There are two main sources of exogenous vari-ation in prices The first arises from companyexperimentation The multipliers described abovewere fixed across individuals and over time formost of the observation period but there was asix-month period during the insurerrsquos first year ofoperation (May 1995 to October 1995) in whichthe insurer experimented with slightly modifiedmultipliers16 This modified formula covers al-most 10 percent of the sample The secondsource of variation arises from discrete ad-justments to the uniform cap The cap variedover time due to inflation competitive condi-tions and as the company gained more expe-rience (Figure 1) The cap was binding for

16 For individuals with low levels of regular premiumsduring the specified period the regular deductible was set at53 percent (instead of 50 percent) of the regular premiumthe low deductible was set at 33 percent (instead of 30percent) of the regular premium and so on

FIGURE 1 VARIATION IN THE DEDUCTIBLE CAP OVER TIME

Notes This figure presents the variation in the deductible cap over time which is one of the main sources of pricing variation inthe data We do not observe the cap directly but it can be calculated from the observed menus The graph plots the maximalregular deductible offered to anyone who bought insurance from the company over a moving seven-day window Thelarge jumps in the graph reflect changes in the deductible cap There are three reasons why the graph is not perfectlysmooth First in a few holiday periods (eg October 1995) there are not enough sales within a seven-day window sonone of those sales hits the cap This gives rise to temporary jumps downward Second the pricing rule applies at thedate of the price quote given to the potential customer Our recorded date is the first date the policy becomes effectiveThe price quote is held for a period of two to four weeks so in periods in which the pricing changes we may still seenew policies sold using earlier quotes made according to a previous pricing regime Finally even within periods of constantcap the maximal deductible varies slightly (variation of less than 05 percent) We do not know the source of this variation

751VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

about a third of the policyholders in our dataAll these individuals would be affected by achange in the cap Much of the variation ofmenus in the data is driven by the exogenousshifts in the uniform deductible cap The un-derlying assumption is that conditional onobservables these sources of variation pri-marily affect the deductible choice of newcustomers but they do not have a significantimpact on the probability of purchasing insurancefrom the company Indeed this assumption holdsin the data with respect to observables there is nodistinguishable difference in the distribution ofobservable characteristics of consumers who buyinsurance just before and those who buy just aftera change in the deductible cap

C Summary Statistics

The top of Table 2A summarizes the deduct-ible menus all are calculated according to theformula described earlier Only 1 percent of thepolicyholders chose the high or very high de-ductible options Therefore for the rest of theanalysis we focus only on the choice betweenregular and low deductible options (chosen by811 and 178 percent of the individuals respec-tively)17 Focusing only on these options doesnot create any selection bias because we do notomit individuals who chose high or very highdeductibles For these individuals we assumethat they chose a regular deductible This as-sumption is consistent with the structural modelwe develop in the next section which impliesthat conditional on choosing high or very highdeductibles an individual would almost alwaysprefer the regular over the low deductible

The bottom of Table 2A as well as Table2B present summary statistics for the policy re-alizations We focus only on claim rates and noton the amounts of the claims This is because anyamount above the higher deductible level is cov-

ered irrespective of the deductible choice and thevast majority of the claims fit in this category (seeSection IIIE) For all these claims the gain fromchoosing a low deductible is the same in the eventof a claim and is equal to the difference betweenthe two deductible levels Therefore the claimamount is rarely relevant for the deductible choice(and likewise for the companyrsquos pricing decisionwe analyze in Section IIIF)

Averaging over all individuals the annualclaim rate was 0245 One can clearly observesome initial evidence of adverse selection Onaverage individuals who chose a low deductiblehad higher claim rates (0309) than those whochose the regular deductible (0232) Those whochose high and very high deductibles had muchlower claim rates (0128 and 0133 respectively)These figures can be interpreted in the context ofthe pricing formula A risk-neutral individual willchoose the low deductible if and only if her claimrate is higher than (pd) (plow pregular)(dregular dlow) When the deductible cap is notbinding which is the case for about two-thirds ofthe sample this ratio is given directly by thepricing formula and is equal to 03 Thus anyindividual with a claim rate higher than 03 willbenefit from buying the additional coverage pro-vided by a low deductible even without any riskaversion The claim data suggest that the offeredmenu is cheaper than an actuarially fair contractfor a nonnegligible part of the population (13percent according to the benchmark estimates re-ported below) This observation is in sharp con-trast to other types of insurance contracts such asappliance warranties which are much more ex-pensive than the actuarially fair price (Rabin andThaler 2001)

II The Empirical Model

A A Model of Deductible Choice

Let wi be individual irsquos wealth (pih di

h) theinsurance contract (premium and deductible re-spectively) with high deductible (pi

l dil) the

insurance contract with low deductible ti theduration of the policy and ui(w) individual irsquosvNM utility function We assume that the num-ber of insurance claims is drawn from a Poissondistribution with an annual claim rate iThrough most of the paper we assume that i isknown to the individual We also assume that iis independent of the deductible choice ie that

17 The small frequency of ldquohighrdquo and ldquovery highrdquochoices provides important information about the lowerends of the risk and risk aversion distributions but (for thatsame reason) makes the analysis sensitive to functionalform Considering these options or the option of not buyinginsurance creates a sharp lower bound on risk aversion forthe majority of the observations making the estimates muchhigher Given that these options are rarely selected how-ever it is not clear to us whether they were regularlymentioned during the insurance sales process renderingtheir use somewhat inappropriate

752 THE AMERICAN ECONOMIC REVIEW JUNE 2007

there is no moral hazard Finally we assumethat in the event of an accident the value of theclaim is greater than di

h We revisit all theseassumptions in Sections IIID and IIIE For therest of this section i subscripts are suppressedfor convenience

In the market we study insurance policies are

typically held for a full year after which theycan be automatically renewed with no commit-ment by either the company or the individualMoreover all auto-insurance policies sold inIsrael can be canceled without prior notice bythe policyholder with premium payments beinglinearly prorated Both the premium and the

TABLE 2AmdashSUMMARY STATISTICSmdashMENUS CHOICES AND OUTCOMES

Variable Obs Mean Std dev Min Max

Menu Deductible (current NIS)a Low 105800 87548 12101 37492 103911Regular 105800 145299 19779 62486 171543High 105800 260802 35291 112475 308778Very high 105800 376305 50853 162464 446013

Premium (current NIS)a Low 105800 338057 91404 132471 1923962Regular 105800 318922 8623 124972 1815058High 105800 279057 75451 109351 1588176Very high 105800 255137 68984 99978 1452046

pd 105800 0328 006 03 18

Realization Choice Low 105800 0178 038 0 1Regular 105800 0811 039 0 1High 105800 0006 008 0 1Very high 105800 0005 007 0 1

Policy termination Active 105800 0150 036 0 1Canceled 105800 0143 035 0 1Expired 105800 0707 046 0 1

Policy duration (years) 105800 0848 028 0005 108Claims All 105800 0208 048 0 5

Low 18799 0280 055 0 5Regular 85840 0194 046 0 5High 654 0109 034 0 3Very high 507 0107 032 0 2

Claims per yearb All 105800 0245 066 0 19882Low 18799 0309 066 0 9264Regular 85840 0232 066 0 19882High 654 0128 062 0 12636Very high 507 0133 050 0 3326

a The average exchange rate throughout the sample period was approximately $1 per 35 NIS starting at 13 in late 1994and reaching 14 in late 1999

b The mean and standard deviation of the claims per year are weighted by the observed policy duration to adjust forvariation in the exposure period These are the maximum likelihood estimates of a simple Poisson model with no covariates

TABLE 2BmdashSUMMARY STATISTICSmdashCONTRACT CHOICES AND REALIZATIONS

Claims Low Regular High Very high Total Share

0 11929 (0193) 49281 (0796) 412 (0007) 299 (0005) 61921 (100) 080341 3124 (0239) 9867 (0755) 47 (0004) 35 (0003) 13073 (100) 016962 565 (0308) 1261 (0688) 4 (0002) 2 (0001) 1832 (100) 002383 71 (0314) 154 (0681) 1 (0004) 0 (0000) 226 (100) 000294 6 (0353) 11 (0647) 0 (0000) 0 (0000) 17 (100) 000025 1 (0500) 1 (0500) 0 (0000) 0 (0000) 2 (100) 000003

Notes The table presents tabulation of choices and number of claims For comparability the figures are computed only forindividuals whose policies lasted at least 09 years (about 73 percent of the data) The bottom rows of Table 2A providedescriptive figures for the full dataset The numbers in parentheses in each cell represent percentages within each row Theright-hand-side column presents the marginal distribution of the number of claims

753VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

probability of a claim are proportional to thelength of the time interval taken into account soit is convenient to think of the contract choice asa commitment for only a short amount of timeThis approach has several advantages First ithelps to account for early cancellations andtruncated policies which together constitute 30percent of the policies in the data18 Second itmakes the deductible choice independent ofother longer-term uncertainties faced by the in-dividual so we can focus on static risk-takingbehavior Third this formulation helps to obtaina simple framework for analysis which is attrac-tive both analytically and computationally19

The expected utility that the individual obtainsfrom the choice of a contract (p d) is given by

(2) v p d

1 tuw pt tuw pt d

We characterize the set of parameters that willmake the individual indifferent between the twooffered contracts This set provides a lower (up-per) bound on the level of risk aversion for indi-viduals who choose the low (high) deductible (fora given ) Thus we analyze the equation v(phdh) v(pl dl) By taking limits with respect to t(and applying LrsquoHopitalrsquos rule) we obtain

(3)

limt30

1

t(u(w pht) u(w plt))

(u(w pht) u(w pht dh)) (u(w plt) u(w plt dl))

pl phuw

uw dl uw dh

or

(4)

pl phuw uw dl uw dh

The last expression has a simple intuition Theright-hand side is the expected gain (in utils) perunit of time from choosing a low deductibleThe left-hand side is the cost of such a choiceper unit of time For the individual to be indif-ferent the expected gains must equal the costs

In our benchmark specification we assumethat the third derivative of the vNM utilityfunction is not too large A Taylor expansion forboth terms on the right-hand side of equation(4) ie u(w d) u(w) du(w) (d22)u13(w) implies that

(5)pl ph

uw dh dluw

1

2dh dldh dlu13w

Let d dh dl 0 p pl ph 0 andd 1frasl2 (dh dl) to obtain

(6)p

duw uw d u13w

or

(7) r u13w

uw

p

d 1

d

where r is the coefficient of absolute risk aver-sion at wealth level w

18 As can be seen in Table 2A 70 percent of the policiesare observed through their full duration (one year) About15 percent are truncated by the end of our observationperiod and the remaining 15 percent are canceled for var-ious reasons such as change in car ownership total-lossaccident or a unilateral decision of the policyholder tochange insurance providers

19 This specification ignores the option value associatedwith not canceling a policy This is not very restrictiveSince experience rating is small and menus do not changeby much this option value is likely to be close to zero Asimple alternative is to assume that individuals behave as ifthey commit for a full year of coverage In such a case themodel will be similar to the one we estimate but willdepend on the functional form of the vNM utility functionand would generally require taking infinite sums (over thepotential realizations for the number of claims within the year)In the special case of quadratic expected utility maximizerswho care only about the mean and variance of the number ofclaims this is easy to solve The result is almost identical to theexpression we subsequently derive in equation (7)

754 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Equation (7) defines an indifference set in thespace of risk and risk aversion which we willrefer to by (r() ) and ((r) r) interchange-ably Both r() and (r) have a closed-formrepresentation a property that will be computa-tionally attractive for estimation20 Both termsare individual specific as they depend on thedeductible menu which varies across individu-als For the rest of the paper we regard eachindividual as associated with a two-dimensionaltype (ri i) An individual with a risk parameter

i who is offered a menu (pih di

h) (pil di

l)will choose the low-deductible contract if andonly if his coefficient of absolute risk aversionsatisfies ri ri(i) Figure 2 presents a graphi-cal illustration

B The Benchmark Econometric Model

The Econometric ModelmdashThe econometricmodel we estimate is fully described by the fiveequations presented in this section Our ob-jective is to estimate the joint distribution of(i ri)mdashthe claim rate and coefficient of abso-lute risk aversionmdashin the population of policy-holders conditional on observables xi The

20 For example estimating the CARA version of themodel (Section IIID) for which r() does not have aclosed-form representation takes almost ten times longer

FIGURE 2 THE INDIVIDUALrsquoS DECISIONmdashA GRAPHICAL ILLUSTRATION

Notes This graph illustrates the individualrsquos decision problem The solid line presents the indifference setmdashequation (7)mdashappliedfor the menu faced by the average individual in the sample Individuals are represented by points in the two-dimensional spaceabove In particular the scattered points are 10000 draws from the joint distribution of risk and risk aversion for the averageindividual (on observables) in the data based on the point estimates of the benchmark model (Table 4) If an individual is eitherto the right of the line (high risk) or above the line (high risk aversion) the low deductible is optimal Adverse selection is capturedby the fact that the line is downward sloping higher-risk individuals require lower levels of risk aversion to choose the lowdeductible Thus in the absence of correlation between risk and risk aversion higher-risk individuals are more likely to choosehigher levels of insurance An individual with i (pidi) will choose a lower deductible even if he is risk neutral ie withprobability one (we do not allow individuals to be risk loving) This does not create an estimation problem because i is notobserved only a posterior distribution for it Any such distribution will have a positive weight on values of i that are below(pidi) Second the indifference set is a function of the menu and in particular of (pidi) and d An increase in (pidi) willshift the set up and to the right and an increase in d will shift the set down and to the left Therefore exogenous shiftsof the menus that make both arguments change in the same direction can make the sets ldquocrossrdquo thereby allowing us toidentify the correlation between risk and risk aversion nonparametrically With positive correlation (shown in the figureby the ldquoright-bendingrdquo shape of the simulated draws) the marginal individuals are relatively high risk therefore creatinga stronger incentive for the insurer to raise the price of the low deductible

755VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

benchmark formulation assumes that (i ri) fol-lows a bivariate lognormal distribution Thuswe can write the model as

(8) ln i xi i

(9) ln ri xi vi

with

(10) i

vi

iid

N00

2 r

r r2

Neither i nor ri is directly observed Thereforewe treat both as latent variables Loosely speak-ing they can be thought of as random effects Weobserve two variables the number of claims andthe deductible choice which are related to thesetwo unobserved components Thus to completeour econometric model we have to specify therelationship between the observed variables andthe latent ones This is done by making two struc-tural assumptions First we assume that the num-ber of claims is drawn from a Poisson distributionnamely

(11) claimsi Poisson(i ti )

where ti is the observed duration of the policySecond we assume that when individuals maketheir deductible choices they follow the theoreti-cal model described in the previous section Themodel implies that individual i chooses the lowdeductible (choicei 1) if and only if ri ri(i)where ri is defined in equation (7) Thus theempirical model for deductible choice is given by

(12) Prchoicei 1 Prri

pi

idi 1

d i

Prexp(xi vi)

pi

exp(xi i)di 1

d i

With no unobserved heterogeneity in risk (i 0) equation (12) reduces to a simple probit In

such a case one can perfectly predict risk from thedata denote it by (xi) and construct an addi-tional covariate ln([pi((xi)di) 1]d i) Giventhe assumption that risk aversion is distributedlognormally running the probit regressionabove and renormalizing the coefficient onthe constructed covariate to 1 (instead ofthe typical normalization of the variance of theerror term to 1) has a structural interpretation withln (ri) as the dependent variable However Cohen(2005) provides evidence of adverse selection inthe data implying the existence of unobservedheterogeneity in risk This makes the simple probitregression misspecified Estimation of the fullmodel is more complicated Once we allow forunobserved heterogeneity in both unobserved riskaversion (vi) and claim rate (i) we have to inte-grate over the two-dimensional region depicted inFigure 2 for estimation

EstimationmdashA natural way to proceed is toestimate the model by maximum likelihoodwhere the likelihood of the data as a function ofthe parameters can be written by integrating outthe latent variables namely

(13) Lclaimsi choicei

Prclaimsi choiceii riPri ri

where is a vector of parameters to be esti-mated While formulating the empirical modelusing likelihood may help our thinking aboutthe data-generating process using maximumlikelihood (or generalized method of moments(GMM)) for estimation is computationally cum-bersome This is because in each iteration itrequires evaluating a separate complex integralfor each individual in the data In contrastMarkov Chain Monte Carlo (MCMC) Gibbssampling is quite attractive in such a case Us-ing data augmentation of latent variables (Mar-tin A Tanner and Wing Hung Wong 1987)according to which we simulate (i ri) and latertreat those simulations as if they are part ofthe data one can avoid evaluating the complexintegrals by sampling from truncated normaldistributions which is significantly less compu-tationally demanding (eg Luc Devroye 1986)This feature combined with the idea of aldquosliced samplerrdquo (Paul Damien John Wake-field and Stephen Walker 1999) to sample from

756 THE AMERICAN ECONOMIC REVIEW JUNE 2007

an unfamiliar posterior distribution makes theuse of a Gibbs sampler quite efficient for ourpurposes Finally the lognormality assumptionimplies that F(ln(i)ri) and F(ln(ri)i) follow a(conditional) normal distribution allowing us torestrict attention to univariate draws furtherreducing the computational burden

The Appendix provides a full description ofthe Gibbs sampler including the conditionaldistributions and the (flat) prior distributions weuse The basic intuition is that conditional onobserving (i ri) for each individual we have asimple linear regression model with two equa-tions The less standard part is to generate drawsfor (i ri) We do this iteratively Conditionalon i the posterior distribution for ln(ri) fol-lows a truncated normal distribution where thetruncation point depends on the menu individ-ual i faces and its direction (from above orbelow) depends on individual irsquos deductiblechoice The final step is to sample from theposterior distribution of ln(i) conditional on riThis is more complicated as we have bothtruncation which arises from adverse selection(just as we do when sampling for ri) and thenumber of claims which provides additionalinformation about the posterior of i Thus theposterior for i takes an unfamiliar form forwhich we use a ldquosliced samplerrdquo

We use 100000 iterations of the Gibbs sam-pler It seems to converge to the stationarydistribution after about 5000 iterations There-fore we drop the first 10000 draws and use thelast 90000 draws of each variable to report ourresults The results are based on the posteriormean and posterior standard deviation fromthese 90000 draws Note that each iterationinvolves generating separate draws of (i ri) foreach individual Performing 100000 iterationsof the benchmark specification (coded in Mat-lab) takes about 60 hours on a Dell Precision530 workstation

C Identification

The parametric version of the model is iden-tified mechanically There are more equationsthan unknowns and no linear dependenciesamong them so (as also verified using MonteCarlo exercises) the model parameters can bebacked out from simulated data Our goal in thissection is not to provide a formal identificationproof Rather we want to provide intuition for

which features of the data allow us to identifyparticular parameters of the model The discus-sion also highlights the assumptions that areessential for identification vis-a-vis those thatare made only for computational convenience(making them in principle testable)

Discussion of Nonparametric Identifica-tionmdashThe main difficulty in identifying themodel arises from the gap between the (exante) risk type i which individuals usewhen choosing a deductible and the (ex post)realization of the number of claims we ob-serve We identify between the variation inrisk types and the variation in risk realizationsusing our parametric distributional assump-tions The key is that the distribution of risktypes can be uniquely backed out from theclaim data alone This allows us to use thedeductible choice as an additional layer ofinformation which identifies unobserved het-erogeneity in risk aversion21 Any distribu-tional assumption that allows us to uniquelyback out the distribution of risk types fromclaim data would be sufficient to identify thedistribution of risk aversion As is customaryin the analysis of count processes we make aparametric assumption that claims are gener-ated by a lognormal mixture of Poisson dis-tributions (Section IIID discusses this furtherand explores an alternative) Using a mixtureenables us to account for adverse selectionthrough unobserved heterogeneity in risk Italso allows us to better fit the tails of theclaim distribution In principle a more flexi-ble mixture or a more flexible claim-generat-ing process could be identified as long as theclaims data are sufficiently rich22

21 Cardon and Hendel (2001) face a similar identificationproblem in the context of health insurance They use vari-ation in coverage choice (analogous to deductible choice) toidentify the variation in health-status signals (analogous torisk types) from the variation in health expenditure (analo-gous to number of claims) They can rely on the coveragechoice to identify this because they make an assumptionregarding unobserved heterogeneity in preferences (iidlogit) We take a different approach as our main goal is toestimate rather than assume the distribution of preferences

22 Although it may seem that the claim data are limited(as they take only integer values between 0 and 5 in ourdata) variation in policy duration generates continuousvariation in the observed claim propensity Of course thisvariation also introduces an additional selection into themodel due to policy cancellations which are potentially

757VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Once the distribution of risk types is iden-tified from claims data the marginal distribu-tion of risk aversion (and its relationship tothe distribution of risk types) is nonparametri-cally identified from the variation in theoffered menus discussed in Section I Thisvariation implies different deductible and pre-mium options to identical (on observables)individuals who purchased insurance at dif-ferent times Different menus lead to differentindifference sets (similar to the one depictedin Figure 2) These sets often cross each otherand nonparametrically identify the distribu-tion of risk aversion and the correlation struc-ture at least within the region in which theindifference sets vary For the tails of thedistribution as is typically the case we haveto rely on parametric assumptions or usebounds The parametric assumption of log-normality we use for most of the paper ismade only for computational convenience

Intuition for the Parametric IdentificationMechanismmdashVariation in the offered menus isimportant for the nonparametric identificationThe parametric assumptions could identify themodel without such variation Thus to keep theintuition simple let us take the bivariate log-normal distribution as given and contrary to thedata assume that all individuals are identical onobservables and face the same menu Supposealso that all individuals are observed for exactlyone year and have up to two claims23 In thissimplified case the model has five parametersto be estimated the mean and variance of risk and

2 the mean and variance of risk aver-sion r and r

2 and the correlation parameter The data can be summarized by five numbersLet 130 131 and 132 1 131 130 be thefraction of individuals with zero one and twoclaims respectively Let 0 1 and 2 be theproportion of individuals who chose a low de-ductible within each ldquoclaim grouprdquo Given ourdistributional assumption about the claim-

generating process we can use 130 and 131 touniquely identify and

2 Loosely isidentified by the average claim rate in the dataand

2 is identified by how fat the tail of theclaim distribution is ie by how large (132131)is compared to (131130) Given and

2 and thelognormality assumption we can (implicitly)construct a posterior distribution of risk typesfor each claim group F(r claims c) andintegrate over it when predicting the deductiblechoice This provides us with three additionalmoments each of the form

(14) Ec Prchoice 1r

dFr claims c dFr

for c 0 1 2 These moments identify thethree remaining parameters of the model rr

2 and Let us now provide more economic content to

the identification argument Using the same ex-ample and conditional on identifying and

2

from the claim data one can think about thedeductible choice data 0 1 2 as a graph(c) The absolute level of the graph identifiesr In the absence of correlation between riskand risk aversion the slope of the graph iden-tifies r

2 with no correlation the slope shouldalways be positive (due to adverse selection)but higher r

2 would imply a flatter graph be-cause more variation in the deductible choiceswill be attributed to variation in risk aversionFinally is identified by the curvature of thegraph The more convex (concave) the graph isthe more positive (negative) is the estimated For example if 0 05 1 051 and 2 099 it is likely that r

2 is high (explaining why0 and 1 are so close) and is highly positive(explaining why 2 is not also close to 1) Incontrast if 0 1 it must mean that thecorrelation between risk and risk aversion isnegative which is the only way the originalpositive correlation induced by adverse selec-tion can be offset This intuition also clarifiesthat the identification of relies on observingindividuals with multiple claims (or differentpolicy durations) and that it is likely to besensitive to the distributional assumptions Thedata (Table 2B) provide a direct (positive) cor-relation between deductible choice and claims

endogenous The results are similar when we use onlyexpired and truncated policies

23 This variation is sufficient (and necessary) to identifythe benchmark model The data provide more variation weobserve up to five claims per individual we observe con-tinuous variation in the policy durations we observe vari-ation in prices and we exploit distributional restrictionsacross individuals with different observable characteristics

758 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The structural assumptions allow us to explainhow much of this correlation can be attributedto adverse selection The remaining correlation(positive or negative) is therefore attributed tocorrelation in the underlying distribution of riskand risk aversion

III Results

A Reduced-Form Estimation

To get an initial sense for the levels of abso-lute risk aversion implied by the data we use asimple back-of-the-envelope exercise We com-pute unconditional averages of p d dh dland d (Table 2A)24 and substitute these valuesin equation (7) The implied coefficient of ab-solute risk aversion is 29 104 NIS125 Thisfigure could be thought of as the average indif-ference point implying that about 18 percent ofthe policyholders have coefficients of absoluterisk aversion exceeding it To convert to USdollar amounts one needs to multiply thesefigures by the average exchange rate (352)resulting in an average indifference point of102 103 $US1 This figure is less than halfof a similar estimate reported by Sydnor (2006)for buyers of homeownerrsquos insurance but about3 and 13 times higher than comparable fig-ures reported by Gertner (1993) and Metrick(1995) respectively for television game showparticipants

Table 3 provides reduced-form analysis ofthe relationship between the observables andour two left-hand-side variables the number ofclaims and the deductible choice26 Column 1reports the estimates from a Poisson regressionof the number of claims on observed character-istics This regression is closely related to therisk equation we estimate in the benchmarkmodel It shows that older people women andpeople with a college education are less likely to

have an accident Bigger more expensiveolder and noncommercial cars are more likelyto be involved in an accident Driving experi-ence and variables associated with less intenseuse of the car reduce accident rates As could beimagined claim propensity is highly correlatedover time past claims are a strong predictor offuture claims Young drivers are 50 to 70 per-cent more likely to be involved in an accidentwith young men significantly more likely thanyoung women Finally as indicated by the trendin the estimated year dummies the accident ratesignificantly declined over time Part of thisdecline is likely due to the decline in accidentrates in Israel in general27 This decline mightalso be partly due to the better selection ofindividuals the company obtained over time asit gained more experience (Cohen 2003)

Columns 2 and 3 of Table 3 present estimatesfrom probit regressions in which the dependentvariable is equal to one if the policyholder chosea low deductible and is equal to zero otherwiseColumn 3 shows the marginal effects of thecovariates on the propensity to choose a lowdeductible These marginal effects do not have astructural interpretation as the choice of lowdeductible depends on its price on risk aver-sion and on risk In this regression we againobserve a strong trend over time Fewer policy-holders chose the low deductible as time wentby One reason for this trend according to thecompany executives is that over time the com-panyrsquos sales persons were directed to focusmainly on the ldquodefaultrdquo regular deductible op-tion28 The effect of other covariates will bediscussed later in the context of the full modelIn unreported probit regressions we also test thequalitative assumptions of the structural model byadding three additional regressors the price ratio(pidi) the average deductible offered d i

24 The unconditional is computed by maximum likeli-hood using the data on claims and observed durations of thepolicies

25 Using the CARA specification as in equation (16) weobtain a slightly lower value of 25 104 NIS1

26 We find positive correlation in the errors of these tworegressions suggesting the existence of adverse selection inthe data and motivating a model with unobserved heteroge-neity in risk This test is similar to the bivariate probit testproposed by Chiappori and Salanie (2000) and replicatesearlier results reported in Cohen (2005)

27 In particular traffic fatalities and counts of trafficaccidents in Israel fell by 11 percent and 18 percent during1998 and 1999 respectively

28 Such biased marketing efforts will bias consumersagainst choosing the low deductible thus making them lookless risk averse This would make our estimate a lowerbound on the true level of risk aversion If only sophisti-cated consumers could see beyond the marketing effort andthis sophistication were related to observables (eg educa-tion) the coefficients on such observables would be biasedupward This is not a major concern given that the coefficientson most of the covariates are fairly stable when we estimatethe benchmark model separately for different years

759VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

TABLE 3mdashNO HETEROGENEITY IN RISK

Variable

Poisson regressiona Probit regressionsc

Dep var Number of claims Dep var 1 if low deductible

(1) (2) (3)

Coeff Std Err IRRb Coeff Std Err dPdX dPdXDemographics Constant 43535 02744 mdash 224994 23562 mdash

Age 00074 00052 09926 01690 00464 00034 00042Age2 989 105 536 105 10001 00017 00005 339 105 455 105Female 00456 00161 09555 08003 01316 00165 00134Family Single omitted omitted omitted

Married 01115 00217 08945 06417 01950 00128 00045Divorced 00717 00346 10743 03191 03127 00064 00044Widower 00155 00527 10156 03441 04573 00069 00025Other (NA) 01347 01755 11442 29518 19836 00511 00370

Education Elementary 00522 00550 09491 02927 04278 00061 00016High school omitted omitted omittedTechnical 00373 00308 10380 05037 02520 00105 00138College 00745 00197 09282 05279 01546 00109 00053Other (NA) 00116 00184 10116 00630 01540 00013 00025

Emigrant 00210 00163 10213 00497 01328 00010 00003

Car attributes Log(Value) 01227 00281 11306 13285 02342 00271 00299Car age 00187 00042 10189 01603 00357 00033 00017Commercial car 01394 00326 08699 09038 02781 00177 00294Log(engine size) 02972 00459 13461 06924 03952 00141 00075

Driving License years 00204 00034 09798 01043 00312 00021 00005License years2 00002 677 105 10002 00015 00005 299 105 167 105

Good driver 00176 00191 09825 07207 01574 00148 00152Any driver 00564 00169 10580 10923 01321 00217 00258Secondary car 00859 00209 09177 00038 01626 00001 00070Business use 01852 00293 12034 07951 02737 00156 00017History length 00527 00110 09486 11218 01450 00228 00171Claims history 06577 00390 19304 04654 05460 00095 00496

Young driver Young driver 05235 00361 16879 28847 06305 00524 00012Gender Male omitted omitted omitted

Female 01475 00288 08629 17959 03135 00396 00195Age 17ndash19 omitted omitted omitted

19ndash21 00701 00532 10726 07800 07509 00153 0014721ndash24 00267 00574 09737 05746 07773 00114 0015624 02082 00567 0812 17869 07328 00397 00136

Experience 1 omitted omitted omitted1ndash3 02416 00458 07854 10175 06577 00217 000093 02827 00532 07538 32513 07386 00762 00410

Company year First year omitted omitted omittedSecond year 00888 00198 09150 44513 01506 00792 00859Third year 00690 00222 09334 85888 01820 01303 01367Fourth year 01817 00232 08339 118277 02102 01616 01734Fifth year 05431 0028 05810 143206 02778 01920 02085

105586Obs 105800 94000 105800Pseudo R2 00162 01364 01296Log likelihood 577459 369595 430869

Significant at the 5 percent confidence levela Maximum likelihood estimates Variation in exposure (policy duration) is accounted forb IRR Incidence rate ratio Each figure should be interpreted as the increasedecrease in claim probability as a result of an increase of one unit in the

right-hand-side variablec There are two separate probit regressions reported in this table Column 2 relies on the deductible choice model and the lognormality assumption As discussed

in the text by including an additional regressor ln([pi((xi)di) 1]d i) (with (xi) predicted from column 1 above) and normalizing its coefficient to 1 we obtaina structural interpretation of this regression with ln (ri) as the dependent variable Thus the reported coefficients are comparable to those estimated for the benchmarkmodel One should be cautious however in interpreting these coefficients Unlike the benchmark model this regression does not allow unobserved heterogeneity inrisk and suffers from some selection bias because observations with high predicted risk rate are omitted (which is why the number of observations is 94000 ratherthan the full sample of 105800) For comparison column 3 reports the marginal effects from a comparable probit regression that uses the full sample and does notcontrol for pricing and predicted risk through the additional structural regressor Column 3 does not have a structural interpretation and its (unreported) coefficientscannot be compared to those estimated from the benchmark model

760 THE AMERICAN ECONOMIC REVIEW JUNE 2007

and the risk rate (xi) as predicted from thePoisson regression of column 1 All three addi-tional regressors enter with the predicted signand with large and highly significant marginaleffects29

Finally column 2 of Table 3 presents animportant specification of the probit regres-sion in which ln([pi(( xi)di) 1]d i) isadded as a regressor and its coefficient isnormalized to 130 As already mentioned ifwe assume that there is no unobserved heter-ogeneity in risk then column 2 is analogousto the ln(ri) equation of the benchmark modelThis restriction of the model is rejected by thedata About 10 percent of the individuals arepredicted to have (xi) (pi di) implying achoice of low deductible for any level of riskaversion Many of these individuals howeverstill choose higher deductible levels Column2 reports the results for the remaining indi-viduals (94000 out of 105800) for which theregression can be run While the signs of theestimated coefficients are similar to those inthe benchmark model presented below therestricted version of the model suggests muchhigher levels and dispersion of risk aversionwell above any reasonable level31 The fullestimation of the benchmark model clearly re-jects this restriction on the model

B Estimation of the Benchmark Model

The Effect of Individual Characteristics onRisk AversionmdashTable 4 presents the estimationresults from the benchmark model The secondcolumn shows how the level of absolute riskaversion is related to individual characteristicsAs the dependent variable is in natural loga-

rithm coefficients on dummy variables can bedirectly interpreted as approximate percentagechanges The vast majority of these coefficientsare quite stable across a wide range of specifi-cations which are mentioned later

The results indicate that women are more riskaverse than men and have a coefficient of ab-solute risk aversion about 20 percent greaterthan that of men These results are consistentwith those of Donkers et al (2001) and Hartoget al (2002) The estimated effect of age sug-gests a nonmonotone pattern of risk preferencesover the life cycle The estimated coefficientsimply that initially (that is at age 18 the youngestindividual in the data) individuals become less riskaverse with age but around the age of 48 indi-viduals start becoming more risk averse32 Mar-ried individuals are estimated to be significantlymore risk averse compared to singles while di-vorced individuals are less (although the coeffi-cient is statistically insignificant)

The analysis suggests that variables that arelikely to be correlated with income or wealthsuch as post-high-school education and thevalue of the car have a positive coefficientindicating that wealthier people have higherlevels of absolute risk aversion Although we donot have data on individualsrsquo income or wealthwe use other proxies for income in additionalspecifications and obtain mixed results Whenwe include as a covariate the average householdincome among those who live in the same zipcode we obtain a significant and negative co-efficient of 0333 (0154) (standard deviationin parentheses) When we match zip code in-come on demographics of the individuals thecoefficient is effectively zero 0036 (0047)and when we use a socioeconomic index ofthe locality in which the individual lives thecoefficient is positive 0127 (0053)33 Thus

29 The estimated marginal effect (z-statistic in parenthe-ses) is 0352 (1376) 16 104 (1481) and 0154(255) for (pidi) d i and (xi) respectively

30 The level is a normalization The sign is estimatedHad the sign on this regressor been positive this would haveimplied a rejection of the model

31 For the implied median level of risk aversion the re-stricted model produces a similar estimate to the estimate wereport below for the benchmark model However since someindividuals who chose the low deductible are estimated to havevery low claim rates the restricted model is ldquoforcedrdquo to estimatevery high risk aversion for these individuals (in contrast thebenchmark model can explain such choices by positive risk re-siduals) resulting in very high dispersion and (due to the lognor-mality assumption) very high average risk aversion which isabout 1025 higher than the benchmark estimates we report below

32 This nonmonotone pattern may explain why age enterswith different signs in the estimation results of Donkers et al(2001) and Hartog et al (2002) A somewhat similar U-shapepattern with respect to age is also reported by Sumit Agarwalet al (2006) in the context of consumer credit markets

33 The full results from these specifications are providedin the online Appendix (available at httpwwwe-aerorgdatajune0720050644_apppdf ) The reason we do not usethese income variables in the benchmark specification istheir imperfect coverage which would require us to omitalmost 20 percent of the individuals Other results fromthese regressions are similar to those we report for thebenchmark specification

761VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

overall we interpret our findings as suggestiveof a nonnegative and perhaps positive associ-ation between incomewealth and absolute riskaversion

At first glance these results may appear tobe inconsistent with the widely held beliefthat absolute risk aversion declines withwealth One should distinguish however be-tween two questions (a) whether for a givenindividual the vNM utility function exhibitsdecreasing absolute risk aversion and (b)how risk preferences vary across individuals

Our results do not speak to the first questionand should not be thought of as a test of thedecreasing absolute risk aversion propertyTesting this property would require observingthe same individual making multiple choicesat different wealth levels Rather our resultsindicate that individuals with greater wealthhave utility functions that involve a greaterdegree of risk aversion It might be thatwealth is endogenous and that risk aversion(or unobserved individual characteristics thatare correlated with it) leads individuals to

TABLE 4mdashTHE BENCHMARK MODEL

Variable Ln() equation Ln(r) equation Additional quantities

Demographics Constant 15406 (00073) 118118 (01032) Var-covar matrix ()

Age 00001 (00026) 00623 (00213) 01498 (00097)Age2 624 106 (263 105) 644 104 (211 104) r 31515 (00773)Female 00006 (00086) 02049 (00643) 08391 (00265)Family Single Omitted Omitted

Married 00198 (00115) 01927 (00974) Unconditional statisticsa

Divorced 00396 (00155) 01754 (01495) Mean 02196 (00013)Widower 00135 (00281) 01320 (02288) Median 02174 (00017)Other (NA) 00557 (00968) 04599 (07397) Std Dev 00483 (00019)

Education Elementary 00194 (00333) 01283 (02156) Mean r 00019 (00002)High school Omitted Omitted Median r 727 106 (756 107)Technical 00017 (00189) 02306 (01341) Std Dev r 00197 (00015)College 00277 (00124) 02177 (00840) Corr(r ) 02067 (00085)Other (NA) 00029 (00107) 00128 (00819)

Emigrant 00030 (00090) 00001 (00651) Obs 105800

Car attributes Log(value) 00794 (00177) 07244 (01272)Car age 00053 (00023) 00411 (00176)Commercial car 00719 (00187) 00313 (01239)Log(engine size) 01299 (00235) 03195 (01847)

Driving License years 00015 (00017) 00157 (00137)License years2 183 105 (351 105) 148 104 (254 104)Good driver 00635 (00112) 00317 (00822)Any driver 00360 (00105) 03000 (00722)Secondary car 00415 (00141) 01209 (00875)Business use 00614 (00134) 03790 (01124)History length 00012 (00052) 03092 (00518)Claims history 01295 (00154) 00459 (01670)

Young driver Young driver 00525 (00253) 02499 (02290)Gender Male Omitted mdash

Female 00355 (00061) mdashAge 17ndash19 Omitted mdash

19ndash21 00387 (00121) mdash21ndash24 00445 (00124) mdash24 00114 (00119) mdash

Experience 1 Omitted mdash1ndash3 00059 (00104) mdash3 00762 (00121) mdash

Company year First year Omitted OmittedSecond year 00771 (00122) 14334 (00853)Third year 00857 (00137) 28459 (01191)Fourth year 01515 (00160) 38089 (01343)Fifth year 04062 (00249) 39525 (01368)

Note Standard deviations based on the draws from the posterior distribution in parentheses Significant at the 5 percent confidence levela Unconditional statistics represent implied quantities for the sample population as a whole ie integrating over the distribution of covariates in the sample (as

well as over the unobserved components)

762 THE AMERICAN ECONOMIC REVIEW JUNE 2007

save more to obtain more education or totake other actions that lead to greaterwealth34

Let us make several additional observationsFirst while owners of more expensive cars ap-pear to have both higher risk exposure andhigher levels of risk aversion owners of biggercars have higher risk exposure but lower levelsof risk aversion This should indicate that thestructure of the model does not constrain therelationship between the coefficients in the twoequations Rather it is the data that ldquospeak uprdquoSecond individuals who are classified by theinsurer as ldquogood driversrdquo indeed have lowerrisk but also appear to have lower risk aversionThis result is somewhat similar to the positivecorrelation between unobserved risk and unob-served risk aversion which we report belowThird policyholders who tend to use the car forbusiness are less risk averse This could bebecause uninsured costs of accidents occurringto such policyholders are tax deductible Fourthpolicyholders who reported three full years ofclaims history are more risk averse but are notdifferent in their risk exposure The attitude thatleads such policyholders to comply with therequest to (voluntarily) report three full years ofclaims history is apparently and not surpris-ingly correlated with higher levels of risk aver-sion In contrast while past claims indicate highrisk they have no significant relationship withrisk aversion Finally we find a strong trendtoward lower levels of risk aversion over timeThis is a replication of the probit results dis-cussed earlier

The Effect of Individual Characteristics onClaim RiskmdashThe first column of Table 4 de-scribes the relationship between observablesand risk exposure Accident risk is higher fordivorced individuals and lower for people witha college education Bigger older more expen-sive and noncommercial cars are all morelikely to be involved in an accident Drivingexperience reduces accident rates as do mea-sures of less intense use of the car while youngdrivers are more exposed to risk Claim propen-sity is highly correlated over time the voluntaryreport of past claims is a strong predictor offuture claims This risk equation produces re-sults that are similar to those of the simplerPoisson regression reported in Table 3 Al-though some of the coefficients lose signifi-cance the magnitude of most coefficients isquite similar The similarity between these twosets of results is to be expected as the riskregression is identified primarily from the dataon claims so incorporating the information ondeductible choice does not qualitatively changethe conceptual identification strategy (see Sec-tion IIC) If the results were not similar thiscould have indicated a misspecification of themodel The slight differences between the riskregressions in Table 3 and Table 4 are drivenmainly by the structural assumptions First thebenchmark model estimates a lognormal mix-ture of Poisson rates rather than a single Pois-son model By incorporating the fatter tails ofthe claim distribution it slightly changes theresults increases the standard errors and de-creases the average predicted claim rate Sec-ond the information on deductible choiceslightly helps us in obtaining more precise es-timates through the correlation structure be-tween the error terms in the two equations

The Implied Level of Risk AversionmdashOne ofthe main goals of the estimation is to obtainmeasures of the level of risk aversion in thepopulation we study Since we use Gibbs sam-pling and augment the latent coefficients ofabsolute risk aversion we can directly obtainthe posterior distribution of various moments ofthis distribution At each iteration of the Gibbssampler we compute the mean standard devi-ation (across individuals) and various percen-tiles of the simulated draws of i and ri and thecorrelation between them The far-right columnof Table 4 reports the averages and standard

34 One may be tempted to interpret the positive wealtheffects as an indirect indication of credit constraints wealth-ier individuals are less credit constrained and therefore canafford to purchase more insurance We do not share thisinterpretation for two reasons First the insurance companyobserves these proxies for wealth and conditions on themwhen setting prices Since the willingness to pay for insur-ance is likely to be correlated with the willingness to pay forthe additional insurance provided by the low deductibleoption premiums already reflect this variation We condi-tion on the level of the premium Second paying less exante implies paying more ex post so applying the creditconstraint argument only to the ex ante payment but not tothe probabilistic ex post deductible payments has no theo-retical foundation Essentially the setup of the model linksthe ex ante decisions to the ex post losses which are bothdriven by the curvature of the vNM utility function

763VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

deviations of these computed quantities over theiterations of the Gibbs sampler35 The impliedrisk aversion of the mean individual is 00019which is about seven times greater than theback-of-the-envelope calculation presented inthe previous section As we assume a lognormal

distribution and estimate a relatively high dis-persion coefficient r the estimates also implysignificantly lower values for the median levelof risk aversion As shown later the qualitativepattern of these results is quite robust acrossspecifications

Table 5 summarizes our findings regardingthe level and dispersion of risk aversion Itpresents two ways to interpret the estimates weobtained from the benchmark model as well ascomparisons to specifications with constant ab-solute risk aversion (CARA) utility and with

35 Note that these estimated quantities are unconditionalIn computing these quantities we integrate over the distri-bution of observable characteristics in the data so onecannot compute these estimates from the estimated param-eter directly

TABLE 5mdashRISK-AVERSION ESTIMATES

Specificationa Absolute risk aversionb Interpretationc Relative risk aversiond

Back-of-the-envelope 10 103 9070 1484Benchmark model

Mean individual 67 103 5605 972225th percentile 23 106 9998 003Median individual 26 105 9974 03775th percentile 29 104 9714 42790th percentile 27 103 7834 390295th percentile 99 103 4937 14327

CARA utilityMean individual 31 103 7651 4436Median individual 34 105 9966 050

Learning modelMean individual 42 103 6886 6140Median individual 56 106 9995 008

Comparable estimatesGertner (1993) 31 104 9699 479Metrick (1995) 66 105 9934 102Holt and Laury (2002)e 32 102 2096 86575Sydnor (2006) 20 103 8329 5395

a The table summarizes the results with respect to the level of risk aversion ldquoBack-of-the-enveloperdquo refers to thecalculation we report in the beginning of Section III ldquobenchmark modelrdquo refers to the results from the benchmark model(Table 4) ldquoCARA utilityrdquo refers to a specification of a CARA utility function and ldquolearning modelrdquo refers to a specificationin which individuals do not know their risk types perfectly (see Section IIID) The last four rows are the closest comparableresults available in the literature

b The second column presents the point estimates for the coefficient of absolute risk aversion converted to $US1 unitsFor the comparable estimates this is given by their estimate of a representative CARA utility maximizer For all otherspecifications this is given by computing the unconditional mean and median in the population using the figures we reportin the mean and median columns of risk aversion in Table 6 (multiplied by 352 the average exchange rate during the periodto convert to US dollars)

c To interpret the absolute risk aversion estimates (ARA) we translate them into x u(w) 1frasl2 u(w 100) 1frasl2 u(w x) That is we report x such that an individual with the estimated ARA is indifferent about participating in a 50ndash50lottery of gaining 100 US dollars and losing x US dollars Note that since our estimate is of absolute risk aversion the quantityx is independent of w To be consistent with the specification we use a quadratic utility function for the back-of-the-envelopebenchmark and learning models and use a CARA utility function for the others

d The last column attempts to translate the ARA estimates into relative risk aversion We follow the literature and do soby multiplying the ARA estimate by average annual income We use the average annual income (after tax) in Israel in 1995(51168 NIS from the Israeli Census) for all our specifications and we use average disposable income in the United Statesin 1987 ($15437) for Gertner (1993) and Metrick (1995) For Holt and Laury (2002) and Sydnor (2006) we use a similarfigure for 2002 ($26974)

e Holt and Laury (2002) do not report a comparable estimate The estimate we provide above is based on estimating aCARA utility model for the 18 subjects in their experiment who participated in the ldquo90rdquo treatment which involved stakescomparable to our setting For these subjects we assume a CARA utility function and a lognormal distribution of theircoefficient of absolute risk aversion The table reports the point estimate of the mean from this distribution

764 THE AMERICAN ECONOMIC REVIEW JUNE 2007

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 7: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

regular was used for this deductible level bothbecause it was relatively similar to the deductiblelevels offered by other insurers and because mostpolicyholders chose it The regular premium var-ied across individuals according to some deter-ministic function (unknown to us) pit ft(xi)which was quite stable over time The regulardeductible level was directly linked to the regularpremium according to

(1) dit min12

pit capt

That is it was set to one-half of the regular pre-mium subject to a deductible cap capt whichvaried over time but not across individuals Thepremiums associated with the other options werecomputed by multiplying pit by three differentconstants 106 for ldquolowrdquo deductible 0875 forldquohighrdquo deductible and 08 for ldquovery highrdquo de-ductible The regular deductible dit was con-verted to the other three deductible levels in asimilar way using multipliers of 06 for low 18for high and 26 for very high

There are two main sources of exogenous vari-ation in prices The first arises from companyexperimentation The multipliers described abovewere fixed across individuals and over time formost of the observation period but there was asix-month period during the insurerrsquos first year ofoperation (May 1995 to October 1995) in whichthe insurer experimented with slightly modifiedmultipliers16 This modified formula covers al-most 10 percent of the sample The secondsource of variation arises from discrete ad-justments to the uniform cap The cap variedover time due to inflation competitive condi-tions and as the company gained more expe-rience (Figure 1) The cap was binding for

16 For individuals with low levels of regular premiumsduring the specified period the regular deductible was set at53 percent (instead of 50 percent) of the regular premiumthe low deductible was set at 33 percent (instead of 30percent) of the regular premium and so on

FIGURE 1 VARIATION IN THE DEDUCTIBLE CAP OVER TIME

Notes This figure presents the variation in the deductible cap over time which is one of the main sources of pricing variation inthe data We do not observe the cap directly but it can be calculated from the observed menus The graph plots the maximalregular deductible offered to anyone who bought insurance from the company over a moving seven-day window Thelarge jumps in the graph reflect changes in the deductible cap There are three reasons why the graph is not perfectlysmooth First in a few holiday periods (eg October 1995) there are not enough sales within a seven-day window sonone of those sales hits the cap This gives rise to temporary jumps downward Second the pricing rule applies at thedate of the price quote given to the potential customer Our recorded date is the first date the policy becomes effectiveThe price quote is held for a period of two to four weeks so in periods in which the pricing changes we may still seenew policies sold using earlier quotes made according to a previous pricing regime Finally even within periods of constantcap the maximal deductible varies slightly (variation of less than 05 percent) We do not know the source of this variation

751VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

about a third of the policyholders in our dataAll these individuals would be affected by achange in the cap Much of the variation ofmenus in the data is driven by the exogenousshifts in the uniform deductible cap The un-derlying assumption is that conditional onobservables these sources of variation pri-marily affect the deductible choice of newcustomers but they do not have a significantimpact on the probability of purchasing insurancefrom the company Indeed this assumption holdsin the data with respect to observables there is nodistinguishable difference in the distribution ofobservable characteristics of consumers who buyinsurance just before and those who buy just aftera change in the deductible cap

C Summary Statistics

The top of Table 2A summarizes the deduct-ible menus all are calculated according to theformula described earlier Only 1 percent of thepolicyholders chose the high or very high de-ductible options Therefore for the rest of theanalysis we focus only on the choice betweenregular and low deductible options (chosen by811 and 178 percent of the individuals respec-tively)17 Focusing only on these options doesnot create any selection bias because we do notomit individuals who chose high or very highdeductibles For these individuals we assumethat they chose a regular deductible This as-sumption is consistent with the structural modelwe develop in the next section which impliesthat conditional on choosing high or very highdeductibles an individual would almost alwaysprefer the regular over the low deductible

The bottom of Table 2A as well as Table2B present summary statistics for the policy re-alizations We focus only on claim rates and noton the amounts of the claims This is because anyamount above the higher deductible level is cov-

ered irrespective of the deductible choice and thevast majority of the claims fit in this category (seeSection IIIE) For all these claims the gain fromchoosing a low deductible is the same in the eventof a claim and is equal to the difference betweenthe two deductible levels Therefore the claimamount is rarely relevant for the deductible choice(and likewise for the companyrsquos pricing decisionwe analyze in Section IIIF)

Averaging over all individuals the annualclaim rate was 0245 One can clearly observesome initial evidence of adverse selection Onaverage individuals who chose a low deductiblehad higher claim rates (0309) than those whochose the regular deductible (0232) Those whochose high and very high deductibles had muchlower claim rates (0128 and 0133 respectively)These figures can be interpreted in the context ofthe pricing formula A risk-neutral individual willchoose the low deductible if and only if her claimrate is higher than (pd) (plow pregular)(dregular dlow) When the deductible cap is notbinding which is the case for about two-thirds ofthe sample this ratio is given directly by thepricing formula and is equal to 03 Thus anyindividual with a claim rate higher than 03 willbenefit from buying the additional coverage pro-vided by a low deductible even without any riskaversion The claim data suggest that the offeredmenu is cheaper than an actuarially fair contractfor a nonnegligible part of the population (13percent according to the benchmark estimates re-ported below) This observation is in sharp con-trast to other types of insurance contracts such asappliance warranties which are much more ex-pensive than the actuarially fair price (Rabin andThaler 2001)

II The Empirical Model

A A Model of Deductible Choice

Let wi be individual irsquos wealth (pih di

h) theinsurance contract (premium and deductible re-spectively) with high deductible (pi

l dil) the

insurance contract with low deductible ti theduration of the policy and ui(w) individual irsquosvNM utility function We assume that the num-ber of insurance claims is drawn from a Poissondistribution with an annual claim rate iThrough most of the paper we assume that i isknown to the individual We also assume that iis independent of the deductible choice ie that

17 The small frequency of ldquohighrdquo and ldquovery highrdquochoices provides important information about the lowerends of the risk and risk aversion distributions but (for thatsame reason) makes the analysis sensitive to functionalform Considering these options or the option of not buyinginsurance creates a sharp lower bound on risk aversion forthe majority of the observations making the estimates muchhigher Given that these options are rarely selected how-ever it is not clear to us whether they were regularlymentioned during the insurance sales process renderingtheir use somewhat inappropriate

752 THE AMERICAN ECONOMIC REVIEW JUNE 2007

there is no moral hazard Finally we assumethat in the event of an accident the value of theclaim is greater than di

h We revisit all theseassumptions in Sections IIID and IIIE For therest of this section i subscripts are suppressedfor convenience

In the market we study insurance policies are

typically held for a full year after which theycan be automatically renewed with no commit-ment by either the company or the individualMoreover all auto-insurance policies sold inIsrael can be canceled without prior notice bythe policyholder with premium payments beinglinearly prorated Both the premium and the

TABLE 2AmdashSUMMARY STATISTICSmdashMENUS CHOICES AND OUTCOMES

Variable Obs Mean Std dev Min Max

Menu Deductible (current NIS)a Low 105800 87548 12101 37492 103911Regular 105800 145299 19779 62486 171543High 105800 260802 35291 112475 308778Very high 105800 376305 50853 162464 446013

Premium (current NIS)a Low 105800 338057 91404 132471 1923962Regular 105800 318922 8623 124972 1815058High 105800 279057 75451 109351 1588176Very high 105800 255137 68984 99978 1452046

pd 105800 0328 006 03 18

Realization Choice Low 105800 0178 038 0 1Regular 105800 0811 039 0 1High 105800 0006 008 0 1Very high 105800 0005 007 0 1

Policy termination Active 105800 0150 036 0 1Canceled 105800 0143 035 0 1Expired 105800 0707 046 0 1

Policy duration (years) 105800 0848 028 0005 108Claims All 105800 0208 048 0 5

Low 18799 0280 055 0 5Regular 85840 0194 046 0 5High 654 0109 034 0 3Very high 507 0107 032 0 2

Claims per yearb All 105800 0245 066 0 19882Low 18799 0309 066 0 9264Regular 85840 0232 066 0 19882High 654 0128 062 0 12636Very high 507 0133 050 0 3326

a The average exchange rate throughout the sample period was approximately $1 per 35 NIS starting at 13 in late 1994and reaching 14 in late 1999

b The mean and standard deviation of the claims per year are weighted by the observed policy duration to adjust forvariation in the exposure period These are the maximum likelihood estimates of a simple Poisson model with no covariates

TABLE 2BmdashSUMMARY STATISTICSmdashCONTRACT CHOICES AND REALIZATIONS

Claims Low Regular High Very high Total Share

0 11929 (0193) 49281 (0796) 412 (0007) 299 (0005) 61921 (100) 080341 3124 (0239) 9867 (0755) 47 (0004) 35 (0003) 13073 (100) 016962 565 (0308) 1261 (0688) 4 (0002) 2 (0001) 1832 (100) 002383 71 (0314) 154 (0681) 1 (0004) 0 (0000) 226 (100) 000294 6 (0353) 11 (0647) 0 (0000) 0 (0000) 17 (100) 000025 1 (0500) 1 (0500) 0 (0000) 0 (0000) 2 (100) 000003

Notes The table presents tabulation of choices and number of claims For comparability the figures are computed only forindividuals whose policies lasted at least 09 years (about 73 percent of the data) The bottom rows of Table 2A providedescriptive figures for the full dataset The numbers in parentheses in each cell represent percentages within each row Theright-hand-side column presents the marginal distribution of the number of claims

753VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

probability of a claim are proportional to thelength of the time interval taken into account soit is convenient to think of the contract choice asa commitment for only a short amount of timeThis approach has several advantages First ithelps to account for early cancellations andtruncated policies which together constitute 30percent of the policies in the data18 Second itmakes the deductible choice independent ofother longer-term uncertainties faced by the in-dividual so we can focus on static risk-takingbehavior Third this formulation helps to obtaina simple framework for analysis which is attrac-tive both analytically and computationally19

The expected utility that the individual obtainsfrom the choice of a contract (p d) is given by

(2) v p d

1 tuw pt tuw pt d

We characterize the set of parameters that willmake the individual indifferent between the twooffered contracts This set provides a lower (up-per) bound on the level of risk aversion for indi-viduals who choose the low (high) deductible (fora given ) Thus we analyze the equation v(phdh) v(pl dl) By taking limits with respect to t(and applying LrsquoHopitalrsquos rule) we obtain

(3)

limt30

1

t(u(w pht) u(w plt))

(u(w pht) u(w pht dh)) (u(w plt) u(w plt dl))

pl phuw

uw dl uw dh

or

(4)

pl phuw uw dl uw dh

The last expression has a simple intuition Theright-hand side is the expected gain (in utils) perunit of time from choosing a low deductibleThe left-hand side is the cost of such a choiceper unit of time For the individual to be indif-ferent the expected gains must equal the costs

In our benchmark specification we assumethat the third derivative of the vNM utilityfunction is not too large A Taylor expansion forboth terms on the right-hand side of equation(4) ie u(w d) u(w) du(w) (d22)u13(w) implies that

(5)pl ph

uw dh dluw

1

2dh dldh dlu13w

Let d dh dl 0 p pl ph 0 andd 1frasl2 (dh dl) to obtain

(6)p

duw uw d u13w

or

(7) r u13w

uw

p

d 1

d

where r is the coefficient of absolute risk aver-sion at wealth level w

18 As can be seen in Table 2A 70 percent of the policiesare observed through their full duration (one year) About15 percent are truncated by the end of our observationperiod and the remaining 15 percent are canceled for var-ious reasons such as change in car ownership total-lossaccident or a unilateral decision of the policyholder tochange insurance providers

19 This specification ignores the option value associatedwith not canceling a policy This is not very restrictiveSince experience rating is small and menus do not changeby much this option value is likely to be close to zero Asimple alternative is to assume that individuals behave as ifthey commit for a full year of coverage In such a case themodel will be similar to the one we estimate but willdepend on the functional form of the vNM utility functionand would generally require taking infinite sums (over thepotential realizations for the number of claims within the year)In the special case of quadratic expected utility maximizerswho care only about the mean and variance of the number ofclaims this is easy to solve The result is almost identical to theexpression we subsequently derive in equation (7)

754 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Equation (7) defines an indifference set in thespace of risk and risk aversion which we willrefer to by (r() ) and ((r) r) interchange-ably Both r() and (r) have a closed-formrepresentation a property that will be computa-tionally attractive for estimation20 Both termsare individual specific as they depend on thedeductible menu which varies across individu-als For the rest of the paper we regard eachindividual as associated with a two-dimensionaltype (ri i) An individual with a risk parameter

i who is offered a menu (pih di

h) (pil di

l)will choose the low-deductible contract if andonly if his coefficient of absolute risk aversionsatisfies ri ri(i) Figure 2 presents a graphi-cal illustration

B The Benchmark Econometric Model

The Econometric ModelmdashThe econometricmodel we estimate is fully described by the fiveequations presented in this section Our ob-jective is to estimate the joint distribution of(i ri)mdashthe claim rate and coefficient of abso-lute risk aversionmdashin the population of policy-holders conditional on observables xi The

20 For example estimating the CARA version of themodel (Section IIID) for which r() does not have aclosed-form representation takes almost ten times longer

FIGURE 2 THE INDIVIDUALrsquoS DECISIONmdashA GRAPHICAL ILLUSTRATION

Notes This graph illustrates the individualrsquos decision problem The solid line presents the indifference setmdashequation (7)mdashappliedfor the menu faced by the average individual in the sample Individuals are represented by points in the two-dimensional spaceabove In particular the scattered points are 10000 draws from the joint distribution of risk and risk aversion for the averageindividual (on observables) in the data based on the point estimates of the benchmark model (Table 4) If an individual is eitherto the right of the line (high risk) or above the line (high risk aversion) the low deductible is optimal Adverse selection is capturedby the fact that the line is downward sloping higher-risk individuals require lower levels of risk aversion to choose the lowdeductible Thus in the absence of correlation between risk and risk aversion higher-risk individuals are more likely to choosehigher levels of insurance An individual with i (pidi) will choose a lower deductible even if he is risk neutral ie withprobability one (we do not allow individuals to be risk loving) This does not create an estimation problem because i is notobserved only a posterior distribution for it Any such distribution will have a positive weight on values of i that are below(pidi) Second the indifference set is a function of the menu and in particular of (pidi) and d An increase in (pidi) willshift the set up and to the right and an increase in d will shift the set down and to the left Therefore exogenous shiftsof the menus that make both arguments change in the same direction can make the sets ldquocrossrdquo thereby allowing us toidentify the correlation between risk and risk aversion nonparametrically With positive correlation (shown in the figureby the ldquoright-bendingrdquo shape of the simulated draws) the marginal individuals are relatively high risk therefore creatinga stronger incentive for the insurer to raise the price of the low deductible

755VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

benchmark formulation assumes that (i ri) fol-lows a bivariate lognormal distribution Thuswe can write the model as

(8) ln i xi i

(9) ln ri xi vi

with

(10) i

vi

iid

N00

2 r

r r2

Neither i nor ri is directly observed Thereforewe treat both as latent variables Loosely speak-ing they can be thought of as random effects Weobserve two variables the number of claims andthe deductible choice which are related to thesetwo unobserved components Thus to completeour econometric model we have to specify therelationship between the observed variables andthe latent ones This is done by making two struc-tural assumptions First we assume that the num-ber of claims is drawn from a Poisson distributionnamely

(11) claimsi Poisson(i ti )

where ti is the observed duration of the policySecond we assume that when individuals maketheir deductible choices they follow the theoreti-cal model described in the previous section Themodel implies that individual i chooses the lowdeductible (choicei 1) if and only if ri ri(i)where ri is defined in equation (7) Thus theempirical model for deductible choice is given by

(12) Prchoicei 1 Prri

pi

idi 1

d i

Prexp(xi vi)

pi

exp(xi i)di 1

d i

With no unobserved heterogeneity in risk (i 0) equation (12) reduces to a simple probit In

such a case one can perfectly predict risk from thedata denote it by (xi) and construct an addi-tional covariate ln([pi((xi)di) 1]d i) Giventhe assumption that risk aversion is distributedlognormally running the probit regressionabove and renormalizing the coefficient onthe constructed covariate to 1 (instead ofthe typical normalization of the variance of theerror term to 1) has a structural interpretation withln (ri) as the dependent variable However Cohen(2005) provides evidence of adverse selection inthe data implying the existence of unobservedheterogeneity in risk This makes the simple probitregression misspecified Estimation of the fullmodel is more complicated Once we allow forunobserved heterogeneity in both unobserved riskaversion (vi) and claim rate (i) we have to inte-grate over the two-dimensional region depicted inFigure 2 for estimation

EstimationmdashA natural way to proceed is toestimate the model by maximum likelihoodwhere the likelihood of the data as a function ofthe parameters can be written by integrating outthe latent variables namely

(13) Lclaimsi choicei

Prclaimsi choiceii riPri ri

where is a vector of parameters to be esti-mated While formulating the empirical modelusing likelihood may help our thinking aboutthe data-generating process using maximumlikelihood (or generalized method of moments(GMM)) for estimation is computationally cum-bersome This is because in each iteration itrequires evaluating a separate complex integralfor each individual in the data In contrastMarkov Chain Monte Carlo (MCMC) Gibbssampling is quite attractive in such a case Us-ing data augmentation of latent variables (Mar-tin A Tanner and Wing Hung Wong 1987)according to which we simulate (i ri) and latertreat those simulations as if they are part ofthe data one can avoid evaluating the complexintegrals by sampling from truncated normaldistributions which is significantly less compu-tationally demanding (eg Luc Devroye 1986)This feature combined with the idea of aldquosliced samplerrdquo (Paul Damien John Wake-field and Stephen Walker 1999) to sample from

756 THE AMERICAN ECONOMIC REVIEW JUNE 2007

an unfamiliar posterior distribution makes theuse of a Gibbs sampler quite efficient for ourpurposes Finally the lognormality assumptionimplies that F(ln(i)ri) and F(ln(ri)i) follow a(conditional) normal distribution allowing us torestrict attention to univariate draws furtherreducing the computational burden

The Appendix provides a full description ofthe Gibbs sampler including the conditionaldistributions and the (flat) prior distributions weuse The basic intuition is that conditional onobserving (i ri) for each individual we have asimple linear regression model with two equa-tions The less standard part is to generate drawsfor (i ri) We do this iteratively Conditionalon i the posterior distribution for ln(ri) fol-lows a truncated normal distribution where thetruncation point depends on the menu individ-ual i faces and its direction (from above orbelow) depends on individual irsquos deductiblechoice The final step is to sample from theposterior distribution of ln(i) conditional on riThis is more complicated as we have bothtruncation which arises from adverse selection(just as we do when sampling for ri) and thenumber of claims which provides additionalinformation about the posterior of i Thus theposterior for i takes an unfamiliar form forwhich we use a ldquosliced samplerrdquo

We use 100000 iterations of the Gibbs sam-pler It seems to converge to the stationarydistribution after about 5000 iterations There-fore we drop the first 10000 draws and use thelast 90000 draws of each variable to report ourresults The results are based on the posteriormean and posterior standard deviation fromthese 90000 draws Note that each iterationinvolves generating separate draws of (i ri) foreach individual Performing 100000 iterationsof the benchmark specification (coded in Mat-lab) takes about 60 hours on a Dell Precision530 workstation

C Identification

The parametric version of the model is iden-tified mechanically There are more equationsthan unknowns and no linear dependenciesamong them so (as also verified using MonteCarlo exercises) the model parameters can bebacked out from simulated data Our goal in thissection is not to provide a formal identificationproof Rather we want to provide intuition for

which features of the data allow us to identifyparticular parameters of the model The discus-sion also highlights the assumptions that areessential for identification vis-a-vis those thatare made only for computational convenience(making them in principle testable)

Discussion of Nonparametric Identifica-tionmdashThe main difficulty in identifying themodel arises from the gap between the (exante) risk type i which individuals usewhen choosing a deductible and the (ex post)realization of the number of claims we ob-serve We identify between the variation inrisk types and the variation in risk realizationsusing our parametric distributional assump-tions The key is that the distribution of risktypes can be uniquely backed out from theclaim data alone This allows us to use thedeductible choice as an additional layer ofinformation which identifies unobserved het-erogeneity in risk aversion21 Any distribu-tional assumption that allows us to uniquelyback out the distribution of risk types fromclaim data would be sufficient to identify thedistribution of risk aversion As is customaryin the analysis of count processes we make aparametric assumption that claims are gener-ated by a lognormal mixture of Poisson dis-tributions (Section IIID discusses this furtherand explores an alternative) Using a mixtureenables us to account for adverse selectionthrough unobserved heterogeneity in risk Italso allows us to better fit the tails of theclaim distribution In principle a more flexi-ble mixture or a more flexible claim-generat-ing process could be identified as long as theclaims data are sufficiently rich22

21 Cardon and Hendel (2001) face a similar identificationproblem in the context of health insurance They use vari-ation in coverage choice (analogous to deductible choice) toidentify the variation in health-status signals (analogous torisk types) from the variation in health expenditure (analo-gous to number of claims) They can rely on the coveragechoice to identify this because they make an assumptionregarding unobserved heterogeneity in preferences (iidlogit) We take a different approach as our main goal is toestimate rather than assume the distribution of preferences

22 Although it may seem that the claim data are limited(as they take only integer values between 0 and 5 in ourdata) variation in policy duration generates continuousvariation in the observed claim propensity Of course thisvariation also introduces an additional selection into themodel due to policy cancellations which are potentially

757VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Once the distribution of risk types is iden-tified from claims data the marginal distribu-tion of risk aversion (and its relationship tothe distribution of risk types) is nonparametri-cally identified from the variation in theoffered menus discussed in Section I Thisvariation implies different deductible and pre-mium options to identical (on observables)individuals who purchased insurance at dif-ferent times Different menus lead to differentindifference sets (similar to the one depictedin Figure 2) These sets often cross each otherand nonparametrically identify the distribu-tion of risk aversion and the correlation struc-ture at least within the region in which theindifference sets vary For the tails of thedistribution as is typically the case we haveto rely on parametric assumptions or usebounds The parametric assumption of log-normality we use for most of the paper ismade only for computational convenience

Intuition for the Parametric IdentificationMechanismmdashVariation in the offered menus isimportant for the nonparametric identificationThe parametric assumptions could identify themodel without such variation Thus to keep theintuition simple let us take the bivariate log-normal distribution as given and contrary to thedata assume that all individuals are identical onobservables and face the same menu Supposealso that all individuals are observed for exactlyone year and have up to two claims23 In thissimplified case the model has five parametersto be estimated the mean and variance of risk and

2 the mean and variance of risk aver-sion r and r

2 and the correlation parameter The data can be summarized by five numbersLet 130 131 and 132 1 131 130 be thefraction of individuals with zero one and twoclaims respectively Let 0 1 and 2 be theproportion of individuals who chose a low de-ductible within each ldquoclaim grouprdquo Given ourdistributional assumption about the claim-

generating process we can use 130 and 131 touniquely identify and

2 Loosely isidentified by the average claim rate in the dataand

2 is identified by how fat the tail of theclaim distribution is ie by how large (132131)is compared to (131130) Given and

2 and thelognormality assumption we can (implicitly)construct a posterior distribution of risk typesfor each claim group F(r claims c) andintegrate over it when predicting the deductiblechoice This provides us with three additionalmoments each of the form

(14) Ec Prchoice 1r

dFr claims c dFr

for c 0 1 2 These moments identify thethree remaining parameters of the model rr

2 and Let us now provide more economic content to

the identification argument Using the same ex-ample and conditional on identifying and

2

from the claim data one can think about thedeductible choice data 0 1 2 as a graph(c) The absolute level of the graph identifiesr In the absence of correlation between riskand risk aversion the slope of the graph iden-tifies r

2 with no correlation the slope shouldalways be positive (due to adverse selection)but higher r

2 would imply a flatter graph be-cause more variation in the deductible choiceswill be attributed to variation in risk aversionFinally is identified by the curvature of thegraph The more convex (concave) the graph isthe more positive (negative) is the estimated For example if 0 05 1 051 and 2 099 it is likely that r

2 is high (explaining why0 and 1 are so close) and is highly positive(explaining why 2 is not also close to 1) Incontrast if 0 1 it must mean that thecorrelation between risk and risk aversion isnegative which is the only way the originalpositive correlation induced by adverse selec-tion can be offset This intuition also clarifiesthat the identification of relies on observingindividuals with multiple claims (or differentpolicy durations) and that it is likely to besensitive to the distributional assumptions Thedata (Table 2B) provide a direct (positive) cor-relation between deductible choice and claims

endogenous The results are similar when we use onlyexpired and truncated policies

23 This variation is sufficient (and necessary) to identifythe benchmark model The data provide more variation weobserve up to five claims per individual we observe con-tinuous variation in the policy durations we observe vari-ation in prices and we exploit distributional restrictionsacross individuals with different observable characteristics

758 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The structural assumptions allow us to explainhow much of this correlation can be attributedto adverse selection The remaining correlation(positive or negative) is therefore attributed tocorrelation in the underlying distribution of riskand risk aversion

III Results

A Reduced-Form Estimation

To get an initial sense for the levels of abso-lute risk aversion implied by the data we use asimple back-of-the-envelope exercise We com-pute unconditional averages of p d dh dland d (Table 2A)24 and substitute these valuesin equation (7) The implied coefficient of ab-solute risk aversion is 29 104 NIS125 Thisfigure could be thought of as the average indif-ference point implying that about 18 percent ofthe policyholders have coefficients of absoluterisk aversion exceeding it To convert to USdollar amounts one needs to multiply thesefigures by the average exchange rate (352)resulting in an average indifference point of102 103 $US1 This figure is less than halfof a similar estimate reported by Sydnor (2006)for buyers of homeownerrsquos insurance but about3 and 13 times higher than comparable fig-ures reported by Gertner (1993) and Metrick(1995) respectively for television game showparticipants

Table 3 provides reduced-form analysis ofthe relationship between the observables andour two left-hand-side variables the number ofclaims and the deductible choice26 Column 1reports the estimates from a Poisson regressionof the number of claims on observed character-istics This regression is closely related to therisk equation we estimate in the benchmarkmodel It shows that older people women andpeople with a college education are less likely to

have an accident Bigger more expensiveolder and noncommercial cars are more likelyto be involved in an accident Driving experi-ence and variables associated with less intenseuse of the car reduce accident rates As could beimagined claim propensity is highly correlatedover time past claims are a strong predictor offuture claims Young drivers are 50 to 70 per-cent more likely to be involved in an accidentwith young men significantly more likely thanyoung women Finally as indicated by the trendin the estimated year dummies the accident ratesignificantly declined over time Part of thisdecline is likely due to the decline in accidentrates in Israel in general27 This decline mightalso be partly due to the better selection ofindividuals the company obtained over time asit gained more experience (Cohen 2003)

Columns 2 and 3 of Table 3 present estimatesfrom probit regressions in which the dependentvariable is equal to one if the policyholder chosea low deductible and is equal to zero otherwiseColumn 3 shows the marginal effects of thecovariates on the propensity to choose a lowdeductible These marginal effects do not have astructural interpretation as the choice of lowdeductible depends on its price on risk aver-sion and on risk In this regression we againobserve a strong trend over time Fewer policy-holders chose the low deductible as time wentby One reason for this trend according to thecompany executives is that over time the com-panyrsquos sales persons were directed to focusmainly on the ldquodefaultrdquo regular deductible op-tion28 The effect of other covariates will bediscussed later in the context of the full modelIn unreported probit regressions we also test thequalitative assumptions of the structural model byadding three additional regressors the price ratio(pidi) the average deductible offered d i

24 The unconditional is computed by maximum likeli-hood using the data on claims and observed durations of thepolicies

25 Using the CARA specification as in equation (16) weobtain a slightly lower value of 25 104 NIS1

26 We find positive correlation in the errors of these tworegressions suggesting the existence of adverse selection inthe data and motivating a model with unobserved heteroge-neity in risk This test is similar to the bivariate probit testproposed by Chiappori and Salanie (2000) and replicatesearlier results reported in Cohen (2005)

27 In particular traffic fatalities and counts of trafficaccidents in Israel fell by 11 percent and 18 percent during1998 and 1999 respectively

28 Such biased marketing efforts will bias consumersagainst choosing the low deductible thus making them lookless risk averse This would make our estimate a lowerbound on the true level of risk aversion If only sophisti-cated consumers could see beyond the marketing effort andthis sophistication were related to observables (eg educa-tion) the coefficients on such observables would be biasedupward This is not a major concern given that the coefficientson most of the covariates are fairly stable when we estimatethe benchmark model separately for different years

759VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

TABLE 3mdashNO HETEROGENEITY IN RISK

Variable

Poisson regressiona Probit regressionsc

Dep var Number of claims Dep var 1 if low deductible

(1) (2) (3)

Coeff Std Err IRRb Coeff Std Err dPdX dPdXDemographics Constant 43535 02744 mdash 224994 23562 mdash

Age 00074 00052 09926 01690 00464 00034 00042Age2 989 105 536 105 10001 00017 00005 339 105 455 105Female 00456 00161 09555 08003 01316 00165 00134Family Single omitted omitted omitted

Married 01115 00217 08945 06417 01950 00128 00045Divorced 00717 00346 10743 03191 03127 00064 00044Widower 00155 00527 10156 03441 04573 00069 00025Other (NA) 01347 01755 11442 29518 19836 00511 00370

Education Elementary 00522 00550 09491 02927 04278 00061 00016High school omitted omitted omittedTechnical 00373 00308 10380 05037 02520 00105 00138College 00745 00197 09282 05279 01546 00109 00053Other (NA) 00116 00184 10116 00630 01540 00013 00025

Emigrant 00210 00163 10213 00497 01328 00010 00003

Car attributes Log(Value) 01227 00281 11306 13285 02342 00271 00299Car age 00187 00042 10189 01603 00357 00033 00017Commercial car 01394 00326 08699 09038 02781 00177 00294Log(engine size) 02972 00459 13461 06924 03952 00141 00075

Driving License years 00204 00034 09798 01043 00312 00021 00005License years2 00002 677 105 10002 00015 00005 299 105 167 105

Good driver 00176 00191 09825 07207 01574 00148 00152Any driver 00564 00169 10580 10923 01321 00217 00258Secondary car 00859 00209 09177 00038 01626 00001 00070Business use 01852 00293 12034 07951 02737 00156 00017History length 00527 00110 09486 11218 01450 00228 00171Claims history 06577 00390 19304 04654 05460 00095 00496

Young driver Young driver 05235 00361 16879 28847 06305 00524 00012Gender Male omitted omitted omitted

Female 01475 00288 08629 17959 03135 00396 00195Age 17ndash19 omitted omitted omitted

19ndash21 00701 00532 10726 07800 07509 00153 0014721ndash24 00267 00574 09737 05746 07773 00114 0015624 02082 00567 0812 17869 07328 00397 00136

Experience 1 omitted omitted omitted1ndash3 02416 00458 07854 10175 06577 00217 000093 02827 00532 07538 32513 07386 00762 00410

Company year First year omitted omitted omittedSecond year 00888 00198 09150 44513 01506 00792 00859Third year 00690 00222 09334 85888 01820 01303 01367Fourth year 01817 00232 08339 118277 02102 01616 01734Fifth year 05431 0028 05810 143206 02778 01920 02085

105586Obs 105800 94000 105800Pseudo R2 00162 01364 01296Log likelihood 577459 369595 430869

Significant at the 5 percent confidence levela Maximum likelihood estimates Variation in exposure (policy duration) is accounted forb IRR Incidence rate ratio Each figure should be interpreted as the increasedecrease in claim probability as a result of an increase of one unit in the

right-hand-side variablec There are two separate probit regressions reported in this table Column 2 relies on the deductible choice model and the lognormality assumption As discussed

in the text by including an additional regressor ln([pi((xi)di) 1]d i) (with (xi) predicted from column 1 above) and normalizing its coefficient to 1 we obtaina structural interpretation of this regression with ln (ri) as the dependent variable Thus the reported coefficients are comparable to those estimated for the benchmarkmodel One should be cautious however in interpreting these coefficients Unlike the benchmark model this regression does not allow unobserved heterogeneity inrisk and suffers from some selection bias because observations with high predicted risk rate are omitted (which is why the number of observations is 94000 ratherthan the full sample of 105800) For comparison column 3 reports the marginal effects from a comparable probit regression that uses the full sample and does notcontrol for pricing and predicted risk through the additional structural regressor Column 3 does not have a structural interpretation and its (unreported) coefficientscannot be compared to those estimated from the benchmark model

760 THE AMERICAN ECONOMIC REVIEW JUNE 2007

and the risk rate (xi) as predicted from thePoisson regression of column 1 All three addi-tional regressors enter with the predicted signand with large and highly significant marginaleffects29

Finally column 2 of Table 3 presents animportant specification of the probit regres-sion in which ln([pi(( xi)di) 1]d i) isadded as a regressor and its coefficient isnormalized to 130 As already mentioned ifwe assume that there is no unobserved heter-ogeneity in risk then column 2 is analogousto the ln(ri) equation of the benchmark modelThis restriction of the model is rejected by thedata About 10 percent of the individuals arepredicted to have (xi) (pi di) implying achoice of low deductible for any level of riskaversion Many of these individuals howeverstill choose higher deductible levels Column2 reports the results for the remaining indi-viduals (94000 out of 105800) for which theregression can be run While the signs of theestimated coefficients are similar to those inthe benchmark model presented below therestricted version of the model suggests muchhigher levels and dispersion of risk aversionwell above any reasonable level31 The fullestimation of the benchmark model clearly re-jects this restriction on the model

B Estimation of the Benchmark Model

The Effect of Individual Characteristics onRisk AversionmdashTable 4 presents the estimationresults from the benchmark model The secondcolumn shows how the level of absolute riskaversion is related to individual characteristicsAs the dependent variable is in natural loga-

rithm coefficients on dummy variables can bedirectly interpreted as approximate percentagechanges The vast majority of these coefficientsare quite stable across a wide range of specifi-cations which are mentioned later

The results indicate that women are more riskaverse than men and have a coefficient of ab-solute risk aversion about 20 percent greaterthan that of men These results are consistentwith those of Donkers et al (2001) and Hartoget al (2002) The estimated effect of age sug-gests a nonmonotone pattern of risk preferencesover the life cycle The estimated coefficientsimply that initially (that is at age 18 the youngestindividual in the data) individuals become less riskaverse with age but around the age of 48 indi-viduals start becoming more risk averse32 Mar-ried individuals are estimated to be significantlymore risk averse compared to singles while di-vorced individuals are less (although the coeffi-cient is statistically insignificant)

The analysis suggests that variables that arelikely to be correlated with income or wealthsuch as post-high-school education and thevalue of the car have a positive coefficientindicating that wealthier people have higherlevels of absolute risk aversion Although we donot have data on individualsrsquo income or wealthwe use other proxies for income in additionalspecifications and obtain mixed results Whenwe include as a covariate the average householdincome among those who live in the same zipcode we obtain a significant and negative co-efficient of 0333 (0154) (standard deviationin parentheses) When we match zip code in-come on demographics of the individuals thecoefficient is effectively zero 0036 (0047)and when we use a socioeconomic index ofthe locality in which the individual lives thecoefficient is positive 0127 (0053)33 Thus

29 The estimated marginal effect (z-statistic in parenthe-ses) is 0352 (1376) 16 104 (1481) and 0154(255) for (pidi) d i and (xi) respectively

30 The level is a normalization The sign is estimatedHad the sign on this regressor been positive this would haveimplied a rejection of the model

31 For the implied median level of risk aversion the re-stricted model produces a similar estimate to the estimate wereport below for the benchmark model However since someindividuals who chose the low deductible are estimated to havevery low claim rates the restricted model is ldquoforcedrdquo to estimatevery high risk aversion for these individuals (in contrast thebenchmark model can explain such choices by positive risk re-siduals) resulting in very high dispersion and (due to the lognor-mality assumption) very high average risk aversion which isabout 1025 higher than the benchmark estimates we report below

32 This nonmonotone pattern may explain why age enterswith different signs in the estimation results of Donkers et al(2001) and Hartog et al (2002) A somewhat similar U-shapepattern with respect to age is also reported by Sumit Agarwalet al (2006) in the context of consumer credit markets

33 The full results from these specifications are providedin the online Appendix (available at httpwwwe-aerorgdatajune0720050644_apppdf ) The reason we do not usethese income variables in the benchmark specification istheir imperfect coverage which would require us to omitalmost 20 percent of the individuals Other results fromthese regressions are similar to those we report for thebenchmark specification

761VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

overall we interpret our findings as suggestiveof a nonnegative and perhaps positive associ-ation between incomewealth and absolute riskaversion

At first glance these results may appear tobe inconsistent with the widely held beliefthat absolute risk aversion declines withwealth One should distinguish however be-tween two questions (a) whether for a givenindividual the vNM utility function exhibitsdecreasing absolute risk aversion and (b)how risk preferences vary across individuals

Our results do not speak to the first questionand should not be thought of as a test of thedecreasing absolute risk aversion propertyTesting this property would require observingthe same individual making multiple choicesat different wealth levels Rather our resultsindicate that individuals with greater wealthhave utility functions that involve a greaterdegree of risk aversion It might be thatwealth is endogenous and that risk aversion(or unobserved individual characteristics thatare correlated with it) leads individuals to

TABLE 4mdashTHE BENCHMARK MODEL

Variable Ln() equation Ln(r) equation Additional quantities

Demographics Constant 15406 (00073) 118118 (01032) Var-covar matrix ()

Age 00001 (00026) 00623 (00213) 01498 (00097)Age2 624 106 (263 105) 644 104 (211 104) r 31515 (00773)Female 00006 (00086) 02049 (00643) 08391 (00265)Family Single Omitted Omitted

Married 00198 (00115) 01927 (00974) Unconditional statisticsa

Divorced 00396 (00155) 01754 (01495) Mean 02196 (00013)Widower 00135 (00281) 01320 (02288) Median 02174 (00017)Other (NA) 00557 (00968) 04599 (07397) Std Dev 00483 (00019)

Education Elementary 00194 (00333) 01283 (02156) Mean r 00019 (00002)High school Omitted Omitted Median r 727 106 (756 107)Technical 00017 (00189) 02306 (01341) Std Dev r 00197 (00015)College 00277 (00124) 02177 (00840) Corr(r ) 02067 (00085)Other (NA) 00029 (00107) 00128 (00819)

Emigrant 00030 (00090) 00001 (00651) Obs 105800

Car attributes Log(value) 00794 (00177) 07244 (01272)Car age 00053 (00023) 00411 (00176)Commercial car 00719 (00187) 00313 (01239)Log(engine size) 01299 (00235) 03195 (01847)

Driving License years 00015 (00017) 00157 (00137)License years2 183 105 (351 105) 148 104 (254 104)Good driver 00635 (00112) 00317 (00822)Any driver 00360 (00105) 03000 (00722)Secondary car 00415 (00141) 01209 (00875)Business use 00614 (00134) 03790 (01124)History length 00012 (00052) 03092 (00518)Claims history 01295 (00154) 00459 (01670)

Young driver Young driver 00525 (00253) 02499 (02290)Gender Male Omitted mdash

Female 00355 (00061) mdashAge 17ndash19 Omitted mdash

19ndash21 00387 (00121) mdash21ndash24 00445 (00124) mdash24 00114 (00119) mdash

Experience 1 Omitted mdash1ndash3 00059 (00104) mdash3 00762 (00121) mdash

Company year First year Omitted OmittedSecond year 00771 (00122) 14334 (00853)Third year 00857 (00137) 28459 (01191)Fourth year 01515 (00160) 38089 (01343)Fifth year 04062 (00249) 39525 (01368)

Note Standard deviations based on the draws from the posterior distribution in parentheses Significant at the 5 percent confidence levela Unconditional statistics represent implied quantities for the sample population as a whole ie integrating over the distribution of covariates in the sample (as

well as over the unobserved components)

762 THE AMERICAN ECONOMIC REVIEW JUNE 2007

save more to obtain more education or totake other actions that lead to greaterwealth34

Let us make several additional observationsFirst while owners of more expensive cars ap-pear to have both higher risk exposure andhigher levels of risk aversion owners of biggercars have higher risk exposure but lower levelsof risk aversion This should indicate that thestructure of the model does not constrain therelationship between the coefficients in the twoequations Rather it is the data that ldquospeak uprdquoSecond individuals who are classified by theinsurer as ldquogood driversrdquo indeed have lowerrisk but also appear to have lower risk aversionThis result is somewhat similar to the positivecorrelation between unobserved risk and unob-served risk aversion which we report belowThird policyholders who tend to use the car forbusiness are less risk averse This could bebecause uninsured costs of accidents occurringto such policyholders are tax deductible Fourthpolicyholders who reported three full years ofclaims history are more risk averse but are notdifferent in their risk exposure The attitude thatleads such policyholders to comply with therequest to (voluntarily) report three full years ofclaims history is apparently and not surpris-ingly correlated with higher levels of risk aver-sion In contrast while past claims indicate highrisk they have no significant relationship withrisk aversion Finally we find a strong trendtoward lower levels of risk aversion over timeThis is a replication of the probit results dis-cussed earlier

The Effect of Individual Characteristics onClaim RiskmdashThe first column of Table 4 de-scribes the relationship between observablesand risk exposure Accident risk is higher fordivorced individuals and lower for people witha college education Bigger older more expen-sive and noncommercial cars are all morelikely to be involved in an accident Drivingexperience reduces accident rates as do mea-sures of less intense use of the car while youngdrivers are more exposed to risk Claim propen-sity is highly correlated over time the voluntaryreport of past claims is a strong predictor offuture claims This risk equation produces re-sults that are similar to those of the simplerPoisson regression reported in Table 3 Al-though some of the coefficients lose signifi-cance the magnitude of most coefficients isquite similar The similarity between these twosets of results is to be expected as the riskregression is identified primarily from the dataon claims so incorporating the information ondeductible choice does not qualitatively changethe conceptual identification strategy (see Sec-tion IIC) If the results were not similar thiscould have indicated a misspecification of themodel The slight differences between the riskregressions in Table 3 and Table 4 are drivenmainly by the structural assumptions First thebenchmark model estimates a lognormal mix-ture of Poisson rates rather than a single Pois-son model By incorporating the fatter tails ofthe claim distribution it slightly changes theresults increases the standard errors and de-creases the average predicted claim rate Sec-ond the information on deductible choiceslightly helps us in obtaining more precise es-timates through the correlation structure be-tween the error terms in the two equations

The Implied Level of Risk AversionmdashOne ofthe main goals of the estimation is to obtainmeasures of the level of risk aversion in thepopulation we study Since we use Gibbs sam-pling and augment the latent coefficients ofabsolute risk aversion we can directly obtainthe posterior distribution of various moments ofthis distribution At each iteration of the Gibbssampler we compute the mean standard devi-ation (across individuals) and various percen-tiles of the simulated draws of i and ri and thecorrelation between them The far-right columnof Table 4 reports the averages and standard

34 One may be tempted to interpret the positive wealtheffects as an indirect indication of credit constraints wealth-ier individuals are less credit constrained and therefore canafford to purchase more insurance We do not share thisinterpretation for two reasons First the insurance companyobserves these proxies for wealth and conditions on themwhen setting prices Since the willingness to pay for insur-ance is likely to be correlated with the willingness to pay forthe additional insurance provided by the low deductibleoption premiums already reflect this variation We condi-tion on the level of the premium Second paying less exante implies paying more ex post so applying the creditconstraint argument only to the ex ante payment but not tothe probabilistic ex post deductible payments has no theo-retical foundation Essentially the setup of the model linksthe ex ante decisions to the ex post losses which are bothdriven by the curvature of the vNM utility function

763VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

deviations of these computed quantities over theiterations of the Gibbs sampler35 The impliedrisk aversion of the mean individual is 00019which is about seven times greater than theback-of-the-envelope calculation presented inthe previous section As we assume a lognormal

distribution and estimate a relatively high dis-persion coefficient r the estimates also implysignificantly lower values for the median levelof risk aversion As shown later the qualitativepattern of these results is quite robust acrossspecifications

Table 5 summarizes our findings regardingthe level and dispersion of risk aversion Itpresents two ways to interpret the estimates weobtained from the benchmark model as well ascomparisons to specifications with constant ab-solute risk aversion (CARA) utility and with

35 Note that these estimated quantities are unconditionalIn computing these quantities we integrate over the distri-bution of observable characteristics in the data so onecannot compute these estimates from the estimated param-eter directly

TABLE 5mdashRISK-AVERSION ESTIMATES

Specificationa Absolute risk aversionb Interpretationc Relative risk aversiond

Back-of-the-envelope 10 103 9070 1484Benchmark model

Mean individual 67 103 5605 972225th percentile 23 106 9998 003Median individual 26 105 9974 03775th percentile 29 104 9714 42790th percentile 27 103 7834 390295th percentile 99 103 4937 14327

CARA utilityMean individual 31 103 7651 4436Median individual 34 105 9966 050

Learning modelMean individual 42 103 6886 6140Median individual 56 106 9995 008

Comparable estimatesGertner (1993) 31 104 9699 479Metrick (1995) 66 105 9934 102Holt and Laury (2002)e 32 102 2096 86575Sydnor (2006) 20 103 8329 5395

a The table summarizes the results with respect to the level of risk aversion ldquoBack-of-the-enveloperdquo refers to thecalculation we report in the beginning of Section III ldquobenchmark modelrdquo refers to the results from the benchmark model(Table 4) ldquoCARA utilityrdquo refers to a specification of a CARA utility function and ldquolearning modelrdquo refers to a specificationin which individuals do not know their risk types perfectly (see Section IIID) The last four rows are the closest comparableresults available in the literature

b The second column presents the point estimates for the coefficient of absolute risk aversion converted to $US1 unitsFor the comparable estimates this is given by their estimate of a representative CARA utility maximizer For all otherspecifications this is given by computing the unconditional mean and median in the population using the figures we reportin the mean and median columns of risk aversion in Table 6 (multiplied by 352 the average exchange rate during the periodto convert to US dollars)

c To interpret the absolute risk aversion estimates (ARA) we translate them into x u(w) 1frasl2 u(w 100) 1frasl2 u(w x) That is we report x such that an individual with the estimated ARA is indifferent about participating in a 50ndash50lottery of gaining 100 US dollars and losing x US dollars Note that since our estimate is of absolute risk aversion the quantityx is independent of w To be consistent with the specification we use a quadratic utility function for the back-of-the-envelopebenchmark and learning models and use a CARA utility function for the others

d The last column attempts to translate the ARA estimates into relative risk aversion We follow the literature and do soby multiplying the ARA estimate by average annual income We use the average annual income (after tax) in Israel in 1995(51168 NIS from the Israeli Census) for all our specifications and we use average disposable income in the United Statesin 1987 ($15437) for Gertner (1993) and Metrick (1995) For Holt and Laury (2002) and Sydnor (2006) we use a similarfigure for 2002 ($26974)

e Holt and Laury (2002) do not report a comparable estimate The estimate we provide above is based on estimating aCARA utility model for the 18 subjects in their experiment who participated in the ldquo90rdquo treatment which involved stakescomparable to our setting For these subjects we assume a CARA utility function and a lognormal distribution of theircoefficient of absolute risk aversion The table reports the point estimate of the mean from this distribution

764 THE AMERICAN ECONOMIC REVIEW JUNE 2007

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 8: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

about a third of the policyholders in our dataAll these individuals would be affected by achange in the cap Much of the variation ofmenus in the data is driven by the exogenousshifts in the uniform deductible cap The un-derlying assumption is that conditional onobservables these sources of variation pri-marily affect the deductible choice of newcustomers but they do not have a significantimpact on the probability of purchasing insurancefrom the company Indeed this assumption holdsin the data with respect to observables there is nodistinguishable difference in the distribution ofobservable characteristics of consumers who buyinsurance just before and those who buy just aftera change in the deductible cap

C Summary Statistics

The top of Table 2A summarizes the deduct-ible menus all are calculated according to theformula described earlier Only 1 percent of thepolicyholders chose the high or very high de-ductible options Therefore for the rest of theanalysis we focus only on the choice betweenregular and low deductible options (chosen by811 and 178 percent of the individuals respec-tively)17 Focusing only on these options doesnot create any selection bias because we do notomit individuals who chose high or very highdeductibles For these individuals we assumethat they chose a regular deductible This as-sumption is consistent with the structural modelwe develop in the next section which impliesthat conditional on choosing high or very highdeductibles an individual would almost alwaysprefer the regular over the low deductible

The bottom of Table 2A as well as Table2B present summary statistics for the policy re-alizations We focus only on claim rates and noton the amounts of the claims This is because anyamount above the higher deductible level is cov-

ered irrespective of the deductible choice and thevast majority of the claims fit in this category (seeSection IIIE) For all these claims the gain fromchoosing a low deductible is the same in the eventof a claim and is equal to the difference betweenthe two deductible levels Therefore the claimamount is rarely relevant for the deductible choice(and likewise for the companyrsquos pricing decisionwe analyze in Section IIIF)

Averaging over all individuals the annualclaim rate was 0245 One can clearly observesome initial evidence of adverse selection Onaverage individuals who chose a low deductiblehad higher claim rates (0309) than those whochose the regular deductible (0232) Those whochose high and very high deductibles had muchlower claim rates (0128 and 0133 respectively)These figures can be interpreted in the context ofthe pricing formula A risk-neutral individual willchoose the low deductible if and only if her claimrate is higher than (pd) (plow pregular)(dregular dlow) When the deductible cap is notbinding which is the case for about two-thirds ofthe sample this ratio is given directly by thepricing formula and is equal to 03 Thus anyindividual with a claim rate higher than 03 willbenefit from buying the additional coverage pro-vided by a low deductible even without any riskaversion The claim data suggest that the offeredmenu is cheaper than an actuarially fair contractfor a nonnegligible part of the population (13percent according to the benchmark estimates re-ported below) This observation is in sharp con-trast to other types of insurance contracts such asappliance warranties which are much more ex-pensive than the actuarially fair price (Rabin andThaler 2001)

II The Empirical Model

A A Model of Deductible Choice

Let wi be individual irsquos wealth (pih di

h) theinsurance contract (premium and deductible re-spectively) with high deductible (pi

l dil) the

insurance contract with low deductible ti theduration of the policy and ui(w) individual irsquosvNM utility function We assume that the num-ber of insurance claims is drawn from a Poissondistribution with an annual claim rate iThrough most of the paper we assume that i isknown to the individual We also assume that iis independent of the deductible choice ie that

17 The small frequency of ldquohighrdquo and ldquovery highrdquochoices provides important information about the lowerends of the risk and risk aversion distributions but (for thatsame reason) makes the analysis sensitive to functionalform Considering these options or the option of not buyinginsurance creates a sharp lower bound on risk aversion forthe majority of the observations making the estimates muchhigher Given that these options are rarely selected how-ever it is not clear to us whether they were regularlymentioned during the insurance sales process renderingtheir use somewhat inappropriate

752 THE AMERICAN ECONOMIC REVIEW JUNE 2007

there is no moral hazard Finally we assumethat in the event of an accident the value of theclaim is greater than di

h We revisit all theseassumptions in Sections IIID and IIIE For therest of this section i subscripts are suppressedfor convenience

In the market we study insurance policies are

typically held for a full year after which theycan be automatically renewed with no commit-ment by either the company or the individualMoreover all auto-insurance policies sold inIsrael can be canceled without prior notice bythe policyholder with premium payments beinglinearly prorated Both the premium and the

TABLE 2AmdashSUMMARY STATISTICSmdashMENUS CHOICES AND OUTCOMES

Variable Obs Mean Std dev Min Max

Menu Deductible (current NIS)a Low 105800 87548 12101 37492 103911Regular 105800 145299 19779 62486 171543High 105800 260802 35291 112475 308778Very high 105800 376305 50853 162464 446013

Premium (current NIS)a Low 105800 338057 91404 132471 1923962Regular 105800 318922 8623 124972 1815058High 105800 279057 75451 109351 1588176Very high 105800 255137 68984 99978 1452046

pd 105800 0328 006 03 18

Realization Choice Low 105800 0178 038 0 1Regular 105800 0811 039 0 1High 105800 0006 008 0 1Very high 105800 0005 007 0 1

Policy termination Active 105800 0150 036 0 1Canceled 105800 0143 035 0 1Expired 105800 0707 046 0 1

Policy duration (years) 105800 0848 028 0005 108Claims All 105800 0208 048 0 5

Low 18799 0280 055 0 5Regular 85840 0194 046 0 5High 654 0109 034 0 3Very high 507 0107 032 0 2

Claims per yearb All 105800 0245 066 0 19882Low 18799 0309 066 0 9264Regular 85840 0232 066 0 19882High 654 0128 062 0 12636Very high 507 0133 050 0 3326

a The average exchange rate throughout the sample period was approximately $1 per 35 NIS starting at 13 in late 1994and reaching 14 in late 1999

b The mean and standard deviation of the claims per year are weighted by the observed policy duration to adjust forvariation in the exposure period These are the maximum likelihood estimates of a simple Poisson model with no covariates

TABLE 2BmdashSUMMARY STATISTICSmdashCONTRACT CHOICES AND REALIZATIONS

Claims Low Regular High Very high Total Share

0 11929 (0193) 49281 (0796) 412 (0007) 299 (0005) 61921 (100) 080341 3124 (0239) 9867 (0755) 47 (0004) 35 (0003) 13073 (100) 016962 565 (0308) 1261 (0688) 4 (0002) 2 (0001) 1832 (100) 002383 71 (0314) 154 (0681) 1 (0004) 0 (0000) 226 (100) 000294 6 (0353) 11 (0647) 0 (0000) 0 (0000) 17 (100) 000025 1 (0500) 1 (0500) 0 (0000) 0 (0000) 2 (100) 000003

Notes The table presents tabulation of choices and number of claims For comparability the figures are computed only forindividuals whose policies lasted at least 09 years (about 73 percent of the data) The bottom rows of Table 2A providedescriptive figures for the full dataset The numbers in parentheses in each cell represent percentages within each row Theright-hand-side column presents the marginal distribution of the number of claims

753VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

probability of a claim are proportional to thelength of the time interval taken into account soit is convenient to think of the contract choice asa commitment for only a short amount of timeThis approach has several advantages First ithelps to account for early cancellations andtruncated policies which together constitute 30percent of the policies in the data18 Second itmakes the deductible choice independent ofother longer-term uncertainties faced by the in-dividual so we can focus on static risk-takingbehavior Third this formulation helps to obtaina simple framework for analysis which is attrac-tive both analytically and computationally19

The expected utility that the individual obtainsfrom the choice of a contract (p d) is given by

(2) v p d

1 tuw pt tuw pt d

We characterize the set of parameters that willmake the individual indifferent between the twooffered contracts This set provides a lower (up-per) bound on the level of risk aversion for indi-viduals who choose the low (high) deductible (fora given ) Thus we analyze the equation v(phdh) v(pl dl) By taking limits with respect to t(and applying LrsquoHopitalrsquos rule) we obtain

(3)

limt30

1

t(u(w pht) u(w plt))

(u(w pht) u(w pht dh)) (u(w plt) u(w plt dl))

pl phuw

uw dl uw dh

or

(4)

pl phuw uw dl uw dh

The last expression has a simple intuition Theright-hand side is the expected gain (in utils) perunit of time from choosing a low deductibleThe left-hand side is the cost of such a choiceper unit of time For the individual to be indif-ferent the expected gains must equal the costs

In our benchmark specification we assumethat the third derivative of the vNM utilityfunction is not too large A Taylor expansion forboth terms on the right-hand side of equation(4) ie u(w d) u(w) du(w) (d22)u13(w) implies that

(5)pl ph

uw dh dluw

1

2dh dldh dlu13w

Let d dh dl 0 p pl ph 0 andd 1frasl2 (dh dl) to obtain

(6)p

duw uw d u13w

or

(7) r u13w

uw

p

d 1

d

where r is the coefficient of absolute risk aver-sion at wealth level w

18 As can be seen in Table 2A 70 percent of the policiesare observed through their full duration (one year) About15 percent are truncated by the end of our observationperiod and the remaining 15 percent are canceled for var-ious reasons such as change in car ownership total-lossaccident or a unilateral decision of the policyholder tochange insurance providers

19 This specification ignores the option value associatedwith not canceling a policy This is not very restrictiveSince experience rating is small and menus do not changeby much this option value is likely to be close to zero Asimple alternative is to assume that individuals behave as ifthey commit for a full year of coverage In such a case themodel will be similar to the one we estimate but willdepend on the functional form of the vNM utility functionand would generally require taking infinite sums (over thepotential realizations for the number of claims within the year)In the special case of quadratic expected utility maximizerswho care only about the mean and variance of the number ofclaims this is easy to solve The result is almost identical to theexpression we subsequently derive in equation (7)

754 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Equation (7) defines an indifference set in thespace of risk and risk aversion which we willrefer to by (r() ) and ((r) r) interchange-ably Both r() and (r) have a closed-formrepresentation a property that will be computa-tionally attractive for estimation20 Both termsare individual specific as they depend on thedeductible menu which varies across individu-als For the rest of the paper we regard eachindividual as associated with a two-dimensionaltype (ri i) An individual with a risk parameter

i who is offered a menu (pih di

h) (pil di

l)will choose the low-deductible contract if andonly if his coefficient of absolute risk aversionsatisfies ri ri(i) Figure 2 presents a graphi-cal illustration

B The Benchmark Econometric Model

The Econometric ModelmdashThe econometricmodel we estimate is fully described by the fiveequations presented in this section Our ob-jective is to estimate the joint distribution of(i ri)mdashthe claim rate and coefficient of abso-lute risk aversionmdashin the population of policy-holders conditional on observables xi The

20 For example estimating the CARA version of themodel (Section IIID) for which r() does not have aclosed-form representation takes almost ten times longer

FIGURE 2 THE INDIVIDUALrsquoS DECISIONmdashA GRAPHICAL ILLUSTRATION

Notes This graph illustrates the individualrsquos decision problem The solid line presents the indifference setmdashequation (7)mdashappliedfor the menu faced by the average individual in the sample Individuals are represented by points in the two-dimensional spaceabove In particular the scattered points are 10000 draws from the joint distribution of risk and risk aversion for the averageindividual (on observables) in the data based on the point estimates of the benchmark model (Table 4) If an individual is eitherto the right of the line (high risk) or above the line (high risk aversion) the low deductible is optimal Adverse selection is capturedby the fact that the line is downward sloping higher-risk individuals require lower levels of risk aversion to choose the lowdeductible Thus in the absence of correlation between risk and risk aversion higher-risk individuals are more likely to choosehigher levels of insurance An individual with i (pidi) will choose a lower deductible even if he is risk neutral ie withprobability one (we do not allow individuals to be risk loving) This does not create an estimation problem because i is notobserved only a posterior distribution for it Any such distribution will have a positive weight on values of i that are below(pidi) Second the indifference set is a function of the menu and in particular of (pidi) and d An increase in (pidi) willshift the set up and to the right and an increase in d will shift the set down and to the left Therefore exogenous shiftsof the menus that make both arguments change in the same direction can make the sets ldquocrossrdquo thereby allowing us toidentify the correlation between risk and risk aversion nonparametrically With positive correlation (shown in the figureby the ldquoright-bendingrdquo shape of the simulated draws) the marginal individuals are relatively high risk therefore creatinga stronger incentive for the insurer to raise the price of the low deductible

755VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

benchmark formulation assumes that (i ri) fol-lows a bivariate lognormal distribution Thuswe can write the model as

(8) ln i xi i

(9) ln ri xi vi

with

(10) i

vi

iid

N00

2 r

r r2

Neither i nor ri is directly observed Thereforewe treat both as latent variables Loosely speak-ing they can be thought of as random effects Weobserve two variables the number of claims andthe deductible choice which are related to thesetwo unobserved components Thus to completeour econometric model we have to specify therelationship between the observed variables andthe latent ones This is done by making two struc-tural assumptions First we assume that the num-ber of claims is drawn from a Poisson distributionnamely

(11) claimsi Poisson(i ti )

where ti is the observed duration of the policySecond we assume that when individuals maketheir deductible choices they follow the theoreti-cal model described in the previous section Themodel implies that individual i chooses the lowdeductible (choicei 1) if and only if ri ri(i)where ri is defined in equation (7) Thus theempirical model for deductible choice is given by

(12) Prchoicei 1 Prri

pi

idi 1

d i

Prexp(xi vi)

pi

exp(xi i)di 1

d i

With no unobserved heterogeneity in risk (i 0) equation (12) reduces to a simple probit In

such a case one can perfectly predict risk from thedata denote it by (xi) and construct an addi-tional covariate ln([pi((xi)di) 1]d i) Giventhe assumption that risk aversion is distributedlognormally running the probit regressionabove and renormalizing the coefficient onthe constructed covariate to 1 (instead ofthe typical normalization of the variance of theerror term to 1) has a structural interpretation withln (ri) as the dependent variable However Cohen(2005) provides evidence of adverse selection inthe data implying the existence of unobservedheterogeneity in risk This makes the simple probitregression misspecified Estimation of the fullmodel is more complicated Once we allow forunobserved heterogeneity in both unobserved riskaversion (vi) and claim rate (i) we have to inte-grate over the two-dimensional region depicted inFigure 2 for estimation

EstimationmdashA natural way to proceed is toestimate the model by maximum likelihoodwhere the likelihood of the data as a function ofthe parameters can be written by integrating outthe latent variables namely

(13) Lclaimsi choicei

Prclaimsi choiceii riPri ri

where is a vector of parameters to be esti-mated While formulating the empirical modelusing likelihood may help our thinking aboutthe data-generating process using maximumlikelihood (or generalized method of moments(GMM)) for estimation is computationally cum-bersome This is because in each iteration itrequires evaluating a separate complex integralfor each individual in the data In contrastMarkov Chain Monte Carlo (MCMC) Gibbssampling is quite attractive in such a case Us-ing data augmentation of latent variables (Mar-tin A Tanner and Wing Hung Wong 1987)according to which we simulate (i ri) and latertreat those simulations as if they are part ofthe data one can avoid evaluating the complexintegrals by sampling from truncated normaldistributions which is significantly less compu-tationally demanding (eg Luc Devroye 1986)This feature combined with the idea of aldquosliced samplerrdquo (Paul Damien John Wake-field and Stephen Walker 1999) to sample from

756 THE AMERICAN ECONOMIC REVIEW JUNE 2007

an unfamiliar posterior distribution makes theuse of a Gibbs sampler quite efficient for ourpurposes Finally the lognormality assumptionimplies that F(ln(i)ri) and F(ln(ri)i) follow a(conditional) normal distribution allowing us torestrict attention to univariate draws furtherreducing the computational burden

The Appendix provides a full description ofthe Gibbs sampler including the conditionaldistributions and the (flat) prior distributions weuse The basic intuition is that conditional onobserving (i ri) for each individual we have asimple linear regression model with two equa-tions The less standard part is to generate drawsfor (i ri) We do this iteratively Conditionalon i the posterior distribution for ln(ri) fol-lows a truncated normal distribution where thetruncation point depends on the menu individ-ual i faces and its direction (from above orbelow) depends on individual irsquos deductiblechoice The final step is to sample from theposterior distribution of ln(i) conditional on riThis is more complicated as we have bothtruncation which arises from adverse selection(just as we do when sampling for ri) and thenumber of claims which provides additionalinformation about the posterior of i Thus theposterior for i takes an unfamiliar form forwhich we use a ldquosliced samplerrdquo

We use 100000 iterations of the Gibbs sam-pler It seems to converge to the stationarydistribution after about 5000 iterations There-fore we drop the first 10000 draws and use thelast 90000 draws of each variable to report ourresults The results are based on the posteriormean and posterior standard deviation fromthese 90000 draws Note that each iterationinvolves generating separate draws of (i ri) foreach individual Performing 100000 iterationsof the benchmark specification (coded in Mat-lab) takes about 60 hours on a Dell Precision530 workstation

C Identification

The parametric version of the model is iden-tified mechanically There are more equationsthan unknowns and no linear dependenciesamong them so (as also verified using MonteCarlo exercises) the model parameters can bebacked out from simulated data Our goal in thissection is not to provide a formal identificationproof Rather we want to provide intuition for

which features of the data allow us to identifyparticular parameters of the model The discus-sion also highlights the assumptions that areessential for identification vis-a-vis those thatare made only for computational convenience(making them in principle testable)

Discussion of Nonparametric Identifica-tionmdashThe main difficulty in identifying themodel arises from the gap between the (exante) risk type i which individuals usewhen choosing a deductible and the (ex post)realization of the number of claims we ob-serve We identify between the variation inrisk types and the variation in risk realizationsusing our parametric distributional assump-tions The key is that the distribution of risktypes can be uniquely backed out from theclaim data alone This allows us to use thedeductible choice as an additional layer ofinformation which identifies unobserved het-erogeneity in risk aversion21 Any distribu-tional assumption that allows us to uniquelyback out the distribution of risk types fromclaim data would be sufficient to identify thedistribution of risk aversion As is customaryin the analysis of count processes we make aparametric assumption that claims are gener-ated by a lognormal mixture of Poisson dis-tributions (Section IIID discusses this furtherand explores an alternative) Using a mixtureenables us to account for adverse selectionthrough unobserved heterogeneity in risk Italso allows us to better fit the tails of theclaim distribution In principle a more flexi-ble mixture or a more flexible claim-generat-ing process could be identified as long as theclaims data are sufficiently rich22

21 Cardon and Hendel (2001) face a similar identificationproblem in the context of health insurance They use vari-ation in coverage choice (analogous to deductible choice) toidentify the variation in health-status signals (analogous torisk types) from the variation in health expenditure (analo-gous to number of claims) They can rely on the coveragechoice to identify this because they make an assumptionregarding unobserved heterogeneity in preferences (iidlogit) We take a different approach as our main goal is toestimate rather than assume the distribution of preferences

22 Although it may seem that the claim data are limited(as they take only integer values between 0 and 5 in ourdata) variation in policy duration generates continuousvariation in the observed claim propensity Of course thisvariation also introduces an additional selection into themodel due to policy cancellations which are potentially

757VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Once the distribution of risk types is iden-tified from claims data the marginal distribu-tion of risk aversion (and its relationship tothe distribution of risk types) is nonparametri-cally identified from the variation in theoffered menus discussed in Section I Thisvariation implies different deductible and pre-mium options to identical (on observables)individuals who purchased insurance at dif-ferent times Different menus lead to differentindifference sets (similar to the one depictedin Figure 2) These sets often cross each otherand nonparametrically identify the distribu-tion of risk aversion and the correlation struc-ture at least within the region in which theindifference sets vary For the tails of thedistribution as is typically the case we haveto rely on parametric assumptions or usebounds The parametric assumption of log-normality we use for most of the paper ismade only for computational convenience

Intuition for the Parametric IdentificationMechanismmdashVariation in the offered menus isimportant for the nonparametric identificationThe parametric assumptions could identify themodel without such variation Thus to keep theintuition simple let us take the bivariate log-normal distribution as given and contrary to thedata assume that all individuals are identical onobservables and face the same menu Supposealso that all individuals are observed for exactlyone year and have up to two claims23 In thissimplified case the model has five parametersto be estimated the mean and variance of risk and

2 the mean and variance of risk aver-sion r and r

2 and the correlation parameter The data can be summarized by five numbersLet 130 131 and 132 1 131 130 be thefraction of individuals with zero one and twoclaims respectively Let 0 1 and 2 be theproportion of individuals who chose a low de-ductible within each ldquoclaim grouprdquo Given ourdistributional assumption about the claim-

generating process we can use 130 and 131 touniquely identify and

2 Loosely isidentified by the average claim rate in the dataand

2 is identified by how fat the tail of theclaim distribution is ie by how large (132131)is compared to (131130) Given and

2 and thelognormality assumption we can (implicitly)construct a posterior distribution of risk typesfor each claim group F(r claims c) andintegrate over it when predicting the deductiblechoice This provides us with three additionalmoments each of the form

(14) Ec Prchoice 1r

dFr claims c dFr

for c 0 1 2 These moments identify thethree remaining parameters of the model rr

2 and Let us now provide more economic content to

the identification argument Using the same ex-ample and conditional on identifying and

2

from the claim data one can think about thedeductible choice data 0 1 2 as a graph(c) The absolute level of the graph identifiesr In the absence of correlation between riskand risk aversion the slope of the graph iden-tifies r

2 with no correlation the slope shouldalways be positive (due to adverse selection)but higher r

2 would imply a flatter graph be-cause more variation in the deductible choiceswill be attributed to variation in risk aversionFinally is identified by the curvature of thegraph The more convex (concave) the graph isthe more positive (negative) is the estimated For example if 0 05 1 051 and 2 099 it is likely that r

2 is high (explaining why0 and 1 are so close) and is highly positive(explaining why 2 is not also close to 1) Incontrast if 0 1 it must mean that thecorrelation between risk and risk aversion isnegative which is the only way the originalpositive correlation induced by adverse selec-tion can be offset This intuition also clarifiesthat the identification of relies on observingindividuals with multiple claims (or differentpolicy durations) and that it is likely to besensitive to the distributional assumptions Thedata (Table 2B) provide a direct (positive) cor-relation between deductible choice and claims

endogenous The results are similar when we use onlyexpired and truncated policies

23 This variation is sufficient (and necessary) to identifythe benchmark model The data provide more variation weobserve up to five claims per individual we observe con-tinuous variation in the policy durations we observe vari-ation in prices and we exploit distributional restrictionsacross individuals with different observable characteristics

758 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The structural assumptions allow us to explainhow much of this correlation can be attributedto adverse selection The remaining correlation(positive or negative) is therefore attributed tocorrelation in the underlying distribution of riskand risk aversion

III Results

A Reduced-Form Estimation

To get an initial sense for the levels of abso-lute risk aversion implied by the data we use asimple back-of-the-envelope exercise We com-pute unconditional averages of p d dh dland d (Table 2A)24 and substitute these valuesin equation (7) The implied coefficient of ab-solute risk aversion is 29 104 NIS125 Thisfigure could be thought of as the average indif-ference point implying that about 18 percent ofthe policyholders have coefficients of absoluterisk aversion exceeding it To convert to USdollar amounts one needs to multiply thesefigures by the average exchange rate (352)resulting in an average indifference point of102 103 $US1 This figure is less than halfof a similar estimate reported by Sydnor (2006)for buyers of homeownerrsquos insurance but about3 and 13 times higher than comparable fig-ures reported by Gertner (1993) and Metrick(1995) respectively for television game showparticipants

Table 3 provides reduced-form analysis ofthe relationship between the observables andour two left-hand-side variables the number ofclaims and the deductible choice26 Column 1reports the estimates from a Poisson regressionof the number of claims on observed character-istics This regression is closely related to therisk equation we estimate in the benchmarkmodel It shows that older people women andpeople with a college education are less likely to

have an accident Bigger more expensiveolder and noncommercial cars are more likelyto be involved in an accident Driving experi-ence and variables associated with less intenseuse of the car reduce accident rates As could beimagined claim propensity is highly correlatedover time past claims are a strong predictor offuture claims Young drivers are 50 to 70 per-cent more likely to be involved in an accidentwith young men significantly more likely thanyoung women Finally as indicated by the trendin the estimated year dummies the accident ratesignificantly declined over time Part of thisdecline is likely due to the decline in accidentrates in Israel in general27 This decline mightalso be partly due to the better selection ofindividuals the company obtained over time asit gained more experience (Cohen 2003)

Columns 2 and 3 of Table 3 present estimatesfrom probit regressions in which the dependentvariable is equal to one if the policyholder chosea low deductible and is equal to zero otherwiseColumn 3 shows the marginal effects of thecovariates on the propensity to choose a lowdeductible These marginal effects do not have astructural interpretation as the choice of lowdeductible depends on its price on risk aver-sion and on risk In this regression we againobserve a strong trend over time Fewer policy-holders chose the low deductible as time wentby One reason for this trend according to thecompany executives is that over time the com-panyrsquos sales persons were directed to focusmainly on the ldquodefaultrdquo regular deductible op-tion28 The effect of other covariates will bediscussed later in the context of the full modelIn unreported probit regressions we also test thequalitative assumptions of the structural model byadding three additional regressors the price ratio(pidi) the average deductible offered d i

24 The unconditional is computed by maximum likeli-hood using the data on claims and observed durations of thepolicies

25 Using the CARA specification as in equation (16) weobtain a slightly lower value of 25 104 NIS1

26 We find positive correlation in the errors of these tworegressions suggesting the existence of adverse selection inthe data and motivating a model with unobserved heteroge-neity in risk This test is similar to the bivariate probit testproposed by Chiappori and Salanie (2000) and replicatesearlier results reported in Cohen (2005)

27 In particular traffic fatalities and counts of trafficaccidents in Israel fell by 11 percent and 18 percent during1998 and 1999 respectively

28 Such biased marketing efforts will bias consumersagainst choosing the low deductible thus making them lookless risk averse This would make our estimate a lowerbound on the true level of risk aversion If only sophisti-cated consumers could see beyond the marketing effort andthis sophistication were related to observables (eg educa-tion) the coefficients on such observables would be biasedupward This is not a major concern given that the coefficientson most of the covariates are fairly stable when we estimatethe benchmark model separately for different years

759VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

TABLE 3mdashNO HETEROGENEITY IN RISK

Variable

Poisson regressiona Probit regressionsc

Dep var Number of claims Dep var 1 if low deductible

(1) (2) (3)

Coeff Std Err IRRb Coeff Std Err dPdX dPdXDemographics Constant 43535 02744 mdash 224994 23562 mdash

Age 00074 00052 09926 01690 00464 00034 00042Age2 989 105 536 105 10001 00017 00005 339 105 455 105Female 00456 00161 09555 08003 01316 00165 00134Family Single omitted omitted omitted

Married 01115 00217 08945 06417 01950 00128 00045Divorced 00717 00346 10743 03191 03127 00064 00044Widower 00155 00527 10156 03441 04573 00069 00025Other (NA) 01347 01755 11442 29518 19836 00511 00370

Education Elementary 00522 00550 09491 02927 04278 00061 00016High school omitted omitted omittedTechnical 00373 00308 10380 05037 02520 00105 00138College 00745 00197 09282 05279 01546 00109 00053Other (NA) 00116 00184 10116 00630 01540 00013 00025

Emigrant 00210 00163 10213 00497 01328 00010 00003

Car attributes Log(Value) 01227 00281 11306 13285 02342 00271 00299Car age 00187 00042 10189 01603 00357 00033 00017Commercial car 01394 00326 08699 09038 02781 00177 00294Log(engine size) 02972 00459 13461 06924 03952 00141 00075

Driving License years 00204 00034 09798 01043 00312 00021 00005License years2 00002 677 105 10002 00015 00005 299 105 167 105

Good driver 00176 00191 09825 07207 01574 00148 00152Any driver 00564 00169 10580 10923 01321 00217 00258Secondary car 00859 00209 09177 00038 01626 00001 00070Business use 01852 00293 12034 07951 02737 00156 00017History length 00527 00110 09486 11218 01450 00228 00171Claims history 06577 00390 19304 04654 05460 00095 00496

Young driver Young driver 05235 00361 16879 28847 06305 00524 00012Gender Male omitted omitted omitted

Female 01475 00288 08629 17959 03135 00396 00195Age 17ndash19 omitted omitted omitted

19ndash21 00701 00532 10726 07800 07509 00153 0014721ndash24 00267 00574 09737 05746 07773 00114 0015624 02082 00567 0812 17869 07328 00397 00136

Experience 1 omitted omitted omitted1ndash3 02416 00458 07854 10175 06577 00217 000093 02827 00532 07538 32513 07386 00762 00410

Company year First year omitted omitted omittedSecond year 00888 00198 09150 44513 01506 00792 00859Third year 00690 00222 09334 85888 01820 01303 01367Fourth year 01817 00232 08339 118277 02102 01616 01734Fifth year 05431 0028 05810 143206 02778 01920 02085

105586Obs 105800 94000 105800Pseudo R2 00162 01364 01296Log likelihood 577459 369595 430869

Significant at the 5 percent confidence levela Maximum likelihood estimates Variation in exposure (policy duration) is accounted forb IRR Incidence rate ratio Each figure should be interpreted as the increasedecrease in claim probability as a result of an increase of one unit in the

right-hand-side variablec There are two separate probit regressions reported in this table Column 2 relies on the deductible choice model and the lognormality assumption As discussed

in the text by including an additional regressor ln([pi((xi)di) 1]d i) (with (xi) predicted from column 1 above) and normalizing its coefficient to 1 we obtaina structural interpretation of this regression with ln (ri) as the dependent variable Thus the reported coefficients are comparable to those estimated for the benchmarkmodel One should be cautious however in interpreting these coefficients Unlike the benchmark model this regression does not allow unobserved heterogeneity inrisk and suffers from some selection bias because observations with high predicted risk rate are omitted (which is why the number of observations is 94000 ratherthan the full sample of 105800) For comparison column 3 reports the marginal effects from a comparable probit regression that uses the full sample and does notcontrol for pricing and predicted risk through the additional structural regressor Column 3 does not have a structural interpretation and its (unreported) coefficientscannot be compared to those estimated from the benchmark model

760 THE AMERICAN ECONOMIC REVIEW JUNE 2007

and the risk rate (xi) as predicted from thePoisson regression of column 1 All three addi-tional regressors enter with the predicted signand with large and highly significant marginaleffects29

Finally column 2 of Table 3 presents animportant specification of the probit regres-sion in which ln([pi(( xi)di) 1]d i) isadded as a regressor and its coefficient isnormalized to 130 As already mentioned ifwe assume that there is no unobserved heter-ogeneity in risk then column 2 is analogousto the ln(ri) equation of the benchmark modelThis restriction of the model is rejected by thedata About 10 percent of the individuals arepredicted to have (xi) (pi di) implying achoice of low deductible for any level of riskaversion Many of these individuals howeverstill choose higher deductible levels Column2 reports the results for the remaining indi-viduals (94000 out of 105800) for which theregression can be run While the signs of theestimated coefficients are similar to those inthe benchmark model presented below therestricted version of the model suggests muchhigher levels and dispersion of risk aversionwell above any reasonable level31 The fullestimation of the benchmark model clearly re-jects this restriction on the model

B Estimation of the Benchmark Model

The Effect of Individual Characteristics onRisk AversionmdashTable 4 presents the estimationresults from the benchmark model The secondcolumn shows how the level of absolute riskaversion is related to individual characteristicsAs the dependent variable is in natural loga-

rithm coefficients on dummy variables can bedirectly interpreted as approximate percentagechanges The vast majority of these coefficientsare quite stable across a wide range of specifi-cations which are mentioned later

The results indicate that women are more riskaverse than men and have a coefficient of ab-solute risk aversion about 20 percent greaterthan that of men These results are consistentwith those of Donkers et al (2001) and Hartoget al (2002) The estimated effect of age sug-gests a nonmonotone pattern of risk preferencesover the life cycle The estimated coefficientsimply that initially (that is at age 18 the youngestindividual in the data) individuals become less riskaverse with age but around the age of 48 indi-viduals start becoming more risk averse32 Mar-ried individuals are estimated to be significantlymore risk averse compared to singles while di-vorced individuals are less (although the coeffi-cient is statistically insignificant)

The analysis suggests that variables that arelikely to be correlated with income or wealthsuch as post-high-school education and thevalue of the car have a positive coefficientindicating that wealthier people have higherlevels of absolute risk aversion Although we donot have data on individualsrsquo income or wealthwe use other proxies for income in additionalspecifications and obtain mixed results Whenwe include as a covariate the average householdincome among those who live in the same zipcode we obtain a significant and negative co-efficient of 0333 (0154) (standard deviationin parentheses) When we match zip code in-come on demographics of the individuals thecoefficient is effectively zero 0036 (0047)and when we use a socioeconomic index ofthe locality in which the individual lives thecoefficient is positive 0127 (0053)33 Thus

29 The estimated marginal effect (z-statistic in parenthe-ses) is 0352 (1376) 16 104 (1481) and 0154(255) for (pidi) d i and (xi) respectively

30 The level is a normalization The sign is estimatedHad the sign on this regressor been positive this would haveimplied a rejection of the model

31 For the implied median level of risk aversion the re-stricted model produces a similar estimate to the estimate wereport below for the benchmark model However since someindividuals who chose the low deductible are estimated to havevery low claim rates the restricted model is ldquoforcedrdquo to estimatevery high risk aversion for these individuals (in contrast thebenchmark model can explain such choices by positive risk re-siduals) resulting in very high dispersion and (due to the lognor-mality assumption) very high average risk aversion which isabout 1025 higher than the benchmark estimates we report below

32 This nonmonotone pattern may explain why age enterswith different signs in the estimation results of Donkers et al(2001) and Hartog et al (2002) A somewhat similar U-shapepattern with respect to age is also reported by Sumit Agarwalet al (2006) in the context of consumer credit markets

33 The full results from these specifications are providedin the online Appendix (available at httpwwwe-aerorgdatajune0720050644_apppdf ) The reason we do not usethese income variables in the benchmark specification istheir imperfect coverage which would require us to omitalmost 20 percent of the individuals Other results fromthese regressions are similar to those we report for thebenchmark specification

761VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

overall we interpret our findings as suggestiveof a nonnegative and perhaps positive associ-ation between incomewealth and absolute riskaversion

At first glance these results may appear tobe inconsistent with the widely held beliefthat absolute risk aversion declines withwealth One should distinguish however be-tween two questions (a) whether for a givenindividual the vNM utility function exhibitsdecreasing absolute risk aversion and (b)how risk preferences vary across individuals

Our results do not speak to the first questionand should not be thought of as a test of thedecreasing absolute risk aversion propertyTesting this property would require observingthe same individual making multiple choicesat different wealth levels Rather our resultsindicate that individuals with greater wealthhave utility functions that involve a greaterdegree of risk aversion It might be thatwealth is endogenous and that risk aversion(or unobserved individual characteristics thatare correlated with it) leads individuals to

TABLE 4mdashTHE BENCHMARK MODEL

Variable Ln() equation Ln(r) equation Additional quantities

Demographics Constant 15406 (00073) 118118 (01032) Var-covar matrix ()

Age 00001 (00026) 00623 (00213) 01498 (00097)Age2 624 106 (263 105) 644 104 (211 104) r 31515 (00773)Female 00006 (00086) 02049 (00643) 08391 (00265)Family Single Omitted Omitted

Married 00198 (00115) 01927 (00974) Unconditional statisticsa

Divorced 00396 (00155) 01754 (01495) Mean 02196 (00013)Widower 00135 (00281) 01320 (02288) Median 02174 (00017)Other (NA) 00557 (00968) 04599 (07397) Std Dev 00483 (00019)

Education Elementary 00194 (00333) 01283 (02156) Mean r 00019 (00002)High school Omitted Omitted Median r 727 106 (756 107)Technical 00017 (00189) 02306 (01341) Std Dev r 00197 (00015)College 00277 (00124) 02177 (00840) Corr(r ) 02067 (00085)Other (NA) 00029 (00107) 00128 (00819)

Emigrant 00030 (00090) 00001 (00651) Obs 105800

Car attributes Log(value) 00794 (00177) 07244 (01272)Car age 00053 (00023) 00411 (00176)Commercial car 00719 (00187) 00313 (01239)Log(engine size) 01299 (00235) 03195 (01847)

Driving License years 00015 (00017) 00157 (00137)License years2 183 105 (351 105) 148 104 (254 104)Good driver 00635 (00112) 00317 (00822)Any driver 00360 (00105) 03000 (00722)Secondary car 00415 (00141) 01209 (00875)Business use 00614 (00134) 03790 (01124)History length 00012 (00052) 03092 (00518)Claims history 01295 (00154) 00459 (01670)

Young driver Young driver 00525 (00253) 02499 (02290)Gender Male Omitted mdash

Female 00355 (00061) mdashAge 17ndash19 Omitted mdash

19ndash21 00387 (00121) mdash21ndash24 00445 (00124) mdash24 00114 (00119) mdash

Experience 1 Omitted mdash1ndash3 00059 (00104) mdash3 00762 (00121) mdash

Company year First year Omitted OmittedSecond year 00771 (00122) 14334 (00853)Third year 00857 (00137) 28459 (01191)Fourth year 01515 (00160) 38089 (01343)Fifth year 04062 (00249) 39525 (01368)

Note Standard deviations based on the draws from the posterior distribution in parentheses Significant at the 5 percent confidence levela Unconditional statistics represent implied quantities for the sample population as a whole ie integrating over the distribution of covariates in the sample (as

well as over the unobserved components)

762 THE AMERICAN ECONOMIC REVIEW JUNE 2007

save more to obtain more education or totake other actions that lead to greaterwealth34

Let us make several additional observationsFirst while owners of more expensive cars ap-pear to have both higher risk exposure andhigher levels of risk aversion owners of biggercars have higher risk exposure but lower levelsof risk aversion This should indicate that thestructure of the model does not constrain therelationship between the coefficients in the twoequations Rather it is the data that ldquospeak uprdquoSecond individuals who are classified by theinsurer as ldquogood driversrdquo indeed have lowerrisk but also appear to have lower risk aversionThis result is somewhat similar to the positivecorrelation between unobserved risk and unob-served risk aversion which we report belowThird policyholders who tend to use the car forbusiness are less risk averse This could bebecause uninsured costs of accidents occurringto such policyholders are tax deductible Fourthpolicyholders who reported three full years ofclaims history are more risk averse but are notdifferent in their risk exposure The attitude thatleads such policyholders to comply with therequest to (voluntarily) report three full years ofclaims history is apparently and not surpris-ingly correlated with higher levels of risk aver-sion In contrast while past claims indicate highrisk they have no significant relationship withrisk aversion Finally we find a strong trendtoward lower levels of risk aversion over timeThis is a replication of the probit results dis-cussed earlier

The Effect of Individual Characteristics onClaim RiskmdashThe first column of Table 4 de-scribes the relationship between observablesand risk exposure Accident risk is higher fordivorced individuals and lower for people witha college education Bigger older more expen-sive and noncommercial cars are all morelikely to be involved in an accident Drivingexperience reduces accident rates as do mea-sures of less intense use of the car while youngdrivers are more exposed to risk Claim propen-sity is highly correlated over time the voluntaryreport of past claims is a strong predictor offuture claims This risk equation produces re-sults that are similar to those of the simplerPoisson regression reported in Table 3 Al-though some of the coefficients lose signifi-cance the magnitude of most coefficients isquite similar The similarity between these twosets of results is to be expected as the riskregression is identified primarily from the dataon claims so incorporating the information ondeductible choice does not qualitatively changethe conceptual identification strategy (see Sec-tion IIC) If the results were not similar thiscould have indicated a misspecification of themodel The slight differences between the riskregressions in Table 3 and Table 4 are drivenmainly by the structural assumptions First thebenchmark model estimates a lognormal mix-ture of Poisson rates rather than a single Pois-son model By incorporating the fatter tails ofthe claim distribution it slightly changes theresults increases the standard errors and de-creases the average predicted claim rate Sec-ond the information on deductible choiceslightly helps us in obtaining more precise es-timates through the correlation structure be-tween the error terms in the two equations

The Implied Level of Risk AversionmdashOne ofthe main goals of the estimation is to obtainmeasures of the level of risk aversion in thepopulation we study Since we use Gibbs sam-pling and augment the latent coefficients ofabsolute risk aversion we can directly obtainthe posterior distribution of various moments ofthis distribution At each iteration of the Gibbssampler we compute the mean standard devi-ation (across individuals) and various percen-tiles of the simulated draws of i and ri and thecorrelation between them The far-right columnof Table 4 reports the averages and standard

34 One may be tempted to interpret the positive wealtheffects as an indirect indication of credit constraints wealth-ier individuals are less credit constrained and therefore canafford to purchase more insurance We do not share thisinterpretation for two reasons First the insurance companyobserves these proxies for wealth and conditions on themwhen setting prices Since the willingness to pay for insur-ance is likely to be correlated with the willingness to pay forthe additional insurance provided by the low deductibleoption premiums already reflect this variation We condi-tion on the level of the premium Second paying less exante implies paying more ex post so applying the creditconstraint argument only to the ex ante payment but not tothe probabilistic ex post deductible payments has no theo-retical foundation Essentially the setup of the model linksthe ex ante decisions to the ex post losses which are bothdriven by the curvature of the vNM utility function

763VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

deviations of these computed quantities over theiterations of the Gibbs sampler35 The impliedrisk aversion of the mean individual is 00019which is about seven times greater than theback-of-the-envelope calculation presented inthe previous section As we assume a lognormal

distribution and estimate a relatively high dis-persion coefficient r the estimates also implysignificantly lower values for the median levelof risk aversion As shown later the qualitativepattern of these results is quite robust acrossspecifications

Table 5 summarizes our findings regardingthe level and dispersion of risk aversion Itpresents two ways to interpret the estimates weobtained from the benchmark model as well ascomparisons to specifications with constant ab-solute risk aversion (CARA) utility and with

35 Note that these estimated quantities are unconditionalIn computing these quantities we integrate over the distri-bution of observable characteristics in the data so onecannot compute these estimates from the estimated param-eter directly

TABLE 5mdashRISK-AVERSION ESTIMATES

Specificationa Absolute risk aversionb Interpretationc Relative risk aversiond

Back-of-the-envelope 10 103 9070 1484Benchmark model

Mean individual 67 103 5605 972225th percentile 23 106 9998 003Median individual 26 105 9974 03775th percentile 29 104 9714 42790th percentile 27 103 7834 390295th percentile 99 103 4937 14327

CARA utilityMean individual 31 103 7651 4436Median individual 34 105 9966 050

Learning modelMean individual 42 103 6886 6140Median individual 56 106 9995 008

Comparable estimatesGertner (1993) 31 104 9699 479Metrick (1995) 66 105 9934 102Holt and Laury (2002)e 32 102 2096 86575Sydnor (2006) 20 103 8329 5395

a The table summarizes the results with respect to the level of risk aversion ldquoBack-of-the-enveloperdquo refers to thecalculation we report in the beginning of Section III ldquobenchmark modelrdquo refers to the results from the benchmark model(Table 4) ldquoCARA utilityrdquo refers to a specification of a CARA utility function and ldquolearning modelrdquo refers to a specificationin which individuals do not know their risk types perfectly (see Section IIID) The last four rows are the closest comparableresults available in the literature

b The second column presents the point estimates for the coefficient of absolute risk aversion converted to $US1 unitsFor the comparable estimates this is given by their estimate of a representative CARA utility maximizer For all otherspecifications this is given by computing the unconditional mean and median in the population using the figures we reportin the mean and median columns of risk aversion in Table 6 (multiplied by 352 the average exchange rate during the periodto convert to US dollars)

c To interpret the absolute risk aversion estimates (ARA) we translate them into x u(w) 1frasl2 u(w 100) 1frasl2 u(w x) That is we report x such that an individual with the estimated ARA is indifferent about participating in a 50ndash50lottery of gaining 100 US dollars and losing x US dollars Note that since our estimate is of absolute risk aversion the quantityx is independent of w To be consistent with the specification we use a quadratic utility function for the back-of-the-envelopebenchmark and learning models and use a CARA utility function for the others

d The last column attempts to translate the ARA estimates into relative risk aversion We follow the literature and do soby multiplying the ARA estimate by average annual income We use the average annual income (after tax) in Israel in 1995(51168 NIS from the Israeli Census) for all our specifications and we use average disposable income in the United Statesin 1987 ($15437) for Gertner (1993) and Metrick (1995) For Holt and Laury (2002) and Sydnor (2006) we use a similarfigure for 2002 ($26974)

e Holt and Laury (2002) do not report a comparable estimate The estimate we provide above is based on estimating aCARA utility model for the 18 subjects in their experiment who participated in the ldquo90rdquo treatment which involved stakescomparable to our setting For these subjects we assume a CARA utility function and a lognormal distribution of theircoefficient of absolute risk aversion The table reports the point estimate of the mean from this distribution

764 THE AMERICAN ECONOMIC REVIEW JUNE 2007

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 9: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

there is no moral hazard Finally we assumethat in the event of an accident the value of theclaim is greater than di

h We revisit all theseassumptions in Sections IIID and IIIE For therest of this section i subscripts are suppressedfor convenience

In the market we study insurance policies are

typically held for a full year after which theycan be automatically renewed with no commit-ment by either the company or the individualMoreover all auto-insurance policies sold inIsrael can be canceled without prior notice bythe policyholder with premium payments beinglinearly prorated Both the premium and the

TABLE 2AmdashSUMMARY STATISTICSmdashMENUS CHOICES AND OUTCOMES

Variable Obs Mean Std dev Min Max

Menu Deductible (current NIS)a Low 105800 87548 12101 37492 103911Regular 105800 145299 19779 62486 171543High 105800 260802 35291 112475 308778Very high 105800 376305 50853 162464 446013

Premium (current NIS)a Low 105800 338057 91404 132471 1923962Regular 105800 318922 8623 124972 1815058High 105800 279057 75451 109351 1588176Very high 105800 255137 68984 99978 1452046

pd 105800 0328 006 03 18

Realization Choice Low 105800 0178 038 0 1Regular 105800 0811 039 0 1High 105800 0006 008 0 1Very high 105800 0005 007 0 1

Policy termination Active 105800 0150 036 0 1Canceled 105800 0143 035 0 1Expired 105800 0707 046 0 1

Policy duration (years) 105800 0848 028 0005 108Claims All 105800 0208 048 0 5

Low 18799 0280 055 0 5Regular 85840 0194 046 0 5High 654 0109 034 0 3Very high 507 0107 032 0 2

Claims per yearb All 105800 0245 066 0 19882Low 18799 0309 066 0 9264Regular 85840 0232 066 0 19882High 654 0128 062 0 12636Very high 507 0133 050 0 3326

a The average exchange rate throughout the sample period was approximately $1 per 35 NIS starting at 13 in late 1994and reaching 14 in late 1999

b The mean and standard deviation of the claims per year are weighted by the observed policy duration to adjust forvariation in the exposure period These are the maximum likelihood estimates of a simple Poisson model with no covariates

TABLE 2BmdashSUMMARY STATISTICSmdashCONTRACT CHOICES AND REALIZATIONS

Claims Low Regular High Very high Total Share

0 11929 (0193) 49281 (0796) 412 (0007) 299 (0005) 61921 (100) 080341 3124 (0239) 9867 (0755) 47 (0004) 35 (0003) 13073 (100) 016962 565 (0308) 1261 (0688) 4 (0002) 2 (0001) 1832 (100) 002383 71 (0314) 154 (0681) 1 (0004) 0 (0000) 226 (100) 000294 6 (0353) 11 (0647) 0 (0000) 0 (0000) 17 (100) 000025 1 (0500) 1 (0500) 0 (0000) 0 (0000) 2 (100) 000003

Notes The table presents tabulation of choices and number of claims For comparability the figures are computed only forindividuals whose policies lasted at least 09 years (about 73 percent of the data) The bottom rows of Table 2A providedescriptive figures for the full dataset The numbers in parentheses in each cell represent percentages within each row Theright-hand-side column presents the marginal distribution of the number of claims

753VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

probability of a claim are proportional to thelength of the time interval taken into account soit is convenient to think of the contract choice asa commitment for only a short amount of timeThis approach has several advantages First ithelps to account for early cancellations andtruncated policies which together constitute 30percent of the policies in the data18 Second itmakes the deductible choice independent ofother longer-term uncertainties faced by the in-dividual so we can focus on static risk-takingbehavior Third this formulation helps to obtaina simple framework for analysis which is attrac-tive both analytically and computationally19

The expected utility that the individual obtainsfrom the choice of a contract (p d) is given by

(2) v p d

1 tuw pt tuw pt d

We characterize the set of parameters that willmake the individual indifferent between the twooffered contracts This set provides a lower (up-per) bound on the level of risk aversion for indi-viduals who choose the low (high) deductible (fora given ) Thus we analyze the equation v(phdh) v(pl dl) By taking limits with respect to t(and applying LrsquoHopitalrsquos rule) we obtain

(3)

limt30

1

t(u(w pht) u(w plt))

(u(w pht) u(w pht dh)) (u(w plt) u(w plt dl))

pl phuw

uw dl uw dh

or

(4)

pl phuw uw dl uw dh

The last expression has a simple intuition Theright-hand side is the expected gain (in utils) perunit of time from choosing a low deductibleThe left-hand side is the cost of such a choiceper unit of time For the individual to be indif-ferent the expected gains must equal the costs

In our benchmark specification we assumethat the third derivative of the vNM utilityfunction is not too large A Taylor expansion forboth terms on the right-hand side of equation(4) ie u(w d) u(w) du(w) (d22)u13(w) implies that

(5)pl ph

uw dh dluw

1

2dh dldh dlu13w

Let d dh dl 0 p pl ph 0 andd 1frasl2 (dh dl) to obtain

(6)p

duw uw d u13w

or

(7) r u13w

uw

p

d 1

d

where r is the coefficient of absolute risk aver-sion at wealth level w

18 As can be seen in Table 2A 70 percent of the policiesare observed through their full duration (one year) About15 percent are truncated by the end of our observationperiod and the remaining 15 percent are canceled for var-ious reasons such as change in car ownership total-lossaccident or a unilateral decision of the policyholder tochange insurance providers

19 This specification ignores the option value associatedwith not canceling a policy This is not very restrictiveSince experience rating is small and menus do not changeby much this option value is likely to be close to zero Asimple alternative is to assume that individuals behave as ifthey commit for a full year of coverage In such a case themodel will be similar to the one we estimate but willdepend on the functional form of the vNM utility functionand would generally require taking infinite sums (over thepotential realizations for the number of claims within the year)In the special case of quadratic expected utility maximizerswho care only about the mean and variance of the number ofclaims this is easy to solve The result is almost identical to theexpression we subsequently derive in equation (7)

754 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Equation (7) defines an indifference set in thespace of risk and risk aversion which we willrefer to by (r() ) and ((r) r) interchange-ably Both r() and (r) have a closed-formrepresentation a property that will be computa-tionally attractive for estimation20 Both termsare individual specific as they depend on thedeductible menu which varies across individu-als For the rest of the paper we regard eachindividual as associated with a two-dimensionaltype (ri i) An individual with a risk parameter

i who is offered a menu (pih di

h) (pil di

l)will choose the low-deductible contract if andonly if his coefficient of absolute risk aversionsatisfies ri ri(i) Figure 2 presents a graphi-cal illustration

B The Benchmark Econometric Model

The Econometric ModelmdashThe econometricmodel we estimate is fully described by the fiveequations presented in this section Our ob-jective is to estimate the joint distribution of(i ri)mdashthe claim rate and coefficient of abso-lute risk aversionmdashin the population of policy-holders conditional on observables xi The

20 For example estimating the CARA version of themodel (Section IIID) for which r() does not have aclosed-form representation takes almost ten times longer

FIGURE 2 THE INDIVIDUALrsquoS DECISIONmdashA GRAPHICAL ILLUSTRATION

Notes This graph illustrates the individualrsquos decision problem The solid line presents the indifference setmdashequation (7)mdashappliedfor the menu faced by the average individual in the sample Individuals are represented by points in the two-dimensional spaceabove In particular the scattered points are 10000 draws from the joint distribution of risk and risk aversion for the averageindividual (on observables) in the data based on the point estimates of the benchmark model (Table 4) If an individual is eitherto the right of the line (high risk) or above the line (high risk aversion) the low deductible is optimal Adverse selection is capturedby the fact that the line is downward sloping higher-risk individuals require lower levels of risk aversion to choose the lowdeductible Thus in the absence of correlation between risk and risk aversion higher-risk individuals are more likely to choosehigher levels of insurance An individual with i (pidi) will choose a lower deductible even if he is risk neutral ie withprobability one (we do not allow individuals to be risk loving) This does not create an estimation problem because i is notobserved only a posterior distribution for it Any such distribution will have a positive weight on values of i that are below(pidi) Second the indifference set is a function of the menu and in particular of (pidi) and d An increase in (pidi) willshift the set up and to the right and an increase in d will shift the set down and to the left Therefore exogenous shiftsof the menus that make both arguments change in the same direction can make the sets ldquocrossrdquo thereby allowing us toidentify the correlation between risk and risk aversion nonparametrically With positive correlation (shown in the figureby the ldquoright-bendingrdquo shape of the simulated draws) the marginal individuals are relatively high risk therefore creatinga stronger incentive for the insurer to raise the price of the low deductible

755VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

benchmark formulation assumes that (i ri) fol-lows a bivariate lognormal distribution Thuswe can write the model as

(8) ln i xi i

(9) ln ri xi vi

with

(10) i

vi

iid

N00

2 r

r r2

Neither i nor ri is directly observed Thereforewe treat both as latent variables Loosely speak-ing they can be thought of as random effects Weobserve two variables the number of claims andthe deductible choice which are related to thesetwo unobserved components Thus to completeour econometric model we have to specify therelationship between the observed variables andthe latent ones This is done by making two struc-tural assumptions First we assume that the num-ber of claims is drawn from a Poisson distributionnamely

(11) claimsi Poisson(i ti )

where ti is the observed duration of the policySecond we assume that when individuals maketheir deductible choices they follow the theoreti-cal model described in the previous section Themodel implies that individual i chooses the lowdeductible (choicei 1) if and only if ri ri(i)where ri is defined in equation (7) Thus theempirical model for deductible choice is given by

(12) Prchoicei 1 Prri

pi

idi 1

d i

Prexp(xi vi)

pi

exp(xi i)di 1

d i

With no unobserved heterogeneity in risk (i 0) equation (12) reduces to a simple probit In

such a case one can perfectly predict risk from thedata denote it by (xi) and construct an addi-tional covariate ln([pi((xi)di) 1]d i) Giventhe assumption that risk aversion is distributedlognormally running the probit regressionabove and renormalizing the coefficient onthe constructed covariate to 1 (instead ofthe typical normalization of the variance of theerror term to 1) has a structural interpretation withln (ri) as the dependent variable However Cohen(2005) provides evidence of adverse selection inthe data implying the existence of unobservedheterogeneity in risk This makes the simple probitregression misspecified Estimation of the fullmodel is more complicated Once we allow forunobserved heterogeneity in both unobserved riskaversion (vi) and claim rate (i) we have to inte-grate over the two-dimensional region depicted inFigure 2 for estimation

EstimationmdashA natural way to proceed is toestimate the model by maximum likelihoodwhere the likelihood of the data as a function ofthe parameters can be written by integrating outthe latent variables namely

(13) Lclaimsi choicei

Prclaimsi choiceii riPri ri

where is a vector of parameters to be esti-mated While formulating the empirical modelusing likelihood may help our thinking aboutthe data-generating process using maximumlikelihood (or generalized method of moments(GMM)) for estimation is computationally cum-bersome This is because in each iteration itrequires evaluating a separate complex integralfor each individual in the data In contrastMarkov Chain Monte Carlo (MCMC) Gibbssampling is quite attractive in such a case Us-ing data augmentation of latent variables (Mar-tin A Tanner and Wing Hung Wong 1987)according to which we simulate (i ri) and latertreat those simulations as if they are part ofthe data one can avoid evaluating the complexintegrals by sampling from truncated normaldistributions which is significantly less compu-tationally demanding (eg Luc Devroye 1986)This feature combined with the idea of aldquosliced samplerrdquo (Paul Damien John Wake-field and Stephen Walker 1999) to sample from

756 THE AMERICAN ECONOMIC REVIEW JUNE 2007

an unfamiliar posterior distribution makes theuse of a Gibbs sampler quite efficient for ourpurposes Finally the lognormality assumptionimplies that F(ln(i)ri) and F(ln(ri)i) follow a(conditional) normal distribution allowing us torestrict attention to univariate draws furtherreducing the computational burden

The Appendix provides a full description ofthe Gibbs sampler including the conditionaldistributions and the (flat) prior distributions weuse The basic intuition is that conditional onobserving (i ri) for each individual we have asimple linear regression model with two equa-tions The less standard part is to generate drawsfor (i ri) We do this iteratively Conditionalon i the posterior distribution for ln(ri) fol-lows a truncated normal distribution where thetruncation point depends on the menu individ-ual i faces and its direction (from above orbelow) depends on individual irsquos deductiblechoice The final step is to sample from theposterior distribution of ln(i) conditional on riThis is more complicated as we have bothtruncation which arises from adverse selection(just as we do when sampling for ri) and thenumber of claims which provides additionalinformation about the posterior of i Thus theposterior for i takes an unfamiliar form forwhich we use a ldquosliced samplerrdquo

We use 100000 iterations of the Gibbs sam-pler It seems to converge to the stationarydistribution after about 5000 iterations There-fore we drop the first 10000 draws and use thelast 90000 draws of each variable to report ourresults The results are based on the posteriormean and posterior standard deviation fromthese 90000 draws Note that each iterationinvolves generating separate draws of (i ri) foreach individual Performing 100000 iterationsof the benchmark specification (coded in Mat-lab) takes about 60 hours on a Dell Precision530 workstation

C Identification

The parametric version of the model is iden-tified mechanically There are more equationsthan unknowns and no linear dependenciesamong them so (as also verified using MonteCarlo exercises) the model parameters can bebacked out from simulated data Our goal in thissection is not to provide a formal identificationproof Rather we want to provide intuition for

which features of the data allow us to identifyparticular parameters of the model The discus-sion also highlights the assumptions that areessential for identification vis-a-vis those thatare made only for computational convenience(making them in principle testable)

Discussion of Nonparametric Identifica-tionmdashThe main difficulty in identifying themodel arises from the gap between the (exante) risk type i which individuals usewhen choosing a deductible and the (ex post)realization of the number of claims we ob-serve We identify between the variation inrisk types and the variation in risk realizationsusing our parametric distributional assump-tions The key is that the distribution of risktypes can be uniquely backed out from theclaim data alone This allows us to use thedeductible choice as an additional layer ofinformation which identifies unobserved het-erogeneity in risk aversion21 Any distribu-tional assumption that allows us to uniquelyback out the distribution of risk types fromclaim data would be sufficient to identify thedistribution of risk aversion As is customaryin the analysis of count processes we make aparametric assumption that claims are gener-ated by a lognormal mixture of Poisson dis-tributions (Section IIID discusses this furtherand explores an alternative) Using a mixtureenables us to account for adverse selectionthrough unobserved heterogeneity in risk Italso allows us to better fit the tails of theclaim distribution In principle a more flexi-ble mixture or a more flexible claim-generat-ing process could be identified as long as theclaims data are sufficiently rich22

21 Cardon and Hendel (2001) face a similar identificationproblem in the context of health insurance They use vari-ation in coverage choice (analogous to deductible choice) toidentify the variation in health-status signals (analogous torisk types) from the variation in health expenditure (analo-gous to number of claims) They can rely on the coveragechoice to identify this because they make an assumptionregarding unobserved heterogeneity in preferences (iidlogit) We take a different approach as our main goal is toestimate rather than assume the distribution of preferences

22 Although it may seem that the claim data are limited(as they take only integer values between 0 and 5 in ourdata) variation in policy duration generates continuousvariation in the observed claim propensity Of course thisvariation also introduces an additional selection into themodel due to policy cancellations which are potentially

757VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Once the distribution of risk types is iden-tified from claims data the marginal distribu-tion of risk aversion (and its relationship tothe distribution of risk types) is nonparametri-cally identified from the variation in theoffered menus discussed in Section I Thisvariation implies different deductible and pre-mium options to identical (on observables)individuals who purchased insurance at dif-ferent times Different menus lead to differentindifference sets (similar to the one depictedin Figure 2) These sets often cross each otherand nonparametrically identify the distribu-tion of risk aversion and the correlation struc-ture at least within the region in which theindifference sets vary For the tails of thedistribution as is typically the case we haveto rely on parametric assumptions or usebounds The parametric assumption of log-normality we use for most of the paper ismade only for computational convenience

Intuition for the Parametric IdentificationMechanismmdashVariation in the offered menus isimportant for the nonparametric identificationThe parametric assumptions could identify themodel without such variation Thus to keep theintuition simple let us take the bivariate log-normal distribution as given and contrary to thedata assume that all individuals are identical onobservables and face the same menu Supposealso that all individuals are observed for exactlyone year and have up to two claims23 In thissimplified case the model has five parametersto be estimated the mean and variance of risk and

2 the mean and variance of risk aver-sion r and r

2 and the correlation parameter The data can be summarized by five numbersLet 130 131 and 132 1 131 130 be thefraction of individuals with zero one and twoclaims respectively Let 0 1 and 2 be theproportion of individuals who chose a low de-ductible within each ldquoclaim grouprdquo Given ourdistributional assumption about the claim-

generating process we can use 130 and 131 touniquely identify and

2 Loosely isidentified by the average claim rate in the dataand

2 is identified by how fat the tail of theclaim distribution is ie by how large (132131)is compared to (131130) Given and

2 and thelognormality assumption we can (implicitly)construct a posterior distribution of risk typesfor each claim group F(r claims c) andintegrate over it when predicting the deductiblechoice This provides us with three additionalmoments each of the form

(14) Ec Prchoice 1r

dFr claims c dFr

for c 0 1 2 These moments identify thethree remaining parameters of the model rr

2 and Let us now provide more economic content to

the identification argument Using the same ex-ample and conditional on identifying and

2

from the claim data one can think about thedeductible choice data 0 1 2 as a graph(c) The absolute level of the graph identifiesr In the absence of correlation between riskand risk aversion the slope of the graph iden-tifies r

2 with no correlation the slope shouldalways be positive (due to adverse selection)but higher r

2 would imply a flatter graph be-cause more variation in the deductible choiceswill be attributed to variation in risk aversionFinally is identified by the curvature of thegraph The more convex (concave) the graph isthe more positive (negative) is the estimated For example if 0 05 1 051 and 2 099 it is likely that r

2 is high (explaining why0 and 1 are so close) and is highly positive(explaining why 2 is not also close to 1) Incontrast if 0 1 it must mean that thecorrelation between risk and risk aversion isnegative which is the only way the originalpositive correlation induced by adverse selec-tion can be offset This intuition also clarifiesthat the identification of relies on observingindividuals with multiple claims (or differentpolicy durations) and that it is likely to besensitive to the distributional assumptions Thedata (Table 2B) provide a direct (positive) cor-relation between deductible choice and claims

endogenous The results are similar when we use onlyexpired and truncated policies

23 This variation is sufficient (and necessary) to identifythe benchmark model The data provide more variation weobserve up to five claims per individual we observe con-tinuous variation in the policy durations we observe vari-ation in prices and we exploit distributional restrictionsacross individuals with different observable characteristics

758 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The structural assumptions allow us to explainhow much of this correlation can be attributedto adverse selection The remaining correlation(positive or negative) is therefore attributed tocorrelation in the underlying distribution of riskand risk aversion

III Results

A Reduced-Form Estimation

To get an initial sense for the levels of abso-lute risk aversion implied by the data we use asimple back-of-the-envelope exercise We com-pute unconditional averages of p d dh dland d (Table 2A)24 and substitute these valuesin equation (7) The implied coefficient of ab-solute risk aversion is 29 104 NIS125 Thisfigure could be thought of as the average indif-ference point implying that about 18 percent ofthe policyholders have coefficients of absoluterisk aversion exceeding it To convert to USdollar amounts one needs to multiply thesefigures by the average exchange rate (352)resulting in an average indifference point of102 103 $US1 This figure is less than halfof a similar estimate reported by Sydnor (2006)for buyers of homeownerrsquos insurance but about3 and 13 times higher than comparable fig-ures reported by Gertner (1993) and Metrick(1995) respectively for television game showparticipants

Table 3 provides reduced-form analysis ofthe relationship between the observables andour two left-hand-side variables the number ofclaims and the deductible choice26 Column 1reports the estimates from a Poisson regressionof the number of claims on observed character-istics This regression is closely related to therisk equation we estimate in the benchmarkmodel It shows that older people women andpeople with a college education are less likely to

have an accident Bigger more expensiveolder and noncommercial cars are more likelyto be involved in an accident Driving experi-ence and variables associated with less intenseuse of the car reduce accident rates As could beimagined claim propensity is highly correlatedover time past claims are a strong predictor offuture claims Young drivers are 50 to 70 per-cent more likely to be involved in an accidentwith young men significantly more likely thanyoung women Finally as indicated by the trendin the estimated year dummies the accident ratesignificantly declined over time Part of thisdecline is likely due to the decline in accidentrates in Israel in general27 This decline mightalso be partly due to the better selection ofindividuals the company obtained over time asit gained more experience (Cohen 2003)

Columns 2 and 3 of Table 3 present estimatesfrom probit regressions in which the dependentvariable is equal to one if the policyholder chosea low deductible and is equal to zero otherwiseColumn 3 shows the marginal effects of thecovariates on the propensity to choose a lowdeductible These marginal effects do not have astructural interpretation as the choice of lowdeductible depends on its price on risk aver-sion and on risk In this regression we againobserve a strong trend over time Fewer policy-holders chose the low deductible as time wentby One reason for this trend according to thecompany executives is that over time the com-panyrsquos sales persons were directed to focusmainly on the ldquodefaultrdquo regular deductible op-tion28 The effect of other covariates will bediscussed later in the context of the full modelIn unreported probit regressions we also test thequalitative assumptions of the structural model byadding three additional regressors the price ratio(pidi) the average deductible offered d i

24 The unconditional is computed by maximum likeli-hood using the data on claims and observed durations of thepolicies

25 Using the CARA specification as in equation (16) weobtain a slightly lower value of 25 104 NIS1

26 We find positive correlation in the errors of these tworegressions suggesting the existence of adverse selection inthe data and motivating a model with unobserved heteroge-neity in risk This test is similar to the bivariate probit testproposed by Chiappori and Salanie (2000) and replicatesearlier results reported in Cohen (2005)

27 In particular traffic fatalities and counts of trafficaccidents in Israel fell by 11 percent and 18 percent during1998 and 1999 respectively

28 Such biased marketing efforts will bias consumersagainst choosing the low deductible thus making them lookless risk averse This would make our estimate a lowerbound on the true level of risk aversion If only sophisti-cated consumers could see beyond the marketing effort andthis sophistication were related to observables (eg educa-tion) the coefficients on such observables would be biasedupward This is not a major concern given that the coefficientson most of the covariates are fairly stable when we estimatethe benchmark model separately for different years

759VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

TABLE 3mdashNO HETEROGENEITY IN RISK

Variable

Poisson regressiona Probit regressionsc

Dep var Number of claims Dep var 1 if low deductible

(1) (2) (3)

Coeff Std Err IRRb Coeff Std Err dPdX dPdXDemographics Constant 43535 02744 mdash 224994 23562 mdash

Age 00074 00052 09926 01690 00464 00034 00042Age2 989 105 536 105 10001 00017 00005 339 105 455 105Female 00456 00161 09555 08003 01316 00165 00134Family Single omitted omitted omitted

Married 01115 00217 08945 06417 01950 00128 00045Divorced 00717 00346 10743 03191 03127 00064 00044Widower 00155 00527 10156 03441 04573 00069 00025Other (NA) 01347 01755 11442 29518 19836 00511 00370

Education Elementary 00522 00550 09491 02927 04278 00061 00016High school omitted omitted omittedTechnical 00373 00308 10380 05037 02520 00105 00138College 00745 00197 09282 05279 01546 00109 00053Other (NA) 00116 00184 10116 00630 01540 00013 00025

Emigrant 00210 00163 10213 00497 01328 00010 00003

Car attributes Log(Value) 01227 00281 11306 13285 02342 00271 00299Car age 00187 00042 10189 01603 00357 00033 00017Commercial car 01394 00326 08699 09038 02781 00177 00294Log(engine size) 02972 00459 13461 06924 03952 00141 00075

Driving License years 00204 00034 09798 01043 00312 00021 00005License years2 00002 677 105 10002 00015 00005 299 105 167 105

Good driver 00176 00191 09825 07207 01574 00148 00152Any driver 00564 00169 10580 10923 01321 00217 00258Secondary car 00859 00209 09177 00038 01626 00001 00070Business use 01852 00293 12034 07951 02737 00156 00017History length 00527 00110 09486 11218 01450 00228 00171Claims history 06577 00390 19304 04654 05460 00095 00496

Young driver Young driver 05235 00361 16879 28847 06305 00524 00012Gender Male omitted omitted omitted

Female 01475 00288 08629 17959 03135 00396 00195Age 17ndash19 omitted omitted omitted

19ndash21 00701 00532 10726 07800 07509 00153 0014721ndash24 00267 00574 09737 05746 07773 00114 0015624 02082 00567 0812 17869 07328 00397 00136

Experience 1 omitted omitted omitted1ndash3 02416 00458 07854 10175 06577 00217 000093 02827 00532 07538 32513 07386 00762 00410

Company year First year omitted omitted omittedSecond year 00888 00198 09150 44513 01506 00792 00859Third year 00690 00222 09334 85888 01820 01303 01367Fourth year 01817 00232 08339 118277 02102 01616 01734Fifth year 05431 0028 05810 143206 02778 01920 02085

105586Obs 105800 94000 105800Pseudo R2 00162 01364 01296Log likelihood 577459 369595 430869

Significant at the 5 percent confidence levela Maximum likelihood estimates Variation in exposure (policy duration) is accounted forb IRR Incidence rate ratio Each figure should be interpreted as the increasedecrease in claim probability as a result of an increase of one unit in the

right-hand-side variablec There are two separate probit regressions reported in this table Column 2 relies on the deductible choice model and the lognormality assumption As discussed

in the text by including an additional regressor ln([pi((xi)di) 1]d i) (with (xi) predicted from column 1 above) and normalizing its coefficient to 1 we obtaina structural interpretation of this regression with ln (ri) as the dependent variable Thus the reported coefficients are comparable to those estimated for the benchmarkmodel One should be cautious however in interpreting these coefficients Unlike the benchmark model this regression does not allow unobserved heterogeneity inrisk and suffers from some selection bias because observations with high predicted risk rate are omitted (which is why the number of observations is 94000 ratherthan the full sample of 105800) For comparison column 3 reports the marginal effects from a comparable probit regression that uses the full sample and does notcontrol for pricing and predicted risk through the additional structural regressor Column 3 does not have a structural interpretation and its (unreported) coefficientscannot be compared to those estimated from the benchmark model

760 THE AMERICAN ECONOMIC REVIEW JUNE 2007

and the risk rate (xi) as predicted from thePoisson regression of column 1 All three addi-tional regressors enter with the predicted signand with large and highly significant marginaleffects29

Finally column 2 of Table 3 presents animportant specification of the probit regres-sion in which ln([pi(( xi)di) 1]d i) isadded as a regressor and its coefficient isnormalized to 130 As already mentioned ifwe assume that there is no unobserved heter-ogeneity in risk then column 2 is analogousto the ln(ri) equation of the benchmark modelThis restriction of the model is rejected by thedata About 10 percent of the individuals arepredicted to have (xi) (pi di) implying achoice of low deductible for any level of riskaversion Many of these individuals howeverstill choose higher deductible levels Column2 reports the results for the remaining indi-viduals (94000 out of 105800) for which theregression can be run While the signs of theestimated coefficients are similar to those inthe benchmark model presented below therestricted version of the model suggests muchhigher levels and dispersion of risk aversionwell above any reasonable level31 The fullestimation of the benchmark model clearly re-jects this restriction on the model

B Estimation of the Benchmark Model

The Effect of Individual Characteristics onRisk AversionmdashTable 4 presents the estimationresults from the benchmark model The secondcolumn shows how the level of absolute riskaversion is related to individual characteristicsAs the dependent variable is in natural loga-

rithm coefficients on dummy variables can bedirectly interpreted as approximate percentagechanges The vast majority of these coefficientsare quite stable across a wide range of specifi-cations which are mentioned later

The results indicate that women are more riskaverse than men and have a coefficient of ab-solute risk aversion about 20 percent greaterthan that of men These results are consistentwith those of Donkers et al (2001) and Hartoget al (2002) The estimated effect of age sug-gests a nonmonotone pattern of risk preferencesover the life cycle The estimated coefficientsimply that initially (that is at age 18 the youngestindividual in the data) individuals become less riskaverse with age but around the age of 48 indi-viduals start becoming more risk averse32 Mar-ried individuals are estimated to be significantlymore risk averse compared to singles while di-vorced individuals are less (although the coeffi-cient is statistically insignificant)

The analysis suggests that variables that arelikely to be correlated with income or wealthsuch as post-high-school education and thevalue of the car have a positive coefficientindicating that wealthier people have higherlevels of absolute risk aversion Although we donot have data on individualsrsquo income or wealthwe use other proxies for income in additionalspecifications and obtain mixed results Whenwe include as a covariate the average householdincome among those who live in the same zipcode we obtain a significant and negative co-efficient of 0333 (0154) (standard deviationin parentheses) When we match zip code in-come on demographics of the individuals thecoefficient is effectively zero 0036 (0047)and when we use a socioeconomic index ofthe locality in which the individual lives thecoefficient is positive 0127 (0053)33 Thus

29 The estimated marginal effect (z-statistic in parenthe-ses) is 0352 (1376) 16 104 (1481) and 0154(255) for (pidi) d i and (xi) respectively

30 The level is a normalization The sign is estimatedHad the sign on this regressor been positive this would haveimplied a rejection of the model

31 For the implied median level of risk aversion the re-stricted model produces a similar estimate to the estimate wereport below for the benchmark model However since someindividuals who chose the low deductible are estimated to havevery low claim rates the restricted model is ldquoforcedrdquo to estimatevery high risk aversion for these individuals (in contrast thebenchmark model can explain such choices by positive risk re-siduals) resulting in very high dispersion and (due to the lognor-mality assumption) very high average risk aversion which isabout 1025 higher than the benchmark estimates we report below

32 This nonmonotone pattern may explain why age enterswith different signs in the estimation results of Donkers et al(2001) and Hartog et al (2002) A somewhat similar U-shapepattern with respect to age is also reported by Sumit Agarwalet al (2006) in the context of consumer credit markets

33 The full results from these specifications are providedin the online Appendix (available at httpwwwe-aerorgdatajune0720050644_apppdf ) The reason we do not usethese income variables in the benchmark specification istheir imperfect coverage which would require us to omitalmost 20 percent of the individuals Other results fromthese regressions are similar to those we report for thebenchmark specification

761VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

overall we interpret our findings as suggestiveof a nonnegative and perhaps positive associ-ation between incomewealth and absolute riskaversion

At first glance these results may appear tobe inconsistent with the widely held beliefthat absolute risk aversion declines withwealth One should distinguish however be-tween two questions (a) whether for a givenindividual the vNM utility function exhibitsdecreasing absolute risk aversion and (b)how risk preferences vary across individuals

Our results do not speak to the first questionand should not be thought of as a test of thedecreasing absolute risk aversion propertyTesting this property would require observingthe same individual making multiple choicesat different wealth levels Rather our resultsindicate that individuals with greater wealthhave utility functions that involve a greaterdegree of risk aversion It might be thatwealth is endogenous and that risk aversion(or unobserved individual characteristics thatare correlated with it) leads individuals to

TABLE 4mdashTHE BENCHMARK MODEL

Variable Ln() equation Ln(r) equation Additional quantities

Demographics Constant 15406 (00073) 118118 (01032) Var-covar matrix ()

Age 00001 (00026) 00623 (00213) 01498 (00097)Age2 624 106 (263 105) 644 104 (211 104) r 31515 (00773)Female 00006 (00086) 02049 (00643) 08391 (00265)Family Single Omitted Omitted

Married 00198 (00115) 01927 (00974) Unconditional statisticsa

Divorced 00396 (00155) 01754 (01495) Mean 02196 (00013)Widower 00135 (00281) 01320 (02288) Median 02174 (00017)Other (NA) 00557 (00968) 04599 (07397) Std Dev 00483 (00019)

Education Elementary 00194 (00333) 01283 (02156) Mean r 00019 (00002)High school Omitted Omitted Median r 727 106 (756 107)Technical 00017 (00189) 02306 (01341) Std Dev r 00197 (00015)College 00277 (00124) 02177 (00840) Corr(r ) 02067 (00085)Other (NA) 00029 (00107) 00128 (00819)

Emigrant 00030 (00090) 00001 (00651) Obs 105800

Car attributes Log(value) 00794 (00177) 07244 (01272)Car age 00053 (00023) 00411 (00176)Commercial car 00719 (00187) 00313 (01239)Log(engine size) 01299 (00235) 03195 (01847)

Driving License years 00015 (00017) 00157 (00137)License years2 183 105 (351 105) 148 104 (254 104)Good driver 00635 (00112) 00317 (00822)Any driver 00360 (00105) 03000 (00722)Secondary car 00415 (00141) 01209 (00875)Business use 00614 (00134) 03790 (01124)History length 00012 (00052) 03092 (00518)Claims history 01295 (00154) 00459 (01670)

Young driver Young driver 00525 (00253) 02499 (02290)Gender Male Omitted mdash

Female 00355 (00061) mdashAge 17ndash19 Omitted mdash

19ndash21 00387 (00121) mdash21ndash24 00445 (00124) mdash24 00114 (00119) mdash

Experience 1 Omitted mdash1ndash3 00059 (00104) mdash3 00762 (00121) mdash

Company year First year Omitted OmittedSecond year 00771 (00122) 14334 (00853)Third year 00857 (00137) 28459 (01191)Fourth year 01515 (00160) 38089 (01343)Fifth year 04062 (00249) 39525 (01368)

Note Standard deviations based on the draws from the posterior distribution in parentheses Significant at the 5 percent confidence levela Unconditional statistics represent implied quantities for the sample population as a whole ie integrating over the distribution of covariates in the sample (as

well as over the unobserved components)

762 THE AMERICAN ECONOMIC REVIEW JUNE 2007

save more to obtain more education or totake other actions that lead to greaterwealth34

Let us make several additional observationsFirst while owners of more expensive cars ap-pear to have both higher risk exposure andhigher levels of risk aversion owners of biggercars have higher risk exposure but lower levelsof risk aversion This should indicate that thestructure of the model does not constrain therelationship between the coefficients in the twoequations Rather it is the data that ldquospeak uprdquoSecond individuals who are classified by theinsurer as ldquogood driversrdquo indeed have lowerrisk but also appear to have lower risk aversionThis result is somewhat similar to the positivecorrelation between unobserved risk and unob-served risk aversion which we report belowThird policyholders who tend to use the car forbusiness are less risk averse This could bebecause uninsured costs of accidents occurringto such policyholders are tax deductible Fourthpolicyholders who reported three full years ofclaims history are more risk averse but are notdifferent in their risk exposure The attitude thatleads such policyholders to comply with therequest to (voluntarily) report three full years ofclaims history is apparently and not surpris-ingly correlated with higher levels of risk aver-sion In contrast while past claims indicate highrisk they have no significant relationship withrisk aversion Finally we find a strong trendtoward lower levels of risk aversion over timeThis is a replication of the probit results dis-cussed earlier

The Effect of Individual Characteristics onClaim RiskmdashThe first column of Table 4 de-scribes the relationship between observablesand risk exposure Accident risk is higher fordivorced individuals and lower for people witha college education Bigger older more expen-sive and noncommercial cars are all morelikely to be involved in an accident Drivingexperience reduces accident rates as do mea-sures of less intense use of the car while youngdrivers are more exposed to risk Claim propen-sity is highly correlated over time the voluntaryreport of past claims is a strong predictor offuture claims This risk equation produces re-sults that are similar to those of the simplerPoisson regression reported in Table 3 Al-though some of the coefficients lose signifi-cance the magnitude of most coefficients isquite similar The similarity between these twosets of results is to be expected as the riskregression is identified primarily from the dataon claims so incorporating the information ondeductible choice does not qualitatively changethe conceptual identification strategy (see Sec-tion IIC) If the results were not similar thiscould have indicated a misspecification of themodel The slight differences between the riskregressions in Table 3 and Table 4 are drivenmainly by the structural assumptions First thebenchmark model estimates a lognormal mix-ture of Poisson rates rather than a single Pois-son model By incorporating the fatter tails ofthe claim distribution it slightly changes theresults increases the standard errors and de-creases the average predicted claim rate Sec-ond the information on deductible choiceslightly helps us in obtaining more precise es-timates through the correlation structure be-tween the error terms in the two equations

The Implied Level of Risk AversionmdashOne ofthe main goals of the estimation is to obtainmeasures of the level of risk aversion in thepopulation we study Since we use Gibbs sam-pling and augment the latent coefficients ofabsolute risk aversion we can directly obtainthe posterior distribution of various moments ofthis distribution At each iteration of the Gibbssampler we compute the mean standard devi-ation (across individuals) and various percen-tiles of the simulated draws of i and ri and thecorrelation between them The far-right columnof Table 4 reports the averages and standard

34 One may be tempted to interpret the positive wealtheffects as an indirect indication of credit constraints wealth-ier individuals are less credit constrained and therefore canafford to purchase more insurance We do not share thisinterpretation for two reasons First the insurance companyobserves these proxies for wealth and conditions on themwhen setting prices Since the willingness to pay for insur-ance is likely to be correlated with the willingness to pay forthe additional insurance provided by the low deductibleoption premiums already reflect this variation We condi-tion on the level of the premium Second paying less exante implies paying more ex post so applying the creditconstraint argument only to the ex ante payment but not tothe probabilistic ex post deductible payments has no theo-retical foundation Essentially the setup of the model linksthe ex ante decisions to the ex post losses which are bothdriven by the curvature of the vNM utility function

763VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

deviations of these computed quantities over theiterations of the Gibbs sampler35 The impliedrisk aversion of the mean individual is 00019which is about seven times greater than theback-of-the-envelope calculation presented inthe previous section As we assume a lognormal

distribution and estimate a relatively high dis-persion coefficient r the estimates also implysignificantly lower values for the median levelof risk aversion As shown later the qualitativepattern of these results is quite robust acrossspecifications

Table 5 summarizes our findings regardingthe level and dispersion of risk aversion Itpresents two ways to interpret the estimates weobtained from the benchmark model as well ascomparisons to specifications with constant ab-solute risk aversion (CARA) utility and with

35 Note that these estimated quantities are unconditionalIn computing these quantities we integrate over the distri-bution of observable characteristics in the data so onecannot compute these estimates from the estimated param-eter directly

TABLE 5mdashRISK-AVERSION ESTIMATES

Specificationa Absolute risk aversionb Interpretationc Relative risk aversiond

Back-of-the-envelope 10 103 9070 1484Benchmark model

Mean individual 67 103 5605 972225th percentile 23 106 9998 003Median individual 26 105 9974 03775th percentile 29 104 9714 42790th percentile 27 103 7834 390295th percentile 99 103 4937 14327

CARA utilityMean individual 31 103 7651 4436Median individual 34 105 9966 050

Learning modelMean individual 42 103 6886 6140Median individual 56 106 9995 008

Comparable estimatesGertner (1993) 31 104 9699 479Metrick (1995) 66 105 9934 102Holt and Laury (2002)e 32 102 2096 86575Sydnor (2006) 20 103 8329 5395

a The table summarizes the results with respect to the level of risk aversion ldquoBack-of-the-enveloperdquo refers to thecalculation we report in the beginning of Section III ldquobenchmark modelrdquo refers to the results from the benchmark model(Table 4) ldquoCARA utilityrdquo refers to a specification of a CARA utility function and ldquolearning modelrdquo refers to a specificationin which individuals do not know their risk types perfectly (see Section IIID) The last four rows are the closest comparableresults available in the literature

b The second column presents the point estimates for the coefficient of absolute risk aversion converted to $US1 unitsFor the comparable estimates this is given by their estimate of a representative CARA utility maximizer For all otherspecifications this is given by computing the unconditional mean and median in the population using the figures we reportin the mean and median columns of risk aversion in Table 6 (multiplied by 352 the average exchange rate during the periodto convert to US dollars)

c To interpret the absolute risk aversion estimates (ARA) we translate them into x u(w) 1frasl2 u(w 100) 1frasl2 u(w x) That is we report x such that an individual with the estimated ARA is indifferent about participating in a 50ndash50lottery of gaining 100 US dollars and losing x US dollars Note that since our estimate is of absolute risk aversion the quantityx is independent of w To be consistent with the specification we use a quadratic utility function for the back-of-the-envelopebenchmark and learning models and use a CARA utility function for the others

d The last column attempts to translate the ARA estimates into relative risk aversion We follow the literature and do soby multiplying the ARA estimate by average annual income We use the average annual income (after tax) in Israel in 1995(51168 NIS from the Israeli Census) for all our specifications and we use average disposable income in the United Statesin 1987 ($15437) for Gertner (1993) and Metrick (1995) For Holt and Laury (2002) and Sydnor (2006) we use a similarfigure for 2002 ($26974)

e Holt and Laury (2002) do not report a comparable estimate The estimate we provide above is based on estimating aCARA utility model for the 18 subjects in their experiment who participated in the ldquo90rdquo treatment which involved stakescomparable to our setting For these subjects we assume a CARA utility function and a lognormal distribution of theircoefficient of absolute risk aversion The table reports the point estimate of the mean from this distribution

764 THE AMERICAN ECONOMIC REVIEW JUNE 2007

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 10: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

probability of a claim are proportional to thelength of the time interval taken into account soit is convenient to think of the contract choice asa commitment for only a short amount of timeThis approach has several advantages First ithelps to account for early cancellations andtruncated policies which together constitute 30percent of the policies in the data18 Second itmakes the deductible choice independent ofother longer-term uncertainties faced by the in-dividual so we can focus on static risk-takingbehavior Third this formulation helps to obtaina simple framework for analysis which is attrac-tive both analytically and computationally19

The expected utility that the individual obtainsfrom the choice of a contract (p d) is given by

(2) v p d

1 tuw pt tuw pt d

We characterize the set of parameters that willmake the individual indifferent between the twooffered contracts This set provides a lower (up-per) bound on the level of risk aversion for indi-viduals who choose the low (high) deductible (fora given ) Thus we analyze the equation v(phdh) v(pl dl) By taking limits with respect to t(and applying LrsquoHopitalrsquos rule) we obtain

(3)

limt30

1

t(u(w pht) u(w plt))

(u(w pht) u(w pht dh)) (u(w plt) u(w plt dl))

pl phuw

uw dl uw dh

or

(4)

pl phuw uw dl uw dh

The last expression has a simple intuition Theright-hand side is the expected gain (in utils) perunit of time from choosing a low deductibleThe left-hand side is the cost of such a choiceper unit of time For the individual to be indif-ferent the expected gains must equal the costs

In our benchmark specification we assumethat the third derivative of the vNM utilityfunction is not too large A Taylor expansion forboth terms on the right-hand side of equation(4) ie u(w d) u(w) du(w) (d22)u13(w) implies that

(5)pl ph

uw dh dluw

1

2dh dldh dlu13w

Let d dh dl 0 p pl ph 0 andd 1frasl2 (dh dl) to obtain

(6)p

duw uw d u13w

or

(7) r u13w

uw

p

d 1

d

where r is the coefficient of absolute risk aver-sion at wealth level w

18 As can be seen in Table 2A 70 percent of the policiesare observed through their full duration (one year) About15 percent are truncated by the end of our observationperiod and the remaining 15 percent are canceled for var-ious reasons such as change in car ownership total-lossaccident or a unilateral decision of the policyholder tochange insurance providers

19 This specification ignores the option value associatedwith not canceling a policy This is not very restrictiveSince experience rating is small and menus do not changeby much this option value is likely to be close to zero Asimple alternative is to assume that individuals behave as ifthey commit for a full year of coverage In such a case themodel will be similar to the one we estimate but willdepend on the functional form of the vNM utility functionand would generally require taking infinite sums (over thepotential realizations for the number of claims within the year)In the special case of quadratic expected utility maximizerswho care only about the mean and variance of the number ofclaims this is easy to solve The result is almost identical to theexpression we subsequently derive in equation (7)

754 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Equation (7) defines an indifference set in thespace of risk and risk aversion which we willrefer to by (r() ) and ((r) r) interchange-ably Both r() and (r) have a closed-formrepresentation a property that will be computa-tionally attractive for estimation20 Both termsare individual specific as they depend on thedeductible menu which varies across individu-als For the rest of the paper we regard eachindividual as associated with a two-dimensionaltype (ri i) An individual with a risk parameter

i who is offered a menu (pih di

h) (pil di

l)will choose the low-deductible contract if andonly if his coefficient of absolute risk aversionsatisfies ri ri(i) Figure 2 presents a graphi-cal illustration

B The Benchmark Econometric Model

The Econometric ModelmdashThe econometricmodel we estimate is fully described by the fiveequations presented in this section Our ob-jective is to estimate the joint distribution of(i ri)mdashthe claim rate and coefficient of abso-lute risk aversionmdashin the population of policy-holders conditional on observables xi The

20 For example estimating the CARA version of themodel (Section IIID) for which r() does not have aclosed-form representation takes almost ten times longer

FIGURE 2 THE INDIVIDUALrsquoS DECISIONmdashA GRAPHICAL ILLUSTRATION

Notes This graph illustrates the individualrsquos decision problem The solid line presents the indifference setmdashequation (7)mdashappliedfor the menu faced by the average individual in the sample Individuals are represented by points in the two-dimensional spaceabove In particular the scattered points are 10000 draws from the joint distribution of risk and risk aversion for the averageindividual (on observables) in the data based on the point estimates of the benchmark model (Table 4) If an individual is eitherto the right of the line (high risk) or above the line (high risk aversion) the low deductible is optimal Adverse selection is capturedby the fact that the line is downward sloping higher-risk individuals require lower levels of risk aversion to choose the lowdeductible Thus in the absence of correlation between risk and risk aversion higher-risk individuals are more likely to choosehigher levels of insurance An individual with i (pidi) will choose a lower deductible even if he is risk neutral ie withprobability one (we do not allow individuals to be risk loving) This does not create an estimation problem because i is notobserved only a posterior distribution for it Any such distribution will have a positive weight on values of i that are below(pidi) Second the indifference set is a function of the menu and in particular of (pidi) and d An increase in (pidi) willshift the set up and to the right and an increase in d will shift the set down and to the left Therefore exogenous shiftsof the menus that make both arguments change in the same direction can make the sets ldquocrossrdquo thereby allowing us toidentify the correlation between risk and risk aversion nonparametrically With positive correlation (shown in the figureby the ldquoright-bendingrdquo shape of the simulated draws) the marginal individuals are relatively high risk therefore creatinga stronger incentive for the insurer to raise the price of the low deductible

755VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

benchmark formulation assumes that (i ri) fol-lows a bivariate lognormal distribution Thuswe can write the model as

(8) ln i xi i

(9) ln ri xi vi

with

(10) i

vi

iid

N00

2 r

r r2

Neither i nor ri is directly observed Thereforewe treat both as latent variables Loosely speak-ing they can be thought of as random effects Weobserve two variables the number of claims andthe deductible choice which are related to thesetwo unobserved components Thus to completeour econometric model we have to specify therelationship between the observed variables andthe latent ones This is done by making two struc-tural assumptions First we assume that the num-ber of claims is drawn from a Poisson distributionnamely

(11) claimsi Poisson(i ti )

where ti is the observed duration of the policySecond we assume that when individuals maketheir deductible choices they follow the theoreti-cal model described in the previous section Themodel implies that individual i chooses the lowdeductible (choicei 1) if and only if ri ri(i)where ri is defined in equation (7) Thus theempirical model for deductible choice is given by

(12) Prchoicei 1 Prri

pi

idi 1

d i

Prexp(xi vi)

pi

exp(xi i)di 1

d i

With no unobserved heterogeneity in risk (i 0) equation (12) reduces to a simple probit In

such a case one can perfectly predict risk from thedata denote it by (xi) and construct an addi-tional covariate ln([pi((xi)di) 1]d i) Giventhe assumption that risk aversion is distributedlognormally running the probit regressionabove and renormalizing the coefficient onthe constructed covariate to 1 (instead ofthe typical normalization of the variance of theerror term to 1) has a structural interpretation withln (ri) as the dependent variable However Cohen(2005) provides evidence of adverse selection inthe data implying the existence of unobservedheterogeneity in risk This makes the simple probitregression misspecified Estimation of the fullmodel is more complicated Once we allow forunobserved heterogeneity in both unobserved riskaversion (vi) and claim rate (i) we have to inte-grate over the two-dimensional region depicted inFigure 2 for estimation

EstimationmdashA natural way to proceed is toestimate the model by maximum likelihoodwhere the likelihood of the data as a function ofthe parameters can be written by integrating outthe latent variables namely

(13) Lclaimsi choicei

Prclaimsi choiceii riPri ri

where is a vector of parameters to be esti-mated While formulating the empirical modelusing likelihood may help our thinking aboutthe data-generating process using maximumlikelihood (or generalized method of moments(GMM)) for estimation is computationally cum-bersome This is because in each iteration itrequires evaluating a separate complex integralfor each individual in the data In contrastMarkov Chain Monte Carlo (MCMC) Gibbssampling is quite attractive in such a case Us-ing data augmentation of latent variables (Mar-tin A Tanner and Wing Hung Wong 1987)according to which we simulate (i ri) and latertreat those simulations as if they are part ofthe data one can avoid evaluating the complexintegrals by sampling from truncated normaldistributions which is significantly less compu-tationally demanding (eg Luc Devroye 1986)This feature combined with the idea of aldquosliced samplerrdquo (Paul Damien John Wake-field and Stephen Walker 1999) to sample from

756 THE AMERICAN ECONOMIC REVIEW JUNE 2007

an unfamiliar posterior distribution makes theuse of a Gibbs sampler quite efficient for ourpurposes Finally the lognormality assumptionimplies that F(ln(i)ri) and F(ln(ri)i) follow a(conditional) normal distribution allowing us torestrict attention to univariate draws furtherreducing the computational burden

The Appendix provides a full description ofthe Gibbs sampler including the conditionaldistributions and the (flat) prior distributions weuse The basic intuition is that conditional onobserving (i ri) for each individual we have asimple linear regression model with two equa-tions The less standard part is to generate drawsfor (i ri) We do this iteratively Conditionalon i the posterior distribution for ln(ri) fol-lows a truncated normal distribution where thetruncation point depends on the menu individ-ual i faces and its direction (from above orbelow) depends on individual irsquos deductiblechoice The final step is to sample from theposterior distribution of ln(i) conditional on riThis is more complicated as we have bothtruncation which arises from adverse selection(just as we do when sampling for ri) and thenumber of claims which provides additionalinformation about the posterior of i Thus theposterior for i takes an unfamiliar form forwhich we use a ldquosliced samplerrdquo

We use 100000 iterations of the Gibbs sam-pler It seems to converge to the stationarydistribution after about 5000 iterations There-fore we drop the first 10000 draws and use thelast 90000 draws of each variable to report ourresults The results are based on the posteriormean and posterior standard deviation fromthese 90000 draws Note that each iterationinvolves generating separate draws of (i ri) foreach individual Performing 100000 iterationsof the benchmark specification (coded in Mat-lab) takes about 60 hours on a Dell Precision530 workstation

C Identification

The parametric version of the model is iden-tified mechanically There are more equationsthan unknowns and no linear dependenciesamong them so (as also verified using MonteCarlo exercises) the model parameters can bebacked out from simulated data Our goal in thissection is not to provide a formal identificationproof Rather we want to provide intuition for

which features of the data allow us to identifyparticular parameters of the model The discus-sion also highlights the assumptions that areessential for identification vis-a-vis those thatare made only for computational convenience(making them in principle testable)

Discussion of Nonparametric Identifica-tionmdashThe main difficulty in identifying themodel arises from the gap between the (exante) risk type i which individuals usewhen choosing a deductible and the (ex post)realization of the number of claims we ob-serve We identify between the variation inrisk types and the variation in risk realizationsusing our parametric distributional assump-tions The key is that the distribution of risktypes can be uniquely backed out from theclaim data alone This allows us to use thedeductible choice as an additional layer ofinformation which identifies unobserved het-erogeneity in risk aversion21 Any distribu-tional assumption that allows us to uniquelyback out the distribution of risk types fromclaim data would be sufficient to identify thedistribution of risk aversion As is customaryin the analysis of count processes we make aparametric assumption that claims are gener-ated by a lognormal mixture of Poisson dis-tributions (Section IIID discusses this furtherand explores an alternative) Using a mixtureenables us to account for adverse selectionthrough unobserved heterogeneity in risk Italso allows us to better fit the tails of theclaim distribution In principle a more flexi-ble mixture or a more flexible claim-generat-ing process could be identified as long as theclaims data are sufficiently rich22

21 Cardon and Hendel (2001) face a similar identificationproblem in the context of health insurance They use vari-ation in coverage choice (analogous to deductible choice) toidentify the variation in health-status signals (analogous torisk types) from the variation in health expenditure (analo-gous to number of claims) They can rely on the coveragechoice to identify this because they make an assumptionregarding unobserved heterogeneity in preferences (iidlogit) We take a different approach as our main goal is toestimate rather than assume the distribution of preferences

22 Although it may seem that the claim data are limited(as they take only integer values between 0 and 5 in ourdata) variation in policy duration generates continuousvariation in the observed claim propensity Of course thisvariation also introduces an additional selection into themodel due to policy cancellations which are potentially

757VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Once the distribution of risk types is iden-tified from claims data the marginal distribu-tion of risk aversion (and its relationship tothe distribution of risk types) is nonparametri-cally identified from the variation in theoffered menus discussed in Section I Thisvariation implies different deductible and pre-mium options to identical (on observables)individuals who purchased insurance at dif-ferent times Different menus lead to differentindifference sets (similar to the one depictedin Figure 2) These sets often cross each otherand nonparametrically identify the distribu-tion of risk aversion and the correlation struc-ture at least within the region in which theindifference sets vary For the tails of thedistribution as is typically the case we haveto rely on parametric assumptions or usebounds The parametric assumption of log-normality we use for most of the paper ismade only for computational convenience

Intuition for the Parametric IdentificationMechanismmdashVariation in the offered menus isimportant for the nonparametric identificationThe parametric assumptions could identify themodel without such variation Thus to keep theintuition simple let us take the bivariate log-normal distribution as given and contrary to thedata assume that all individuals are identical onobservables and face the same menu Supposealso that all individuals are observed for exactlyone year and have up to two claims23 In thissimplified case the model has five parametersto be estimated the mean and variance of risk and

2 the mean and variance of risk aver-sion r and r

2 and the correlation parameter The data can be summarized by five numbersLet 130 131 and 132 1 131 130 be thefraction of individuals with zero one and twoclaims respectively Let 0 1 and 2 be theproportion of individuals who chose a low de-ductible within each ldquoclaim grouprdquo Given ourdistributional assumption about the claim-

generating process we can use 130 and 131 touniquely identify and

2 Loosely isidentified by the average claim rate in the dataand

2 is identified by how fat the tail of theclaim distribution is ie by how large (132131)is compared to (131130) Given and

2 and thelognormality assumption we can (implicitly)construct a posterior distribution of risk typesfor each claim group F(r claims c) andintegrate over it when predicting the deductiblechoice This provides us with three additionalmoments each of the form

(14) Ec Prchoice 1r

dFr claims c dFr

for c 0 1 2 These moments identify thethree remaining parameters of the model rr

2 and Let us now provide more economic content to

the identification argument Using the same ex-ample and conditional on identifying and

2

from the claim data one can think about thedeductible choice data 0 1 2 as a graph(c) The absolute level of the graph identifiesr In the absence of correlation between riskand risk aversion the slope of the graph iden-tifies r

2 with no correlation the slope shouldalways be positive (due to adverse selection)but higher r

2 would imply a flatter graph be-cause more variation in the deductible choiceswill be attributed to variation in risk aversionFinally is identified by the curvature of thegraph The more convex (concave) the graph isthe more positive (negative) is the estimated For example if 0 05 1 051 and 2 099 it is likely that r

2 is high (explaining why0 and 1 are so close) and is highly positive(explaining why 2 is not also close to 1) Incontrast if 0 1 it must mean that thecorrelation between risk and risk aversion isnegative which is the only way the originalpositive correlation induced by adverse selec-tion can be offset This intuition also clarifiesthat the identification of relies on observingindividuals with multiple claims (or differentpolicy durations) and that it is likely to besensitive to the distributional assumptions Thedata (Table 2B) provide a direct (positive) cor-relation between deductible choice and claims

endogenous The results are similar when we use onlyexpired and truncated policies

23 This variation is sufficient (and necessary) to identifythe benchmark model The data provide more variation weobserve up to five claims per individual we observe con-tinuous variation in the policy durations we observe vari-ation in prices and we exploit distributional restrictionsacross individuals with different observable characteristics

758 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The structural assumptions allow us to explainhow much of this correlation can be attributedto adverse selection The remaining correlation(positive or negative) is therefore attributed tocorrelation in the underlying distribution of riskand risk aversion

III Results

A Reduced-Form Estimation

To get an initial sense for the levels of abso-lute risk aversion implied by the data we use asimple back-of-the-envelope exercise We com-pute unconditional averages of p d dh dland d (Table 2A)24 and substitute these valuesin equation (7) The implied coefficient of ab-solute risk aversion is 29 104 NIS125 Thisfigure could be thought of as the average indif-ference point implying that about 18 percent ofthe policyholders have coefficients of absoluterisk aversion exceeding it To convert to USdollar amounts one needs to multiply thesefigures by the average exchange rate (352)resulting in an average indifference point of102 103 $US1 This figure is less than halfof a similar estimate reported by Sydnor (2006)for buyers of homeownerrsquos insurance but about3 and 13 times higher than comparable fig-ures reported by Gertner (1993) and Metrick(1995) respectively for television game showparticipants

Table 3 provides reduced-form analysis ofthe relationship between the observables andour two left-hand-side variables the number ofclaims and the deductible choice26 Column 1reports the estimates from a Poisson regressionof the number of claims on observed character-istics This regression is closely related to therisk equation we estimate in the benchmarkmodel It shows that older people women andpeople with a college education are less likely to

have an accident Bigger more expensiveolder and noncommercial cars are more likelyto be involved in an accident Driving experi-ence and variables associated with less intenseuse of the car reduce accident rates As could beimagined claim propensity is highly correlatedover time past claims are a strong predictor offuture claims Young drivers are 50 to 70 per-cent more likely to be involved in an accidentwith young men significantly more likely thanyoung women Finally as indicated by the trendin the estimated year dummies the accident ratesignificantly declined over time Part of thisdecline is likely due to the decline in accidentrates in Israel in general27 This decline mightalso be partly due to the better selection ofindividuals the company obtained over time asit gained more experience (Cohen 2003)

Columns 2 and 3 of Table 3 present estimatesfrom probit regressions in which the dependentvariable is equal to one if the policyholder chosea low deductible and is equal to zero otherwiseColumn 3 shows the marginal effects of thecovariates on the propensity to choose a lowdeductible These marginal effects do not have astructural interpretation as the choice of lowdeductible depends on its price on risk aver-sion and on risk In this regression we againobserve a strong trend over time Fewer policy-holders chose the low deductible as time wentby One reason for this trend according to thecompany executives is that over time the com-panyrsquos sales persons were directed to focusmainly on the ldquodefaultrdquo regular deductible op-tion28 The effect of other covariates will bediscussed later in the context of the full modelIn unreported probit regressions we also test thequalitative assumptions of the structural model byadding three additional regressors the price ratio(pidi) the average deductible offered d i

24 The unconditional is computed by maximum likeli-hood using the data on claims and observed durations of thepolicies

25 Using the CARA specification as in equation (16) weobtain a slightly lower value of 25 104 NIS1

26 We find positive correlation in the errors of these tworegressions suggesting the existence of adverse selection inthe data and motivating a model with unobserved heteroge-neity in risk This test is similar to the bivariate probit testproposed by Chiappori and Salanie (2000) and replicatesearlier results reported in Cohen (2005)

27 In particular traffic fatalities and counts of trafficaccidents in Israel fell by 11 percent and 18 percent during1998 and 1999 respectively

28 Such biased marketing efforts will bias consumersagainst choosing the low deductible thus making them lookless risk averse This would make our estimate a lowerbound on the true level of risk aversion If only sophisti-cated consumers could see beyond the marketing effort andthis sophistication were related to observables (eg educa-tion) the coefficients on such observables would be biasedupward This is not a major concern given that the coefficientson most of the covariates are fairly stable when we estimatethe benchmark model separately for different years

759VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

TABLE 3mdashNO HETEROGENEITY IN RISK

Variable

Poisson regressiona Probit regressionsc

Dep var Number of claims Dep var 1 if low deductible

(1) (2) (3)

Coeff Std Err IRRb Coeff Std Err dPdX dPdXDemographics Constant 43535 02744 mdash 224994 23562 mdash

Age 00074 00052 09926 01690 00464 00034 00042Age2 989 105 536 105 10001 00017 00005 339 105 455 105Female 00456 00161 09555 08003 01316 00165 00134Family Single omitted omitted omitted

Married 01115 00217 08945 06417 01950 00128 00045Divorced 00717 00346 10743 03191 03127 00064 00044Widower 00155 00527 10156 03441 04573 00069 00025Other (NA) 01347 01755 11442 29518 19836 00511 00370

Education Elementary 00522 00550 09491 02927 04278 00061 00016High school omitted omitted omittedTechnical 00373 00308 10380 05037 02520 00105 00138College 00745 00197 09282 05279 01546 00109 00053Other (NA) 00116 00184 10116 00630 01540 00013 00025

Emigrant 00210 00163 10213 00497 01328 00010 00003

Car attributes Log(Value) 01227 00281 11306 13285 02342 00271 00299Car age 00187 00042 10189 01603 00357 00033 00017Commercial car 01394 00326 08699 09038 02781 00177 00294Log(engine size) 02972 00459 13461 06924 03952 00141 00075

Driving License years 00204 00034 09798 01043 00312 00021 00005License years2 00002 677 105 10002 00015 00005 299 105 167 105

Good driver 00176 00191 09825 07207 01574 00148 00152Any driver 00564 00169 10580 10923 01321 00217 00258Secondary car 00859 00209 09177 00038 01626 00001 00070Business use 01852 00293 12034 07951 02737 00156 00017History length 00527 00110 09486 11218 01450 00228 00171Claims history 06577 00390 19304 04654 05460 00095 00496

Young driver Young driver 05235 00361 16879 28847 06305 00524 00012Gender Male omitted omitted omitted

Female 01475 00288 08629 17959 03135 00396 00195Age 17ndash19 omitted omitted omitted

19ndash21 00701 00532 10726 07800 07509 00153 0014721ndash24 00267 00574 09737 05746 07773 00114 0015624 02082 00567 0812 17869 07328 00397 00136

Experience 1 omitted omitted omitted1ndash3 02416 00458 07854 10175 06577 00217 000093 02827 00532 07538 32513 07386 00762 00410

Company year First year omitted omitted omittedSecond year 00888 00198 09150 44513 01506 00792 00859Third year 00690 00222 09334 85888 01820 01303 01367Fourth year 01817 00232 08339 118277 02102 01616 01734Fifth year 05431 0028 05810 143206 02778 01920 02085

105586Obs 105800 94000 105800Pseudo R2 00162 01364 01296Log likelihood 577459 369595 430869

Significant at the 5 percent confidence levela Maximum likelihood estimates Variation in exposure (policy duration) is accounted forb IRR Incidence rate ratio Each figure should be interpreted as the increasedecrease in claim probability as a result of an increase of one unit in the

right-hand-side variablec There are two separate probit regressions reported in this table Column 2 relies on the deductible choice model and the lognormality assumption As discussed

in the text by including an additional regressor ln([pi((xi)di) 1]d i) (with (xi) predicted from column 1 above) and normalizing its coefficient to 1 we obtaina structural interpretation of this regression with ln (ri) as the dependent variable Thus the reported coefficients are comparable to those estimated for the benchmarkmodel One should be cautious however in interpreting these coefficients Unlike the benchmark model this regression does not allow unobserved heterogeneity inrisk and suffers from some selection bias because observations with high predicted risk rate are omitted (which is why the number of observations is 94000 ratherthan the full sample of 105800) For comparison column 3 reports the marginal effects from a comparable probit regression that uses the full sample and does notcontrol for pricing and predicted risk through the additional structural regressor Column 3 does not have a structural interpretation and its (unreported) coefficientscannot be compared to those estimated from the benchmark model

760 THE AMERICAN ECONOMIC REVIEW JUNE 2007

and the risk rate (xi) as predicted from thePoisson regression of column 1 All three addi-tional regressors enter with the predicted signand with large and highly significant marginaleffects29

Finally column 2 of Table 3 presents animportant specification of the probit regres-sion in which ln([pi(( xi)di) 1]d i) isadded as a regressor and its coefficient isnormalized to 130 As already mentioned ifwe assume that there is no unobserved heter-ogeneity in risk then column 2 is analogousto the ln(ri) equation of the benchmark modelThis restriction of the model is rejected by thedata About 10 percent of the individuals arepredicted to have (xi) (pi di) implying achoice of low deductible for any level of riskaversion Many of these individuals howeverstill choose higher deductible levels Column2 reports the results for the remaining indi-viduals (94000 out of 105800) for which theregression can be run While the signs of theestimated coefficients are similar to those inthe benchmark model presented below therestricted version of the model suggests muchhigher levels and dispersion of risk aversionwell above any reasonable level31 The fullestimation of the benchmark model clearly re-jects this restriction on the model

B Estimation of the Benchmark Model

The Effect of Individual Characteristics onRisk AversionmdashTable 4 presents the estimationresults from the benchmark model The secondcolumn shows how the level of absolute riskaversion is related to individual characteristicsAs the dependent variable is in natural loga-

rithm coefficients on dummy variables can bedirectly interpreted as approximate percentagechanges The vast majority of these coefficientsare quite stable across a wide range of specifi-cations which are mentioned later

The results indicate that women are more riskaverse than men and have a coefficient of ab-solute risk aversion about 20 percent greaterthan that of men These results are consistentwith those of Donkers et al (2001) and Hartoget al (2002) The estimated effect of age sug-gests a nonmonotone pattern of risk preferencesover the life cycle The estimated coefficientsimply that initially (that is at age 18 the youngestindividual in the data) individuals become less riskaverse with age but around the age of 48 indi-viduals start becoming more risk averse32 Mar-ried individuals are estimated to be significantlymore risk averse compared to singles while di-vorced individuals are less (although the coeffi-cient is statistically insignificant)

The analysis suggests that variables that arelikely to be correlated with income or wealthsuch as post-high-school education and thevalue of the car have a positive coefficientindicating that wealthier people have higherlevels of absolute risk aversion Although we donot have data on individualsrsquo income or wealthwe use other proxies for income in additionalspecifications and obtain mixed results Whenwe include as a covariate the average householdincome among those who live in the same zipcode we obtain a significant and negative co-efficient of 0333 (0154) (standard deviationin parentheses) When we match zip code in-come on demographics of the individuals thecoefficient is effectively zero 0036 (0047)and when we use a socioeconomic index ofthe locality in which the individual lives thecoefficient is positive 0127 (0053)33 Thus

29 The estimated marginal effect (z-statistic in parenthe-ses) is 0352 (1376) 16 104 (1481) and 0154(255) for (pidi) d i and (xi) respectively

30 The level is a normalization The sign is estimatedHad the sign on this regressor been positive this would haveimplied a rejection of the model

31 For the implied median level of risk aversion the re-stricted model produces a similar estimate to the estimate wereport below for the benchmark model However since someindividuals who chose the low deductible are estimated to havevery low claim rates the restricted model is ldquoforcedrdquo to estimatevery high risk aversion for these individuals (in contrast thebenchmark model can explain such choices by positive risk re-siduals) resulting in very high dispersion and (due to the lognor-mality assumption) very high average risk aversion which isabout 1025 higher than the benchmark estimates we report below

32 This nonmonotone pattern may explain why age enterswith different signs in the estimation results of Donkers et al(2001) and Hartog et al (2002) A somewhat similar U-shapepattern with respect to age is also reported by Sumit Agarwalet al (2006) in the context of consumer credit markets

33 The full results from these specifications are providedin the online Appendix (available at httpwwwe-aerorgdatajune0720050644_apppdf ) The reason we do not usethese income variables in the benchmark specification istheir imperfect coverage which would require us to omitalmost 20 percent of the individuals Other results fromthese regressions are similar to those we report for thebenchmark specification

761VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

overall we interpret our findings as suggestiveof a nonnegative and perhaps positive associ-ation between incomewealth and absolute riskaversion

At first glance these results may appear tobe inconsistent with the widely held beliefthat absolute risk aversion declines withwealth One should distinguish however be-tween two questions (a) whether for a givenindividual the vNM utility function exhibitsdecreasing absolute risk aversion and (b)how risk preferences vary across individuals

Our results do not speak to the first questionand should not be thought of as a test of thedecreasing absolute risk aversion propertyTesting this property would require observingthe same individual making multiple choicesat different wealth levels Rather our resultsindicate that individuals with greater wealthhave utility functions that involve a greaterdegree of risk aversion It might be thatwealth is endogenous and that risk aversion(or unobserved individual characteristics thatare correlated with it) leads individuals to

TABLE 4mdashTHE BENCHMARK MODEL

Variable Ln() equation Ln(r) equation Additional quantities

Demographics Constant 15406 (00073) 118118 (01032) Var-covar matrix ()

Age 00001 (00026) 00623 (00213) 01498 (00097)Age2 624 106 (263 105) 644 104 (211 104) r 31515 (00773)Female 00006 (00086) 02049 (00643) 08391 (00265)Family Single Omitted Omitted

Married 00198 (00115) 01927 (00974) Unconditional statisticsa

Divorced 00396 (00155) 01754 (01495) Mean 02196 (00013)Widower 00135 (00281) 01320 (02288) Median 02174 (00017)Other (NA) 00557 (00968) 04599 (07397) Std Dev 00483 (00019)

Education Elementary 00194 (00333) 01283 (02156) Mean r 00019 (00002)High school Omitted Omitted Median r 727 106 (756 107)Technical 00017 (00189) 02306 (01341) Std Dev r 00197 (00015)College 00277 (00124) 02177 (00840) Corr(r ) 02067 (00085)Other (NA) 00029 (00107) 00128 (00819)

Emigrant 00030 (00090) 00001 (00651) Obs 105800

Car attributes Log(value) 00794 (00177) 07244 (01272)Car age 00053 (00023) 00411 (00176)Commercial car 00719 (00187) 00313 (01239)Log(engine size) 01299 (00235) 03195 (01847)

Driving License years 00015 (00017) 00157 (00137)License years2 183 105 (351 105) 148 104 (254 104)Good driver 00635 (00112) 00317 (00822)Any driver 00360 (00105) 03000 (00722)Secondary car 00415 (00141) 01209 (00875)Business use 00614 (00134) 03790 (01124)History length 00012 (00052) 03092 (00518)Claims history 01295 (00154) 00459 (01670)

Young driver Young driver 00525 (00253) 02499 (02290)Gender Male Omitted mdash

Female 00355 (00061) mdashAge 17ndash19 Omitted mdash

19ndash21 00387 (00121) mdash21ndash24 00445 (00124) mdash24 00114 (00119) mdash

Experience 1 Omitted mdash1ndash3 00059 (00104) mdash3 00762 (00121) mdash

Company year First year Omitted OmittedSecond year 00771 (00122) 14334 (00853)Third year 00857 (00137) 28459 (01191)Fourth year 01515 (00160) 38089 (01343)Fifth year 04062 (00249) 39525 (01368)

Note Standard deviations based on the draws from the posterior distribution in parentheses Significant at the 5 percent confidence levela Unconditional statistics represent implied quantities for the sample population as a whole ie integrating over the distribution of covariates in the sample (as

well as over the unobserved components)

762 THE AMERICAN ECONOMIC REVIEW JUNE 2007

save more to obtain more education or totake other actions that lead to greaterwealth34

Let us make several additional observationsFirst while owners of more expensive cars ap-pear to have both higher risk exposure andhigher levels of risk aversion owners of biggercars have higher risk exposure but lower levelsof risk aversion This should indicate that thestructure of the model does not constrain therelationship between the coefficients in the twoequations Rather it is the data that ldquospeak uprdquoSecond individuals who are classified by theinsurer as ldquogood driversrdquo indeed have lowerrisk but also appear to have lower risk aversionThis result is somewhat similar to the positivecorrelation between unobserved risk and unob-served risk aversion which we report belowThird policyholders who tend to use the car forbusiness are less risk averse This could bebecause uninsured costs of accidents occurringto such policyholders are tax deductible Fourthpolicyholders who reported three full years ofclaims history are more risk averse but are notdifferent in their risk exposure The attitude thatleads such policyholders to comply with therequest to (voluntarily) report three full years ofclaims history is apparently and not surpris-ingly correlated with higher levels of risk aver-sion In contrast while past claims indicate highrisk they have no significant relationship withrisk aversion Finally we find a strong trendtoward lower levels of risk aversion over timeThis is a replication of the probit results dis-cussed earlier

The Effect of Individual Characteristics onClaim RiskmdashThe first column of Table 4 de-scribes the relationship between observablesand risk exposure Accident risk is higher fordivorced individuals and lower for people witha college education Bigger older more expen-sive and noncommercial cars are all morelikely to be involved in an accident Drivingexperience reduces accident rates as do mea-sures of less intense use of the car while youngdrivers are more exposed to risk Claim propen-sity is highly correlated over time the voluntaryreport of past claims is a strong predictor offuture claims This risk equation produces re-sults that are similar to those of the simplerPoisson regression reported in Table 3 Al-though some of the coefficients lose signifi-cance the magnitude of most coefficients isquite similar The similarity between these twosets of results is to be expected as the riskregression is identified primarily from the dataon claims so incorporating the information ondeductible choice does not qualitatively changethe conceptual identification strategy (see Sec-tion IIC) If the results were not similar thiscould have indicated a misspecification of themodel The slight differences between the riskregressions in Table 3 and Table 4 are drivenmainly by the structural assumptions First thebenchmark model estimates a lognormal mix-ture of Poisson rates rather than a single Pois-son model By incorporating the fatter tails ofthe claim distribution it slightly changes theresults increases the standard errors and de-creases the average predicted claim rate Sec-ond the information on deductible choiceslightly helps us in obtaining more precise es-timates through the correlation structure be-tween the error terms in the two equations

The Implied Level of Risk AversionmdashOne ofthe main goals of the estimation is to obtainmeasures of the level of risk aversion in thepopulation we study Since we use Gibbs sam-pling and augment the latent coefficients ofabsolute risk aversion we can directly obtainthe posterior distribution of various moments ofthis distribution At each iteration of the Gibbssampler we compute the mean standard devi-ation (across individuals) and various percen-tiles of the simulated draws of i and ri and thecorrelation between them The far-right columnof Table 4 reports the averages and standard

34 One may be tempted to interpret the positive wealtheffects as an indirect indication of credit constraints wealth-ier individuals are less credit constrained and therefore canafford to purchase more insurance We do not share thisinterpretation for two reasons First the insurance companyobserves these proxies for wealth and conditions on themwhen setting prices Since the willingness to pay for insur-ance is likely to be correlated with the willingness to pay forthe additional insurance provided by the low deductibleoption premiums already reflect this variation We condi-tion on the level of the premium Second paying less exante implies paying more ex post so applying the creditconstraint argument only to the ex ante payment but not tothe probabilistic ex post deductible payments has no theo-retical foundation Essentially the setup of the model linksthe ex ante decisions to the ex post losses which are bothdriven by the curvature of the vNM utility function

763VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

deviations of these computed quantities over theiterations of the Gibbs sampler35 The impliedrisk aversion of the mean individual is 00019which is about seven times greater than theback-of-the-envelope calculation presented inthe previous section As we assume a lognormal

distribution and estimate a relatively high dis-persion coefficient r the estimates also implysignificantly lower values for the median levelof risk aversion As shown later the qualitativepattern of these results is quite robust acrossspecifications

Table 5 summarizes our findings regardingthe level and dispersion of risk aversion Itpresents two ways to interpret the estimates weobtained from the benchmark model as well ascomparisons to specifications with constant ab-solute risk aversion (CARA) utility and with

35 Note that these estimated quantities are unconditionalIn computing these quantities we integrate over the distri-bution of observable characteristics in the data so onecannot compute these estimates from the estimated param-eter directly

TABLE 5mdashRISK-AVERSION ESTIMATES

Specificationa Absolute risk aversionb Interpretationc Relative risk aversiond

Back-of-the-envelope 10 103 9070 1484Benchmark model

Mean individual 67 103 5605 972225th percentile 23 106 9998 003Median individual 26 105 9974 03775th percentile 29 104 9714 42790th percentile 27 103 7834 390295th percentile 99 103 4937 14327

CARA utilityMean individual 31 103 7651 4436Median individual 34 105 9966 050

Learning modelMean individual 42 103 6886 6140Median individual 56 106 9995 008

Comparable estimatesGertner (1993) 31 104 9699 479Metrick (1995) 66 105 9934 102Holt and Laury (2002)e 32 102 2096 86575Sydnor (2006) 20 103 8329 5395

a The table summarizes the results with respect to the level of risk aversion ldquoBack-of-the-enveloperdquo refers to thecalculation we report in the beginning of Section III ldquobenchmark modelrdquo refers to the results from the benchmark model(Table 4) ldquoCARA utilityrdquo refers to a specification of a CARA utility function and ldquolearning modelrdquo refers to a specificationin which individuals do not know their risk types perfectly (see Section IIID) The last four rows are the closest comparableresults available in the literature

b The second column presents the point estimates for the coefficient of absolute risk aversion converted to $US1 unitsFor the comparable estimates this is given by their estimate of a representative CARA utility maximizer For all otherspecifications this is given by computing the unconditional mean and median in the population using the figures we reportin the mean and median columns of risk aversion in Table 6 (multiplied by 352 the average exchange rate during the periodto convert to US dollars)

c To interpret the absolute risk aversion estimates (ARA) we translate them into x u(w) 1frasl2 u(w 100) 1frasl2 u(w x) That is we report x such that an individual with the estimated ARA is indifferent about participating in a 50ndash50lottery of gaining 100 US dollars and losing x US dollars Note that since our estimate is of absolute risk aversion the quantityx is independent of w To be consistent with the specification we use a quadratic utility function for the back-of-the-envelopebenchmark and learning models and use a CARA utility function for the others

d The last column attempts to translate the ARA estimates into relative risk aversion We follow the literature and do soby multiplying the ARA estimate by average annual income We use the average annual income (after tax) in Israel in 1995(51168 NIS from the Israeli Census) for all our specifications and we use average disposable income in the United Statesin 1987 ($15437) for Gertner (1993) and Metrick (1995) For Holt and Laury (2002) and Sydnor (2006) we use a similarfigure for 2002 ($26974)

e Holt and Laury (2002) do not report a comparable estimate The estimate we provide above is based on estimating aCARA utility model for the 18 subjects in their experiment who participated in the ldquo90rdquo treatment which involved stakescomparable to our setting For these subjects we assume a CARA utility function and a lognormal distribution of theircoefficient of absolute risk aversion The table reports the point estimate of the mean from this distribution

764 THE AMERICAN ECONOMIC REVIEW JUNE 2007

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 11: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

Equation (7) defines an indifference set in thespace of risk and risk aversion which we willrefer to by (r() ) and ((r) r) interchange-ably Both r() and (r) have a closed-formrepresentation a property that will be computa-tionally attractive for estimation20 Both termsare individual specific as they depend on thedeductible menu which varies across individu-als For the rest of the paper we regard eachindividual as associated with a two-dimensionaltype (ri i) An individual with a risk parameter

i who is offered a menu (pih di

h) (pil di

l)will choose the low-deductible contract if andonly if his coefficient of absolute risk aversionsatisfies ri ri(i) Figure 2 presents a graphi-cal illustration

B The Benchmark Econometric Model

The Econometric ModelmdashThe econometricmodel we estimate is fully described by the fiveequations presented in this section Our ob-jective is to estimate the joint distribution of(i ri)mdashthe claim rate and coefficient of abso-lute risk aversionmdashin the population of policy-holders conditional on observables xi The

20 For example estimating the CARA version of themodel (Section IIID) for which r() does not have aclosed-form representation takes almost ten times longer

FIGURE 2 THE INDIVIDUALrsquoS DECISIONmdashA GRAPHICAL ILLUSTRATION

Notes This graph illustrates the individualrsquos decision problem The solid line presents the indifference setmdashequation (7)mdashappliedfor the menu faced by the average individual in the sample Individuals are represented by points in the two-dimensional spaceabove In particular the scattered points are 10000 draws from the joint distribution of risk and risk aversion for the averageindividual (on observables) in the data based on the point estimates of the benchmark model (Table 4) If an individual is eitherto the right of the line (high risk) or above the line (high risk aversion) the low deductible is optimal Adverse selection is capturedby the fact that the line is downward sloping higher-risk individuals require lower levels of risk aversion to choose the lowdeductible Thus in the absence of correlation between risk and risk aversion higher-risk individuals are more likely to choosehigher levels of insurance An individual with i (pidi) will choose a lower deductible even if he is risk neutral ie withprobability one (we do not allow individuals to be risk loving) This does not create an estimation problem because i is notobserved only a posterior distribution for it Any such distribution will have a positive weight on values of i that are below(pidi) Second the indifference set is a function of the menu and in particular of (pidi) and d An increase in (pidi) willshift the set up and to the right and an increase in d will shift the set down and to the left Therefore exogenous shiftsof the menus that make both arguments change in the same direction can make the sets ldquocrossrdquo thereby allowing us toidentify the correlation between risk and risk aversion nonparametrically With positive correlation (shown in the figureby the ldquoright-bendingrdquo shape of the simulated draws) the marginal individuals are relatively high risk therefore creatinga stronger incentive for the insurer to raise the price of the low deductible

755VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

benchmark formulation assumes that (i ri) fol-lows a bivariate lognormal distribution Thuswe can write the model as

(8) ln i xi i

(9) ln ri xi vi

with

(10) i

vi

iid

N00

2 r

r r2

Neither i nor ri is directly observed Thereforewe treat both as latent variables Loosely speak-ing they can be thought of as random effects Weobserve two variables the number of claims andthe deductible choice which are related to thesetwo unobserved components Thus to completeour econometric model we have to specify therelationship between the observed variables andthe latent ones This is done by making two struc-tural assumptions First we assume that the num-ber of claims is drawn from a Poisson distributionnamely

(11) claimsi Poisson(i ti )

where ti is the observed duration of the policySecond we assume that when individuals maketheir deductible choices they follow the theoreti-cal model described in the previous section Themodel implies that individual i chooses the lowdeductible (choicei 1) if and only if ri ri(i)where ri is defined in equation (7) Thus theempirical model for deductible choice is given by

(12) Prchoicei 1 Prri

pi

idi 1

d i

Prexp(xi vi)

pi

exp(xi i)di 1

d i

With no unobserved heterogeneity in risk (i 0) equation (12) reduces to a simple probit In

such a case one can perfectly predict risk from thedata denote it by (xi) and construct an addi-tional covariate ln([pi((xi)di) 1]d i) Giventhe assumption that risk aversion is distributedlognormally running the probit regressionabove and renormalizing the coefficient onthe constructed covariate to 1 (instead ofthe typical normalization of the variance of theerror term to 1) has a structural interpretation withln (ri) as the dependent variable However Cohen(2005) provides evidence of adverse selection inthe data implying the existence of unobservedheterogeneity in risk This makes the simple probitregression misspecified Estimation of the fullmodel is more complicated Once we allow forunobserved heterogeneity in both unobserved riskaversion (vi) and claim rate (i) we have to inte-grate over the two-dimensional region depicted inFigure 2 for estimation

EstimationmdashA natural way to proceed is toestimate the model by maximum likelihoodwhere the likelihood of the data as a function ofthe parameters can be written by integrating outthe latent variables namely

(13) Lclaimsi choicei

Prclaimsi choiceii riPri ri

where is a vector of parameters to be esti-mated While formulating the empirical modelusing likelihood may help our thinking aboutthe data-generating process using maximumlikelihood (or generalized method of moments(GMM)) for estimation is computationally cum-bersome This is because in each iteration itrequires evaluating a separate complex integralfor each individual in the data In contrastMarkov Chain Monte Carlo (MCMC) Gibbssampling is quite attractive in such a case Us-ing data augmentation of latent variables (Mar-tin A Tanner and Wing Hung Wong 1987)according to which we simulate (i ri) and latertreat those simulations as if they are part ofthe data one can avoid evaluating the complexintegrals by sampling from truncated normaldistributions which is significantly less compu-tationally demanding (eg Luc Devroye 1986)This feature combined with the idea of aldquosliced samplerrdquo (Paul Damien John Wake-field and Stephen Walker 1999) to sample from

756 THE AMERICAN ECONOMIC REVIEW JUNE 2007

an unfamiliar posterior distribution makes theuse of a Gibbs sampler quite efficient for ourpurposes Finally the lognormality assumptionimplies that F(ln(i)ri) and F(ln(ri)i) follow a(conditional) normal distribution allowing us torestrict attention to univariate draws furtherreducing the computational burden

The Appendix provides a full description ofthe Gibbs sampler including the conditionaldistributions and the (flat) prior distributions weuse The basic intuition is that conditional onobserving (i ri) for each individual we have asimple linear regression model with two equa-tions The less standard part is to generate drawsfor (i ri) We do this iteratively Conditionalon i the posterior distribution for ln(ri) fol-lows a truncated normal distribution where thetruncation point depends on the menu individ-ual i faces and its direction (from above orbelow) depends on individual irsquos deductiblechoice The final step is to sample from theposterior distribution of ln(i) conditional on riThis is more complicated as we have bothtruncation which arises from adverse selection(just as we do when sampling for ri) and thenumber of claims which provides additionalinformation about the posterior of i Thus theposterior for i takes an unfamiliar form forwhich we use a ldquosliced samplerrdquo

We use 100000 iterations of the Gibbs sam-pler It seems to converge to the stationarydistribution after about 5000 iterations There-fore we drop the first 10000 draws and use thelast 90000 draws of each variable to report ourresults The results are based on the posteriormean and posterior standard deviation fromthese 90000 draws Note that each iterationinvolves generating separate draws of (i ri) foreach individual Performing 100000 iterationsof the benchmark specification (coded in Mat-lab) takes about 60 hours on a Dell Precision530 workstation

C Identification

The parametric version of the model is iden-tified mechanically There are more equationsthan unknowns and no linear dependenciesamong them so (as also verified using MonteCarlo exercises) the model parameters can bebacked out from simulated data Our goal in thissection is not to provide a formal identificationproof Rather we want to provide intuition for

which features of the data allow us to identifyparticular parameters of the model The discus-sion also highlights the assumptions that areessential for identification vis-a-vis those thatare made only for computational convenience(making them in principle testable)

Discussion of Nonparametric Identifica-tionmdashThe main difficulty in identifying themodel arises from the gap between the (exante) risk type i which individuals usewhen choosing a deductible and the (ex post)realization of the number of claims we ob-serve We identify between the variation inrisk types and the variation in risk realizationsusing our parametric distributional assump-tions The key is that the distribution of risktypes can be uniquely backed out from theclaim data alone This allows us to use thedeductible choice as an additional layer ofinformation which identifies unobserved het-erogeneity in risk aversion21 Any distribu-tional assumption that allows us to uniquelyback out the distribution of risk types fromclaim data would be sufficient to identify thedistribution of risk aversion As is customaryin the analysis of count processes we make aparametric assumption that claims are gener-ated by a lognormal mixture of Poisson dis-tributions (Section IIID discusses this furtherand explores an alternative) Using a mixtureenables us to account for adverse selectionthrough unobserved heterogeneity in risk Italso allows us to better fit the tails of theclaim distribution In principle a more flexi-ble mixture or a more flexible claim-generat-ing process could be identified as long as theclaims data are sufficiently rich22

21 Cardon and Hendel (2001) face a similar identificationproblem in the context of health insurance They use vari-ation in coverage choice (analogous to deductible choice) toidentify the variation in health-status signals (analogous torisk types) from the variation in health expenditure (analo-gous to number of claims) They can rely on the coveragechoice to identify this because they make an assumptionregarding unobserved heterogeneity in preferences (iidlogit) We take a different approach as our main goal is toestimate rather than assume the distribution of preferences

22 Although it may seem that the claim data are limited(as they take only integer values between 0 and 5 in ourdata) variation in policy duration generates continuousvariation in the observed claim propensity Of course thisvariation also introduces an additional selection into themodel due to policy cancellations which are potentially

757VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Once the distribution of risk types is iden-tified from claims data the marginal distribu-tion of risk aversion (and its relationship tothe distribution of risk types) is nonparametri-cally identified from the variation in theoffered menus discussed in Section I Thisvariation implies different deductible and pre-mium options to identical (on observables)individuals who purchased insurance at dif-ferent times Different menus lead to differentindifference sets (similar to the one depictedin Figure 2) These sets often cross each otherand nonparametrically identify the distribu-tion of risk aversion and the correlation struc-ture at least within the region in which theindifference sets vary For the tails of thedistribution as is typically the case we haveto rely on parametric assumptions or usebounds The parametric assumption of log-normality we use for most of the paper ismade only for computational convenience

Intuition for the Parametric IdentificationMechanismmdashVariation in the offered menus isimportant for the nonparametric identificationThe parametric assumptions could identify themodel without such variation Thus to keep theintuition simple let us take the bivariate log-normal distribution as given and contrary to thedata assume that all individuals are identical onobservables and face the same menu Supposealso that all individuals are observed for exactlyone year and have up to two claims23 In thissimplified case the model has five parametersto be estimated the mean and variance of risk and

2 the mean and variance of risk aver-sion r and r

2 and the correlation parameter The data can be summarized by five numbersLet 130 131 and 132 1 131 130 be thefraction of individuals with zero one and twoclaims respectively Let 0 1 and 2 be theproportion of individuals who chose a low de-ductible within each ldquoclaim grouprdquo Given ourdistributional assumption about the claim-

generating process we can use 130 and 131 touniquely identify and

2 Loosely isidentified by the average claim rate in the dataand

2 is identified by how fat the tail of theclaim distribution is ie by how large (132131)is compared to (131130) Given and

2 and thelognormality assumption we can (implicitly)construct a posterior distribution of risk typesfor each claim group F(r claims c) andintegrate over it when predicting the deductiblechoice This provides us with three additionalmoments each of the form

(14) Ec Prchoice 1r

dFr claims c dFr

for c 0 1 2 These moments identify thethree remaining parameters of the model rr

2 and Let us now provide more economic content to

the identification argument Using the same ex-ample and conditional on identifying and

2

from the claim data one can think about thedeductible choice data 0 1 2 as a graph(c) The absolute level of the graph identifiesr In the absence of correlation between riskand risk aversion the slope of the graph iden-tifies r

2 with no correlation the slope shouldalways be positive (due to adverse selection)but higher r

2 would imply a flatter graph be-cause more variation in the deductible choiceswill be attributed to variation in risk aversionFinally is identified by the curvature of thegraph The more convex (concave) the graph isthe more positive (negative) is the estimated For example if 0 05 1 051 and 2 099 it is likely that r

2 is high (explaining why0 and 1 are so close) and is highly positive(explaining why 2 is not also close to 1) Incontrast if 0 1 it must mean that thecorrelation between risk and risk aversion isnegative which is the only way the originalpositive correlation induced by adverse selec-tion can be offset This intuition also clarifiesthat the identification of relies on observingindividuals with multiple claims (or differentpolicy durations) and that it is likely to besensitive to the distributional assumptions Thedata (Table 2B) provide a direct (positive) cor-relation between deductible choice and claims

endogenous The results are similar when we use onlyexpired and truncated policies

23 This variation is sufficient (and necessary) to identifythe benchmark model The data provide more variation weobserve up to five claims per individual we observe con-tinuous variation in the policy durations we observe vari-ation in prices and we exploit distributional restrictionsacross individuals with different observable characteristics

758 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The structural assumptions allow us to explainhow much of this correlation can be attributedto adverse selection The remaining correlation(positive or negative) is therefore attributed tocorrelation in the underlying distribution of riskand risk aversion

III Results

A Reduced-Form Estimation

To get an initial sense for the levels of abso-lute risk aversion implied by the data we use asimple back-of-the-envelope exercise We com-pute unconditional averages of p d dh dland d (Table 2A)24 and substitute these valuesin equation (7) The implied coefficient of ab-solute risk aversion is 29 104 NIS125 Thisfigure could be thought of as the average indif-ference point implying that about 18 percent ofthe policyholders have coefficients of absoluterisk aversion exceeding it To convert to USdollar amounts one needs to multiply thesefigures by the average exchange rate (352)resulting in an average indifference point of102 103 $US1 This figure is less than halfof a similar estimate reported by Sydnor (2006)for buyers of homeownerrsquos insurance but about3 and 13 times higher than comparable fig-ures reported by Gertner (1993) and Metrick(1995) respectively for television game showparticipants

Table 3 provides reduced-form analysis ofthe relationship between the observables andour two left-hand-side variables the number ofclaims and the deductible choice26 Column 1reports the estimates from a Poisson regressionof the number of claims on observed character-istics This regression is closely related to therisk equation we estimate in the benchmarkmodel It shows that older people women andpeople with a college education are less likely to

have an accident Bigger more expensiveolder and noncommercial cars are more likelyto be involved in an accident Driving experi-ence and variables associated with less intenseuse of the car reduce accident rates As could beimagined claim propensity is highly correlatedover time past claims are a strong predictor offuture claims Young drivers are 50 to 70 per-cent more likely to be involved in an accidentwith young men significantly more likely thanyoung women Finally as indicated by the trendin the estimated year dummies the accident ratesignificantly declined over time Part of thisdecline is likely due to the decline in accidentrates in Israel in general27 This decline mightalso be partly due to the better selection ofindividuals the company obtained over time asit gained more experience (Cohen 2003)

Columns 2 and 3 of Table 3 present estimatesfrom probit regressions in which the dependentvariable is equal to one if the policyholder chosea low deductible and is equal to zero otherwiseColumn 3 shows the marginal effects of thecovariates on the propensity to choose a lowdeductible These marginal effects do not have astructural interpretation as the choice of lowdeductible depends on its price on risk aver-sion and on risk In this regression we againobserve a strong trend over time Fewer policy-holders chose the low deductible as time wentby One reason for this trend according to thecompany executives is that over time the com-panyrsquos sales persons were directed to focusmainly on the ldquodefaultrdquo regular deductible op-tion28 The effect of other covariates will bediscussed later in the context of the full modelIn unreported probit regressions we also test thequalitative assumptions of the structural model byadding three additional regressors the price ratio(pidi) the average deductible offered d i

24 The unconditional is computed by maximum likeli-hood using the data on claims and observed durations of thepolicies

25 Using the CARA specification as in equation (16) weobtain a slightly lower value of 25 104 NIS1

26 We find positive correlation in the errors of these tworegressions suggesting the existence of adverse selection inthe data and motivating a model with unobserved heteroge-neity in risk This test is similar to the bivariate probit testproposed by Chiappori and Salanie (2000) and replicatesearlier results reported in Cohen (2005)

27 In particular traffic fatalities and counts of trafficaccidents in Israel fell by 11 percent and 18 percent during1998 and 1999 respectively

28 Such biased marketing efforts will bias consumersagainst choosing the low deductible thus making them lookless risk averse This would make our estimate a lowerbound on the true level of risk aversion If only sophisti-cated consumers could see beyond the marketing effort andthis sophistication were related to observables (eg educa-tion) the coefficients on such observables would be biasedupward This is not a major concern given that the coefficientson most of the covariates are fairly stable when we estimatethe benchmark model separately for different years

759VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

TABLE 3mdashNO HETEROGENEITY IN RISK

Variable

Poisson regressiona Probit regressionsc

Dep var Number of claims Dep var 1 if low deductible

(1) (2) (3)

Coeff Std Err IRRb Coeff Std Err dPdX dPdXDemographics Constant 43535 02744 mdash 224994 23562 mdash

Age 00074 00052 09926 01690 00464 00034 00042Age2 989 105 536 105 10001 00017 00005 339 105 455 105Female 00456 00161 09555 08003 01316 00165 00134Family Single omitted omitted omitted

Married 01115 00217 08945 06417 01950 00128 00045Divorced 00717 00346 10743 03191 03127 00064 00044Widower 00155 00527 10156 03441 04573 00069 00025Other (NA) 01347 01755 11442 29518 19836 00511 00370

Education Elementary 00522 00550 09491 02927 04278 00061 00016High school omitted omitted omittedTechnical 00373 00308 10380 05037 02520 00105 00138College 00745 00197 09282 05279 01546 00109 00053Other (NA) 00116 00184 10116 00630 01540 00013 00025

Emigrant 00210 00163 10213 00497 01328 00010 00003

Car attributes Log(Value) 01227 00281 11306 13285 02342 00271 00299Car age 00187 00042 10189 01603 00357 00033 00017Commercial car 01394 00326 08699 09038 02781 00177 00294Log(engine size) 02972 00459 13461 06924 03952 00141 00075

Driving License years 00204 00034 09798 01043 00312 00021 00005License years2 00002 677 105 10002 00015 00005 299 105 167 105

Good driver 00176 00191 09825 07207 01574 00148 00152Any driver 00564 00169 10580 10923 01321 00217 00258Secondary car 00859 00209 09177 00038 01626 00001 00070Business use 01852 00293 12034 07951 02737 00156 00017History length 00527 00110 09486 11218 01450 00228 00171Claims history 06577 00390 19304 04654 05460 00095 00496

Young driver Young driver 05235 00361 16879 28847 06305 00524 00012Gender Male omitted omitted omitted

Female 01475 00288 08629 17959 03135 00396 00195Age 17ndash19 omitted omitted omitted

19ndash21 00701 00532 10726 07800 07509 00153 0014721ndash24 00267 00574 09737 05746 07773 00114 0015624 02082 00567 0812 17869 07328 00397 00136

Experience 1 omitted omitted omitted1ndash3 02416 00458 07854 10175 06577 00217 000093 02827 00532 07538 32513 07386 00762 00410

Company year First year omitted omitted omittedSecond year 00888 00198 09150 44513 01506 00792 00859Third year 00690 00222 09334 85888 01820 01303 01367Fourth year 01817 00232 08339 118277 02102 01616 01734Fifth year 05431 0028 05810 143206 02778 01920 02085

105586Obs 105800 94000 105800Pseudo R2 00162 01364 01296Log likelihood 577459 369595 430869

Significant at the 5 percent confidence levela Maximum likelihood estimates Variation in exposure (policy duration) is accounted forb IRR Incidence rate ratio Each figure should be interpreted as the increasedecrease in claim probability as a result of an increase of one unit in the

right-hand-side variablec There are two separate probit regressions reported in this table Column 2 relies on the deductible choice model and the lognormality assumption As discussed

in the text by including an additional regressor ln([pi((xi)di) 1]d i) (with (xi) predicted from column 1 above) and normalizing its coefficient to 1 we obtaina structural interpretation of this regression with ln (ri) as the dependent variable Thus the reported coefficients are comparable to those estimated for the benchmarkmodel One should be cautious however in interpreting these coefficients Unlike the benchmark model this regression does not allow unobserved heterogeneity inrisk and suffers from some selection bias because observations with high predicted risk rate are omitted (which is why the number of observations is 94000 ratherthan the full sample of 105800) For comparison column 3 reports the marginal effects from a comparable probit regression that uses the full sample and does notcontrol for pricing and predicted risk through the additional structural regressor Column 3 does not have a structural interpretation and its (unreported) coefficientscannot be compared to those estimated from the benchmark model

760 THE AMERICAN ECONOMIC REVIEW JUNE 2007

and the risk rate (xi) as predicted from thePoisson regression of column 1 All three addi-tional regressors enter with the predicted signand with large and highly significant marginaleffects29

Finally column 2 of Table 3 presents animportant specification of the probit regres-sion in which ln([pi(( xi)di) 1]d i) isadded as a regressor and its coefficient isnormalized to 130 As already mentioned ifwe assume that there is no unobserved heter-ogeneity in risk then column 2 is analogousto the ln(ri) equation of the benchmark modelThis restriction of the model is rejected by thedata About 10 percent of the individuals arepredicted to have (xi) (pi di) implying achoice of low deductible for any level of riskaversion Many of these individuals howeverstill choose higher deductible levels Column2 reports the results for the remaining indi-viduals (94000 out of 105800) for which theregression can be run While the signs of theestimated coefficients are similar to those inthe benchmark model presented below therestricted version of the model suggests muchhigher levels and dispersion of risk aversionwell above any reasonable level31 The fullestimation of the benchmark model clearly re-jects this restriction on the model

B Estimation of the Benchmark Model

The Effect of Individual Characteristics onRisk AversionmdashTable 4 presents the estimationresults from the benchmark model The secondcolumn shows how the level of absolute riskaversion is related to individual characteristicsAs the dependent variable is in natural loga-

rithm coefficients on dummy variables can bedirectly interpreted as approximate percentagechanges The vast majority of these coefficientsare quite stable across a wide range of specifi-cations which are mentioned later

The results indicate that women are more riskaverse than men and have a coefficient of ab-solute risk aversion about 20 percent greaterthan that of men These results are consistentwith those of Donkers et al (2001) and Hartoget al (2002) The estimated effect of age sug-gests a nonmonotone pattern of risk preferencesover the life cycle The estimated coefficientsimply that initially (that is at age 18 the youngestindividual in the data) individuals become less riskaverse with age but around the age of 48 indi-viduals start becoming more risk averse32 Mar-ried individuals are estimated to be significantlymore risk averse compared to singles while di-vorced individuals are less (although the coeffi-cient is statistically insignificant)

The analysis suggests that variables that arelikely to be correlated with income or wealthsuch as post-high-school education and thevalue of the car have a positive coefficientindicating that wealthier people have higherlevels of absolute risk aversion Although we donot have data on individualsrsquo income or wealthwe use other proxies for income in additionalspecifications and obtain mixed results Whenwe include as a covariate the average householdincome among those who live in the same zipcode we obtain a significant and negative co-efficient of 0333 (0154) (standard deviationin parentheses) When we match zip code in-come on demographics of the individuals thecoefficient is effectively zero 0036 (0047)and when we use a socioeconomic index ofthe locality in which the individual lives thecoefficient is positive 0127 (0053)33 Thus

29 The estimated marginal effect (z-statistic in parenthe-ses) is 0352 (1376) 16 104 (1481) and 0154(255) for (pidi) d i and (xi) respectively

30 The level is a normalization The sign is estimatedHad the sign on this regressor been positive this would haveimplied a rejection of the model

31 For the implied median level of risk aversion the re-stricted model produces a similar estimate to the estimate wereport below for the benchmark model However since someindividuals who chose the low deductible are estimated to havevery low claim rates the restricted model is ldquoforcedrdquo to estimatevery high risk aversion for these individuals (in contrast thebenchmark model can explain such choices by positive risk re-siduals) resulting in very high dispersion and (due to the lognor-mality assumption) very high average risk aversion which isabout 1025 higher than the benchmark estimates we report below

32 This nonmonotone pattern may explain why age enterswith different signs in the estimation results of Donkers et al(2001) and Hartog et al (2002) A somewhat similar U-shapepattern with respect to age is also reported by Sumit Agarwalet al (2006) in the context of consumer credit markets

33 The full results from these specifications are providedin the online Appendix (available at httpwwwe-aerorgdatajune0720050644_apppdf ) The reason we do not usethese income variables in the benchmark specification istheir imperfect coverage which would require us to omitalmost 20 percent of the individuals Other results fromthese regressions are similar to those we report for thebenchmark specification

761VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

overall we interpret our findings as suggestiveof a nonnegative and perhaps positive associ-ation between incomewealth and absolute riskaversion

At first glance these results may appear tobe inconsistent with the widely held beliefthat absolute risk aversion declines withwealth One should distinguish however be-tween two questions (a) whether for a givenindividual the vNM utility function exhibitsdecreasing absolute risk aversion and (b)how risk preferences vary across individuals

Our results do not speak to the first questionand should not be thought of as a test of thedecreasing absolute risk aversion propertyTesting this property would require observingthe same individual making multiple choicesat different wealth levels Rather our resultsindicate that individuals with greater wealthhave utility functions that involve a greaterdegree of risk aversion It might be thatwealth is endogenous and that risk aversion(or unobserved individual characteristics thatare correlated with it) leads individuals to

TABLE 4mdashTHE BENCHMARK MODEL

Variable Ln() equation Ln(r) equation Additional quantities

Demographics Constant 15406 (00073) 118118 (01032) Var-covar matrix ()

Age 00001 (00026) 00623 (00213) 01498 (00097)Age2 624 106 (263 105) 644 104 (211 104) r 31515 (00773)Female 00006 (00086) 02049 (00643) 08391 (00265)Family Single Omitted Omitted

Married 00198 (00115) 01927 (00974) Unconditional statisticsa

Divorced 00396 (00155) 01754 (01495) Mean 02196 (00013)Widower 00135 (00281) 01320 (02288) Median 02174 (00017)Other (NA) 00557 (00968) 04599 (07397) Std Dev 00483 (00019)

Education Elementary 00194 (00333) 01283 (02156) Mean r 00019 (00002)High school Omitted Omitted Median r 727 106 (756 107)Technical 00017 (00189) 02306 (01341) Std Dev r 00197 (00015)College 00277 (00124) 02177 (00840) Corr(r ) 02067 (00085)Other (NA) 00029 (00107) 00128 (00819)

Emigrant 00030 (00090) 00001 (00651) Obs 105800

Car attributes Log(value) 00794 (00177) 07244 (01272)Car age 00053 (00023) 00411 (00176)Commercial car 00719 (00187) 00313 (01239)Log(engine size) 01299 (00235) 03195 (01847)

Driving License years 00015 (00017) 00157 (00137)License years2 183 105 (351 105) 148 104 (254 104)Good driver 00635 (00112) 00317 (00822)Any driver 00360 (00105) 03000 (00722)Secondary car 00415 (00141) 01209 (00875)Business use 00614 (00134) 03790 (01124)History length 00012 (00052) 03092 (00518)Claims history 01295 (00154) 00459 (01670)

Young driver Young driver 00525 (00253) 02499 (02290)Gender Male Omitted mdash

Female 00355 (00061) mdashAge 17ndash19 Omitted mdash

19ndash21 00387 (00121) mdash21ndash24 00445 (00124) mdash24 00114 (00119) mdash

Experience 1 Omitted mdash1ndash3 00059 (00104) mdash3 00762 (00121) mdash

Company year First year Omitted OmittedSecond year 00771 (00122) 14334 (00853)Third year 00857 (00137) 28459 (01191)Fourth year 01515 (00160) 38089 (01343)Fifth year 04062 (00249) 39525 (01368)

Note Standard deviations based on the draws from the posterior distribution in parentheses Significant at the 5 percent confidence levela Unconditional statistics represent implied quantities for the sample population as a whole ie integrating over the distribution of covariates in the sample (as

well as over the unobserved components)

762 THE AMERICAN ECONOMIC REVIEW JUNE 2007

save more to obtain more education or totake other actions that lead to greaterwealth34

Let us make several additional observationsFirst while owners of more expensive cars ap-pear to have both higher risk exposure andhigher levels of risk aversion owners of biggercars have higher risk exposure but lower levelsof risk aversion This should indicate that thestructure of the model does not constrain therelationship between the coefficients in the twoequations Rather it is the data that ldquospeak uprdquoSecond individuals who are classified by theinsurer as ldquogood driversrdquo indeed have lowerrisk but also appear to have lower risk aversionThis result is somewhat similar to the positivecorrelation between unobserved risk and unob-served risk aversion which we report belowThird policyholders who tend to use the car forbusiness are less risk averse This could bebecause uninsured costs of accidents occurringto such policyholders are tax deductible Fourthpolicyholders who reported three full years ofclaims history are more risk averse but are notdifferent in their risk exposure The attitude thatleads such policyholders to comply with therequest to (voluntarily) report three full years ofclaims history is apparently and not surpris-ingly correlated with higher levels of risk aver-sion In contrast while past claims indicate highrisk they have no significant relationship withrisk aversion Finally we find a strong trendtoward lower levels of risk aversion over timeThis is a replication of the probit results dis-cussed earlier

The Effect of Individual Characteristics onClaim RiskmdashThe first column of Table 4 de-scribes the relationship between observablesand risk exposure Accident risk is higher fordivorced individuals and lower for people witha college education Bigger older more expen-sive and noncommercial cars are all morelikely to be involved in an accident Drivingexperience reduces accident rates as do mea-sures of less intense use of the car while youngdrivers are more exposed to risk Claim propen-sity is highly correlated over time the voluntaryreport of past claims is a strong predictor offuture claims This risk equation produces re-sults that are similar to those of the simplerPoisson regression reported in Table 3 Al-though some of the coefficients lose signifi-cance the magnitude of most coefficients isquite similar The similarity between these twosets of results is to be expected as the riskregression is identified primarily from the dataon claims so incorporating the information ondeductible choice does not qualitatively changethe conceptual identification strategy (see Sec-tion IIC) If the results were not similar thiscould have indicated a misspecification of themodel The slight differences between the riskregressions in Table 3 and Table 4 are drivenmainly by the structural assumptions First thebenchmark model estimates a lognormal mix-ture of Poisson rates rather than a single Pois-son model By incorporating the fatter tails ofthe claim distribution it slightly changes theresults increases the standard errors and de-creases the average predicted claim rate Sec-ond the information on deductible choiceslightly helps us in obtaining more precise es-timates through the correlation structure be-tween the error terms in the two equations

The Implied Level of Risk AversionmdashOne ofthe main goals of the estimation is to obtainmeasures of the level of risk aversion in thepopulation we study Since we use Gibbs sam-pling and augment the latent coefficients ofabsolute risk aversion we can directly obtainthe posterior distribution of various moments ofthis distribution At each iteration of the Gibbssampler we compute the mean standard devi-ation (across individuals) and various percen-tiles of the simulated draws of i and ri and thecorrelation between them The far-right columnof Table 4 reports the averages and standard

34 One may be tempted to interpret the positive wealtheffects as an indirect indication of credit constraints wealth-ier individuals are less credit constrained and therefore canafford to purchase more insurance We do not share thisinterpretation for two reasons First the insurance companyobserves these proxies for wealth and conditions on themwhen setting prices Since the willingness to pay for insur-ance is likely to be correlated with the willingness to pay forthe additional insurance provided by the low deductibleoption premiums already reflect this variation We condi-tion on the level of the premium Second paying less exante implies paying more ex post so applying the creditconstraint argument only to the ex ante payment but not tothe probabilistic ex post deductible payments has no theo-retical foundation Essentially the setup of the model linksthe ex ante decisions to the ex post losses which are bothdriven by the curvature of the vNM utility function

763VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

deviations of these computed quantities over theiterations of the Gibbs sampler35 The impliedrisk aversion of the mean individual is 00019which is about seven times greater than theback-of-the-envelope calculation presented inthe previous section As we assume a lognormal

distribution and estimate a relatively high dis-persion coefficient r the estimates also implysignificantly lower values for the median levelof risk aversion As shown later the qualitativepattern of these results is quite robust acrossspecifications

Table 5 summarizes our findings regardingthe level and dispersion of risk aversion Itpresents two ways to interpret the estimates weobtained from the benchmark model as well ascomparisons to specifications with constant ab-solute risk aversion (CARA) utility and with

35 Note that these estimated quantities are unconditionalIn computing these quantities we integrate over the distri-bution of observable characteristics in the data so onecannot compute these estimates from the estimated param-eter directly

TABLE 5mdashRISK-AVERSION ESTIMATES

Specificationa Absolute risk aversionb Interpretationc Relative risk aversiond

Back-of-the-envelope 10 103 9070 1484Benchmark model

Mean individual 67 103 5605 972225th percentile 23 106 9998 003Median individual 26 105 9974 03775th percentile 29 104 9714 42790th percentile 27 103 7834 390295th percentile 99 103 4937 14327

CARA utilityMean individual 31 103 7651 4436Median individual 34 105 9966 050

Learning modelMean individual 42 103 6886 6140Median individual 56 106 9995 008

Comparable estimatesGertner (1993) 31 104 9699 479Metrick (1995) 66 105 9934 102Holt and Laury (2002)e 32 102 2096 86575Sydnor (2006) 20 103 8329 5395

a The table summarizes the results with respect to the level of risk aversion ldquoBack-of-the-enveloperdquo refers to thecalculation we report in the beginning of Section III ldquobenchmark modelrdquo refers to the results from the benchmark model(Table 4) ldquoCARA utilityrdquo refers to a specification of a CARA utility function and ldquolearning modelrdquo refers to a specificationin which individuals do not know their risk types perfectly (see Section IIID) The last four rows are the closest comparableresults available in the literature

b The second column presents the point estimates for the coefficient of absolute risk aversion converted to $US1 unitsFor the comparable estimates this is given by their estimate of a representative CARA utility maximizer For all otherspecifications this is given by computing the unconditional mean and median in the population using the figures we reportin the mean and median columns of risk aversion in Table 6 (multiplied by 352 the average exchange rate during the periodto convert to US dollars)

c To interpret the absolute risk aversion estimates (ARA) we translate them into x u(w) 1frasl2 u(w 100) 1frasl2 u(w x) That is we report x such that an individual with the estimated ARA is indifferent about participating in a 50ndash50lottery of gaining 100 US dollars and losing x US dollars Note that since our estimate is of absolute risk aversion the quantityx is independent of w To be consistent with the specification we use a quadratic utility function for the back-of-the-envelopebenchmark and learning models and use a CARA utility function for the others

d The last column attempts to translate the ARA estimates into relative risk aversion We follow the literature and do soby multiplying the ARA estimate by average annual income We use the average annual income (after tax) in Israel in 1995(51168 NIS from the Israeli Census) for all our specifications and we use average disposable income in the United Statesin 1987 ($15437) for Gertner (1993) and Metrick (1995) For Holt and Laury (2002) and Sydnor (2006) we use a similarfigure for 2002 ($26974)

e Holt and Laury (2002) do not report a comparable estimate The estimate we provide above is based on estimating aCARA utility model for the 18 subjects in their experiment who participated in the ldquo90rdquo treatment which involved stakescomparable to our setting For these subjects we assume a CARA utility function and a lognormal distribution of theircoefficient of absolute risk aversion The table reports the point estimate of the mean from this distribution

764 THE AMERICAN ECONOMIC REVIEW JUNE 2007

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 12: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

benchmark formulation assumes that (i ri) fol-lows a bivariate lognormal distribution Thuswe can write the model as

(8) ln i xi i

(9) ln ri xi vi

with

(10) i

vi

iid

N00

2 r

r r2

Neither i nor ri is directly observed Thereforewe treat both as latent variables Loosely speak-ing they can be thought of as random effects Weobserve two variables the number of claims andthe deductible choice which are related to thesetwo unobserved components Thus to completeour econometric model we have to specify therelationship between the observed variables andthe latent ones This is done by making two struc-tural assumptions First we assume that the num-ber of claims is drawn from a Poisson distributionnamely

(11) claimsi Poisson(i ti )

where ti is the observed duration of the policySecond we assume that when individuals maketheir deductible choices they follow the theoreti-cal model described in the previous section Themodel implies that individual i chooses the lowdeductible (choicei 1) if and only if ri ri(i)where ri is defined in equation (7) Thus theempirical model for deductible choice is given by

(12) Prchoicei 1 Prri

pi

idi 1

d i

Prexp(xi vi)

pi

exp(xi i)di 1

d i

With no unobserved heterogeneity in risk (i 0) equation (12) reduces to a simple probit In

such a case one can perfectly predict risk from thedata denote it by (xi) and construct an addi-tional covariate ln([pi((xi)di) 1]d i) Giventhe assumption that risk aversion is distributedlognormally running the probit regressionabove and renormalizing the coefficient onthe constructed covariate to 1 (instead ofthe typical normalization of the variance of theerror term to 1) has a structural interpretation withln (ri) as the dependent variable However Cohen(2005) provides evidence of adverse selection inthe data implying the existence of unobservedheterogeneity in risk This makes the simple probitregression misspecified Estimation of the fullmodel is more complicated Once we allow forunobserved heterogeneity in both unobserved riskaversion (vi) and claim rate (i) we have to inte-grate over the two-dimensional region depicted inFigure 2 for estimation

EstimationmdashA natural way to proceed is toestimate the model by maximum likelihoodwhere the likelihood of the data as a function ofthe parameters can be written by integrating outthe latent variables namely

(13) Lclaimsi choicei

Prclaimsi choiceii riPri ri

where is a vector of parameters to be esti-mated While formulating the empirical modelusing likelihood may help our thinking aboutthe data-generating process using maximumlikelihood (or generalized method of moments(GMM)) for estimation is computationally cum-bersome This is because in each iteration itrequires evaluating a separate complex integralfor each individual in the data In contrastMarkov Chain Monte Carlo (MCMC) Gibbssampling is quite attractive in such a case Us-ing data augmentation of latent variables (Mar-tin A Tanner and Wing Hung Wong 1987)according to which we simulate (i ri) and latertreat those simulations as if they are part ofthe data one can avoid evaluating the complexintegrals by sampling from truncated normaldistributions which is significantly less compu-tationally demanding (eg Luc Devroye 1986)This feature combined with the idea of aldquosliced samplerrdquo (Paul Damien John Wake-field and Stephen Walker 1999) to sample from

756 THE AMERICAN ECONOMIC REVIEW JUNE 2007

an unfamiliar posterior distribution makes theuse of a Gibbs sampler quite efficient for ourpurposes Finally the lognormality assumptionimplies that F(ln(i)ri) and F(ln(ri)i) follow a(conditional) normal distribution allowing us torestrict attention to univariate draws furtherreducing the computational burden

The Appendix provides a full description ofthe Gibbs sampler including the conditionaldistributions and the (flat) prior distributions weuse The basic intuition is that conditional onobserving (i ri) for each individual we have asimple linear regression model with two equa-tions The less standard part is to generate drawsfor (i ri) We do this iteratively Conditionalon i the posterior distribution for ln(ri) fol-lows a truncated normal distribution where thetruncation point depends on the menu individ-ual i faces and its direction (from above orbelow) depends on individual irsquos deductiblechoice The final step is to sample from theposterior distribution of ln(i) conditional on riThis is more complicated as we have bothtruncation which arises from adverse selection(just as we do when sampling for ri) and thenumber of claims which provides additionalinformation about the posterior of i Thus theposterior for i takes an unfamiliar form forwhich we use a ldquosliced samplerrdquo

We use 100000 iterations of the Gibbs sam-pler It seems to converge to the stationarydistribution after about 5000 iterations There-fore we drop the first 10000 draws and use thelast 90000 draws of each variable to report ourresults The results are based on the posteriormean and posterior standard deviation fromthese 90000 draws Note that each iterationinvolves generating separate draws of (i ri) foreach individual Performing 100000 iterationsof the benchmark specification (coded in Mat-lab) takes about 60 hours on a Dell Precision530 workstation

C Identification

The parametric version of the model is iden-tified mechanically There are more equationsthan unknowns and no linear dependenciesamong them so (as also verified using MonteCarlo exercises) the model parameters can bebacked out from simulated data Our goal in thissection is not to provide a formal identificationproof Rather we want to provide intuition for

which features of the data allow us to identifyparticular parameters of the model The discus-sion also highlights the assumptions that areessential for identification vis-a-vis those thatare made only for computational convenience(making them in principle testable)

Discussion of Nonparametric Identifica-tionmdashThe main difficulty in identifying themodel arises from the gap between the (exante) risk type i which individuals usewhen choosing a deductible and the (ex post)realization of the number of claims we ob-serve We identify between the variation inrisk types and the variation in risk realizationsusing our parametric distributional assump-tions The key is that the distribution of risktypes can be uniquely backed out from theclaim data alone This allows us to use thedeductible choice as an additional layer ofinformation which identifies unobserved het-erogeneity in risk aversion21 Any distribu-tional assumption that allows us to uniquelyback out the distribution of risk types fromclaim data would be sufficient to identify thedistribution of risk aversion As is customaryin the analysis of count processes we make aparametric assumption that claims are gener-ated by a lognormal mixture of Poisson dis-tributions (Section IIID discusses this furtherand explores an alternative) Using a mixtureenables us to account for adverse selectionthrough unobserved heterogeneity in risk Italso allows us to better fit the tails of theclaim distribution In principle a more flexi-ble mixture or a more flexible claim-generat-ing process could be identified as long as theclaims data are sufficiently rich22

21 Cardon and Hendel (2001) face a similar identificationproblem in the context of health insurance They use vari-ation in coverage choice (analogous to deductible choice) toidentify the variation in health-status signals (analogous torisk types) from the variation in health expenditure (analo-gous to number of claims) They can rely on the coveragechoice to identify this because they make an assumptionregarding unobserved heterogeneity in preferences (iidlogit) We take a different approach as our main goal is toestimate rather than assume the distribution of preferences

22 Although it may seem that the claim data are limited(as they take only integer values between 0 and 5 in ourdata) variation in policy duration generates continuousvariation in the observed claim propensity Of course thisvariation also introduces an additional selection into themodel due to policy cancellations which are potentially

757VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Once the distribution of risk types is iden-tified from claims data the marginal distribu-tion of risk aversion (and its relationship tothe distribution of risk types) is nonparametri-cally identified from the variation in theoffered menus discussed in Section I Thisvariation implies different deductible and pre-mium options to identical (on observables)individuals who purchased insurance at dif-ferent times Different menus lead to differentindifference sets (similar to the one depictedin Figure 2) These sets often cross each otherand nonparametrically identify the distribu-tion of risk aversion and the correlation struc-ture at least within the region in which theindifference sets vary For the tails of thedistribution as is typically the case we haveto rely on parametric assumptions or usebounds The parametric assumption of log-normality we use for most of the paper ismade only for computational convenience

Intuition for the Parametric IdentificationMechanismmdashVariation in the offered menus isimportant for the nonparametric identificationThe parametric assumptions could identify themodel without such variation Thus to keep theintuition simple let us take the bivariate log-normal distribution as given and contrary to thedata assume that all individuals are identical onobservables and face the same menu Supposealso that all individuals are observed for exactlyone year and have up to two claims23 In thissimplified case the model has five parametersto be estimated the mean and variance of risk and

2 the mean and variance of risk aver-sion r and r

2 and the correlation parameter The data can be summarized by five numbersLet 130 131 and 132 1 131 130 be thefraction of individuals with zero one and twoclaims respectively Let 0 1 and 2 be theproportion of individuals who chose a low de-ductible within each ldquoclaim grouprdquo Given ourdistributional assumption about the claim-

generating process we can use 130 and 131 touniquely identify and

2 Loosely isidentified by the average claim rate in the dataand

2 is identified by how fat the tail of theclaim distribution is ie by how large (132131)is compared to (131130) Given and

2 and thelognormality assumption we can (implicitly)construct a posterior distribution of risk typesfor each claim group F(r claims c) andintegrate over it when predicting the deductiblechoice This provides us with three additionalmoments each of the form

(14) Ec Prchoice 1r

dFr claims c dFr

for c 0 1 2 These moments identify thethree remaining parameters of the model rr

2 and Let us now provide more economic content to

the identification argument Using the same ex-ample and conditional on identifying and

2

from the claim data one can think about thedeductible choice data 0 1 2 as a graph(c) The absolute level of the graph identifiesr In the absence of correlation between riskand risk aversion the slope of the graph iden-tifies r

2 with no correlation the slope shouldalways be positive (due to adverse selection)but higher r

2 would imply a flatter graph be-cause more variation in the deductible choiceswill be attributed to variation in risk aversionFinally is identified by the curvature of thegraph The more convex (concave) the graph isthe more positive (negative) is the estimated For example if 0 05 1 051 and 2 099 it is likely that r

2 is high (explaining why0 and 1 are so close) and is highly positive(explaining why 2 is not also close to 1) Incontrast if 0 1 it must mean that thecorrelation between risk and risk aversion isnegative which is the only way the originalpositive correlation induced by adverse selec-tion can be offset This intuition also clarifiesthat the identification of relies on observingindividuals with multiple claims (or differentpolicy durations) and that it is likely to besensitive to the distributional assumptions Thedata (Table 2B) provide a direct (positive) cor-relation between deductible choice and claims

endogenous The results are similar when we use onlyexpired and truncated policies

23 This variation is sufficient (and necessary) to identifythe benchmark model The data provide more variation weobserve up to five claims per individual we observe con-tinuous variation in the policy durations we observe vari-ation in prices and we exploit distributional restrictionsacross individuals with different observable characteristics

758 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The structural assumptions allow us to explainhow much of this correlation can be attributedto adverse selection The remaining correlation(positive or negative) is therefore attributed tocorrelation in the underlying distribution of riskand risk aversion

III Results

A Reduced-Form Estimation

To get an initial sense for the levels of abso-lute risk aversion implied by the data we use asimple back-of-the-envelope exercise We com-pute unconditional averages of p d dh dland d (Table 2A)24 and substitute these valuesin equation (7) The implied coefficient of ab-solute risk aversion is 29 104 NIS125 Thisfigure could be thought of as the average indif-ference point implying that about 18 percent ofthe policyholders have coefficients of absoluterisk aversion exceeding it To convert to USdollar amounts one needs to multiply thesefigures by the average exchange rate (352)resulting in an average indifference point of102 103 $US1 This figure is less than halfof a similar estimate reported by Sydnor (2006)for buyers of homeownerrsquos insurance but about3 and 13 times higher than comparable fig-ures reported by Gertner (1993) and Metrick(1995) respectively for television game showparticipants

Table 3 provides reduced-form analysis ofthe relationship between the observables andour two left-hand-side variables the number ofclaims and the deductible choice26 Column 1reports the estimates from a Poisson regressionof the number of claims on observed character-istics This regression is closely related to therisk equation we estimate in the benchmarkmodel It shows that older people women andpeople with a college education are less likely to

have an accident Bigger more expensiveolder and noncommercial cars are more likelyto be involved in an accident Driving experi-ence and variables associated with less intenseuse of the car reduce accident rates As could beimagined claim propensity is highly correlatedover time past claims are a strong predictor offuture claims Young drivers are 50 to 70 per-cent more likely to be involved in an accidentwith young men significantly more likely thanyoung women Finally as indicated by the trendin the estimated year dummies the accident ratesignificantly declined over time Part of thisdecline is likely due to the decline in accidentrates in Israel in general27 This decline mightalso be partly due to the better selection ofindividuals the company obtained over time asit gained more experience (Cohen 2003)

Columns 2 and 3 of Table 3 present estimatesfrom probit regressions in which the dependentvariable is equal to one if the policyholder chosea low deductible and is equal to zero otherwiseColumn 3 shows the marginal effects of thecovariates on the propensity to choose a lowdeductible These marginal effects do not have astructural interpretation as the choice of lowdeductible depends on its price on risk aver-sion and on risk In this regression we againobserve a strong trend over time Fewer policy-holders chose the low deductible as time wentby One reason for this trend according to thecompany executives is that over time the com-panyrsquos sales persons were directed to focusmainly on the ldquodefaultrdquo regular deductible op-tion28 The effect of other covariates will bediscussed later in the context of the full modelIn unreported probit regressions we also test thequalitative assumptions of the structural model byadding three additional regressors the price ratio(pidi) the average deductible offered d i

24 The unconditional is computed by maximum likeli-hood using the data on claims and observed durations of thepolicies

25 Using the CARA specification as in equation (16) weobtain a slightly lower value of 25 104 NIS1

26 We find positive correlation in the errors of these tworegressions suggesting the existence of adverse selection inthe data and motivating a model with unobserved heteroge-neity in risk This test is similar to the bivariate probit testproposed by Chiappori and Salanie (2000) and replicatesearlier results reported in Cohen (2005)

27 In particular traffic fatalities and counts of trafficaccidents in Israel fell by 11 percent and 18 percent during1998 and 1999 respectively

28 Such biased marketing efforts will bias consumersagainst choosing the low deductible thus making them lookless risk averse This would make our estimate a lowerbound on the true level of risk aversion If only sophisti-cated consumers could see beyond the marketing effort andthis sophistication were related to observables (eg educa-tion) the coefficients on such observables would be biasedupward This is not a major concern given that the coefficientson most of the covariates are fairly stable when we estimatethe benchmark model separately for different years

759VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

TABLE 3mdashNO HETEROGENEITY IN RISK

Variable

Poisson regressiona Probit regressionsc

Dep var Number of claims Dep var 1 if low deductible

(1) (2) (3)

Coeff Std Err IRRb Coeff Std Err dPdX dPdXDemographics Constant 43535 02744 mdash 224994 23562 mdash

Age 00074 00052 09926 01690 00464 00034 00042Age2 989 105 536 105 10001 00017 00005 339 105 455 105Female 00456 00161 09555 08003 01316 00165 00134Family Single omitted omitted omitted

Married 01115 00217 08945 06417 01950 00128 00045Divorced 00717 00346 10743 03191 03127 00064 00044Widower 00155 00527 10156 03441 04573 00069 00025Other (NA) 01347 01755 11442 29518 19836 00511 00370

Education Elementary 00522 00550 09491 02927 04278 00061 00016High school omitted omitted omittedTechnical 00373 00308 10380 05037 02520 00105 00138College 00745 00197 09282 05279 01546 00109 00053Other (NA) 00116 00184 10116 00630 01540 00013 00025

Emigrant 00210 00163 10213 00497 01328 00010 00003

Car attributes Log(Value) 01227 00281 11306 13285 02342 00271 00299Car age 00187 00042 10189 01603 00357 00033 00017Commercial car 01394 00326 08699 09038 02781 00177 00294Log(engine size) 02972 00459 13461 06924 03952 00141 00075

Driving License years 00204 00034 09798 01043 00312 00021 00005License years2 00002 677 105 10002 00015 00005 299 105 167 105

Good driver 00176 00191 09825 07207 01574 00148 00152Any driver 00564 00169 10580 10923 01321 00217 00258Secondary car 00859 00209 09177 00038 01626 00001 00070Business use 01852 00293 12034 07951 02737 00156 00017History length 00527 00110 09486 11218 01450 00228 00171Claims history 06577 00390 19304 04654 05460 00095 00496

Young driver Young driver 05235 00361 16879 28847 06305 00524 00012Gender Male omitted omitted omitted

Female 01475 00288 08629 17959 03135 00396 00195Age 17ndash19 omitted omitted omitted

19ndash21 00701 00532 10726 07800 07509 00153 0014721ndash24 00267 00574 09737 05746 07773 00114 0015624 02082 00567 0812 17869 07328 00397 00136

Experience 1 omitted omitted omitted1ndash3 02416 00458 07854 10175 06577 00217 000093 02827 00532 07538 32513 07386 00762 00410

Company year First year omitted omitted omittedSecond year 00888 00198 09150 44513 01506 00792 00859Third year 00690 00222 09334 85888 01820 01303 01367Fourth year 01817 00232 08339 118277 02102 01616 01734Fifth year 05431 0028 05810 143206 02778 01920 02085

105586Obs 105800 94000 105800Pseudo R2 00162 01364 01296Log likelihood 577459 369595 430869

Significant at the 5 percent confidence levela Maximum likelihood estimates Variation in exposure (policy duration) is accounted forb IRR Incidence rate ratio Each figure should be interpreted as the increasedecrease in claim probability as a result of an increase of one unit in the

right-hand-side variablec There are two separate probit regressions reported in this table Column 2 relies on the deductible choice model and the lognormality assumption As discussed

in the text by including an additional regressor ln([pi((xi)di) 1]d i) (with (xi) predicted from column 1 above) and normalizing its coefficient to 1 we obtaina structural interpretation of this regression with ln (ri) as the dependent variable Thus the reported coefficients are comparable to those estimated for the benchmarkmodel One should be cautious however in interpreting these coefficients Unlike the benchmark model this regression does not allow unobserved heterogeneity inrisk and suffers from some selection bias because observations with high predicted risk rate are omitted (which is why the number of observations is 94000 ratherthan the full sample of 105800) For comparison column 3 reports the marginal effects from a comparable probit regression that uses the full sample and does notcontrol for pricing and predicted risk through the additional structural regressor Column 3 does not have a structural interpretation and its (unreported) coefficientscannot be compared to those estimated from the benchmark model

760 THE AMERICAN ECONOMIC REVIEW JUNE 2007

and the risk rate (xi) as predicted from thePoisson regression of column 1 All three addi-tional regressors enter with the predicted signand with large and highly significant marginaleffects29

Finally column 2 of Table 3 presents animportant specification of the probit regres-sion in which ln([pi(( xi)di) 1]d i) isadded as a regressor and its coefficient isnormalized to 130 As already mentioned ifwe assume that there is no unobserved heter-ogeneity in risk then column 2 is analogousto the ln(ri) equation of the benchmark modelThis restriction of the model is rejected by thedata About 10 percent of the individuals arepredicted to have (xi) (pi di) implying achoice of low deductible for any level of riskaversion Many of these individuals howeverstill choose higher deductible levels Column2 reports the results for the remaining indi-viduals (94000 out of 105800) for which theregression can be run While the signs of theestimated coefficients are similar to those inthe benchmark model presented below therestricted version of the model suggests muchhigher levels and dispersion of risk aversionwell above any reasonable level31 The fullestimation of the benchmark model clearly re-jects this restriction on the model

B Estimation of the Benchmark Model

The Effect of Individual Characteristics onRisk AversionmdashTable 4 presents the estimationresults from the benchmark model The secondcolumn shows how the level of absolute riskaversion is related to individual characteristicsAs the dependent variable is in natural loga-

rithm coefficients on dummy variables can bedirectly interpreted as approximate percentagechanges The vast majority of these coefficientsare quite stable across a wide range of specifi-cations which are mentioned later

The results indicate that women are more riskaverse than men and have a coefficient of ab-solute risk aversion about 20 percent greaterthan that of men These results are consistentwith those of Donkers et al (2001) and Hartoget al (2002) The estimated effect of age sug-gests a nonmonotone pattern of risk preferencesover the life cycle The estimated coefficientsimply that initially (that is at age 18 the youngestindividual in the data) individuals become less riskaverse with age but around the age of 48 indi-viduals start becoming more risk averse32 Mar-ried individuals are estimated to be significantlymore risk averse compared to singles while di-vorced individuals are less (although the coeffi-cient is statistically insignificant)

The analysis suggests that variables that arelikely to be correlated with income or wealthsuch as post-high-school education and thevalue of the car have a positive coefficientindicating that wealthier people have higherlevels of absolute risk aversion Although we donot have data on individualsrsquo income or wealthwe use other proxies for income in additionalspecifications and obtain mixed results Whenwe include as a covariate the average householdincome among those who live in the same zipcode we obtain a significant and negative co-efficient of 0333 (0154) (standard deviationin parentheses) When we match zip code in-come on demographics of the individuals thecoefficient is effectively zero 0036 (0047)and when we use a socioeconomic index ofthe locality in which the individual lives thecoefficient is positive 0127 (0053)33 Thus

29 The estimated marginal effect (z-statistic in parenthe-ses) is 0352 (1376) 16 104 (1481) and 0154(255) for (pidi) d i and (xi) respectively

30 The level is a normalization The sign is estimatedHad the sign on this regressor been positive this would haveimplied a rejection of the model

31 For the implied median level of risk aversion the re-stricted model produces a similar estimate to the estimate wereport below for the benchmark model However since someindividuals who chose the low deductible are estimated to havevery low claim rates the restricted model is ldquoforcedrdquo to estimatevery high risk aversion for these individuals (in contrast thebenchmark model can explain such choices by positive risk re-siduals) resulting in very high dispersion and (due to the lognor-mality assumption) very high average risk aversion which isabout 1025 higher than the benchmark estimates we report below

32 This nonmonotone pattern may explain why age enterswith different signs in the estimation results of Donkers et al(2001) and Hartog et al (2002) A somewhat similar U-shapepattern with respect to age is also reported by Sumit Agarwalet al (2006) in the context of consumer credit markets

33 The full results from these specifications are providedin the online Appendix (available at httpwwwe-aerorgdatajune0720050644_apppdf ) The reason we do not usethese income variables in the benchmark specification istheir imperfect coverage which would require us to omitalmost 20 percent of the individuals Other results fromthese regressions are similar to those we report for thebenchmark specification

761VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

overall we interpret our findings as suggestiveof a nonnegative and perhaps positive associ-ation between incomewealth and absolute riskaversion

At first glance these results may appear tobe inconsistent with the widely held beliefthat absolute risk aversion declines withwealth One should distinguish however be-tween two questions (a) whether for a givenindividual the vNM utility function exhibitsdecreasing absolute risk aversion and (b)how risk preferences vary across individuals

Our results do not speak to the first questionand should not be thought of as a test of thedecreasing absolute risk aversion propertyTesting this property would require observingthe same individual making multiple choicesat different wealth levels Rather our resultsindicate that individuals with greater wealthhave utility functions that involve a greaterdegree of risk aversion It might be thatwealth is endogenous and that risk aversion(or unobserved individual characteristics thatare correlated with it) leads individuals to

TABLE 4mdashTHE BENCHMARK MODEL

Variable Ln() equation Ln(r) equation Additional quantities

Demographics Constant 15406 (00073) 118118 (01032) Var-covar matrix ()

Age 00001 (00026) 00623 (00213) 01498 (00097)Age2 624 106 (263 105) 644 104 (211 104) r 31515 (00773)Female 00006 (00086) 02049 (00643) 08391 (00265)Family Single Omitted Omitted

Married 00198 (00115) 01927 (00974) Unconditional statisticsa

Divorced 00396 (00155) 01754 (01495) Mean 02196 (00013)Widower 00135 (00281) 01320 (02288) Median 02174 (00017)Other (NA) 00557 (00968) 04599 (07397) Std Dev 00483 (00019)

Education Elementary 00194 (00333) 01283 (02156) Mean r 00019 (00002)High school Omitted Omitted Median r 727 106 (756 107)Technical 00017 (00189) 02306 (01341) Std Dev r 00197 (00015)College 00277 (00124) 02177 (00840) Corr(r ) 02067 (00085)Other (NA) 00029 (00107) 00128 (00819)

Emigrant 00030 (00090) 00001 (00651) Obs 105800

Car attributes Log(value) 00794 (00177) 07244 (01272)Car age 00053 (00023) 00411 (00176)Commercial car 00719 (00187) 00313 (01239)Log(engine size) 01299 (00235) 03195 (01847)

Driving License years 00015 (00017) 00157 (00137)License years2 183 105 (351 105) 148 104 (254 104)Good driver 00635 (00112) 00317 (00822)Any driver 00360 (00105) 03000 (00722)Secondary car 00415 (00141) 01209 (00875)Business use 00614 (00134) 03790 (01124)History length 00012 (00052) 03092 (00518)Claims history 01295 (00154) 00459 (01670)

Young driver Young driver 00525 (00253) 02499 (02290)Gender Male Omitted mdash

Female 00355 (00061) mdashAge 17ndash19 Omitted mdash

19ndash21 00387 (00121) mdash21ndash24 00445 (00124) mdash24 00114 (00119) mdash

Experience 1 Omitted mdash1ndash3 00059 (00104) mdash3 00762 (00121) mdash

Company year First year Omitted OmittedSecond year 00771 (00122) 14334 (00853)Third year 00857 (00137) 28459 (01191)Fourth year 01515 (00160) 38089 (01343)Fifth year 04062 (00249) 39525 (01368)

Note Standard deviations based on the draws from the posterior distribution in parentheses Significant at the 5 percent confidence levela Unconditional statistics represent implied quantities for the sample population as a whole ie integrating over the distribution of covariates in the sample (as

well as over the unobserved components)

762 THE AMERICAN ECONOMIC REVIEW JUNE 2007

save more to obtain more education or totake other actions that lead to greaterwealth34

Let us make several additional observationsFirst while owners of more expensive cars ap-pear to have both higher risk exposure andhigher levels of risk aversion owners of biggercars have higher risk exposure but lower levelsof risk aversion This should indicate that thestructure of the model does not constrain therelationship between the coefficients in the twoequations Rather it is the data that ldquospeak uprdquoSecond individuals who are classified by theinsurer as ldquogood driversrdquo indeed have lowerrisk but also appear to have lower risk aversionThis result is somewhat similar to the positivecorrelation between unobserved risk and unob-served risk aversion which we report belowThird policyholders who tend to use the car forbusiness are less risk averse This could bebecause uninsured costs of accidents occurringto such policyholders are tax deductible Fourthpolicyholders who reported three full years ofclaims history are more risk averse but are notdifferent in their risk exposure The attitude thatleads such policyholders to comply with therequest to (voluntarily) report three full years ofclaims history is apparently and not surpris-ingly correlated with higher levels of risk aver-sion In contrast while past claims indicate highrisk they have no significant relationship withrisk aversion Finally we find a strong trendtoward lower levels of risk aversion over timeThis is a replication of the probit results dis-cussed earlier

The Effect of Individual Characteristics onClaim RiskmdashThe first column of Table 4 de-scribes the relationship between observablesand risk exposure Accident risk is higher fordivorced individuals and lower for people witha college education Bigger older more expen-sive and noncommercial cars are all morelikely to be involved in an accident Drivingexperience reduces accident rates as do mea-sures of less intense use of the car while youngdrivers are more exposed to risk Claim propen-sity is highly correlated over time the voluntaryreport of past claims is a strong predictor offuture claims This risk equation produces re-sults that are similar to those of the simplerPoisson regression reported in Table 3 Al-though some of the coefficients lose signifi-cance the magnitude of most coefficients isquite similar The similarity between these twosets of results is to be expected as the riskregression is identified primarily from the dataon claims so incorporating the information ondeductible choice does not qualitatively changethe conceptual identification strategy (see Sec-tion IIC) If the results were not similar thiscould have indicated a misspecification of themodel The slight differences between the riskregressions in Table 3 and Table 4 are drivenmainly by the structural assumptions First thebenchmark model estimates a lognormal mix-ture of Poisson rates rather than a single Pois-son model By incorporating the fatter tails ofthe claim distribution it slightly changes theresults increases the standard errors and de-creases the average predicted claim rate Sec-ond the information on deductible choiceslightly helps us in obtaining more precise es-timates through the correlation structure be-tween the error terms in the two equations

The Implied Level of Risk AversionmdashOne ofthe main goals of the estimation is to obtainmeasures of the level of risk aversion in thepopulation we study Since we use Gibbs sam-pling and augment the latent coefficients ofabsolute risk aversion we can directly obtainthe posterior distribution of various moments ofthis distribution At each iteration of the Gibbssampler we compute the mean standard devi-ation (across individuals) and various percen-tiles of the simulated draws of i and ri and thecorrelation between them The far-right columnof Table 4 reports the averages and standard

34 One may be tempted to interpret the positive wealtheffects as an indirect indication of credit constraints wealth-ier individuals are less credit constrained and therefore canafford to purchase more insurance We do not share thisinterpretation for two reasons First the insurance companyobserves these proxies for wealth and conditions on themwhen setting prices Since the willingness to pay for insur-ance is likely to be correlated with the willingness to pay forthe additional insurance provided by the low deductibleoption premiums already reflect this variation We condi-tion on the level of the premium Second paying less exante implies paying more ex post so applying the creditconstraint argument only to the ex ante payment but not tothe probabilistic ex post deductible payments has no theo-retical foundation Essentially the setup of the model linksthe ex ante decisions to the ex post losses which are bothdriven by the curvature of the vNM utility function

763VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

deviations of these computed quantities over theiterations of the Gibbs sampler35 The impliedrisk aversion of the mean individual is 00019which is about seven times greater than theback-of-the-envelope calculation presented inthe previous section As we assume a lognormal

distribution and estimate a relatively high dis-persion coefficient r the estimates also implysignificantly lower values for the median levelof risk aversion As shown later the qualitativepattern of these results is quite robust acrossspecifications

Table 5 summarizes our findings regardingthe level and dispersion of risk aversion Itpresents two ways to interpret the estimates weobtained from the benchmark model as well ascomparisons to specifications with constant ab-solute risk aversion (CARA) utility and with

35 Note that these estimated quantities are unconditionalIn computing these quantities we integrate over the distri-bution of observable characteristics in the data so onecannot compute these estimates from the estimated param-eter directly

TABLE 5mdashRISK-AVERSION ESTIMATES

Specificationa Absolute risk aversionb Interpretationc Relative risk aversiond

Back-of-the-envelope 10 103 9070 1484Benchmark model

Mean individual 67 103 5605 972225th percentile 23 106 9998 003Median individual 26 105 9974 03775th percentile 29 104 9714 42790th percentile 27 103 7834 390295th percentile 99 103 4937 14327

CARA utilityMean individual 31 103 7651 4436Median individual 34 105 9966 050

Learning modelMean individual 42 103 6886 6140Median individual 56 106 9995 008

Comparable estimatesGertner (1993) 31 104 9699 479Metrick (1995) 66 105 9934 102Holt and Laury (2002)e 32 102 2096 86575Sydnor (2006) 20 103 8329 5395

a The table summarizes the results with respect to the level of risk aversion ldquoBack-of-the-enveloperdquo refers to thecalculation we report in the beginning of Section III ldquobenchmark modelrdquo refers to the results from the benchmark model(Table 4) ldquoCARA utilityrdquo refers to a specification of a CARA utility function and ldquolearning modelrdquo refers to a specificationin which individuals do not know their risk types perfectly (see Section IIID) The last four rows are the closest comparableresults available in the literature

b The second column presents the point estimates for the coefficient of absolute risk aversion converted to $US1 unitsFor the comparable estimates this is given by their estimate of a representative CARA utility maximizer For all otherspecifications this is given by computing the unconditional mean and median in the population using the figures we reportin the mean and median columns of risk aversion in Table 6 (multiplied by 352 the average exchange rate during the periodto convert to US dollars)

c To interpret the absolute risk aversion estimates (ARA) we translate them into x u(w) 1frasl2 u(w 100) 1frasl2 u(w x) That is we report x such that an individual with the estimated ARA is indifferent about participating in a 50ndash50lottery of gaining 100 US dollars and losing x US dollars Note that since our estimate is of absolute risk aversion the quantityx is independent of w To be consistent with the specification we use a quadratic utility function for the back-of-the-envelopebenchmark and learning models and use a CARA utility function for the others

d The last column attempts to translate the ARA estimates into relative risk aversion We follow the literature and do soby multiplying the ARA estimate by average annual income We use the average annual income (after tax) in Israel in 1995(51168 NIS from the Israeli Census) for all our specifications and we use average disposable income in the United Statesin 1987 ($15437) for Gertner (1993) and Metrick (1995) For Holt and Laury (2002) and Sydnor (2006) we use a similarfigure for 2002 ($26974)

e Holt and Laury (2002) do not report a comparable estimate The estimate we provide above is based on estimating aCARA utility model for the 18 subjects in their experiment who participated in the ldquo90rdquo treatment which involved stakescomparable to our setting For these subjects we assume a CARA utility function and a lognormal distribution of theircoefficient of absolute risk aversion The table reports the point estimate of the mean from this distribution

764 THE AMERICAN ECONOMIC REVIEW JUNE 2007

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 13: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

an unfamiliar posterior distribution makes theuse of a Gibbs sampler quite efficient for ourpurposes Finally the lognormality assumptionimplies that F(ln(i)ri) and F(ln(ri)i) follow a(conditional) normal distribution allowing us torestrict attention to univariate draws furtherreducing the computational burden

The Appendix provides a full description ofthe Gibbs sampler including the conditionaldistributions and the (flat) prior distributions weuse The basic intuition is that conditional onobserving (i ri) for each individual we have asimple linear regression model with two equa-tions The less standard part is to generate drawsfor (i ri) We do this iteratively Conditionalon i the posterior distribution for ln(ri) fol-lows a truncated normal distribution where thetruncation point depends on the menu individ-ual i faces and its direction (from above orbelow) depends on individual irsquos deductiblechoice The final step is to sample from theposterior distribution of ln(i) conditional on riThis is more complicated as we have bothtruncation which arises from adverse selection(just as we do when sampling for ri) and thenumber of claims which provides additionalinformation about the posterior of i Thus theposterior for i takes an unfamiliar form forwhich we use a ldquosliced samplerrdquo

We use 100000 iterations of the Gibbs sam-pler It seems to converge to the stationarydistribution after about 5000 iterations There-fore we drop the first 10000 draws and use thelast 90000 draws of each variable to report ourresults The results are based on the posteriormean and posterior standard deviation fromthese 90000 draws Note that each iterationinvolves generating separate draws of (i ri) foreach individual Performing 100000 iterationsof the benchmark specification (coded in Mat-lab) takes about 60 hours on a Dell Precision530 workstation

C Identification

The parametric version of the model is iden-tified mechanically There are more equationsthan unknowns and no linear dependenciesamong them so (as also verified using MonteCarlo exercises) the model parameters can bebacked out from simulated data Our goal in thissection is not to provide a formal identificationproof Rather we want to provide intuition for

which features of the data allow us to identifyparticular parameters of the model The discus-sion also highlights the assumptions that areessential for identification vis-a-vis those thatare made only for computational convenience(making them in principle testable)

Discussion of Nonparametric Identifica-tionmdashThe main difficulty in identifying themodel arises from the gap between the (exante) risk type i which individuals usewhen choosing a deductible and the (ex post)realization of the number of claims we ob-serve We identify between the variation inrisk types and the variation in risk realizationsusing our parametric distributional assump-tions The key is that the distribution of risktypes can be uniquely backed out from theclaim data alone This allows us to use thedeductible choice as an additional layer ofinformation which identifies unobserved het-erogeneity in risk aversion21 Any distribu-tional assumption that allows us to uniquelyback out the distribution of risk types fromclaim data would be sufficient to identify thedistribution of risk aversion As is customaryin the analysis of count processes we make aparametric assumption that claims are gener-ated by a lognormal mixture of Poisson dis-tributions (Section IIID discusses this furtherand explores an alternative) Using a mixtureenables us to account for adverse selectionthrough unobserved heterogeneity in risk Italso allows us to better fit the tails of theclaim distribution In principle a more flexi-ble mixture or a more flexible claim-generat-ing process could be identified as long as theclaims data are sufficiently rich22

21 Cardon and Hendel (2001) face a similar identificationproblem in the context of health insurance They use vari-ation in coverage choice (analogous to deductible choice) toidentify the variation in health-status signals (analogous torisk types) from the variation in health expenditure (analo-gous to number of claims) They can rely on the coveragechoice to identify this because they make an assumptionregarding unobserved heterogeneity in preferences (iidlogit) We take a different approach as our main goal is toestimate rather than assume the distribution of preferences

22 Although it may seem that the claim data are limited(as they take only integer values between 0 and 5 in ourdata) variation in policy duration generates continuousvariation in the observed claim propensity Of course thisvariation also introduces an additional selection into themodel due to policy cancellations which are potentially

757VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Once the distribution of risk types is iden-tified from claims data the marginal distribu-tion of risk aversion (and its relationship tothe distribution of risk types) is nonparametri-cally identified from the variation in theoffered menus discussed in Section I Thisvariation implies different deductible and pre-mium options to identical (on observables)individuals who purchased insurance at dif-ferent times Different menus lead to differentindifference sets (similar to the one depictedin Figure 2) These sets often cross each otherand nonparametrically identify the distribu-tion of risk aversion and the correlation struc-ture at least within the region in which theindifference sets vary For the tails of thedistribution as is typically the case we haveto rely on parametric assumptions or usebounds The parametric assumption of log-normality we use for most of the paper ismade only for computational convenience

Intuition for the Parametric IdentificationMechanismmdashVariation in the offered menus isimportant for the nonparametric identificationThe parametric assumptions could identify themodel without such variation Thus to keep theintuition simple let us take the bivariate log-normal distribution as given and contrary to thedata assume that all individuals are identical onobservables and face the same menu Supposealso that all individuals are observed for exactlyone year and have up to two claims23 In thissimplified case the model has five parametersto be estimated the mean and variance of risk and

2 the mean and variance of risk aver-sion r and r

2 and the correlation parameter The data can be summarized by five numbersLet 130 131 and 132 1 131 130 be thefraction of individuals with zero one and twoclaims respectively Let 0 1 and 2 be theproportion of individuals who chose a low de-ductible within each ldquoclaim grouprdquo Given ourdistributional assumption about the claim-

generating process we can use 130 and 131 touniquely identify and

2 Loosely isidentified by the average claim rate in the dataand

2 is identified by how fat the tail of theclaim distribution is ie by how large (132131)is compared to (131130) Given and

2 and thelognormality assumption we can (implicitly)construct a posterior distribution of risk typesfor each claim group F(r claims c) andintegrate over it when predicting the deductiblechoice This provides us with three additionalmoments each of the form

(14) Ec Prchoice 1r

dFr claims c dFr

for c 0 1 2 These moments identify thethree remaining parameters of the model rr

2 and Let us now provide more economic content to

the identification argument Using the same ex-ample and conditional on identifying and

2

from the claim data one can think about thedeductible choice data 0 1 2 as a graph(c) The absolute level of the graph identifiesr In the absence of correlation between riskand risk aversion the slope of the graph iden-tifies r

2 with no correlation the slope shouldalways be positive (due to adverse selection)but higher r

2 would imply a flatter graph be-cause more variation in the deductible choiceswill be attributed to variation in risk aversionFinally is identified by the curvature of thegraph The more convex (concave) the graph isthe more positive (negative) is the estimated For example if 0 05 1 051 and 2 099 it is likely that r

2 is high (explaining why0 and 1 are so close) and is highly positive(explaining why 2 is not also close to 1) Incontrast if 0 1 it must mean that thecorrelation between risk and risk aversion isnegative which is the only way the originalpositive correlation induced by adverse selec-tion can be offset This intuition also clarifiesthat the identification of relies on observingindividuals with multiple claims (or differentpolicy durations) and that it is likely to besensitive to the distributional assumptions Thedata (Table 2B) provide a direct (positive) cor-relation between deductible choice and claims

endogenous The results are similar when we use onlyexpired and truncated policies

23 This variation is sufficient (and necessary) to identifythe benchmark model The data provide more variation weobserve up to five claims per individual we observe con-tinuous variation in the policy durations we observe vari-ation in prices and we exploit distributional restrictionsacross individuals with different observable characteristics

758 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The structural assumptions allow us to explainhow much of this correlation can be attributedto adverse selection The remaining correlation(positive or negative) is therefore attributed tocorrelation in the underlying distribution of riskand risk aversion

III Results

A Reduced-Form Estimation

To get an initial sense for the levels of abso-lute risk aversion implied by the data we use asimple back-of-the-envelope exercise We com-pute unconditional averages of p d dh dland d (Table 2A)24 and substitute these valuesin equation (7) The implied coefficient of ab-solute risk aversion is 29 104 NIS125 Thisfigure could be thought of as the average indif-ference point implying that about 18 percent ofthe policyholders have coefficients of absoluterisk aversion exceeding it To convert to USdollar amounts one needs to multiply thesefigures by the average exchange rate (352)resulting in an average indifference point of102 103 $US1 This figure is less than halfof a similar estimate reported by Sydnor (2006)for buyers of homeownerrsquos insurance but about3 and 13 times higher than comparable fig-ures reported by Gertner (1993) and Metrick(1995) respectively for television game showparticipants

Table 3 provides reduced-form analysis ofthe relationship between the observables andour two left-hand-side variables the number ofclaims and the deductible choice26 Column 1reports the estimates from a Poisson regressionof the number of claims on observed character-istics This regression is closely related to therisk equation we estimate in the benchmarkmodel It shows that older people women andpeople with a college education are less likely to

have an accident Bigger more expensiveolder and noncommercial cars are more likelyto be involved in an accident Driving experi-ence and variables associated with less intenseuse of the car reduce accident rates As could beimagined claim propensity is highly correlatedover time past claims are a strong predictor offuture claims Young drivers are 50 to 70 per-cent more likely to be involved in an accidentwith young men significantly more likely thanyoung women Finally as indicated by the trendin the estimated year dummies the accident ratesignificantly declined over time Part of thisdecline is likely due to the decline in accidentrates in Israel in general27 This decline mightalso be partly due to the better selection ofindividuals the company obtained over time asit gained more experience (Cohen 2003)

Columns 2 and 3 of Table 3 present estimatesfrom probit regressions in which the dependentvariable is equal to one if the policyholder chosea low deductible and is equal to zero otherwiseColumn 3 shows the marginal effects of thecovariates on the propensity to choose a lowdeductible These marginal effects do not have astructural interpretation as the choice of lowdeductible depends on its price on risk aver-sion and on risk In this regression we againobserve a strong trend over time Fewer policy-holders chose the low deductible as time wentby One reason for this trend according to thecompany executives is that over time the com-panyrsquos sales persons were directed to focusmainly on the ldquodefaultrdquo regular deductible op-tion28 The effect of other covariates will bediscussed later in the context of the full modelIn unreported probit regressions we also test thequalitative assumptions of the structural model byadding three additional regressors the price ratio(pidi) the average deductible offered d i

24 The unconditional is computed by maximum likeli-hood using the data on claims and observed durations of thepolicies

25 Using the CARA specification as in equation (16) weobtain a slightly lower value of 25 104 NIS1

26 We find positive correlation in the errors of these tworegressions suggesting the existence of adverse selection inthe data and motivating a model with unobserved heteroge-neity in risk This test is similar to the bivariate probit testproposed by Chiappori and Salanie (2000) and replicatesearlier results reported in Cohen (2005)

27 In particular traffic fatalities and counts of trafficaccidents in Israel fell by 11 percent and 18 percent during1998 and 1999 respectively

28 Such biased marketing efforts will bias consumersagainst choosing the low deductible thus making them lookless risk averse This would make our estimate a lowerbound on the true level of risk aversion If only sophisti-cated consumers could see beyond the marketing effort andthis sophistication were related to observables (eg educa-tion) the coefficients on such observables would be biasedupward This is not a major concern given that the coefficientson most of the covariates are fairly stable when we estimatethe benchmark model separately for different years

759VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

TABLE 3mdashNO HETEROGENEITY IN RISK

Variable

Poisson regressiona Probit regressionsc

Dep var Number of claims Dep var 1 if low deductible

(1) (2) (3)

Coeff Std Err IRRb Coeff Std Err dPdX dPdXDemographics Constant 43535 02744 mdash 224994 23562 mdash

Age 00074 00052 09926 01690 00464 00034 00042Age2 989 105 536 105 10001 00017 00005 339 105 455 105Female 00456 00161 09555 08003 01316 00165 00134Family Single omitted omitted omitted

Married 01115 00217 08945 06417 01950 00128 00045Divorced 00717 00346 10743 03191 03127 00064 00044Widower 00155 00527 10156 03441 04573 00069 00025Other (NA) 01347 01755 11442 29518 19836 00511 00370

Education Elementary 00522 00550 09491 02927 04278 00061 00016High school omitted omitted omittedTechnical 00373 00308 10380 05037 02520 00105 00138College 00745 00197 09282 05279 01546 00109 00053Other (NA) 00116 00184 10116 00630 01540 00013 00025

Emigrant 00210 00163 10213 00497 01328 00010 00003

Car attributes Log(Value) 01227 00281 11306 13285 02342 00271 00299Car age 00187 00042 10189 01603 00357 00033 00017Commercial car 01394 00326 08699 09038 02781 00177 00294Log(engine size) 02972 00459 13461 06924 03952 00141 00075

Driving License years 00204 00034 09798 01043 00312 00021 00005License years2 00002 677 105 10002 00015 00005 299 105 167 105

Good driver 00176 00191 09825 07207 01574 00148 00152Any driver 00564 00169 10580 10923 01321 00217 00258Secondary car 00859 00209 09177 00038 01626 00001 00070Business use 01852 00293 12034 07951 02737 00156 00017History length 00527 00110 09486 11218 01450 00228 00171Claims history 06577 00390 19304 04654 05460 00095 00496

Young driver Young driver 05235 00361 16879 28847 06305 00524 00012Gender Male omitted omitted omitted

Female 01475 00288 08629 17959 03135 00396 00195Age 17ndash19 omitted omitted omitted

19ndash21 00701 00532 10726 07800 07509 00153 0014721ndash24 00267 00574 09737 05746 07773 00114 0015624 02082 00567 0812 17869 07328 00397 00136

Experience 1 omitted omitted omitted1ndash3 02416 00458 07854 10175 06577 00217 000093 02827 00532 07538 32513 07386 00762 00410

Company year First year omitted omitted omittedSecond year 00888 00198 09150 44513 01506 00792 00859Third year 00690 00222 09334 85888 01820 01303 01367Fourth year 01817 00232 08339 118277 02102 01616 01734Fifth year 05431 0028 05810 143206 02778 01920 02085

105586Obs 105800 94000 105800Pseudo R2 00162 01364 01296Log likelihood 577459 369595 430869

Significant at the 5 percent confidence levela Maximum likelihood estimates Variation in exposure (policy duration) is accounted forb IRR Incidence rate ratio Each figure should be interpreted as the increasedecrease in claim probability as a result of an increase of one unit in the

right-hand-side variablec There are two separate probit regressions reported in this table Column 2 relies on the deductible choice model and the lognormality assumption As discussed

in the text by including an additional regressor ln([pi((xi)di) 1]d i) (with (xi) predicted from column 1 above) and normalizing its coefficient to 1 we obtaina structural interpretation of this regression with ln (ri) as the dependent variable Thus the reported coefficients are comparable to those estimated for the benchmarkmodel One should be cautious however in interpreting these coefficients Unlike the benchmark model this regression does not allow unobserved heterogeneity inrisk and suffers from some selection bias because observations with high predicted risk rate are omitted (which is why the number of observations is 94000 ratherthan the full sample of 105800) For comparison column 3 reports the marginal effects from a comparable probit regression that uses the full sample and does notcontrol for pricing and predicted risk through the additional structural regressor Column 3 does not have a structural interpretation and its (unreported) coefficientscannot be compared to those estimated from the benchmark model

760 THE AMERICAN ECONOMIC REVIEW JUNE 2007

and the risk rate (xi) as predicted from thePoisson regression of column 1 All three addi-tional regressors enter with the predicted signand with large and highly significant marginaleffects29

Finally column 2 of Table 3 presents animportant specification of the probit regres-sion in which ln([pi(( xi)di) 1]d i) isadded as a regressor and its coefficient isnormalized to 130 As already mentioned ifwe assume that there is no unobserved heter-ogeneity in risk then column 2 is analogousto the ln(ri) equation of the benchmark modelThis restriction of the model is rejected by thedata About 10 percent of the individuals arepredicted to have (xi) (pi di) implying achoice of low deductible for any level of riskaversion Many of these individuals howeverstill choose higher deductible levels Column2 reports the results for the remaining indi-viduals (94000 out of 105800) for which theregression can be run While the signs of theestimated coefficients are similar to those inthe benchmark model presented below therestricted version of the model suggests muchhigher levels and dispersion of risk aversionwell above any reasonable level31 The fullestimation of the benchmark model clearly re-jects this restriction on the model

B Estimation of the Benchmark Model

The Effect of Individual Characteristics onRisk AversionmdashTable 4 presents the estimationresults from the benchmark model The secondcolumn shows how the level of absolute riskaversion is related to individual characteristicsAs the dependent variable is in natural loga-

rithm coefficients on dummy variables can bedirectly interpreted as approximate percentagechanges The vast majority of these coefficientsare quite stable across a wide range of specifi-cations which are mentioned later

The results indicate that women are more riskaverse than men and have a coefficient of ab-solute risk aversion about 20 percent greaterthan that of men These results are consistentwith those of Donkers et al (2001) and Hartoget al (2002) The estimated effect of age sug-gests a nonmonotone pattern of risk preferencesover the life cycle The estimated coefficientsimply that initially (that is at age 18 the youngestindividual in the data) individuals become less riskaverse with age but around the age of 48 indi-viduals start becoming more risk averse32 Mar-ried individuals are estimated to be significantlymore risk averse compared to singles while di-vorced individuals are less (although the coeffi-cient is statistically insignificant)

The analysis suggests that variables that arelikely to be correlated with income or wealthsuch as post-high-school education and thevalue of the car have a positive coefficientindicating that wealthier people have higherlevels of absolute risk aversion Although we donot have data on individualsrsquo income or wealthwe use other proxies for income in additionalspecifications and obtain mixed results Whenwe include as a covariate the average householdincome among those who live in the same zipcode we obtain a significant and negative co-efficient of 0333 (0154) (standard deviationin parentheses) When we match zip code in-come on demographics of the individuals thecoefficient is effectively zero 0036 (0047)and when we use a socioeconomic index ofthe locality in which the individual lives thecoefficient is positive 0127 (0053)33 Thus

29 The estimated marginal effect (z-statistic in parenthe-ses) is 0352 (1376) 16 104 (1481) and 0154(255) for (pidi) d i and (xi) respectively

30 The level is a normalization The sign is estimatedHad the sign on this regressor been positive this would haveimplied a rejection of the model

31 For the implied median level of risk aversion the re-stricted model produces a similar estimate to the estimate wereport below for the benchmark model However since someindividuals who chose the low deductible are estimated to havevery low claim rates the restricted model is ldquoforcedrdquo to estimatevery high risk aversion for these individuals (in contrast thebenchmark model can explain such choices by positive risk re-siduals) resulting in very high dispersion and (due to the lognor-mality assumption) very high average risk aversion which isabout 1025 higher than the benchmark estimates we report below

32 This nonmonotone pattern may explain why age enterswith different signs in the estimation results of Donkers et al(2001) and Hartog et al (2002) A somewhat similar U-shapepattern with respect to age is also reported by Sumit Agarwalet al (2006) in the context of consumer credit markets

33 The full results from these specifications are providedin the online Appendix (available at httpwwwe-aerorgdatajune0720050644_apppdf ) The reason we do not usethese income variables in the benchmark specification istheir imperfect coverage which would require us to omitalmost 20 percent of the individuals Other results fromthese regressions are similar to those we report for thebenchmark specification

761VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

overall we interpret our findings as suggestiveof a nonnegative and perhaps positive associ-ation between incomewealth and absolute riskaversion

At first glance these results may appear tobe inconsistent with the widely held beliefthat absolute risk aversion declines withwealth One should distinguish however be-tween two questions (a) whether for a givenindividual the vNM utility function exhibitsdecreasing absolute risk aversion and (b)how risk preferences vary across individuals

Our results do not speak to the first questionand should not be thought of as a test of thedecreasing absolute risk aversion propertyTesting this property would require observingthe same individual making multiple choicesat different wealth levels Rather our resultsindicate that individuals with greater wealthhave utility functions that involve a greaterdegree of risk aversion It might be thatwealth is endogenous and that risk aversion(or unobserved individual characteristics thatare correlated with it) leads individuals to

TABLE 4mdashTHE BENCHMARK MODEL

Variable Ln() equation Ln(r) equation Additional quantities

Demographics Constant 15406 (00073) 118118 (01032) Var-covar matrix ()

Age 00001 (00026) 00623 (00213) 01498 (00097)Age2 624 106 (263 105) 644 104 (211 104) r 31515 (00773)Female 00006 (00086) 02049 (00643) 08391 (00265)Family Single Omitted Omitted

Married 00198 (00115) 01927 (00974) Unconditional statisticsa

Divorced 00396 (00155) 01754 (01495) Mean 02196 (00013)Widower 00135 (00281) 01320 (02288) Median 02174 (00017)Other (NA) 00557 (00968) 04599 (07397) Std Dev 00483 (00019)

Education Elementary 00194 (00333) 01283 (02156) Mean r 00019 (00002)High school Omitted Omitted Median r 727 106 (756 107)Technical 00017 (00189) 02306 (01341) Std Dev r 00197 (00015)College 00277 (00124) 02177 (00840) Corr(r ) 02067 (00085)Other (NA) 00029 (00107) 00128 (00819)

Emigrant 00030 (00090) 00001 (00651) Obs 105800

Car attributes Log(value) 00794 (00177) 07244 (01272)Car age 00053 (00023) 00411 (00176)Commercial car 00719 (00187) 00313 (01239)Log(engine size) 01299 (00235) 03195 (01847)

Driving License years 00015 (00017) 00157 (00137)License years2 183 105 (351 105) 148 104 (254 104)Good driver 00635 (00112) 00317 (00822)Any driver 00360 (00105) 03000 (00722)Secondary car 00415 (00141) 01209 (00875)Business use 00614 (00134) 03790 (01124)History length 00012 (00052) 03092 (00518)Claims history 01295 (00154) 00459 (01670)

Young driver Young driver 00525 (00253) 02499 (02290)Gender Male Omitted mdash

Female 00355 (00061) mdashAge 17ndash19 Omitted mdash

19ndash21 00387 (00121) mdash21ndash24 00445 (00124) mdash24 00114 (00119) mdash

Experience 1 Omitted mdash1ndash3 00059 (00104) mdash3 00762 (00121) mdash

Company year First year Omitted OmittedSecond year 00771 (00122) 14334 (00853)Third year 00857 (00137) 28459 (01191)Fourth year 01515 (00160) 38089 (01343)Fifth year 04062 (00249) 39525 (01368)

Note Standard deviations based on the draws from the posterior distribution in parentheses Significant at the 5 percent confidence levela Unconditional statistics represent implied quantities for the sample population as a whole ie integrating over the distribution of covariates in the sample (as

well as over the unobserved components)

762 THE AMERICAN ECONOMIC REVIEW JUNE 2007

save more to obtain more education or totake other actions that lead to greaterwealth34

Let us make several additional observationsFirst while owners of more expensive cars ap-pear to have both higher risk exposure andhigher levels of risk aversion owners of biggercars have higher risk exposure but lower levelsof risk aversion This should indicate that thestructure of the model does not constrain therelationship between the coefficients in the twoequations Rather it is the data that ldquospeak uprdquoSecond individuals who are classified by theinsurer as ldquogood driversrdquo indeed have lowerrisk but also appear to have lower risk aversionThis result is somewhat similar to the positivecorrelation between unobserved risk and unob-served risk aversion which we report belowThird policyholders who tend to use the car forbusiness are less risk averse This could bebecause uninsured costs of accidents occurringto such policyholders are tax deductible Fourthpolicyholders who reported three full years ofclaims history are more risk averse but are notdifferent in their risk exposure The attitude thatleads such policyholders to comply with therequest to (voluntarily) report three full years ofclaims history is apparently and not surpris-ingly correlated with higher levels of risk aver-sion In contrast while past claims indicate highrisk they have no significant relationship withrisk aversion Finally we find a strong trendtoward lower levels of risk aversion over timeThis is a replication of the probit results dis-cussed earlier

The Effect of Individual Characteristics onClaim RiskmdashThe first column of Table 4 de-scribes the relationship between observablesand risk exposure Accident risk is higher fordivorced individuals and lower for people witha college education Bigger older more expen-sive and noncommercial cars are all morelikely to be involved in an accident Drivingexperience reduces accident rates as do mea-sures of less intense use of the car while youngdrivers are more exposed to risk Claim propen-sity is highly correlated over time the voluntaryreport of past claims is a strong predictor offuture claims This risk equation produces re-sults that are similar to those of the simplerPoisson regression reported in Table 3 Al-though some of the coefficients lose signifi-cance the magnitude of most coefficients isquite similar The similarity between these twosets of results is to be expected as the riskregression is identified primarily from the dataon claims so incorporating the information ondeductible choice does not qualitatively changethe conceptual identification strategy (see Sec-tion IIC) If the results were not similar thiscould have indicated a misspecification of themodel The slight differences between the riskregressions in Table 3 and Table 4 are drivenmainly by the structural assumptions First thebenchmark model estimates a lognormal mix-ture of Poisson rates rather than a single Pois-son model By incorporating the fatter tails ofthe claim distribution it slightly changes theresults increases the standard errors and de-creases the average predicted claim rate Sec-ond the information on deductible choiceslightly helps us in obtaining more precise es-timates through the correlation structure be-tween the error terms in the two equations

The Implied Level of Risk AversionmdashOne ofthe main goals of the estimation is to obtainmeasures of the level of risk aversion in thepopulation we study Since we use Gibbs sam-pling and augment the latent coefficients ofabsolute risk aversion we can directly obtainthe posterior distribution of various moments ofthis distribution At each iteration of the Gibbssampler we compute the mean standard devi-ation (across individuals) and various percen-tiles of the simulated draws of i and ri and thecorrelation between them The far-right columnof Table 4 reports the averages and standard

34 One may be tempted to interpret the positive wealtheffects as an indirect indication of credit constraints wealth-ier individuals are less credit constrained and therefore canafford to purchase more insurance We do not share thisinterpretation for two reasons First the insurance companyobserves these proxies for wealth and conditions on themwhen setting prices Since the willingness to pay for insur-ance is likely to be correlated with the willingness to pay forthe additional insurance provided by the low deductibleoption premiums already reflect this variation We condi-tion on the level of the premium Second paying less exante implies paying more ex post so applying the creditconstraint argument only to the ex ante payment but not tothe probabilistic ex post deductible payments has no theo-retical foundation Essentially the setup of the model linksthe ex ante decisions to the ex post losses which are bothdriven by the curvature of the vNM utility function

763VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

deviations of these computed quantities over theiterations of the Gibbs sampler35 The impliedrisk aversion of the mean individual is 00019which is about seven times greater than theback-of-the-envelope calculation presented inthe previous section As we assume a lognormal

distribution and estimate a relatively high dis-persion coefficient r the estimates also implysignificantly lower values for the median levelof risk aversion As shown later the qualitativepattern of these results is quite robust acrossspecifications

Table 5 summarizes our findings regardingthe level and dispersion of risk aversion Itpresents two ways to interpret the estimates weobtained from the benchmark model as well ascomparisons to specifications with constant ab-solute risk aversion (CARA) utility and with

35 Note that these estimated quantities are unconditionalIn computing these quantities we integrate over the distri-bution of observable characteristics in the data so onecannot compute these estimates from the estimated param-eter directly

TABLE 5mdashRISK-AVERSION ESTIMATES

Specificationa Absolute risk aversionb Interpretationc Relative risk aversiond

Back-of-the-envelope 10 103 9070 1484Benchmark model

Mean individual 67 103 5605 972225th percentile 23 106 9998 003Median individual 26 105 9974 03775th percentile 29 104 9714 42790th percentile 27 103 7834 390295th percentile 99 103 4937 14327

CARA utilityMean individual 31 103 7651 4436Median individual 34 105 9966 050

Learning modelMean individual 42 103 6886 6140Median individual 56 106 9995 008

Comparable estimatesGertner (1993) 31 104 9699 479Metrick (1995) 66 105 9934 102Holt and Laury (2002)e 32 102 2096 86575Sydnor (2006) 20 103 8329 5395

a The table summarizes the results with respect to the level of risk aversion ldquoBack-of-the-enveloperdquo refers to thecalculation we report in the beginning of Section III ldquobenchmark modelrdquo refers to the results from the benchmark model(Table 4) ldquoCARA utilityrdquo refers to a specification of a CARA utility function and ldquolearning modelrdquo refers to a specificationin which individuals do not know their risk types perfectly (see Section IIID) The last four rows are the closest comparableresults available in the literature

b The second column presents the point estimates for the coefficient of absolute risk aversion converted to $US1 unitsFor the comparable estimates this is given by their estimate of a representative CARA utility maximizer For all otherspecifications this is given by computing the unconditional mean and median in the population using the figures we reportin the mean and median columns of risk aversion in Table 6 (multiplied by 352 the average exchange rate during the periodto convert to US dollars)

c To interpret the absolute risk aversion estimates (ARA) we translate them into x u(w) 1frasl2 u(w 100) 1frasl2 u(w x) That is we report x such that an individual with the estimated ARA is indifferent about participating in a 50ndash50lottery of gaining 100 US dollars and losing x US dollars Note that since our estimate is of absolute risk aversion the quantityx is independent of w To be consistent with the specification we use a quadratic utility function for the back-of-the-envelopebenchmark and learning models and use a CARA utility function for the others

d The last column attempts to translate the ARA estimates into relative risk aversion We follow the literature and do soby multiplying the ARA estimate by average annual income We use the average annual income (after tax) in Israel in 1995(51168 NIS from the Israeli Census) for all our specifications and we use average disposable income in the United Statesin 1987 ($15437) for Gertner (1993) and Metrick (1995) For Holt and Laury (2002) and Sydnor (2006) we use a similarfigure for 2002 ($26974)

e Holt and Laury (2002) do not report a comparable estimate The estimate we provide above is based on estimating aCARA utility model for the 18 subjects in their experiment who participated in the ldquo90rdquo treatment which involved stakescomparable to our setting For these subjects we assume a CARA utility function and a lognormal distribution of theircoefficient of absolute risk aversion The table reports the point estimate of the mean from this distribution

764 THE AMERICAN ECONOMIC REVIEW JUNE 2007

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 14: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

Once the distribution of risk types is iden-tified from claims data the marginal distribu-tion of risk aversion (and its relationship tothe distribution of risk types) is nonparametri-cally identified from the variation in theoffered menus discussed in Section I Thisvariation implies different deductible and pre-mium options to identical (on observables)individuals who purchased insurance at dif-ferent times Different menus lead to differentindifference sets (similar to the one depictedin Figure 2) These sets often cross each otherand nonparametrically identify the distribu-tion of risk aversion and the correlation struc-ture at least within the region in which theindifference sets vary For the tails of thedistribution as is typically the case we haveto rely on parametric assumptions or usebounds The parametric assumption of log-normality we use for most of the paper ismade only for computational convenience

Intuition for the Parametric IdentificationMechanismmdashVariation in the offered menus isimportant for the nonparametric identificationThe parametric assumptions could identify themodel without such variation Thus to keep theintuition simple let us take the bivariate log-normal distribution as given and contrary to thedata assume that all individuals are identical onobservables and face the same menu Supposealso that all individuals are observed for exactlyone year and have up to two claims23 In thissimplified case the model has five parametersto be estimated the mean and variance of risk and

2 the mean and variance of risk aver-sion r and r

2 and the correlation parameter The data can be summarized by five numbersLet 130 131 and 132 1 131 130 be thefraction of individuals with zero one and twoclaims respectively Let 0 1 and 2 be theproportion of individuals who chose a low de-ductible within each ldquoclaim grouprdquo Given ourdistributional assumption about the claim-

generating process we can use 130 and 131 touniquely identify and

2 Loosely isidentified by the average claim rate in the dataand

2 is identified by how fat the tail of theclaim distribution is ie by how large (132131)is compared to (131130) Given and

2 and thelognormality assumption we can (implicitly)construct a posterior distribution of risk typesfor each claim group F(r claims c) andintegrate over it when predicting the deductiblechoice This provides us with three additionalmoments each of the form

(14) Ec Prchoice 1r

dFr claims c dFr

for c 0 1 2 These moments identify thethree remaining parameters of the model rr

2 and Let us now provide more economic content to

the identification argument Using the same ex-ample and conditional on identifying and

2

from the claim data one can think about thedeductible choice data 0 1 2 as a graph(c) The absolute level of the graph identifiesr In the absence of correlation between riskand risk aversion the slope of the graph iden-tifies r

2 with no correlation the slope shouldalways be positive (due to adverse selection)but higher r

2 would imply a flatter graph be-cause more variation in the deductible choiceswill be attributed to variation in risk aversionFinally is identified by the curvature of thegraph The more convex (concave) the graph isthe more positive (negative) is the estimated For example if 0 05 1 051 and 2 099 it is likely that r

2 is high (explaining why0 and 1 are so close) and is highly positive(explaining why 2 is not also close to 1) Incontrast if 0 1 it must mean that thecorrelation between risk and risk aversion isnegative which is the only way the originalpositive correlation induced by adverse selec-tion can be offset This intuition also clarifiesthat the identification of relies on observingindividuals with multiple claims (or differentpolicy durations) and that it is likely to besensitive to the distributional assumptions Thedata (Table 2B) provide a direct (positive) cor-relation between deductible choice and claims

endogenous The results are similar when we use onlyexpired and truncated policies

23 This variation is sufficient (and necessary) to identifythe benchmark model The data provide more variation weobserve up to five claims per individual we observe con-tinuous variation in the policy durations we observe vari-ation in prices and we exploit distributional restrictionsacross individuals with different observable characteristics

758 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The structural assumptions allow us to explainhow much of this correlation can be attributedto adverse selection The remaining correlation(positive or negative) is therefore attributed tocorrelation in the underlying distribution of riskand risk aversion

III Results

A Reduced-Form Estimation

To get an initial sense for the levels of abso-lute risk aversion implied by the data we use asimple back-of-the-envelope exercise We com-pute unconditional averages of p d dh dland d (Table 2A)24 and substitute these valuesin equation (7) The implied coefficient of ab-solute risk aversion is 29 104 NIS125 Thisfigure could be thought of as the average indif-ference point implying that about 18 percent ofthe policyholders have coefficients of absoluterisk aversion exceeding it To convert to USdollar amounts one needs to multiply thesefigures by the average exchange rate (352)resulting in an average indifference point of102 103 $US1 This figure is less than halfof a similar estimate reported by Sydnor (2006)for buyers of homeownerrsquos insurance but about3 and 13 times higher than comparable fig-ures reported by Gertner (1993) and Metrick(1995) respectively for television game showparticipants

Table 3 provides reduced-form analysis ofthe relationship between the observables andour two left-hand-side variables the number ofclaims and the deductible choice26 Column 1reports the estimates from a Poisson regressionof the number of claims on observed character-istics This regression is closely related to therisk equation we estimate in the benchmarkmodel It shows that older people women andpeople with a college education are less likely to

have an accident Bigger more expensiveolder and noncommercial cars are more likelyto be involved in an accident Driving experi-ence and variables associated with less intenseuse of the car reduce accident rates As could beimagined claim propensity is highly correlatedover time past claims are a strong predictor offuture claims Young drivers are 50 to 70 per-cent more likely to be involved in an accidentwith young men significantly more likely thanyoung women Finally as indicated by the trendin the estimated year dummies the accident ratesignificantly declined over time Part of thisdecline is likely due to the decline in accidentrates in Israel in general27 This decline mightalso be partly due to the better selection ofindividuals the company obtained over time asit gained more experience (Cohen 2003)

Columns 2 and 3 of Table 3 present estimatesfrom probit regressions in which the dependentvariable is equal to one if the policyholder chosea low deductible and is equal to zero otherwiseColumn 3 shows the marginal effects of thecovariates on the propensity to choose a lowdeductible These marginal effects do not have astructural interpretation as the choice of lowdeductible depends on its price on risk aver-sion and on risk In this regression we againobserve a strong trend over time Fewer policy-holders chose the low deductible as time wentby One reason for this trend according to thecompany executives is that over time the com-panyrsquos sales persons were directed to focusmainly on the ldquodefaultrdquo regular deductible op-tion28 The effect of other covariates will bediscussed later in the context of the full modelIn unreported probit regressions we also test thequalitative assumptions of the structural model byadding three additional regressors the price ratio(pidi) the average deductible offered d i

24 The unconditional is computed by maximum likeli-hood using the data on claims and observed durations of thepolicies

25 Using the CARA specification as in equation (16) weobtain a slightly lower value of 25 104 NIS1

26 We find positive correlation in the errors of these tworegressions suggesting the existence of adverse selection inthe data and motivating a model with unobserved heteroge-neity in risk This test is similar to the bivariate probit testproposed by Chiappori and Salanie (2000) and replicatesearlier results reported in Cohen (2005)

27 In particular traffic fatalities and counts of trafficaccidents in Israel fell by 11 percent and 18 percent during1998 and 1999 respectively

28 Such biased marketing efforts will bias consumersagainst choosing the low deductible thus making them lookless risk averse This would make our estimate a lowerbound on the true level of risk aversion If only sophisti-cated consumers could see beyond the marketing effort andthis sophistication were related to observables (eg educa-tion) the coefficients on such observables would be biasedupward This is not a major concern given that the coefficientson most of the covariates are fairly stable when we estimatethe benchmark model separately for different years

759VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

TABLE 3mdashNO HETEROGENEITY IN RISK

Variable

Poisson regressiona Probit regressionsc

Dep var Number of claims Dep var 1 if low deductible

(1) (2) (3)

Coeff Std Err IRRb Coeff Std Err dPdX dPdXDemographics Constant 43535 02744 mdash 224994 23562 mdash

Age 00074 00052 09926 01690 00464 00034 00042Age2 989 105 536 105 10001 00017 00005 339 105 455 105Female 00456 00161 09555 08003 01316 00165 00134Family Single omitted omitted omitted

Married 01115 00217 08945 06417 01950 00128 00045Divorced 00717 00346 10743 03191 03127 00064 00044Widower 00155 00527 10156 03441 04573 00069 00025Other (NA) 01347 01755 11442 29518 19836 00511 00370

Education Elementary 00522 00550 09491 02927 04278 00061 00016High school omitted omitted omittedTechnical 00373 00308 10380 05037 02520 00105 00138College 00745 00197 09282 05279 01546 00109 00053Other (NA) 00116 00184 10116 00630 01540 00013 00025

Emigrant 00210 00163 10213 00497 01328 00010 00003

Car attributes Log(Value) 01227 00281 11306 13285 02342 00271 00299Car age 00187 00042 10189 01603 00357 00033 00017Commercial car 01394 00326 08699 09038 02781 00177 00294Log(engine size) 02972 00459 13461 06924 03952 00141 00075

Driving License years 00204 00034 09798 01043 00312 00021 00005License years2 00002 677 105 10002 00015 00005 299 105 167 105

Good driver 00176 00191 09825 07207 01574 00148 00152Any driver 00564 00169 10580 10923 01321 00217 00258Secondary car 00859 00209 09177 00038 01626 00001 00070Business use 01852 00293 12034 07951 02737 00156 00017History length 00527 00110 09486 11218 01450 00228 00171Claims history 06577 00390 19304 04654 05460 00095 00496

Young driver Young driver 05235 00361 16879 28847 06305 00524 00012Gender Male omitted omitted omitted

Female 01475 00288 08629 17959 03135 00396 00195Age 17ndash19 omitted omitted omitted

19ndash21 00701 00532 10726 07800 07509 00153 0014721ndash24 00267 00574 09737 05746 07773 00114 0015624 02082 00567 0812 17869 07328 00397 00136

Experience 1 omitted omitted omitted1ndash3 02416 00458 07854 10175 06577 00217 000093 02827 00532 07538 32513 07386 00762 00410

Company year First year omitted omitted omittedSecond year 00888 00198 09150 44513 01506 00792 00859Third year 00690 00222 09334 85888 01820 01303 01367Fourth year 01817 00232 08339 118277 02102 01616 01734Fifth year 05431 0028 05810 143206 02778 01920 02085

105586Obs 105800 94000 105800Pseudo R2 00162 01364 01296Log likelihood 577459 369595 430869

Significant at the 5 percent confidence levela Maximum likelihood estimates Variation in exposure (policy duration) is accounted forb IRR Incidence rate ratio Each figure should be interpreted as the increasedecrease in claim probability as a result of an increase of one unit in the

right-hand-side variablec There are two separate probit regressions reported in this table Column 2 relies on the deductible choice model and the lognormality assumption As discussed

in the text by including an additional regressor ln([pi((xi)di) 1]d i) (with (xi) predicted from column 1 above) and normalizing its coefficient to 1 we obtaina structural interpretation of this regression with ln (ri) as the dependent variable Thus the reported coefficients are comparable to those estimated for the benchmarkmodel One should be cautious however in interpreting these coefficients Unlike the benchmark model this regression does not allow unobserved heterogeneity inrisk and suffers from some selection bias because observations with high predicted risk rate are omitted (which is why the number of observations is 94000 ratherthan the full sample of 105800) For comparison column 3 reports the marginal effects from a comparable probit regression that uses the full sample and does notcontrol for pricing and predicted risk through the additional structural regressor Column 3 does not have a structural interpretation and its (unreported) coefficientscannot be compared to those estimated from the benchmark model

760 THE AMERICAN ECONOMIC REVIEW JUNE 2007

and the risk rate (xi) as predicted from thePoisson regression of column 1 All three addi-tional regressors enter with the predicted signand with large and highly significant marginaleffects29

Finally column 2 of Table 3 presents animportant specification of the probit regres-sion in which ln([pi(( xi)di) 1]d i) isadded as a regressor and its coefficient isnormalized to 130 As already mentioned ifwe assume that there is no unobserved heter-ogeneity in risk then column 2 is analogousto the ln(ri) equation of the benchmark modelThis restriction of the model is rejected by thedata About 10 percent of the individuals arepredicted to have (xi) (pi di) implying achoice of low deductible for any level of riskaversion Many of these individuals howeverstill choose higher deductible levels Column2 reports the results for the remaining indi-viduals (94000 out of 105800) for which theregression can be run While the signs of theestimated coefficients are similar to those inthe benchmark model presented below therestricted version of the model suggests muchhigher levels and dispersion of risk aversionwell above any reasonable level31 The fullestimation of the benchmark model clearly re-jects this restriction on the model

B Estimation of the Benchmark Model

The Effect of Individual Characteristics onRisk AversionmdashTable 4 presents the estimationresults from the benchmark model The secondcolumn shows how the level of absolute riskaversion is related to individual characteristicsAs the dependent variable is in natural loga-

rithm coefficients on dummy variables can bedirectly interpreted as approximate percentagechanges The vast majority of these coefficientsare quite stable across a wide range of specifi-cations which are mentioned later

The results indicate that women are more riskaverse than men and have a coefficient of ab-solute risk aversion about 20 percent greaterthan that of men These results are consistentwith those of Donkers et al (2001) and Hartoget al (2002) The estimated effect of age sug-gests a nonmonotone pattern of risk preferencesover the life cycle The estimated coefficientsimply that initially (that is at age 18 the youngestindividual in the data) individuals become less riskaverse with age but around the age of 48 indi-viduals start becoming more risk averse32 Mar-ried individuals are estimated to be significantlymore risk averse compared to singles while di-vorced individuals are less (although the coeffi-cient is statistically insignificant)

The analysis suggests that variables that arelikely to be correlated with income or wealthsuch as post-high-school education and thevalue of the car have a positive coefficientindicating that wealthier people have higherlevels of absolute risk aversion Although we donot have data on individualsrsquo income or wealthwe use other proxies for income in additionalspecifications and obtain mixed results Whenwe include as a covariate the average householdincome among those who live in the same zipcode we obtain a significant and negative co-efficient of 0333 (0154) (standard deviationin parentheses) When we match zip code in-come on demographics of the individuals thecoefficient is effectively zero 0036 (0047)and when we use a socioeconomic index ofthe locality in which the individual lives thecoefficient is positive 0127 (0053)33 Thus

29 The estimated marginal effect (z-statistic in parenthe-ses) is 0352 (1376) 16 104 (1481) and 0154(255) for (pidi) d i and (xi) respectively

30 The level is a normalization The sign is estimatedHad the sign on this regressor been positive this would haveimplied a rejection of the model

31 For the implied median level of risk aversion the re-stricted model produces a similar estimate to the estimate wereport below for the benchmark model However since someindividuals who chose the low deductible are estimated to havevery low claim rates the restricted model is ldquoforcedrdquo to estimatevery high risk aversion for these individuals (in contrast thebenchmark model can explain such choices by positive risk re-siduals) resulting in very high dispersion and (due to the lognor-mality assumption) very high average risk aversion which isabout 1025 higher than the benchmark estimates we report below

32 This nonmonotone pattern may explain why age enterswith different signs in the estimation results of Donkers et al(2001) and Hartog et al (2002) A somewhat similar U-shapepattern with respect to age is also reported by Sumit Agarwalet al (2006) in the context of consumer credit markets

33 The full results from these specifications are providedin the online Appendix (available at httpwwwe-aerorgdatajune0720050644_apppdf ) The reason we do not usethese income variables in the benchmark specification istheir imperfect coverage which would require us to omitalmost 20 percent of the individuals Other results fromthese regressions are similar to those we report for thebenchmark specification

761VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

overall we interpret our findings as suggestiveof a nonnegative and perhaps positive associ-ation between incomewealth and absolute riskaversion

At first glance these results may appear tobe inconsistent with the widely held beliefthat absolute risk aversion declines withwealth One should distinguish however be-tween two questions (a) whether for a givenindividual the vNM utility function exhibitsdecreasing absolute risk aversion and (b)how risk preferences vary across individuals

Our results do not speak to the first questionand should not be thought of as a test of thedecreasing absolute risk aversion propertyTesting this property would require observingthe same individual making multiple choicesat different wealth levels Rather our resultsindicate that individuals with greater wealthhave utility functions that involve a greaterdegree of risk aversion It might be thatwealth is endogenous and that risk aversion(or unobserved individual characteristics thatare correlated with it) leads individuals to

TABLE 4mdashTHE BENCHMARK MODEL

Variable Ln() equation Ln(r) equation Additional quantities

Demographics Constant 15406 (00073) 118118 (01032) Var-covar matrix ()

Age 00001 (00026) 00623 (00213) 01498 (00097)Age2 624 106 (263 105) 644 104 (211 104) r 31515 (00773)Female 00006 (00086) 02049 (00643) 08391 (00265)Family Single Omitted Omitted

Married 00198 (00115) 01927 (00974) Unconditional statisticsa

Divorced 00396 (00155) 01754 (01495) Mean 02196 (00013)Widower 00135 (00281) 01320 (02288) Median 02174 (00017)Other (NA) 00557 (00968) 04599 (07397) Std Dev 00483 (00019)

Education Elementary 00194 (00333) 01283 (02156) Mean r 00019 (00002)High school Omitted Omitted Median r 727 106 (756 107)Technical 00017 (00189) 02306 (01341) Std Dev r 00197 (00015)College 00277 (00124) 02177 (00840) Corr(r ) 02067 (00085)Other (NA) 00029 (00107) 00128 (00819)

Emigrant 00030 (00090) 00001 (00651) Obs 105800

Car attributes Log(value) 00794 (00177) 07244 (01272)Car age 00053 (00023) 00411 (00176)Commercial car 00719 (00187) 00313 (01239)Log(engine size) 01299 (00235) 03195 (01847)

Driving License years 00015 (00017) 00157 (00137)License years2 183 105 (351 105) 148 104 (254 104)Good driver 00635 (00112) 00317 (00822)Any driver 00360 (00105) 03000 (00722)Secondary car 00415 (00141) 01209 (00875)Business use 00614 (00134) 03790 (01124)History length 00012 (00052) 03092 (00518)Claims history 01295 (00154) 00459 (01670)

Young driver Young driver 00525 (00253) 02499 (02290)Gender Male Omitted mdash

Female 00355 (00061) mdashAge 17ndash19 Omitted mdash

19ndash21 00387 (00121) mdash21ndash24 00445 (00124) mdash24 00114 (00119) mdash

Experience 1 Omitted mdash1ndash3 00059 (00104) mdash3 00762 (00121) mdash

Company year First year Omitted OmittedSecond year 00771 (00122) 14334 (00853)Third year 00857 (00137) 28459 (01191)Fourth year 01515 (00160) 38089 (01343)Fifth year 04062 (00249) 39525 (01368)

Note Standard deviations based on the draws from the posterior distribution in parentheses Significant at the 5 percent confidence levela Unconditional statistics represent implied quantities for the sample population as a whole ie integrating over the distribution of covariates in the sample (as

well as over the unobserved components)

762 THE AMERICAN ECONOMIC REVIEW JUNE 2007

save more to obtain more education or totake other actions that lead to greaterwealth34

Let us make several additional observationsFirst while owners of more expensive cars ap-pear to have both higher risk exposure andhigher levels of risk aversion owners of biggercars have higher risk exposure but lower levelsof risk aversion This should indicate that thestructure of the model does not constrain therelationship between the coefficients in the twoequations Rather it is the data that ldquospeak uprdquoSecond individuals who are classified by theinsurer as ldquogood driversrdquo indeed have lowerrisk but also appear to have lower risk aversionThis result is somewhat similar to the positivecorrelation between unobserved risk and unob-served risk aversion which we report belowThird policyholders who tend to use the car forbusiness are less risk averse This could bebecause uninsured costs of accidents occurringto such policyholders are tax deductible Fourthpolicyholders who reported three full years ofclaims history are more risk averse but are notdifferent in their risk exposure The attitude thatleads such policyholders to comply with therequest to (voluntarily) report three full years ofclaims history is apparently and not surpris-ingly correlated with higher levels of risk aver-sion In contrast while past claims indicate highrisk they have no significant relationship withrisk aversion Finally we find a strong trendtoward lower levels of risk aversion over timeThis is a replication of the probit results dis-cussed earlier

The Effect of Individual Characteristics onClaim RiskmdashThe first column of Table 4 de-scribes the relationship between observablesand risk exposure Accident risk is higher fordivorced individuals and lower for people witha college education Bigger older more expen-sive and noncommercial cars are all morelikely to be involved in an accident Drivingexperience reduces accident rates as do mea-sures of less intense use of the car while youngdrivers are more exposed to risk Claim propen-sity is highly correlated over time the voluntaryreport of past claims is a strong predictor offuture claims This risk equation produces re-sults that are similar to those of the simplerPoisson regression reported in Table 3 Al-though some of the coefficients lose signifi-cance the magnitude of most coefficients isquite similar The similarity between these twosets of results is to be expected as the riskregression is identified primarily from the dataon claims so incorporating the information ondeductible choice does not qualitatively changethe conceptual identification strategy (see Sec-tion IIC) If the results were not similar thiscould have indicated a misspecification of themodel The slight differences between the riskregressions in Table 3 and Table 4 are drivenmainly by the structural assumptions First thebenchmark model estimates a lognormal mix-ture of Poisson rates rather than a single Pois-son model By incorporating the fatter tails ofthe claim distribution it slightly changes theresults increases the standard errors and de-creases the average predicted claim rate Sec-ond the information on deductible choiceslightly helps us in obtaining more precise es-timates through the correlation structure be-tween the error terms in the two equations

The Implied Level of Risk AversionmdashOne ofthe main goals of the estimation is to obtainmeasures of the level of risk aversion in thepopulation we study Since we use Gibbs sam-pling and augment the latent coefficients ofabsolute risk aversion we can directly obtainthe posterior distribution of various moments ofthis distribution At each iteration of the Gibbssampler we compute the mean standard devi-ation (across individuals) and various percen-tiles of the simulated draws of i and ri and thecorrelation between them The far-right columnof Table 4 reports the averages and standard

34 One may be tempted to interpret the positive wealtheffects as an indirect indication of credit constraints wealth-ier individuals are less credit constrained and therefore canafford to purchase more insurance We do not share thisinterpretation for two reasons First the insurance companyobserves these proxies for wealth and conditions on themwhen setting prices Since the willingness to pay for insur-ance is likely to be correlated with the willingness to pay forthe additional insurance provided by the low deductibleoption premiums already reflect this variation We condi-tion on the level of the premium Second paying less exante implies paying more ex post so applying the creditconstraint argument only to the ex ante payment but not tothe probabilistic ex post deductible payments has no theo-retical foundation Essentially the setup of the model linksthe ex ante decisions to the ex post losses which are bothdriven by the curvature of the vNM utility function

763VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

deviations of these computed quantities over theiterations of the Gibbs sampler35 The impliedrisk aversion of the mean individual is 00019which is about seven times greater than theback-of-the-envelope calculation presented inthe previous section As we assume a lognormal

distribution and estimate a relatively high dis-persion coefficient r the estimates also implysignificantly lower values for the median levelof risk aversion As shown later the qualitativepattern of these results is quite robust acrossspecifications

Table 5 summarizes our findings regardingthe level and dispersion of risk aversion Itpresents two ways to interpret the estimates weobtained from the benchmark model as well ascomparisons to specifications with constant ab-solute risk aversion (CARA) utility and with

35 Note that these estimated quantities are unconditionalIn computing these quantities we integrate over the distri-bution of observable characteristics in the data so onecannot compute these estimates from the estimated param-eter directly

TABLE 5mdashRISK-AVERSION ESTIMATES

Specificationa Absolute risk aversionb Interpretationc Relative risk aversiond

Back-of-the-envelope 10 103 9070 1484Benchmark model

Mean individual 67 103 5605 972225th percentile 23 106 9998 003Median individual 26 105 9974 03775th percentile 29 104 9714 42790th percentile 27 103 7834 390295th percentile 99 103 4937 14327

CARA utilityMean individual 31 103 7651 4436Median individual 34 105 9966 050

Learning modelMean individual 42 103 6886 6140Median individual 56 106 9995 008

Comparable estimatesGertner (1993) 31 104 9699 479Metrick (1995) 66 105 9934 102Holt and Laury (2002)e 32 102 2096 86575Sydnor (2006) 20 103 8329 5395

a The table summarizes the results with respect to the level of risk aversion ldquoBack-of-the-enveloperdquo refers to thecalculation we report in the beginning of Section III ldquobenchmark modelrdquo refers to the results from the benchmark model(Table 4) ldquoCARA utilityrdquo refers to a specification of a CARA utility function and ldquolearning modelrdquo refers to a specificationin which individuals do not know their risk types perfectly (see Section IIID) The last four rows are the closest comparableresults available in the literature

b The second column presents the point estimates for the coefficient of absolute risk aversion converted to $US1 unitsFor the comparable estimates this is given by their estimate of a representative CARA utility maximizer For all otherspecifications this is given by computing the unconditional mean and median in the population using the figures we reportin the mean and median columns of risk aversion in Table 6 (multiplied by 352 the average exchange rate during the periodto convert to US dollars)

c To interpret the absolute risk aversion estimates (ARA) we translate them into x u(w) 1frasl2 u(w 100) 1frasl2 u(w x) That is we report x such that an individual with the estimated ARA is indifferent about participating in a 50ndash50lottery of gaining 100 US dollars and losing x US dollars Note that since our estimate is of absolute risk aversion the quantityx is independent of w To be consistent with the specification we use a quadratic utility function for the back-of-the-envelopebenchmark and learning models and use a CARA utility function for the others

d The last column attempts to translate the ARA estimates into relative risk aversion We follow the literature and do soby multiplying the ARA estimate by average annual income We use the average annual income (after tax) in Israel in 1995(51168 NIS from the Israeli Census) for all our specifications and we use average disposable income in the United Statesin 1987 ($15437) for Gertner (1993) and Metrick (1995) For Holt and Laury (2002) and Sydnor (2006) we use a similarfigure for 2002 ($26974)

e Holt and Laury (2002) do not report a comparable estimate The estimate we provide above is based on estimating aCARA utility model for the 18 subjects in their experiment who participated in the ldquo90rdquo treatment which involved stakescomparable to our setting For these subjects we assume a CARA utility function and a lognormal distribution of theircoefficient of absolute risk aversion The table reports the point estimate of the mean from this distribution

764 THE AMERICAN ECONOMIC REVIEW JUNE 2007

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 15: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

The structural assumptions allow us to explainhow much of this correlation can be attributedto adverse selection The remaining correlation(positive or negative) is therefore attributed tocorrelation in the underlying distribution of riskand risk aversion

III Results

A Reduced-Form Estimation

To get an initial sense for the levels of abso-lute risk aversion implied by the data we use asimple back-of-the-envelope exercise We com-pute unconditional averages of p d dh dland d (Table 2A)24 and substitute these valuesin equation (7) The implied coefficient of ab-solute risk aversion is 29 104 NIS125 Thisfigure could be thought of as the average indif-ference point implying that about 18 percent ofthe policyholders have coefficients of absoluterisk aversion exceeding it To convert to USdollar amounts one needs to multiply thesefigures by the average exchange rate (352)resulting in an average indifference point of102 103 $US1 This figure is less than halfof a similar estimate reported by Sydnor (2006)for buyers of homeownerrsquos insurance but about3 and 13 times higher than comparable fig-ures reported by Gertner (1993) and Metrick(1995) respectively for television game showparticipants

Table 3 provides reduced-form analysis ofthe relationship between the observables andour two left-hand-side variables the number ofclaims and the deductible choice26 Column 1reports the estimates from a Poisson regressionof the number of claims on observed character-istics This regression is closely related to therisk equation we estimate in the benchmarkmodel It shows that older people women andpeople with a college education are less likely to

have an accident Bigger more expensiveolder and noncommercial cars are more likelyto be involved in an accident Driving experi-ence and variables associated with less intenseuse of the car reduce accident rates As could beimagined claim propensity is highly correlatedover time past claims are a strong predictor offuture claims Young drivers are 50 to 70 per-cent more likely to be involved in an accidentwith young men significantly more likely thanyoung women Finally as indicated by the trendin the estimated year dummies the accident ratesignificantly declined over time Part of thisdecline is likely due to the decline in accidentrates in Israel in general27 This decline mightalso be partly due to the better selection ofindividuals the company obtained over time asit gained more experience (Cohen 2003)

Columns 2 and 3 of Table 3 present estimatesfrom probit regressions in which the dependentvariable is equal to one if the policyholder chosea low deductible and is equal to zero otherwiseColumn 3 shows the marginal effects of thecovariates on the propensity to choose a lowdeductible These marginal effects do not have astructural interpretation as the choice of lowdeductible depends on its price on risk aver-sion and on risk In this regression we againobserve a strong trend over time Fewer policy-holders chose the low deductible as time wentby One reason for this trend according to thecompany executives is that over time the com-panyrsquos sales persons were directed to focusmainly on the ldquodefaultrdquo regular deductible op-tion28 The effect of other covariates will bediscussed later in the context of the full modelIn unreported probit regressions we also test thequalitative assumptions of the structural model byadding three additional regressors the price ratio(pidi) the average deductible offered d i

24 The unconditional is computed by maximum likeli-hood using the data on claims and observed durations of thepolicies

25 Using the CARA specification as in equation (16) weobtain a slightly lower value of 25 104 NIS1

26 We find positive correlation in the errors of these tworegressions suggesting the existence of adverse selection inthe data and motivating a model with unobserved heteroge-neity in risk This test is similar to the bivariate probit testproposed by Chiappori and Salanie (2000) and replicatesearlier results reported in Cohen (2005)

27 In particular traffic fatalities and counts of trafficaccidents in Israel fell by 11 percent and 18 percent during1998 and 1999 respectively

28 Such biased marketing efforts will bias consumersagainst choosing the low deductible thus making them lookless risk averse This would make our estimate a lowerbound on the true level of risk aversion If only sophisti-cated consumers could see beyond the marketing effort andthis sophistication were related to observables (eg educa-tion) the coefficients on such observables would be biasedupward This is not a major concern given that the coefficientson most of the covariates are fairly stable when we estimatethe benchmark model separately for different years

759VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

TABLE 3mdashNO HETEROGENEITY IN RISK

Variable

Poisson regressiona Probit regressionsc

Dep var Number of claims Dep var 1 if low deductible

(1) (2) (3)

Coeff Std Err IRRb Coeff Std Err dPdX dPdXDemographics Constant 43535 02744 mdash 224994 23562 mdash

Age 00074 00052 09926 01690 00464 00034 00042Age2 989 105 536 105 10001 00017 00005 339 105 455 105Female 00456 00161 09555 08003 01316 00165 00134Family Single omitted omitted omitted

Married 01115 00217 08945 06417 01950 00128 00045Divorced 00717 00346 10743 03191 03127 00064 00044Widower 00155 00527 10156 03441 04573 00069 00025Other (NA) 01347 01755 11442 29518 19836 00511 00370

Education Elementary 00522 00550 09491 02927 04278 00061 00016High school omitted omitted omittedTechnical 00373 00308 10380 05037 02520 00105 00138College 00745 00197 09282 05279 01546 00109 00053Other (NA) 00116 00184 10116 00630 01540 00013 00025

Emigrant 00210 00163 10213 00497 01328 00010 00003

Car attributes Log(Value) 01227 00281 11306 13285 02342 00271 00299Car age 00187 00042 10189 01603 00357 00033 00017Commercial car 01394 00326 08699 09038 02781 00177 00294Log(engine size) 02972 00459 13461 06924 03952 00141 00075

Driving License years 00204 00034 09798 01043 00312 00021 00005License years2 00002 677 105 10002 00015 00005 299 105 167 105

Good driver 00176 00191 09825 07207 01574 00148 00152Any driver 00564 00169 10580 10923 01321 00217 00258Secondary car 00859 00209 09177 00038 01626 00001 00070Business use 01852 00293 12034 07951 02737 00156 00017History length 00527 00110 09486 11218 01450 00228 00171Claims history 06577 00390 19304 04654 05460 00095 00496

Young driver Young driver 05235 00361 16879 28847 06305 00524 00012Gender Male omitted omitted omitted

Female 01475 00288 08629 17959 03135 00396 00195Age 17ndash19 omitted omitted omitted

19ndash21 00701 00532 10726 07800 07509 00153 0014721ndash24 00267 00574 09737 05746 07773 00114 0015624 02082 00567 0812 17869 07328 00397 00136

Experience 1 omitted omitted omitted1ndash3 02416 00458 07854 10175 06577 00217 000093 02827 00532 07538 32513 07386 00762 00410

Company year First year omitted omitted omittedSecond year 00888 00198 09150 44513 01506 00792 00859Third year 00690 00222 09334 85888 01820 01303 01367Fourth year 01817 00232 08339 118277 02102 01616 01734Fifth year 05431 0028 05810 143206 02778 01920 02085

105586Obs 105800 94000 105800Pseudo R2 00162 01364 01296Log likelihood 577459 369595 430869

Significant at the 5 percent confidence levela Maximum likelihood estimates Variation in exposure (policy duration) is accounted forb IRR Incidence rate ratio Each figure should be interpreted as the increasedecrease in claim probability as a result of an increase of one unit in the

right-hand-side variablec There are two separate probit regressions reported in this table Column 2 relies on the deductible choice model and the lognormality assumption As discussed

in the text by including an additional regressor ln([pi((xi)di) 1]d i) (with (xi) predicted from column 1 above) and normalizing its coefficient to 1 we obtaina structural interpretation of this regression with ln (ri) as the dependent variable Thus the reported coefficients are comparable to those estimated for the benchmarkmodel One should be cautious however in interpreting these coefficients Unlike the benchmark model this regression does not allow unobserved heterogeneity inrisk and suffers from some selection bias because observations with high predicted risk rate are omitted (which is why the number of observations is 94000 ratherthan the full sample of 105800) For comparison column 3 reports the marginal effects from a comparable probit regression that uses the full sample and does notcontrol for pricing and predicted risk through the additional structural regressor Column 3 does not have a structural interpretation and its (unreported) coefficientscannot be compared to those estimated from the benchmark model

760 THE AMERICAN ECONOMIC REVIEW JUNE 2007

and the risk rate (xi) as predicted from thePoisson regression of column 1 All three addi-tional regressors enter with the predicted signand with large and highly significant marginaleffects29

Finally column 2 of Table 3 presents animportant specification of the probit regres-sion in which ln([pi(( xi)di) 1]d i) isadded as a regressor and its coefficient isnormalized to 130 As already mentioned ifwe assume that there is no unobserved heter-ogeneity in risk then column 2 is analogousto the ln(ri) equation of the benchmark modelThis restriction of the model is rejected by thedata About 10 percent of the individuals arepredicted to have (xi) (pi di) implying achoice of low deductible for any level of riskaversion Many of these individuals howeverstill choose higher deductible levels Column2 reports the results for the remaining indi-viduals (94000 out of 105800) for which theregression can be run While the signs of theestimated coefficients are similar to those inthe benchmark model presented below therestricted version of the model suggests muchhigher levels and dispersion of risk aversionwell above any reasonable level31 The fullestimation of the benchmark model clearly re-jects this restriction on the model

B Estimation of the Benchmark Model

The Effect of Individual Characteristics onRisk AversionmdashTable 4 presents the estimationresults from the benchmark model The secondcolumn shows how the level of absolute riskaversion is related to individual characteristicsAs the dependent variable is in natural loga-

rithm coefficients on dummy variables can bedirectly interpreted as approximate percentagechanges The vast majority of these coefficientsare quite stable across a wide range of specifi-cations which are mentioned later

The results indicate that women are more riskaverse than men and have a coefficient of ab-solute risk aversion about 20 percent greaterthan that of men These results are consistentwith those of Donkers et al (2001) and Hartoget al (2002) The estimated effect of age sug-gests a nonmonotone pattern of risk preferencesover the life cycle The estimated coefficientsimply that initially (that is at age 18 the youngestindividual in the data) individuals become less riskaverse with age but around the age of 48 indi-viduals start becoming more risk averse32 Mar-ried individuals are estimated to be significantlymore risk averse compared to singles while di-vorced individuals are less (although the coeffi-cient is statistically insignificant)

The analysis suggests that variables that arelikely to be correlated with income or wealthsuch as post-high-school education and thevalue of the car have a positive coefficientindicating that wealthier people have higherlevels of absolute risk aversion Although we donot have data on individualsrsquo income or wealthwe use other proxies for income in additionalspecifications and obtain mixed results Whenwe include as a covariate the average householdincome among those who live in the same zipcode we obtain a significant and negative co-efficient of 0333 (0154) (standard deviationin parentheses) When we match zip code in-come on demographics of the individuals thecoefficient is effectively zero 0036 (0047)and when we use a socioeconomic index ofthe locality in which the individual lives thecoefficient is positive 0127 (0053)33 Thus

29 The estimated marginal effect (z-statistic in parenthe-ses) is 0352 (1376) 16 104 (1481) and 0154(255) for (pidi) d i and (xi) respectively

30 The level is a normalization The sign is estimatedHad the sign on this regressor been positive this would haveimplied a rejection of the model

31 For the implied median level of risk aversion the re-stricted model produces a similar estimate to the estimate wereport below for the benchmark model However since someindividuals who chose the low deductible are estimated to havevery low claim rates the restricted model is ldquoforcedrdquo to estimatevery high risk aversion for these individuals (in contrast thebenchmark model can explain such choices by positive risk re-siduals) resulting in very high dispersion and (due to the lognor-mality assumption) very high average risk aversion which isabout 1025 higher than the benchmark estimates we report below

32 This nonmonotone pattern may explain why age enterswith different signs in the estimation results of Donkers et al(2001) and Hartog et al (2002) A somewhat similar U-shapepattern with respect to age is also reported by Sumit Agarwalet al (2006) in the context of consumer credit markets

33 The full results from these specifications are providedin the online Appendix (available at httpwwwe-aerorgdatajune0720050644_apppdf ) The reason we do not usethese income variables in the benchmark specification istheir imperfect coverage which would require us to omitalmost 20 percent of the individuals Other results fromthese regressions are similar to those we report for thebenchmark specification

761VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

overall we interpret our findings as suggestiveof a nonnegative and perhaps positive associ-ation between incomewealth and absolute riskaversion

At first glance these results may appear tobe inconsistent with the widely held beliefthat absolute risk aversion declines withwealth One should distinguish however be-tween two questions (a) whether for a givenindividual the vNM utility function exhibitsdecreasing absolute risk aversion and (b)how risk preferences vary across individuals

Our results do not speak to the first questionand should not be thought of as a test of thedecreasing absolute risk aversion propertyTesting this property would require observingthe same individual making multiple choicesat different wealth levels Rather our resultsindicate that individuals with greater wealthhave utility functions that involve a greaterdegree of risk aversion It might be thatwealth is endogenous and that risk aversion(or unobserved individual characteristics thatare correlated with it) leads individuals to

TABLE 4mdashTHE BENCHMARK MODEL

Variable Ln() equation Ln(r) equation Additional quantities

Demographics Constant 15406 (00073) 118118 (01032) Var-covar matrix ()

Age 00001 (00026) 00623 (00213) 01498 (00097)Age2 624 106 (263 105) 644 104 (211 104) r 31515 (00773)Female 00006 (00086) 02049 (00643) 08391 (00265)Family Single Omitted Omitted

Married 00198 (00115) 01927 (00974) Unconditional statisticsa

Divorced 00396 (00155) 01754 (01495) Mean 02196 (00013)Widower 00135 (00281) 01320 (02288) Median 02174 (00017)Other (NA) 00557 (00968) 04599 (07397) Std Dev 00483 (00019)

Education Elementary 00194 (00333) 01283 (02156) Mean r 00019 (00002)High school Omitted Omitted Median r 727 106 (756 107)Technical 00017 (00189) 02306 (01341) Std Dev r 00197 (00015)College 00277 (00124) 02177 (00840) Corr(r ) 02067 (00085)Other (NA) 00029 (00107) 00128 (00819)

Emigrant 00030 (00090) 00001 (00651) Obs 105800

Car attributes Log(value) 00794 (00177) 07244 (01272)Car age 00053 (00023) 00411 (00176)Commercial car 00719 (00187) 00313 (01239)Log(engine size) 01299 (00235) 03195 (01847)

Driving License years 00015 (00017) 00157 (00137)License years2 183 105 (351 105) 148 104 (254 104)Good driver 00635 (00112) 00317 (00822)Any driver 00360 (00105) 03000 (00722)Secondary car 00415 (00141) 01209 (00875)Business use 00614 (00134) 03790 (01124)History length 00012 (00052) 03092 (00518)Claims history 01295 (00154) 00459 (01670)

Young driver Young driver 00525 (00253) 02499 (02290)Gender Male Omitted mdash

Female 00355 (00061) mdashAge 17ndash19 Omitted mdash

19ndash21 00387 (00121) mdash21ndash24 00445 (00124) mdash24 00114 (00119) mdash

Experience 1 Omitted mdash1ndash3 00059 (00104) mdash3 00762 (00121) mdash

Company year First year Omitted OmittedSecond year 00771 (00122) 14334 (00853)Third year 00857 (00137) 28459 (01191)Fourth year 01515 (00160) 38089 (01343)Fifth year 04062 (00249) 39525 (01368)

Note Standard deviations based on the draws from the posterior distribution in parentheses Significant at the 5 percent confidence levela Unconditional statistics represent implied quantities for the sample population as a whole ie integrating over the distribution of covariates in the sample (as

well as over the unobserved components)

762 THE AMERICAN ECONOMIC REVIEW JUNE 2007

save more to obtain more education or totake other actions that lead to greaterwealth34

Let us make several additional observationsFirst while owners of more expensive cars ap-pear to have both higher risk exposure andhigher levels of risk aversion owners of biggercars have higher risk exposure but lower levelsof risk aversion This should indicate that thestructure of the model does not constrain therelationship between the coefficients in the twoequations Rather it is the data that ldquospeak uprdquoSecond individuals who are classified by theinsurer as ldquogood driversrdquo indeed have lowerrisk but also appear to have lower risk aversionThis result is somewhat similar to the positivecorrelation between unobserved risk and unob-served risk aversion which we report belowThird policyholders who tend to use the car forbusiness are less risk averse This could bebecause uninsured costs of accidents occurringto such policyholders are tax deductible Fourthpolicyholders who reported three full years ofclaims history are more risk averse but are notdifferent in their risk exposure The attitude thatleads such policyholders to comply with therequest to (voluntarily) report three full years ofclaims history is apparently and not surpris-ingly correlated with higher levels of risk aver-sion In contrast while past claims indicate highrisk they have no significant relationship withrisk aversion Finally we find a strong trendtoward lower levels of risk aversion over timeThis is a replication of the probit results dis-cussed earlier

The Effect of Individual Characteristics onClaim RiskmdashThe first column of Table 4 de-scribes the relationship between observablesand risk exposure Accident risk is higher fordivorced individuals and lower for people witha college education Bigger older more expen-sive and noncommercial cars are all morelikely to be involved in an accident Drivingexperience reduces accident rates as do mea-sures of less intense use of the car while youngdrivers are more exposed to risk Claim propen-sity is highly correlated over time the voluntaryreport of past claims is a strong predictor offuture claims This risk equation produces re-sults that are similar to those of the simplerPoisson regression reported in Table 3 Al-though some of the coefficients lose signifi-cance the magnitude of most coefficients isquite similar The similarity between these twosets of results is to be expected as the riskregression is identified primarily from the dataon claims so incorporating the information ondeductible choice does not qualitatively changethe conceptual identification strategy (see Sec-tion IIC) If the results were not similar thiscould have indicated a misspecification of themodel The slight differences between the riskregressions in Table 3 and Table 4 are drivenmainly by the structural assumptions First thebenchmark model estimates a lognormal mix-ture of Poisson rates rather than a single Pois-son model By incorporating the fatter tails ofthe claim distribution it slightly changes theresults increases the standard errors and de-creases the average predicted claim rate Sec-ond the information on deductible choiceslightly helps us in obtaining more precise es-timates through the correlation structure be-tween the error terms in the two equations

The Implied Level of Risk AversionmdashOne ofthe main goals of the estimation is to obtainmeasures of the level of risk aversion in thepopulation we study Since we use Gibbs sam-pling and augment the latent coefficients ofabsolute risk aversion we can directly obtainthe posterior distribution of various moments ofthis distribution At each iteration of the Gibbssampler we compute the mean standard devi-ation (across individuals) and various percen-tiles of the simulated draws of i and ri and thecorrelation between them The far-right columnof Table 4 reports the averages and standard

34 One may be tempted to interpret the positive wealtheffects as an indirect indication of credit constraints wealth-ier individuals are less credit constrained and therefore canafford to purchase more insurance We do not share thisinterpretation for two reasons First the insurance companyobserves these proxies for wealth and conditions on themwhen setting prices Since the willingness to pay for insur-ance is likely to be correlated with the willingness to pay forthe additional insurance provided by the low deductibleoption premiums already reflect this variation We condi-tion on the level of the premium Second paying less exante implies paying more ex post so applying the creditconstraint argument only to the ex ante payment but not tothe probabilistic ex post deductible payments has no theo-retical foundation Essentially the setup of the model linksthe ex ante decisions to the ex post losses which are bothdriven by the curvature of the vNM utility function

763VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

deviations of these computed quantities over theiterations of the Gibbs sampler35 The impliedrisk aversion of the mean individual is 00019which is about seven times greater than theback-of-the-envelope calculation presented inthe previous section As we assume a lognormal

distribution and estimate a relatively high dis-persion coefficient r the estimates also implysignificantly lower values for the median levelof risk aversion As shown later the qualitativepattern of these results is quite robust acrossspecifications

Table 5 summarizes our findings regardingthe level and dispersion of risk aversion Itpresents two ways to interpret the estimates weobtained from the benchmark model as well ascomparisons to specifications with constant ab-solute risk aversion (CARA) utility and with

35 Note that these estimated quantities are unconditionalIn computing these quantities we integrate over the distri-bution of observable characteristics in the data so onecannot compute these estimates from the estimated param-eter directly

TABLE 5mdashRISK-AVERSION ESTIMATES

Specificationa Absolute risk aversionb Interpretationc Relative risk aversiond

Back-of-the-envelope 10 103 9070 1484Benchmark model

Mean individual 67 103 5605 972225th percentile 23 106 9998 003Median individual 26 105 9974 03775th percentile 29 104 9714 42790th percentile 27 103 7834 390295th percentile 99 103 4937 14327

CARA utilityMean individual 31 103 7651 4436Median individual 34 105 9966 050

Learning modelMean individual 42 103 6886 6140Median individual 56 106 9995 008

Comparable estimatesGertner (1993) 31 104 9699 479Metrick (1995) 66 105 9934 102Holt and Laury (2002)e 32 102 2096 86575Sydnor (2006) 20 103 8329 5395

a The table summarizes the results with respect to the level of risk aversion ldquoBack-of-the-enveloperdquo refers to thecalculation we report in the beginning of Section III ldquobenchmark modelrdquo refers to the results from the benchmark model(Table 4) ldquoCARA utilityrdquo refers to a specification of a CARA utility function and ldquolearning modelrdquo refers to a specificationin which individuals do not know their risk types perfectly (see Section IIID) The last four rows are the closest comparableresults available in the literature

b The second column presents the point estimates for the coefficient of absolute risk aversion converted to $US1 unitsFor the comparable estimates this is given by their estimate of a representative CARA utility maximizer For all otherspecifications this is given by computing the unconditional mean and median in the population using the figures we reportin the mean and median columns of risk aversion in Table 6 (multiplied by 352 the average exchange rate during the periodto convert to US dollars)

c To interpret the absolute risk aversion estimates (ARA) we translate them into x u(w) 1frasl2 u(w 100) 1frasl2 u(w x) That is we report x such that an individual with the estimated ARA is indifferent about participating in a 50ndash50lottery of gaining 100 US dollars and losing x US dollars Note that since our estimate is of absolute risk aversion the quantityx is independent of w To be consistent with the specification we use a quadratic utility function for the back-of-the-envelopebenchmark and learning models and use a CARA utility function for the others

d The last column attempts to translate the ARA estimates into relative risk aversion We follow the literature and do soby multiplying the ARA estimate by average annual income We use the average annual income (after tax) in Israel in 1995(51168 NIS from the Israeli Census) for all our specifications and we use average disposable income in the United Statesin 1987 ($15437) for Gertner (1993) and Metrick (1995) For Holt and Laury (2002) and Sydnor (2006) we use a similarfigure for 2002 ($26974)

e Holt and Laury (2002) do not report a comparable estimate The estimate we provide above is based on estimating aCARA utility model for the 18 subjects in their experiment who participated in the ldquo90rdquo treatment which involved stakescomparable to our setting For these subjects we assume a CARA utility function and a lognormal distribution of theircoefficient of absolute risk aversion The table reports the point estimate of the mean from this distribution

764 THE AMERICAN ECONOMIC REVIEW JUNE 2007

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 16: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

TABLE 3mdashNO HETEROGENEITY IN RISK

Variable

Poisson regressiona Probit regressionsc

Dep var Number of claims Dep var 1 if low deductible

(1) (2) (3)

Coeff Std Err IRRb Coeff Std Err dPdX dPdXDemographics Constant 43535 02744 mdash 224994 23562 mdash

Age 00074 00052 09926 01690 00464 00034 00042Age2 989 105 536 105 10001 00017 00005 339 105 455 105Female 00456 00161 09555 08003 01316 00165 00134Family Single omitted omitted omitted

Married 01115 00217 08945 06417 01950 00128 00045Divorced 00717 00346 10743 03191 03127 00064 00044Widower 00155 00527 10156 03441 04573 00069 00025Other (NA) 01347 01755 11442 29518 19836 00511 00370

Education Elementary 00522 00550 09491 02927 04278 00061 00016High school omitted omitted omittedTechnical 00373 00308 10380 05037 02520 00105 00138College 00745 00197 09282 05279 01546 00109 00053Other (NA) 00116 00184 10116 00630 01540 00013 00025

Emigrant 00210 00163 10213 00497 01328 00010 00003

Car attributes Log(Value) 01227 00281 11306 13285 02342 00271 00299Car age 00187 00042 10189 01603 00357 00033 00017Commercial car 01394 00326 08699 09038 02781 00177 00294Log(engine size) 02972 00459 13461 06924 03952 00141 00075

Driving License years 00204 00034 09798 01043 00312 00021 00005License years2 00002 677 105 10002 00015 00005 299 105 167 105

Good driver 00176 00191 09825 07207 01574 00148 00152Any driver 00564 00169 10580 10923 01321 00217 00258Secondary car 00859 00209 09177 00038 01626 00001 00070Business use 01852 00293 12034 07951 02737 00156 00017History length 00527 00110 09486 11218 01450 00228 00171Claims history 06577 00390 19304 04654 05460 00095 00496

Young driver Young driver 05235 00361 16879 28847 06305 00524 00012Gender Male omitted omitted omitted

Female 01475 00288 08629 17959 03135 00396 00195Age 17ndash19 omitted omitted omitted

19ndash21 00701 00532 10726 07800 07509 00153 0014721ndash24 00267 00574 09737 05746 07773 00114 0015624 02082 00567 0812 17869 07328 00397 00136

Experience 1 omitted omitted omitted1ndash3 02416 00458 07854 10175 06577 00217 000093 02827 00532 07538 32513 07386 00762 00410

Company year First year omitted omitted omittedSecond year 00888 00198 09150 44513 01506 00792 00859Third year 00690 00222 09334 85888 01820 01303 01367Fourth year 01817 00232 08339 118277 02102 01616 01734Fifth year 05431 0028 05810 143206 02778 01920 02085

105586Obs 105800 94000 105800Pseudo R2 00162 01364 01296Log likelihood 577459 369595 430869

Significant at the 5 percent confidence levela Maximum likelihood estimates Variation in exposure (policy duration) is accounted forb IRR Incidence rate ratio Each figure should be interpreted as the increasedecrease in claim probability as a result of an increase of one unit in the

right-hand-side variablec There are two separate probit regressions reported in this table Column 2 relies on the deductible choice model and the lognormality assumption As discussed

in the text by including an additional regressor ln([pi((xi)di) 1]d i) (with (xi) predicted from column 1 above) and normalizing its coefficient to 1 we obtaina structural interpretation of this regression with ln (ri) as the dependent variable Thus the reported coefficients are comparable to those estimated for the benchmarkmodel One should be cautious however in interpreting these coefficients Unlike the benchmark model this regression does not allow unobserved heterogeneity inrisk and suffers from some selection bias because observations with high predicted risk rate are omitted (which is why the number of observations is 94000 ratherthan the full sample of 105800) For comparison column 3 reports the marginal effects from a comparable probit regression that uses the full sample and does notcontrol for pricing and predicted risk through the additional structural regressor Column 3 does not have a structural interpretation and its (unreported) coefficientscannot be compared to those estimated from the benchmark model

760 THE AMERICAN ECONOMIC REVIEW JUNE 2007

and the risk rate (xi) as predicted from thePoisson regression of column 1 All three addi-tional regressors enter with the predicted signand with large and highly significant marginaleffects29

Finally column 2 of Table 3 presents animportant specification of the probit regres-sion in which ln([pi(( xi)di) 1]d i) isadded as a regressor and its coefficient isnormalized to 130 As already mentioned ifwe assume that there is no unobserved heter-ogeneity in risk then column 2 is analogousto the ln(ri) equation of the benchmark modelThis restriction of the model is rejected by thedata About 10 percent of the individuals arepredicted to have (xi) (pi di) implying achoice of low deductible for any level of riskaversion Many of these individuals howeverstill choose higher deductible levels Column2 reports the results for the remaining indi-viduals (94000 out of 105800) for which theregression can be run While the signs of theestimated coefficients are similar to those inthe benchmark model presented below therestricted version of the model suggests muchhigher levels and dispersion of risk aversionwell above any reasonable level31 The fullestimation of the benchmark model clearly re-jects this restriction on the model

B Estimation of the Benchmark Model

The Effect of Individual Characteristics onRisk AversionmdashTable 4 presents the estimationresults from the benchmark model The secondcolumn shows how the level of absolute riskaversion is related to individual characteristicsAs the dependent variable is in natural loga-

rithm coefficients on dummy variables can bedirectly interpreted as approximate percentagechanges The vast majority of these coefficientsare quite stable across a wide range of specifi-cations which are mentioned later

The results indicate that women are more riskaverse than men and have a coefficient of ab-solute risk aversion about 20 percent greaterthan that of men These results are consistentwith those of Donkers et al (2001) and Hartoget al (2002) The estimated effect of age sug-gests a nonmonotone pattern of risk preferencesover the life cycle The estimated coefficientsimply that initially (that is at age 18 the youngestindividual in the data) individuals become less riskaverse with age but around the age of 48 indi-viduals start becoming more risk averse32 Mar-ried individuals are estimated to be significantlymore risk averse compared to singles while di-vorced individuals are less (although the coeffi-cient is statistically insignificant)

The analysis suggests that variables that arelikely to be correlated with income or wealthsuch as post-high-school education and thevalue of the car have a positive coefficientindicating that wealthier people have higherlevels of absolute risk aversion Although we donot have data on individualsrsquo income or wealthwe use other proxies for income in additionalspecifications and obtain mixed results Whenwe include as a covariate the average householdincome among those who live in the same zipcode we obtain a significant and negative co-efficient of 0333 (0154) (standard deviationin parentheses) When we match zip code in-come on demographics of the individuals thecoefficient is effectively zero 0036 (0047)and when we use a socioeconomic index ofthe locality in which the individual lives thecoefficient is positive 0127 (0053)33 Thus

29 The estimated marginal effect (z-statistic in parenthe-ses) is 0352 (1376) 16 104 (1481) and 0154(255) for (pidi) d i and (xi) respectively

30 The level is a normalization The sign is estimatedHad the sign on this regressor been positive this would haveimplied a rejection of the model

31 For the implied median level of risk aversion the re-stricted model produces a similar estimate to the estimate wereport below for the benchmark model However since someindividuals who chose the low deductible are estimated to havevery low claim rates the restricted model is ldquoforcedrdquo to estimatevery high risk aversion for these individuals (in contrast thebenchmark model can explain such choices by positive risk re-siduals) resulting in very high dispersion and (due to the lognor-mality assumption) very high average risk aversion which isabout 1025 higher than the benchmark estimates we report below

32 This nonmonotone pattern may explain why age enterswith different signs in the estimation results of Donkers et al(2001) and Hartog et al (2002) A somewhat similar U-shapepattern with respect to age is also reported by Sumit Agarwalet al (2006) in the context of consumer credit markets

33 The full results from these specifications are providedin the online Appendix (available at httpwwwe-aerorgdatajune0720050644_apppdf ) The reason we do not usethese income variables in the benchmark specification istheir imperfect coverage which would require us to omitalmost 20 percent of the individuals Other results fromthese regressions are similar to those we report for thebenchmark specification

761VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

overall we interpret our findings as suggestiveof a nonnegative and perhaps positive associ-ation between incomewealth and absolute riskaversion

At first glance these results may appear tobe inconsistent with the widely held beliefthat absolute risk aversion declines withwealth One should distinguish however be-tween two questions (a) whether for a givenindividual the vNM utility function exhibitsdecreasing absolute risk aversion and (b)how risk preferences vary across individuals

Our results do not speak to the first questionand should not be thought of as a test of thedecreasing absolute risk aversion propertyTesting this property would require observingthe same individual making multiple choicesat different wealth levels Rather our resultsindicate that individuals with greater wealthhave utility functions that involve a greaterdegree of risk aversion It might be thatwealth is endogenous and that risk aversion(or unobserved individual characteristics thatare correlated with it) leads individuals to

TABLE 4mdashTHE BENCHMARK MODEL

Variable Ln() equation Ln(r) equation Additional quantities

Demographics Constant 15406 (00073) 118118 (01032) Var-covar matrix ()

Age 00001 (00026) 00623 (00213) 01498 (00097)Age2 624 106 (263 105) 644 104 (211 104) r 31515 (00773)Female 00006 (00086) 02049 (00643) 08391 (00265)Family Single Omitted Omitted

Married 00198 (00115) 01927 (00974) Unconditional statisticsa

Divorced 00396 (00155) 01754 (01495) Mean 02196 (00013)Widower 00135 (00281) 01320 (02288) Median 02174 (00017)Other (NA) 00557 (00968) 04599 (07397) Std Dev 00483 (00019)

Education Elementary 00194 (00333) 01283 (02156) Mean r 00019 (00002)High school Omitted Omitted Median r 727 106 (756 107)Technical 00017 (00189) 02306 (01341) Std Dev r 00197 (00015)College 00277 (00124) 02177 (00840) Corr(r ) 02067 (00085)Other (NA) 00029 (00107) 00128 (00819)

Emigrant 00030 (00090) 00001 (00651) Obs 105800

Car attributes Log(value) 00794 (00177) 07244 (01272)Car age 00053 (00023) 00411 (00176)Commercial car 00719 (00187) 00313 (01239)Log(engine size) 01299 (00235) 03195 (01847)

Driving License years 00015 (00017) 00157 (00137)License years2 183 105 (351 105) 148 104 (254 104)Good driver 00635 (00112) 00317 (00822)Any driver 00360 (00105) 03000 (00722)Secondary car 00415 (00141) 01209 (00875)Business use 00614 (00134) 03790 (01124)History length 00012 (00052) 03092 (00518)Claims history 01295 (00154) 00459 (01670)

Young driver Young driver 00525 (00253) 02499 (02290)Gender Male Omitted mdash

Female 00355 (00061) mdashAge 17ndash19 Omitted mdash

19ndash21 00387 (00121) mdash21ndash24 00445 (00124) mdash24 00114 (00119) mdash

Experience 1 Omitted mdash1ndash3 00059 (00104) mdash3 00762 (00121) mdash

Company year First year Omitted OmittedSecond year 00771 (00122) 14334 (00853)Third year 00857 (00137) 28459 (01191)Fourth year 01515 (00160) 38089 (01343)Fifth year 04062 (00249) 39525 (01368)

Note Standard deviations based on the draws from the posterior distribution in parentheses Significant at the 5 percent confidence levela Unconditional statistics represent implied quantities for the sample population as a whole ie integrating over the distribution of covariates in the sample (as

well as over the unobserved components)

762 THE AMERICAN ECONOMIC REVIEW JUNE 2007

save more to obtain more education or totake other actions that lead to greaterwealth34

Let us make several additional observationsFirst while owners of more expensive cars ap-pear to have both higher risk exposure andhigher levels of risk aversion owners of biggercars have higher risk exposure but lower levelsof risk aversion This should indicate that thestructure of the model does not constrain therelationship between the coefficients in the twoequations Rather it is the data that ldquospeak uprdquoSecond individuals who are classified by theinsurer as ldquogood driversrdquo indeed have lowerrisk but also appear to have lower risk aversionThis result is somewhat similar to the positivecorrelation between unobserved risk and unob-served risk aversion which we report belowThird policyholders who tend to use the car forbusiness are less risk averse This could bebecause uninsured costs of accidents occurringto such policyholders are tax deductible Fourthpolicyholders who reported three full years ofclaims history are more risk averse but are notdifferent in their risk exposure The attitude thatleads such policyholders to comply with therequest to (voluntarily) report three full years ofclaims history is apparently and not surpris-ingly correlated with higher levels of risk aver-sion In contrast while past claims indicate highrisk they have no significant relationship withrisk aversion Finally we find a strong trendtoward lower levels of risk aversion over timeThis is a replication of the probit results dis-cussed earlier

The Effect of Individual Characteristics onClaim RiskmdashThe first column of Table 4 de-scribes the relationship between observablesand risk exposure Accident risk is higher fordivorced individuals and lower for people witha college education Bigger older more expen-sive and noncommercial cars are all morelikely to be involved in an accident Drivingexperience reduces accident rates as do mea-sures of less intense use of the car while youngdrivers are more exposed to risk Claim propen-sity is highly correlated over time the voluntaryreport of past claims is a strong predictor offuture claims This risk equation produces re-sults that are similar to those of the simplerPoisson regression reported in Table 3 Al-though some of the coefficients lose signifi-cance the magnitude of most coefficients isquite similar The similarity between these twosets of results is to be expected as the riskregression is identified primarily from the dataon claims so incorporating the information ondeductible choice does not qualitatively changethe conceptual identification strategy (see Sec-tion IIC) If the results were not similar thiscould have indicated a misspecification of themodel The slight differences between the riskregressions in Table 3 and Table 4 are drivenmainly by the structural assumptions First thebenchmark model estimates a lognormal mix-ture of Poisson rates rather than a single Pois-son model By incorporating the fatter tails ofthe claim distribution it slightly changes theresults increases the standard errors and de-creases the average predicted claim rate Sec-ond the information on deductible choiceslightly helps us in obtaining more precise es-timates through the correlation structure be-tween the error terms in the two equations

The Implied Level of Risk AversionmdashOne ofthe main goals of the estimation is to obtainmeasures of the level of risk aversion in thepopulation we study Since we use Gibbs sam-pling and augment the latent coefficients ofabsolute risk aversion we can directly obtainthe posterior distribution of various moments ofthis distribution At each iteration of the Gibbssampler we compute the mean standard devi-ation (across individuals) and various percen-tiles of the simulated draws of i and ri and thecorrelation between them The far-right columnof Table 4 reports the averages and standard

34 One may be tempted to interpret the positive wealtheffects as an indirect indication of credit constraints wealth-ier individuals are less credit constrained and therefore canafford to purchase more insurance We do not share thisinterpretation for two reasons First the insurance companyobserves these proxies for wealth and conditions on themwhen setting prices Since the willingness to pay for insur-ance is likely to be correlated with the willingness to pay forthe additional insurance provided by the low deductibleoption premiums already reflect this variation We condi-tion on the level of the premium Second paying less exante implies paying more ex post so applying the creditconstraint argument only to the ex ante payment but not tothe probabilistic ex post deductible payments has no theo-retical foundation Essentially the setup of the model linksthe ex ante decisions to the ex post losses which are bothdriven by the curvature of the vNM utility function

763VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

deviations of these computed quantities over theiterations of the Gibbs sampler35 The impliedrisk aversion of the mean individual is 00019which is about seven times greater than theback-of-the-envelope calculation presented inthe previous section As we assume a lognormal

distribution and estimate a relatively high dis-persion coefficient r the estimates also implysignificantly lower values for the median levelof risk aversion As shown later the qualitativepattern of these results is quite robust acrossspecifications

Table 5 summarizes our findings regardingthe level and dispersion of risk aversion Itpresents two ways to interpret the estimates weobtained from the benchmark model as well ascomparisons to specifications with constant ab-solute risk aversion (CARA) utility and with

35 Note that these estimated quantities are unconditionalIn computing these quantities we integrate over the distri-bution of observable characteristics in the data so onecannot compute these estimates from the estimated param-eter directly

TABLE 5mdashRISK-AVERSION ESTIMATES

Specificationa Absolute risk aversionb Interpretationc Relative risk aversiond

Back-of-the-envelope 10 103 9070 1484Benchmark model

Mean individual 67 103 5605 972225th percentile 23 106 9998 003Median individual 26 105 9974 03775th percentile 29 104 9714 42790th percentile 27 103 7834 390295th percentile 99 103 4937 14327

CARA utilityMean individual 31 103 7651 4436Median individual 34 105 9966 050

Learning modelMean individual 42 103 6886 6140Median individual 56 106 9995 008

Comparable estimatesGertner (1993) 31 104 9699 479Metrick (1995) 66 105 9934 102Holt and Laury (2002)e 32 102 2096 86575Sydnor (2006) 20 103 8329 5395

a The table summarizes the results with respect to the level of risk aversion ldquoBack-of-the-enveloperdquo refers to thecalculation we report in the beginning of Section III ldquobenchmark modelrdquo refers to the results from the benchmark model(Table 4) ldquoCARA utilityrdquo refers to a specification of a CARA utility function and ldquolearning modelrdquo refers to a specificationin which individuals do not know their risk types perfectly (see Section IIID) The last four rows are the closest comparableresults available in the literature

b The second column presents the point estimates for the coefficient of absolute risk aversion converted to $US1 unitsFor the comparable estimates this is given by their estimate of a representative CARA utility maximizer For all otherspecifications this is given by computing the unconditional mean and median in the population using the figures we reportin the mean and median columns of risk aversion in Table 6 (multiplied by 352 the average exchange rate during the periodto convert to US dollars)

c To interpret the absolute risk aversion estimates (ARA) we translate them into x u(w) 1frasl2 u(w 100) 1frasl2 u(w x) That is we report x such that an individual with the estimated ARA is indifferent about participating in a 50ndash50lottery of gaining 100 US dollars and losing x US dollars Note that since our estimate is of absolute risk aversion the quantityx is independent of w To be consistent with the specification we use a quadratic utility function for the back-of-the-envelopebenchmark and learning models and use a CARA utility function for the others

d The last column attempts to translate the ARA estimates into relative risk aversion We follow the literature and do soby multiplying the ARA estimate by average annual income We use the average annual income (after tax) in Israel in 1995(51168 NIS from the Israeli Census) for all our specifications and we use average disposable income in the United Statesin 1987 ($15437) for Gertner (1993) and Metrick (1995) For Holt and Laury (2002) and Sydnor (2006) we use a similarfigure for 2002 ($26974)

e Holt and Laury (2002) do not report a comparable estimate The estimate we provide above is based on estimating aCARA utility model for the 18 subjects in their experiment who participated in the ldquo90rdquo treatment which involved stakescomparable to our setting For these subjects we assume a CARA utility function and a lognormal distribution of theircoefficient of absolute risk aversion The table reports the point estimate of the mean from this distribution

764 THE AMERICAN ECONOMIC REVIEW JUNE 2007

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 17: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

and the risk rate (xi) as predicted from thePoisson regression of column 1 All three addi-tional regressors enter with the predicted signand with large and highly significant marginaleffects29

Finally column 2 of Table 3 presents animportant specification of the probit regres-sion in which ln([pi(( xi)di) 1]d i) isadded as a regressor and its coefficient isnormalized to 130 As already mentioned ifwe assume that there is no unobserved heter-ogeneity in risk then column 2 is analogousto the ln(ri) equation of the benchmark modelThis restriction of the model is rejected by thedata About 10 percent of the individuals arepredicted to have (xi) (pi di) implying achoice of low deductible for any level of riskaversion Many of these individuals howeverstill choose higher deductible levels Column2 reports the results for the remaining indi-viduals (94000 out of 105800) for which theregression can be run While the signs of theestimated coefficients are similar to those inthe benchmark model presented below therestricted version of the model suggests muchhigher levels and dispersion of risk aversionwell above any reasonable level31 The fullestimation of the benchmark model clearly re-jects this restriction on the model

B Estimation of the Benchmark Model

The Effect of Individual Characteristics onRisk AversionmdashTable 4 presents the estimationresults from the benchmark model The secondcolumn shows how the level of absolute riskaversion is related to individual characteristicsAs the dependent variable is in natural loga-

rithm coefficients on dummy variables can bedirectly interpreted as approximate percentagechanges The vast majority of these coefficientsare quite stable across a wide range of specifi-cations which are mentioned later

The results indicate that women are more riskaverse than men and have a coefficient of ab-solute risk aversion about 20 percent greaterthan that of men These results are consistentwith those of Donkers et al (2001) and Hartoget al (2002) The estimated effect of age sug-gests a nonmonotone pattern of risk preferencesover the life cycle The estimated coefficientsimply that initially (that is at age 18 the youngestindividual in the data) individuals become less riskaverse with age but around the age of 48 indi-viduals start becoming more risk averse32 Mar-ried individuals are estimated to be significantlymore risk averse compared to singles while di-vorced individuals are less (although the coeffi-cient is statistically insignificant)

The analysis suggests that variables that arelikely to be correlated with income or wealthsuch as post-high-school education and thevalue of the car have a positive coefficientindicating that wealthier people have higherlevels of absolute risk aversion Although we donot have data on individualsrsquo income or wealthwe use other proxies for income in additionalspecifications and obtain mixed results Whenwe include as a covariate the average householdincome among those who live in the same zipcode we obtain a significant and negative co-efficient of 0333 (0154) (standard deviationin parentheses) When we match zip code in-come on demographics of the individuals thecoefficient is effectively zero 0036 (0047)and when we use a socioeconomic index ofthe locality in which the individual lives thecoefficient is positive 0127 (0053)33 Thus

29 The estimated marginal effect (z-statistic in parenthe-ses) is 0352 (1376) 16 104 (1481) and 0154(255) for (pidi) d i and (xi) respectively

30 The level is a normalization The sign is estimatedHad the sign on this regressor been positive this would haveimplied a rejection of the model

31 For the implied median level of risk aversion the re-stricted model produces a similar estimate to the estimate wereport below for the benchmark model However since someindividuals who chose the low deductible are estimated to havevery low claim rates the restricted model is ldquoforcedrdquo to estimatevery high risk aversion for these individuals (in contrast thebenchmark model can explain such choices by positive risk re-siduals) resulting in very high dispersion and (due to the lognor-mality assumption) very high average risk aversion which isabout 1025 higher than the benchmark estimates we report below

32 This nonmonotone pattern may explain why age enterswith different signs in the estimation results of Donkers et al(2001) and Hartog et al (2002) A somewhat similar U-shapepattern with respect to age is also reported by Sumit Agarwalet al (2006) in the context of consumer credit markets

33 The full results from these specifications are providedin the online Appendix (available at httpwwwe-aerorgdatajune0720050644_apppdf ) The reason we do not usethese income variables in the benchmark specification istheir imperfect coverage which would require us to omitalmost 20 percent of the individuals Other results fromthese regressions are similar to those we report for thebenchmark specification

761VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

overall we interpret our findings as suggestiveof a nonnegative and perhaps positive associ-ation between incomewealth and absolute riskaversion

At first glance these results may appear tobe inconsistent with the widely held beliefthat absolute risk aversion declines withwealth One should distinguish however be-tween two questions (a) whether for a givenindividual the vNM utility function exhibitsdecreasing absolute risk aversion and (b)how risk preferences vary across individuals

Our results do not speak to the first questionand should not be thought of as a test of thedecreasing absolute risk aversion propertyTesting this property would require observingthe same individual making multiple choicesat different wealth levels Rather our resultsindicate that individuals with greater wealthhave utility functions that involve a greaterdegree of risk aversion It might be thatwealth is endogenous and that risk aversion(or unobserved individual characteristics thatare correlated with it) leads individuals to

TABLE 4mdashTHE BENCHMARK MODEL

Variable Ln() equation Ln(r) equation Additional quantities

Demographics Constant 15406 (00073) 118118 (01032) Var-covar matrix ()

Age 00001 (00026) 00623 (00213) 01498 (00097)Age2 624 106 (263 105) 644 104 (211 104) r 31515 (00773)Female 00006 (00086) 02049 (00643) 08391 (00265)Family Single Omitted Omitted

Married 00198 (00115) 01927 (00974) Unconditional statisticsa

Divorced 00396 (00155) 01754 (01495) Mean 02196 (00013)Widower 00135 (00281) 01320 (02288) Median 02174 (00017)Other (NA) 00557 (00968) 04599 (07397) Std Dev 00483 (00019)

Education Elementary 00194 (00333) 01283 (02156) Mean r 00019 (00002)High school Omitted Omitted Median r 727 106 (756 107)Technical 00017 (00189) 02306 (01341) Std Dev r 00197 (00015)College 00277 (00124) 02177 (00840) Corr(r ) 02067 (00085)Other (NA) 00029 (00107) 00128 (00819)

Emigrant 00030 (00090) 00001 (00651) Obs 105800

Car attributes Log(value) 00794 (00177) 07244 (01272)Car age 00053 (00023) 00411 (00176)Commercial car 00719 (00187) 00313 (01239)Log(engine size) 01299 (00235) 03195 (01847)

Driving License years 00015 (00017) 00157 (00137)License years2 183 105 (351 105) 148 104 (254 104)Good driver 00635 (00112) 00317 (00822)Any driver 00360 (00105) 03000 (00722)Secondary car 00415 (00141) 01209 (00875)Business use 00614 (00134) 03790 (01124)History length 00012 (00052) 03092 (00518)Claims history 01295 (00154) 00459 (01670)

Young driver Young driver 00525 (00253) 02499 (02290)Gender Male Omitted mdash

Female 00355 (00061) mdashAge 17ndash19 Omitted mdash

19ndash21 00387 (00121) mdash21ndash24 00445 (00124) mdash24 00114 (00119) mdash

Experience 1 Omitted mdash1ndash3 00059 (00104) mdash3 00762 (00121) mdash

Company year First year Omitted OmittedSecond year 00771 (00122) 14334 (00853)Third year 00857 (00137) 28459 (01191)Fourth year 01515 (00160) 38089 (01343)Fifth year 04062 (00249) 39525 (01368)

Note Standard deviations based on the draws from the posterior distribution in parentheses Significant at the 5 percent confidence levela Unconditional statistics represent implied quantities for the sample population as a whole ie integrating over the distribution of covariates in the sample (as

well as over the unobserved components)

762 THE AMERICAN ECONOMIC REVIEW JUNE 2007

save more to obtain more education or totake other actions that lead to greaterwealth34

Let us make several additional observationsFirst while owners of more expensive cars ap-pear to have both higher risk exposure andhigher levels of risk aversion owners of biggercars have higher risk exposure but lower levelsof risk aversion This should indicate that thestructure of the model does not constrain therelationship between the coefficients in the twoequations Rather it is the data that ldquospeak uprdquoSecond individuals who are classified by theinsurer as ldquogood driversrdquo indeed have lowerrisk but also appear to have lower risk aversionThis result is somewhat similar to the positivecorrelation between unobserved risk and unob-served risk aversion which we report belowThird policyholders who tend to use the car forbusiness are less risk averse This could bebecause uninsured costs of accidents occurringto such policyholders are tax deductible Fourthpolicyholders who reported three full years ofclaims history are more risk averse but are notdifferent in their risk exposure The attitude thatleads such policyholders to comply with therequest to (voluntarily) report three full years ofclaims history is apparently and not surpris-ingly correlated with higher levels of risk aver-sion In contrast while past claims indicate highrisk they have no significant relationship withrisk aversion Finally we find a strong trendtoward lower levels of risk aversion over timeThis is a replication of the probit results dis-cussed earlier

The Effect of Individual Characteristics onClaim RiskmdashThe first column of Table 4 de-scribes the relationship between observablesand risk exposure Accident risk is higher fordivorced individuals and lower for people witha college education Bigger older more expen-sive and noncommercial cars are all morelikely to be involved in an accident Drivingexperience reduces accident rates as do mea-sures of less intense use of the car while youngdrivers are more exposed to risk Claim propen-sity is highly correlated over time the voluntaryreport of past claims is a strong predictor offuture claims This risk equation produces re-sults that are similar to those of the simplerPoisson regression reported in Table 3 Al-though some of the coefficients lose signifi-cance the magnitude of most coefficients isquite similar The similarity between these twosets of results is to be expected as the riskregression is identified primarily from the dataon claims so incorporating the information ondeductible choice does not qualitatively changethe conceptual identification strategy (see Sec-tion IIC) If the results were not similar thiscould have indicated a misspecification of themodel The slight differences between the riskregressions in Table 3 and Table 4 are drivenmainly by the structural assumptions First thebenchmark model estimates a lognormal mix-ture of Poisson rates rather than a single Pois-son model By incorporating the fatter tails ofthe claim distribution it slightly changes theresults increases the standard errors and de-creases the average predicted claim rate Sec-ond the information on deductible choiceslightly helps us in obtaining more precise es-timates through the correlation structure be-tween the error terms in the two equations

The Implied Level of Risk AversionmdashOne ofthe main goals of the estimation is to obtainmeasures of the level of risk aversion in thepopulation we study Since we use Gibbs sam-pling and augment the latent coefficients ofabsolute risk aversion we can directly obtainthe posterior distribution of various moments ofthis distribution At each iteration of the Gibbssampler we compute the mean standard devi-ation (across individuals) and various percen-tiles of the simulated draws of i and ri and thecorrelation between them The far-right columnof Table 4 reports the averages and standard

34 One may be tempted to interpret the positive wealtheffects as an indirect indication of credit constraints wealth-ier individuals are less credit constrained and therefore canafford to purchase more insurance We do not share thisinterpretation for two reasons First the insurance companyobserves these proxies for wealth and conditions on themwhen setting prices Since the willingness to pay for insur-ance is likely to be correlated with the willingness to pay forthe additional insurance provided by the low deductibleoption premiums already reflect this variation We condi-tion on the level of the premium Second paying less exante implies paying more ex post so applying the creditconstraint argument only to the ex ante payment but not tothe probabilistic ex post deductible payments has no theo-retical foundation Essentially the setup of the model linksthe ex ante decisions to the ex post losses which are bothdriven by the curvature of the vNM utility function

763VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

deviations of these computed quantities over theiterations of the Gibbs sampler35 The impliedrisk aversion of the mean individual is 00019which is about seven times greater than theback-of-the-envelope calculation presented inthe previous section As we assume a lognormal

distribution and estimate a relatively high dis-persion coefficient r the estimates also implysignificantly lower values for the median levelof risk aversion As shown later the qualitativepattern of these results is quite robust acrossspecifications

Table 5 summarizes our findings regardingthe level and dispersion of risk aversion Itpresents two ways to interpret the estimates weobtained from the benchmark model as well ascomparisons to specifications with constant ab-solute risk aversion (CARA) utility and with

35 Note that these estimated quantities are unconditionalIn computing these quantities we integrate over the distri-bution of observable characteristics in the data so onecannot compute these estimates from the estimated param-eter directly

TABLE 5mdashRISK-AVERSION ESTIMATES

Specificationa Absolute risk aversionb Interpretationc Relative risk aversiond

Back-of-the-envelope 10 103 9070 1484Benchmark model

Mean individual 67 103 5605 972225th percentile 23 106 9998 003Median individual 26 105 9974 03775th percentile 29 104 9714 42790th percentile 27 103 7834 390295th percentile 99 103 4937 14327

CARA utilityMean individual 31 103 7651 4436Median individual 34 105 9966 050

Learning modelMean individual 42 103 6886 6140Median individual 56 106 9995 008

Comparable estimatesGertner (1993) 31 104 9699 479Metrick (1995) 66 105 9934 102Holt and Laury (2002)e 32 102 2096 86575Sydnor (2006) 20 103 8329 5395

a The table summarizes the results with respect to the level of risk aversion ldquoBack-of-the-enveloperdquo refers to thecalculation we report in the beginning of Section III ldquobenchmark modelrdquo refers to the results from the benchmark model(Table 4) ldquoCARA utilityrdquo refers to a specification of a CARA utility function and ldquolearning modelrdquo refers to a specificationin which individuals do not know their risk types perfectly (see Section IIID) The last four rows are the closest comparableresults available in the literature

b The second column presents the point estimates for the coefficient of absolute risk aversion converted to $US1 unitsFor the comparable estimates this is given by their estimate of a representative CARA utility maximizer For all otherspecifications this is given by computing the unconditional mean and median in the population using the figures we reportin the mean and median columns of risk aversion in Table 6 (multiplied by 352 the average exchange rate during the periodto convert to US dollars)

c To interpret the absolute risk aversion estimates (ARA) we translate them into x u(w) 1frasl2 u(w 100) 1frasl2 u(w x) That is we report x such that an individual with the estimated ARA is indifferent about participating in a 50ndash50lottery of gaining 100 US dollars and losing x US dollars Note that since our estimate is of absolute risk aversion the quantityx is independent of w To be consistent with the specification we use a quadratic utility function for the back-of-the-envelopebenchmark and learning models and use a CARA utility function for the others

d The last column attempts to translate the ARA estimates into relative risk aversion We follow the literature and do soby multiplying the ARA estimate by average annual income We use the average annual income (after tax) in Israel in 1995(51168 NIS from the Israeli Census) for all our specifications and we use average disposable income in the United Statesin 1987 ($15437) for Gertner (1993) and Metrick (1995) For Holt and Laury (2002) and Sydnor (2006) we use a similarfigure for 2002 ($26974)

e Holt and Laury (2002) do not report a comparable estimate The estimate we provide above is based on estimating aCARA utility model for the 18 subjects in their experiment who participated in the ldquo90rdquo treatment which involved stakescomparable to our setting For these subjects we assume a CARA utility function and a lognormal distribution of theircoefficient of absolute risk aversion The table reports the point estimate of the mean from this distribution

764 THE AMERICAN ECONOMIC REVIEW JUNE 2007

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 18: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

overall we interpret our findings as suggestiveof a nonnegative and perhaps positive associ-ation between incomewealth and absolute riskaversion

At first glance these results may appear tobe inconsistent with the widely held beliefthat absolute risk aversion declines withwealth One should distinguish however be-tween two questions (a) whether for a givenindividual the vNM utility function exhibitsdecreasing absolute risk aversion and (b)how risk preferences vary across individuals

Our results do not speak to the first questionand should not be thought of as a test of thedecreasing absolute risk aversion propertyTesting this property would require observingthe same individual making multiple choicesat different wealth levels Rather our resultsindicate that individuals with greater wealthhave utility functions that involve a greaterdegree of risk aversion It might be thatwealth is endogenous and that risk aversion(or unobserved individual characteristics thatare correlated with it) leads individuals to

TABLE 4mdashTHE BENCHMARK MODEL

Variable Ln() equation Ln(r) equation Additional quantities

Demographics Constant 15406 (00073) 118118 (01032) Var-covar matrix ()

Age 00001 (00026) 00623 (00213) 01498 (00097)Age2 624 106 (263 105) 644 104 (211 104) r 31515 (00773)Female 00006 (00086) 02049 (00643) 08391 (00265)Family Single Omitted Omitted

Married 00198 (00115) 01927 (00974) Unconditional statisticsa

Divorced 00396 (00155) 01754 (01495) Mean 02196 (00013)Widower 00135 (00281) 01320 (02288) Median 02174 (00017)Other (NA) 00557 (00968) 04599 (07397) Std Dev 00483 (00019)

Education Elementary 00194 (00333) 01283 (02156) Mean r 00019 (00002)High school Omitted Omitted Median r 727 106 (756 107)Technical 00017 (00189) 02306 (01341) Std Dev r 00197 (00015)College 00277 (00124) 02177 (00840) Corr(r ) 02067 (00085)Other (NA) 00029 (00107) 00128 (00819)

Emigrant 00030 (00090) 00001 (00651) Obs 105800

Car attributes Log(value) 00794 (00177) 07244 (01272)Car age 00053 (00023) 00411 (00176)Commercial car 00719 (00187) 00313 (01239)Log(engine size) 01299 (00235) 03195 (01847)

Driving License years 00015 (00017) 00157 (00137)License years2 183 105 (351 105) 148 104 (254 104)Good driver 00635 (00112) 00317 (00822)Any driver 00360 (00105) 03000 (00722)Secondary car 00415 (00141) 01209 (00875)Business use 00614 (00134) 03790 (01124)History length 00012 (00052) 03092 (00518)Claims history 01295 (00154) 00459 (01670)

Young driver Young driver 00525 (00253) 02499 (02290)Gender Male Omitted mdash

Female 00355 (00061) mdashAge 17ndash19 Omitted mdash

19ndash21 00387 (00121) mdash21ndash24 00445 (00124) mdash24 00114 (00119) mdash

Experience 1 Omitted mdash1ndash3 00059 (00104) mdash3 00762 (00121) mdash

Company year First year Omitted OmittedSecond year 00771 (00122) 14334 (00853)Third year 00857 (00137) 28459 (01191)Fourth year 01515 (00160) 38089 (01343)Fifth year 04062 (00249) 39525 (01368)

Note Standard deviations based on the draws from the posterior distribution in parentheses Significant at the 5 percent confidence levela Unconditional statistics represent implied quantities for the sample population as a whole ie integrating over the distribution of covariates in the sample (as

well as over the unobserved components)

762 THE AMERICAN ECONOMIC REVIEW JUNE 2007

save more to obtain more education or totake other actions that lead to greaterwealth34

Let us make several additional observationsFirst while owners of more expensive cars ap-pear to have both higher risk exposure andhigher levels of risk aversion owners of biggercars have higher risk exposure but lower levelsof risk aversion This should indicate that thestructure of the model does not constrain therelationship between the coefficients in the twoequations Rather it is the data that ldquospeak uprdquoSecond individuals who are classified by theinsurer as ldquogood driversrdquo indeed have lowerrisk but also appear to have lower risk aversionThis result is somewhat similar to the positivecorrelation between unobserved risk and unob-served risk aversion which we report belowThird policyholders who tend to use the car forbusiness are less risk averse This could bebecause uninsured costs of accidents occurringto such policyholders are tax deductible Fourthpolicyholders who reported three full years ofclaims history are more risk averse but are notdifferent in their risk exposure The attitude thatleads such policyholders to comply with therequest to (voluntarily) report three full years ofclaims history is apparently and not surpris-ingly correlated with higher levels of risk aver-sion In contrast while past claims indicate highrisk they have no significant relationship withrisk aversion Finally we find a strong trendtoward lower levels of risk aversion over timeThis is a replication of the probit results dis-cussed earlier

The Effect of Individual Characteristics onClaim RiskmdashThe first column of Table 4 de-scribes the relationship between observablesand risk exposure Accident risk is higher fordivorced individuals and lower for people witha college education Bigger older more expen-sive and noncommercial cars are all morelikely to be involved in an accident Drivingexperience reduces accident rates as do mea-sures of less intense use of the car while youngdrivers are more exposed to risk Claim propen-sity is highly correlated over time the voluntaryreport of past claims is a strong predictor offuture claims This risk equation produces re-sults that are similar to those of the simplerPoisson regression reported in Table 3 Al-though some of the coefficients lose signifi-cance the magnitude of most coefficients isquite similar The similarity between these twosets of results is to be expected as the riskregression is identified primarily from the dataon claims so incorporating the information ondeductible choice does not qualitatively changethe conceptual identification strategy (see Sec-tion IIC) If the results were not similar thiscould have indicated a misspecification of themodel The slight differences between the riskregressions in Table 3 and Table 4 are drivenmainly by the structural assumptions First thebenchmark model estimates a lognormal mix-ture of Poisson rates rather than a single Pois-son model By incorporating the fatter tails ofthe claim distribution it slightly changes theresults increases the standard errors and de-creases the average predicted claim rate Sec-ond the information on deductible choiceslightly helps us in obtaining more precise es-timates through the correlation structure be-tween the error terms in the two equations

The Implied Level of Risk AversionmdashOne ofthe main goals of the estimation is to obtainmeasures of the level of risk aversion in thepopulation we study Since we use Gibbs sam-pling and augment the latent coefficients ofabsolute risk aversion we can directly obtainthe posterior distribution of various moments ofthis distribution At each iteration of the Gibbssampler we compute the mean standard devi-ation (across individuals) and various percen-tiles of the simulated draws of i and ri and thecorrelation between them The far-right columnof Table 4 reports the averages and standard

34 One may be tempted to interpret the positive wealtheffects as an indirect indication of credit constraints wealth-ier individuals are less credit constrained and therefore canafford to purchase more insurance We do not share thisinterpretation for two reasons First the insurance companyobserves these proxies for wealth and conditions on themwhen setting prices Since the willingness to pay for insur-ance is likely to be correlated with the willingness to pay forthe additional insurance provided by the low deductibleoption premiums already reflect this variation We condi-tion on the level of the premium Second paying less exante implies paying more ex post so applying the creditconstraint argument only to the ex ante payment but not tothe probabilistic ex post deductible payments has no theo-retical foundation Essentially the setup of the model linksthe ex ante decisions to the ex post losses which are bothdriven by the curvature of the vNM utility function

763VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

deviations of these computed quantities over theiterations of the Gibbs sampler35 The impliedrisk aversion of the mean individual is 00019which is about seven times greater than theback-of-the-envelope calculation presented inthe previous section As we assume a lognormal

distribution and estimate a relatively high dis-persion coefficient r the estimates also implysignificantly lower values for the median levelof risk aversion As shown later the qualitativepattern of these results is quite robust acrossspecifications

Table 5 summarizes our findings regardingthe level and dispersion of risk aversion Itpresents two ways to interpret the estimates weobtained from the benchmark model as well ascomparisons to specifications with constant ab-solute risk aversion (CARA) utility and with

35 Note that these estimated quantities are unconditionalIn computing these quantities we integrate over the distri-bution of observable characteristics in the data so onecannot compute these estimates from the estimated param-eter directly

TABLE 5mdashRISK-AVERSION ESTIMATES

Specificationa Absolute risk aversionb Interpretationc Relative risk aversiond

Back-of-the-envelope 10 103 9070 1484Benchmark model

Mean individual 67 103 5605 972225th percentile 23 106 9998 003Median individual 26 105 9974 03775th percentile 29 104 9714 42790th percentile 27 103 7834 390295th percentile 99 103 4937 14327

CARA utilityMean individual 31 103 7651 4436Median individual 34 105 9966 050

Learning modelMean individual 42 103 6886 6140Median individual 56 106 9995 008

Comparable estimatesGertner (1993) 31 104 9699 479Metrick (1995) 66 105 9934 102Holt and Laury (2002)e 32 102 2096 86575Sydnor (2006) 20 103 8329 5395

a The table summarizes the results with respect to the level of risk aversion ldquoBack-of-the-enveloperdquo refers to thecalculation we report in the beginning of Section III ldquobenchmark modelrdquo refers to the results from the benchmark model(Table 4) ldquoCARA utilityrdquo refers to a specification of a CARA utility function and ldquolearning modelrdquo refers to a specificationin which individuals do not know their risk types perfectly (see Section IIID) The last four rows are the closest comparableresults available in the literature

b The second column presents the point estimates for the coefficient of absolute risk aversion converted to $US1 unitsFor the comparable estimates this is given by their estimate of a representative CARA utility maximizer For all otherspecifications this is given by computing the unconditional mean and median in the population using the figures we reportin the mean and median columns of risk aversion in Table 6 (multiplied by 352 the average exchange rate during the periodto convert to US dollars)

c To interpret the absolute risk aversion estimates (ARA) we translate them into x u(w) 1frasl2 u(w 100) 1frasl2 u(w x) That is we report x such that an individual with the estimated ARA is indifferent about participating in a 50ndash50lottery of gaining 100 US dollars and losing x US dollars Note that since our estimate is of absolute risk aversion the quantityx is independent of w To be consistent with the specification we use a quadratic utility function for the back-of-the-envelopebenchmark and learning models and use a CARA utility function for the others

d The last column attempts to translate the ARA estimates into relative risk aversion We follow the literature and do soby multiplying the ARA estimate by average annual income We use the average annual income (after tax) in Israel in 1995(51168 NIS from the Israeli Census) for all our specifications and we use average disposable income in the United Statesin 1987 ($15437) for Gertner (1993) and Metrick (1995) For Holt and Laury (2002) and Sydnor (2006) we use a similarfigure for 2002 ($26974)

e Holt and Laury (2002) do not report a comparable estimate The estimate we provide above is based on estimating aCARA utility model for the 18 subjects in their experiment who participated in the ldquo90rdquo treatment which involved stakescomparable to our setting For these subjects we assume a CARA utility function and a lognormal distribution of theircoefficient of absolute risk aversion The table reports the point estimate of the mean from this distribution

764 THE AMERICAN ECONOMIC REVIEW JUNE 2007

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 19: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

save more to obtain more education or totake other actions that lead to greaterwealth34

Let us make several additional observationsFirst while owners of more expensive cars ap-pear to have both higher risk exposure andhigher levels of risk aversion owners of biggercars have higher risk exposure but lower levelsof risk aversion This should indicate that thestructure of the model does not constrain therelationship between the coefficients in the twoequations Rather it is the data that ldquospeak uprdquoSecond individuals who are classified by theinsurer as ldquogood driversrdquo indeed have lowerrisk but also appear to have lower risk aversionThis result is somewhat similar to the positivecorrelation between unobserved risk and unob-served risk aversion which we report belowThird policyholders who tend to use the car forbusiness are less risk averse This could bebecause uninsured costs of accidents occurringto such policyholders are tax deductible Fourthpolicyholders who reported three full years ofclaims history are more risk averse but are notdifferent in their risk exposure The attitude thatleads such policyholders to comply with therequest to (voluntarily) report three full years ofclaims history is apparently and not surpris-ingly correlated with higher levels of risk aver-sion In contrast while past claims indicate highrisk they have no significant relationship withrisk aversion Finally we find a strong trendtoward lower levels of risk aversion over timeThis is a replication of the probit results dis-cussed earlier

The Effect of Individual Characteristics onClaim RiskmdashThe first column of Table 4 de-scribes the relationship between observablesand risk exposure Accident risk is higher fordivorced individuals and lower for people witha college education Bigger older more expen-sive and noncommercial cars are all morelikely to be involved in an accident Drivingexperience reduces accident rates as do mea-sures of less intense use of the car while youngdrivers are more exposed to risk Claim propen-sity is highly correlated over time the voluntaryreport of past claims is a strong predictor offuture claims This risk equation produces re-sults that are similar to those of the simplerPoisson regression reported in Table 3 Al-though some of the coefficients lose signifi-cance the magnitude of most coefficients isquite similar The similarity between these twosets of results is to be expected as the riskregression is identified primarily from the dataon claims so incorporating the information ondeductible choice does not qualitatively changethe conceptual identification strategy (see Sec-tion IIC) If the results were not similar thiscould have indicated a misspecification of themodel The slight differences between the riskregressions in Table 3 and Table 4 are drivenmainly by the structural assumptions First thebenchmark model estimates a lognormal mix-ture of Poisson rates rather than a single Pois-son model By incorporating the fatter tails ofthe claim distribution it slightly changes theresults increases the standard errors and de-creases the average predicted claim rate Sec-ond the information on deductible choiceslightly helps us in obtaining more precise es-timates through the correlation structure be-tween the error terms in the two equations

The Implied Level of Risk AversionmdashOne ofthe main goals of the estimation is to obtainmeasures of the level of risk aversion in thepopulation we study Since we use Gibbs sam-pling and augment the latent coefficients ofabsolute risk aversion we can directly obtainthe posterior distribution of various moments ofthis distribution At each iteration of the Gibbssampler we compute the mean standard devi-ation (across individuals) and various percen-tiles of the simulated draws of i and ri and thecorrelation between them The far-right columnof Table 4 reports the averages and standard

34 One may be tempted to interpret the positive wealtheffects as an indirect indication of credit constraints wealth-ier individuals are less credit constrained and therefore canafford to purchase more insurance We do not share thisinterpretation for two reasons First the insurance companyobserves these proxies for wealth and conditions on themwhen setting prices Since the willingness to pay for insur-ance is likely to be correlated with the willingness to pay forthe additional insurance provided by the low deductibleoption premiums already reflect this variation We condi-tion on the level of the premium Second paying less exante implies paying more ex post so applying the creditconstraint argument only to the ex ante payment but not tothe probabilistic ex post deductible payments has no theo-retical foundation Essentially the setup of the model linksthe ex ante decisions to the ex post losses which are bothdriven by the curvature of the vNM utility function

763VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

deviations of these computed quantities over theiterations of the Gibbs sampler35 The impliedrisk aversion of the mean individual is 00019which is about seven times greater than theback-of-the-envelope calculation presented inthe previous section As we assume a lognormal

distribution and estimate a relatively high dis-persion coefficient r the estimates also implysignificantly lower values for the median levelof risk aversion As shown later the qualitativepattern of these results is quite robust acrossspecifications

Table 5 summarizes our findings regardingthe level and dispersion of risk aversion Itpresents two ways to interpret the estimates weobtained from the benchmark model as well ascomparisons to specifications with constant ab-solute risk aversion (CARA) utility and with

35 Note that these estimated quantities are unconditionalIn computing these quantities we integrate over the distri-bution of observable characteristics in the data so onecannot compute these estimates from the estimated param-eter directly

TABLE 5mdashRISK-AVERSION ESTIMATES

Specificationa Absolute risk aversionb Interpretationc Relative risk aversiond

Back-of-the-envelope 10 103 9070 1484Benchmark model

Mean individual 67 103 5605 972225th percentile 23 106 9998 003Median individual 26 105 9974 03775th percentile 29 104 9714 42790th percentile 27 103 7834 390295th percentile 99 103 4937 14327

CARA utilityMean individual 31 103 7651 4436Median individual 34 105 9966 050

Learning modelMean individual 42 103 6886 6140Median individual 56 106 9995 008

Comparable estimatesGertner (1993) 31 104 9699 479Metrick (1995) 66 105 9934 102Holt and Laury (2002)e 32 102 2096 86575Sydnor (2006) 20 103 8329 5395

a The table summarizes the results with respect to the level of risk aversion ldquoBack-of-the-enveloperdquo refers to thecalculation we report in the beginning of Section III ldquobenchmark modelrdquo refers to the results from the benchmark model(Table 4) ldquoCARA utilityrdquo refers to a specification of a CARA utility function and ldquolearning modelrdquo refers to a specificationin which individuals do not know their risk types perfectly (see Section IIID) The last four rows are the closest comparableresults available in the literature

b The second column presents the point estimates for the coefficient of absolute risk aversion converted to $US1 unitsFor the comparable estimates this is given by their estimate of a representative CARA utility maximizer For all otherspecifications this is given by computing the unconditional mean and median in the population using the figures we reportin the mean and median columns of risk aversion in Table 6 (multiplied by 352 the average exchange rate during the periodto convert to US dollars)

c To interpret the absolute risk aversion estimates (ARA) we translate them into x u(w) 1frasl2 u(w 100) 1frasl2 u(w x) That is we report x such that an individual with the estimated ARA is indifferent about participating in a 50ndash50lottery of gaining 100 US dollars and losing x US dollars Note that since our estimate is of absolute risk aversion the quantityx is independent of w To be consistent with the specification we use a quadratic utility function for the back-of-the-envelopebenchmark and learning models and use a CARA utility function for the others

d The last column attempts to translate the ARA estimates into relative risk aversion We follow the literature and do soby multiplying the ARA estimate by average annual income We use the average annual income (after tax) in Israel in 1995(51168 NIS from the Israeli Census) for all our specifications and we use average disposable income in the United Statesin 1987 ($15437) for Gertner (1993) and Metrick (1995) For Holt and Laury (2002) and Sydnor (2006) we use a similarfigure for 2002 ($26974)

e Holt and Laury (2002) do not report a comparable estimate The estimate we provide above is based on estimating aCARA utility model for the 18 subjects in their experiment who participated in the ldquo90rdquo treatment which involved stakescomparable to our setting For these subjects we assume a CARA utility function and a lognormal distribution of theircoefficient of absolute risk aversion The table reports the point estimate of the mean from this distribution

764 THE AMERICAN ECONOMIC REVIEW JUNE 2007

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 20: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

deviations of these computed quantities over theiterations of the Gibbs sampler35 The impliedrisk aversion of the mean individual is 00019which is about seven times greater than theback-of-the-envelope calculation presented inthe previous section As we assume a lognormal

distribution and estimate a relatively high dis-persion coefficient r the estimates also implysignificantly lower values for the median levelof risk aversion As shown later the qualitativepattern of these results is quite robust acrossspecifications

Table 5 summarizes our findings regardingthe level and dispersion of risk aversion Itpresents two ways to interpret the estimates weobtained from the benchmark model as well ascomparisons to specifications with constant ab-solute risk aversion (CARA) utility and with

35 Note that these estimated quantities are unconditionalIn computing these quantities we integrate over the distri-bution of observable characteristics in the data so onecannot compute these estimates from the estimated param-eter directly

TABLE 5mdashRISK-AVERSION ESTIMATES

Specificationa Absolute risk aversionb Interpretationc Relative risk aversiond

Back-of-the-envelope 10 103 9070 1484Benchmark model

Mean individual 67 103 5605 972225th percentile 23 106 9998 003Median individual 26 105 9974 03775th percentile 29 104 9714 42790th percentile 27 103 7834 390295th percentile 99 103 4937 14327

CARA utilityMean individual 31 103 7651 4436Median individual 34 105 9966 050

Learning modelMean individual 42 103 6886 6140Median individual 56 106 9995 008

Comparable estimatesGertner (1993) 31 104 9699 479Metrick (1995) 66 105 9934 102Holt and Laury (2002)e 32 102 2096 86575Sydnor (2006) 20 103 8329 5395

a The table summarizes the results with respect to the level of risk aversion ldquoBack-of-the-enveloperdquo refers to thecalculation we report in the beginning of Section III ldquobenchmark modelrdquo refers to the results from the benchmark model(Table 4) ldquoCARA utilityrdquo refers to a specification of a CARA utility function and ldquolearning modelrdquo refers to a specificationin which individuals do not know their risk types perfectly (see Section IIID) The last four rows are the closest comparableresults available in the literature

b The second column presents the point estimates for the coefficient of absolute risk aversion converted to $US1 unitsFor the comparable estimates this is given by their estimate of a representative CARA utility maximizer For all otherspecifications this is given by computing the unconditional mean and median in the population using the figures we reportin the mean and median columns of risk aversion in Table 6 (multiplied by 352 the average exchange rate during the periodto convert to US dollars)

c To interpret the absolute risk aversion estimates (ARA) we translate them into x u(w) 1frasl2 u(w 100) 1frasl2 u(w x) That is we report x such that an individual with the estimated ARA is indifferent about participating in a 50ndash50lottery of gaining 100 US dollars and losing x US dollars Note that since our estimate is of absolute risk aversion the quantityx is independent of w To be consistent with the specification we use a quadratic utility function for the back-of-the-envelopebenchmark and learning models and use a CARA utility function for the others

d The last column attempts to translate the ARA estimates into relative risk aversion We follow the literature and do soby multiplying the ARA estimate by average annual income We use the average annual income (after tax) in Israel in 1995(51168 NIS from the Israeli Census) for all our specifications and we use average disposable income in the United Statesin 1987 ($15437) for Gertner (1993) and Metrick (1995) For Holt and Laury (2002) and Sydnor (2006) we use a similarfigure for 2002 ($26974)

e Holt and Laury (2002) do not report a comparable estimate The estimate we provide above is based on estimating aCARA utility model for the 18 subjects in their experiment who participated in the ldquo90rdquo treatment which involved stakescomparable to our setting For these subjects we assume a CARA utility function and a lognormal distribution of theircoefficient of absolute risk aversion The table reports the point estimate of the mean from this distribution

764 THE AMERICAN ECONOMIC REVIEW JUNE 2007

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 21: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

incomplete-information (discussed later) and toother comparable figures in the literature Ourbenchmark estimate suggests that an averagequadratic utility maximizer36 will be indifferentabout participating in a 50ndash50 lottery in whichhe gains $100 or loses $5605 A CARA spec-ification suggests a significantly lower value foraverage risk aversion making the mean individ-ual indifferent about a 50ndash50 lottery of gaining$100 or losing $7651 The results from theincomplete information model for the meanlevel of risk aversion are in between these esti-mates All the results suggest that although themean individual exhibits a significant level ofrisk aversion heterogeneity in risk preferencesis important and the median individual is al-most risk neutral with respect to lotteries of$100 magnitude Although the mean is alwaysgreater than the median under the lognormaldistribution the large difference we find is notimposed In principle we could have obtained ahigh level of risk aversion with less heteroge-neity thereby leading to a smaller differencebetween the mean and the median (the esti-mated distribution of risk types is an example)

Let us briefly discuss the relevance of thecomparison to Gertner (1993) and Metrick(1995) There are two ways one can reconcilethe differences between the estimates Firstboth these papers measure risk aversion fortelevision game show participants these arehighly selected groups in a rather ldquorisk-friendlyrdquo environment37 Second the magni-tudes of the stakes are higher The showparticipants bet on several thousand dollars andmore while our average individual risks muchlower stakes in the range of $100 Thus thedifference in the results may be due to the issuesraised in Rabin (2000) regarding the compara-bility of behavior across different contexts andbet sizes We discuss this further in Section IV

A different way to quantify our estimate is byreporting them in relative terms There is noconsensus in the literature as to the relevantwealth that is taken into account in such deci-

sions Therefore for comparability we followthe closest papers in this respect (eg Gertner1993) and use annual income as the relevantwealth We multiply the estimated coefficient ofabsolute risk aversion by the average annualincome in Israel during the observation periodUnder the (questionable) assumption that an-nual income is a good proxy for the relevantwealth at the time of decision making thisproduct would be a proxy for the coefficient ofrelative risk aversion As Table 5 indicates ourbenchmark specification results in an impliedcoefficient of relative risk aversion of about 97A CARA specification results in a lower coef-ficient of 44 On the other hand the medianestimate for relative risk aversion is well belowone Thus the widely used estimate of a lowsingle-digit coefficient of relative risk aversionfalls somewhere between the median and themean and between the median and the seventy-fifth percentile of the risk aversion distribution

The Relationship between Unobserved Riskand Unobserved Risk AversionmdashTable 4 al-lows us to make observations about the relation-ship between risk and risk aversion We firstdiscuss the relative importance of unobservedheterogeneity of both dimensions In the popu-lation we study unobserved heterogeneity inrisk aversion (r) is much greater than unob-served heterogeneity in risk () This is trueboth in absolute terms (315 compared to 015respectively) and after normalizing by the cor-responding mean level38 using the coefficientof variation as a measure of dispersion (027compared to 01 respectively) It is also true forthe overall unconditional dispersion This couldindicate that selection on risk aversion is moreimportant in our data than adverse selection39 Theright metric to use for such statements is not

36 For such an individual the second-order Taylor ex-pansion we use in Section IIA is exact

37 We suspect that individuals who participate in televi-sion game shows are more adventuresome than the generalpopulation Moreover knowing that the audience mightwish to see them keep betting is likely to further encourageparticipants to take risks

38 All covariates are measured in deviations from theirsample mean so the estimated constant in each equation isthe estimated mean of the left-hand-side variable

39 An additional observation is that given our estimatesobservables account for slightly less than 50 percent of thevariation in ln(ri) but for almost 65 percent of the variation inln(i ) This may seem surprising given the finding that disper-sion in risk aversion is more important and thus should be thefocus of insurance companies However this finding is con-sistent with the conventional wisdom that insurance companiesspend much effort and resources on collecting information thathelps in risk classification but only little effort on informationthat predicts willingness to pay

765VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 22: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

entirely clear however as one should projectthese estimated variances onto the same scaleof say willingness to pay or profits Profits forexample are affected directly by risk but not byrisk aversion so the comparison above could bemisleading Therefore we relegate the discus-sion of this issue to Section IIIF where weshow that even when we look at pricing andprofits heterogeneity in risk aversion is moreimportant

Table 4 also indicates a strong and signif-icant positive correlation of 084 between un-observed risk aversion and unobserved riskThis result may be surprising because it isnatural to think that risk aversion with respectto financial decisions is likely to be associatedwith a greater tendency to take precautionsand therefore with lower risk Indeed a recentpaper by Finkelstein and McGarry (2006)supports such intuition by documenting anegative correlation between risk aversionand risk in the market for long-term careinsurance (see also Israel 2005 for the autoinsurance market in Illinois) Our marketmight however be special in ways that couldproduce a positive correlation First in con-trast to most insurance markets where apolicyholderrsquos risk depends on the policy-holderrsquos precautions but not on the precau-tions of others accident risk in the autoinsurance market is a result of an interactionbetween onersquos driving habits and those ofother drivers Second the correlation coeffi-cient may be highly sensitive to the particularway we measure risk and risk aversion Thereare many unobserved omitted factors that arelikely to be related to both dimensions Theintensity of vehicle use for example mightbe an important determinant of risk If indi-viduals who are more risk averse also drivemore miles per year a positive correlationbetween risk and risk aversion could emergeThus our results caution against assumingthat risk and risk aversion are always nega-tively correlated Whether this is the case maydepend on the characteristics of the particularmarket one studies and on the particular mea-sure for risk Indeed one can use estimatedannual mileage to control for one omittedvariable that may potentially work to producea positive correlation between risk aversionand risk Despite its partial coverage in thedata and being considered (by the company)

as unreliable40 controlling for annual mileagereported by policyholders reduces the esti-mated correlation coefficient to 06841 Weview this result as consistent with the possi-bility that underlying unobserved factors thataffect risk play an important role in generat-ing the estimated positive correlation betweenrisk and risk aversion A third explanation forthe positive estimated correlation is the dis-tributional assumption As discussed in Sec-tion IIC the correlation coefficient isprobably the parameter that is most sensitiveto these assumptions Indeed as discussedlater when we change our assumptions aboutthe Poisson process and use an alternativedistribution with extremely thin tails the es-timated correlation coefficient reverses signsWe of course view the assumptions of thebenchmark model as more appropriate andtherefore maintain the view that the data sug-gest a positive correlation Finally note thatwhile the correlation parameter we estimate ishigh the implied unconditional correlationbetween risk and risk aversion is less than025 across all reported specifications This isbecause the coefficients on the same covariate(for example the size of the car or whetherthe car is used for business) often affect riskand risk aversion in opposite directions andbecause of the logarithmic transformation

C Stability of the Risk Aversion Coefficients

Estimating risk preferences is motivated bythe idea that the same (or similar) risk aversionparameter may explain risky decisions acrossmultiple contexts This idea is at the heart of thecurrent debate we mention in the introductionregarding the empirical relevance of expectedutility and whether the standard construct of a

40 Insurance companies typically do not use these self-reported mileage estimates as they are considered unreli-able While companies could verify these estimates at thetime of a claim such reports are hard to enforce Anindividual can always claim that her ex ante estimate waslower than it turned out to be Indeed the estimated elas-ticity of risk with respect to estimated mileage is 006(0012) which seems low suggesting a bias downwardpotentially due to ldquoerrors in variablesrdquo bias

41 The full results from this specification are providedin the online Appendix Other results from this specifi-cation are similar to those we report for the benchmarkspecification

766 THE AMERICAN ECONOMIC REVIEW JUNE 2007

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 23: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

single risk aversion parameter that appliesacross many types of decisions is the right wayto think about consumer behavior To the bestof our knowledge there is no direct empiricalevidence on this issue While we defer much ofthis discussion to the concluding section weverify below that at least over a limited set ofchoices in which we observe the same individ-uals the estimated risk preferences help to ex-plain choices over time and across differentcontexts

Stability across ContextsmdashIdeally onewould like to show that the estimated risk pref-erences are stable across multiple lines of insur-ance and perhaps even across other riskydecisions individuals face Absent data on suchchoices we provide evidence that the estimatedrisk aversion coefficients help in predictingother related risky choices Individuals in ourdata had to make three additional coveragechoices in addition to the main policy choicethat we have analyzed so far They had tochoose whether to purchase coverage for caraudio equipment (bought by 498 percent of thesample of new policyholders) for towing ser-vices (bought by 929 percent) and for wind-shield damage (bought by 954 percent) Thesetypes of coverage are sold by the same companybut are administered by a third party so we donot have information about their realizationTherefore we cannot perform a similar exerciseto the one we perform for the deductible choiceOf course one should not expect these addi-tional coverage choices to be perfectly corre-lated with the deductible choice Even if thesame risk preferences are an important factorfor all of these decisions variation in risk acrosscoverage is also important For example own-ership of an expensive car audio system is likelyto affect the purchase of audio coverage inde-pendent of risk preferences Similarly ownersof old cars may value more coverage for towingservices Risk preferences however should en-ter into all of these coverage choices and there-fore we expect these additional types ofcoverages to be positively correlated with thechoice of a low deductible

We coded these three additional coveragechoices as three dummy variables and use themas additional covariates to verify that they canhelp explain unobserved risk aversion Firstwe add these variables to the probit regression

reported in column 3 of Table 3 They all havea positive statistically and economicallysignificant power in predicting the deductiblechoice42 To verify that these reported correla-tions are not driven by correlation in risks wealso estimate the benchmark model with theseadditional variables as covariates The esti-mated coefficients on the three additional cov-erage choices are positive and two of the threeare significant in both the risk and risk aversionequations they are 0053 (0019) 00002 (0021)and 0057 (0023) in the risk equation and0354 (0064) 0123 (0122) and 0813 (0174)in the risk aversion equation for audio towingand windshield coverage respectively (standarddeviations in parentheses)43 This suggests thatthe estimated risk preferences help in explainingmultiple coverage choices across these relatedcontexts

Stability over TimemdashWe now provide evi-dence that the estimated risk preferences arealso stable over time All the individuals whodecide to renew their policy with the companyafter it expires are at least in principle free tochange their deductible choice However anoverwhelming majority of individuals (morethan 97 percent) do not change their deductiblechoices when they renew Even individuals whoinitially chose the rarely chosen ldquohighrdquo andldquovery highrdquo deductible levels typically do notchange their choices Therefore it is also notsurprising that estimating the model on the firstdeductible choice of individuals and the seconddeductible choice of the same individuals yieldssimilar results as shown at the end of the nextsection Of course changes in deductiblechoices do not necessarily imply changes inrisk preferences Risk may also change overtime and may drive such changes At thesame time while persistence in deductiblechoices over time is consistent with stablerisk preferences it may also be driven bymany other factors For example it is reason-able to think that individuals do not devote

42 They are estimated to increase on average the prob-ability of choosing a low deductible by 0019 (819) 0023(527) and 0041 (758) for audio towing and windshieldcoverage respectively (z stat in parentheses)

43 The full results from this specification are provided in theonline Appendix Other results from this specification aresimilar to those we report for the benchmark specification

767VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 24: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

the same amount of thought to their renewaldecision as to their initial choice leading tothe observed persistence This caveat is themain reason why we focus most of the analysisonly on new policies

D Robustness

This section discusses the robustness of themain results Results from all specifications wemention in this section are summarized in Table6 The full results from these specifications arerelegated to the online Appendix

The von NeumannndashMorgenstern Utility Func-tionmdashTo derive the benchmark model we as-sume a negligible third derivative of the vNMutility function This is attractive as it allowsus to summarize risk preferences by a one-dimensional parameter r [u13(w)u(w)]There is however a large literature (eg MilesS Kimball and N Gregory Mankiw 1989) em-phasizing the importance of a (positive) third

derivative of the utility function which leads toprecautionary saving It is therefore importantto address the sensitivity of the results to thisassumption

By allowing a third-order Taylor approxima-tion of equation (4) we obtain

(15) r u13w

uw

p

d 1

d

uw

uw

dh2 dh dl dl

2

6d

which reduces to equation (7) when u(w) 0This illustrates that a positive third derivativeprovides an additional (precautionary) incentiveto insure It also shows that in order to fullydescribe preferences in the presence of a thirdderivative a one-dimensional parameter doesnot suffice one needs to know both r [u13(w)u(w)] and [u(w)u(w)] This makesit less attractive for estimation

TABLE 6mdashROBUSTNESS

Specificationa Sample Obs

Claim risk () Absolute risk aversion (r)

Corr(r ) Mean Median Std dev Mean Median Std dev

Baseline estimatesBenchmark model All new customers 105800 0220 0217 0048 19 103 73 106 0020 0207 0839

The vNM utility functionCARA utility All new customers 105800 0219 0217 0048 87 104 b 98 106 b 0009b 0201 0826

The claim generating processBenchmark model No multiple claims 103260 0182 0180 0040 20 103 28 105 0018 0135 0547Thinner-tail risk distribution All new customers 105800 0205c 0171c 0155c 17 103 19 106 0020 0076 0916

The distribution of risk aversionLower-bound procedure All new customers 105800 mdash mdash mdash 37 104 0 0002 mdash mdash

Incomplete information about riskBenchmark model Experienced drivers 82966 0214 0211 0051 21 103 83 106 0021 0200 0761Benchmark model Inexperienced drivers 22834 0230 0220 0073 30 103 12 107 0032 0186 0572Learning model All new customers 105800 0204 0191 0084 12 103 16 106 0016 0200 0772

Sample SelectionBenchmark model First two years 45739 0244 0235 0066 31 103 26 105 0026 0225 0699Benchmark model Last three years 60061 0203 0201 0043 16 103 34 107 0021 0113 0611Benchmark model Referred by a friend 26434 0213 0205 0065 30 103 84 107 0031 0155 0480Benchmark model Referred by advertising 79366 0219 0216 0051 21 103 76 106 0022 0212 0806Benchmark model Non-stayers 48387 0226 0240 0057 23 103 77 107 0026 0149 0848Benchmark model Stayers 1st choice 57413 0190 0182 0057 29 103 29 105 0024 0152 0463Benchmark model Stayers 2nd choice 57413 0208 0200 0065 30 103 16 105 0026 0211 0637

Note The table presents the key figures from various specifications and subsamples tracing the order they are presented in Section IIID Full results (in the format

of Table 4) from all these specifications are available in the online Appendixa ldquoBenchmark modelrdquo refers to the benchmark specification estimated on various subsamples (the first row replicates the estimates from Table 4) The other

specifications are slightly different and are all described in more detail in the corresponding parts of Section IIIDb The interpretation of r in the CARA model takes a slightly different quantitative meaning when applied to noninfinitesimal lotteries (such as the approximately

$100 stakes we analyze) This is due to the positive third derivative of the CARA utility function compared to the benchmark model in which we assume a small

third derivative Thus these numbers are not fully comparable to the corresponding figures in the other specificationsc The interpretation of in the thinner-tail distribution we estimate is slightly different from the standard Poisson rate which is assumed in the other specifications

Thus these numbers are not fully comparable to the corresponding figures in the other specifications

768 THE AMERICAN ECONOMIC REVIEW JUNE 2007

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 25: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

We can allow for a positive third derivativewithout expanding the dimensionality of pref-erences by imposing a parametric functionalform on the utility function Essentially suchparametric form imposes a relationship between[u13(w)u(w)] and [u(w)u(w)] Two standardforms are those that exhibit CARA and thosethat exhibit constant relative risk aversion(CRRA) CRRA requires us to make additionalassumptions about the relevant (and unob-served) wealth level of each individual makingit less attractive The CARA case is imple-mented below

With CARA utility we substitute u(w) exp(rw) in equation (4) and rearrange toobtain

(16) rp

exprdh exprdl

This equation defines the indifference set Unlikethe benchmark model there is no closed-formrepresentation for ri() a property that makesestimation significantly slower Due to the precau-tionary saving incentive arising from the thirdderivative we can use a lower level of absoluterisk aversion to rationalize the low deductiblechoice given In other words the CARA indif-ference set is flatter in comparison to the bench-mark case depicted in Figure 2 Thus the CARAspecification will generally lead to lower estimatesof the coefficient of absolute risk aversion asshown in Table 6 The mean and dispersion ofestimated absolute risk aversion (but not the me-dian) are smaller by about a factor of two Thegeneral qualitative results however remain thesame As shown in Table 5 the mean individual isquite risk averse with respect to lotteries of $100magnitude the median is close to risk neutral withrespect to such lotteries and heterogeneity in riskaversion is important

The Claim-Generating ProcessmdashIn thebenchmark model we assume that claims aregenerated by a Poisson model This assumptionis both analytically attractive (it is important inderiving equation (4)) and in our view a rea-sonable approximation of the data-generatingprocess for claims The Poisson distribution hasbeen widely used in economics to model acci-dent counts (eg Nancy L Rose 1990 Dionneand Vanasse 1992 Ronald Michener and CarlaTighe 1992) In using this distribution re-

searchers have followed the long history of theuse of Poisson by actuaries and insurance spe-cialists going back to Ladislaus Bortkiewicz(1898)

An important restriction of the Poisson dis-tribution is that its mean and variance are thesame Although some economic studies con-firmed this assumption for particular accidentdata (eg Christopher M Auld et al 2001) it isoften the case that this restriction is falsifiedThe most common deviation from the Poissonrestriction is that of fat tails ie variance that ishigher than the mean This led researchers touse a negative binomial distribution to accom-modate this regularity by introducing a secondparameter which delinks the relationship be-tween the mean and the variance One naturalinterpretation of the fat tails is that of unob-served heterogeneity and the negative binomialdistribution can be viewed as a Gamma mixtureof Poisson processes Consistent with this viewwe assume that the claim-generating processfollows a Poisson process at the individuallevel but allows unobserved heterogeneity inrisk and estimates a lognormal mixture of Pois-son processes which is similar to a negativebinomial The dispersion parameter we esti-mate is a free parameter which is identi-fied by the fatness of the tails of the claimdistribution

While one would like to allow fat tails of theaggregate claim distribution one may criticizethe assumption of a Poisson distribution at theindividual level for having tails that may be ldquotoofatrdquo Recently Jaap H Abbring et al (2003) andMark Israel (2004) provided evidence for neg-ative state dependence in data from auto insur-ance claims similar to the ones we useControlling for heterogeneity across individu-als these papers show that a second claim isless likely than a first This may happen due toexperience rating or perhaps to more carefuldriving and less intense use of a car after anaccident The Poisson distribution assumes thatthe second accident is just as likely so negativestate dependence may suggest thinner tails atthe individual level

To verify that the main results are not sensi-tive to this restriction of the Poisson model weperform two tests First we estimate the bench-mark model on a sample that includes onlyindividuals with one or no claims The model isstill identified using variation in policy durations

769VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 26: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

The estimates of the level of risk aversion andits dispersion across individuals remain similarto those of the benchmark model (Table 6) Theestimates of risk and the correlation with riskaversion are significantly lower But this isexpected we selected out of the sample thehigh-risk individuals

As a further test for the sensitivity of theresults to possible negative state dependencewe estimated the model with a different distri-bution at the individual level which gives riseto much thinner tails We could not find anyknown distribution of count variables that givesrise to tails thinner than Poissonrsquos so we madeone up In particular for a given Poisson ratelet pn be the probability of observing n occur-rences The distribution we take to the data issuch that

(17) Prn pn2

yenm0 pm2

Thus this distribution makes the probability ofmultiple claims much lower probably muchmore so than any realistic negative state depen-dence in the data Essentially such a distribu-tion makes the econometrician view individualswith multiple claims as high-risk individualswith much more certainty as it requires muchmore ldquobad luckrdquo for low-risk individuals tohave multiple claims The estimates for thelevel of risk aversion and its dispersion arealmost the same as in the benchmark model(Table 6) The rest of the summary figures aredifferent The difference in the level of risk isnot informative the interpretation of isslightly different given the change in the un-derlying distribution The dispersion in ismuch higher This is a direct result of the thin-tail assumption The dispersion of is identifiedby the tails of the claim distribution at theaggregate level With thinner tails imposed onthe individual distribution more heterogeneityis needed to match the observed tails at theaggregate Finally the correlation coefficient inthis specification changes signs and is now neg-ative and close to 1 This is consistent withour discussion in Section IIC which emphasizesthat the identification of the correlation coeffi-cient is closely tied to the structural assump-tions Once these are changed the estimatedcorrelation coefficient would change too Most

importantly however the mean and dispersionof risk aversion are stable

The Distribution of Absolute Risk Aver-sionmdashIn the benchmark model we assume thatthe coefficient of absolute risk aversion is lognor-mally distributed across individuals Since onlyfew previous studies focused on heterogeneity inrisk preferences there is not much existing evi-dence regarding the distribution of risk prefer-ences The only evidence we are aware of is theexperimental results presented by Steffen Andersenet al (2005) which show a skewed distributionwith a fat right tail which is qualitatively consis-tent with the lognormal distribution we assume

An additional advantage of the normality as-sumption is computational It provides a closed-form conditional distribution allowing us to useonly univariate (rather than bivariate) draws inthe estimation procedure significantly reducingthe computational burden One may be con-cerned about the sensitivity of the results to thisdistributional assumption For example it maydrive the result that the median level of riskaversion is much lower than the mean

Incorporating alternative distributional as-sumptions significantly complicates and slowsthe estimation procedure44 As an alternativewe develop a procedure that we believe pro-vides some guidance as to the sensitivity of theresults to this distributional assumption Thedisadvantage of the procedure is that it cannotaccount for adverse selection Since we foundthat adverse selection is not that important thisexercise is informative The exercise conveysthat the main qualitative results are not drivenby the (fat) tails of the lognormal distributionwe impose Rather they are driven by the highfraction of individuals who chose a low deduct-ible despite being of low risk The model im-plies that such individuals must have a fairlyhigh level of risk aversion

The exercise uses a Gibbs sampler to estimatethe lognormal distribution of i given the co-variates and the observed number of claims Ineach iteration of this Gibbs sampler conditional

44 A simple discrete-type distribution of risk aversion isnot well identified with the given data This is due to thenature of the exercise The distribution is identified bythe binary deductible choice and by the ldquolocalrdquo variation inthe pricing menu This variation is not enough to pin downthe discrete types accurately

770 THE AMERICAN ECONOMIC REVIEW JUNE 2007

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 27: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

on the data and the most recent draws for theindividual irsquos we compute lower bounds forthe level and dispersion of risk aversion (inaddition to generating the draws for ri from theconditional lognormal distribution) To deter-mine the lower bound for the level of risk aver-sion we compute

(18) r i i 0 if choicei 0

max0 pi

idi 113d i

if choicei 1

Namely we make ri as low as possible giventhe assumptions of the model We assume thatindividuals who chose a regular deductible arerisk neutral while individuals who chose a lowdeductible are just indifferent between the twodeductible levels (unless i is high enough inwhich case they are also assumed to be riskneutral) We then compute the average over therirsquos To compute the lower bound for the dis-persion we search for values of rirsquos that areconsistent with the assumptions of the modeland the observed data and that minimize thevariance of ri This turns out to be a simplesearch procedure that is linear in the number ofindividuals

The lower bound of the level of risk aversionthat we obtain from this exercise is 368 104

NIS1 (with standard deviation of 638 106)This level is five times lower than the analogouspoint estimate reported in Table 4 This trans-lates into indifference about participating in a50ndash50 lottery of gaining $100 and losing $885Thus it still suggests a significant level of riskaversion for lotteries of $100 magnitude Simi-larly the conversion to relative risk aversion asin Table 5 implies a relative risk aversion co-efficient of 1883 The result for the lowerbound of the dispersion is 189 103 (withstandard deviation of 816 105) which is tentimes lower than the benchmark estimate Thusthe coefficient of variation for absolute riskaversion declines only by two suggesting ahigh degree of heterogeneity Finally oneshould note that these bounds involve extremeassumptions and are computed separately andtherefore cannot be binding at the same timeThus the ldquocorrectrdquo estimates for both quantities

are likely to be higher closer to the results wereport for the benchmark model

Incomplete Information about RiskmdashThroughout we assume that individuals haveperfect information about their individual-specific objective risk type i This is a strongerassumption than we need Because expectedutility is linear in probabilities it suffices thatindividualsrsquo expected risk rate is equal to theirobjective risk rate ie i E(iIi) i wherei is a random variable representing individualirsquos perceived risk rate and Ii is individual irsquosinformation at the time of the coverage choiceNamely individuals could be uncertain abouttheir risk type but their point estimate has to becorrect

There are several channels through whichincomplete information may operate Let usconsider two such cases First suppose thatindividuals are correct but only on average iethat i i i where E(i) 0 The intuitionfor this case is similar to an ldquoerrors in variablesrdquomodel and in principle will result in an evenless important role for adverse selection Giventhat we find a relatively small effect of adverseselection this bias will not change this conclu-sion This may be even more pronounced ifcorr(i i) 0 which reflects a reasonableassumption of ldquoreversion to the meanrdquo ie thatan individualrsquos estimate of his risk type is someweighted average between his true risk type andthe average risk type of individuals who aresimilar (on observables) to him The conclusionmay go in the other way only if the mistakes goin the other direction according to which indi-viduals who are riskier than average believe thatthey are even more risky than they truly areThis we believe is less plausible

As a test for the sensitivity of the results tothe complete information assumption we per-form two exercises The first exercise estimatesthe benchmark model separately for experi-enced drivers and inexperienced drivers wherewe define an individual to be an experienceddriver if he has ten years or more driving expe-rience This is a conservative definition com-pared to those used in the literature (Chiapporiand Salanie 2000 Cohen 2005) The underlyingassumption consistent with evidence providedin Cohen (2005) is that drivers learn about theirown risk types as they gain more driving expe-rience but that most of this learning occurs

771VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 28: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

within the first ten years of driving Thus ourcomplete information model is a more appeal-ing description of experienced drivers Theresults for experienced drivers are almost iden-tical to the results from the benchmark specifi-cation (Table 6) suggesting that the mainresults are not driven by the fraction of individ-uals who are less likely to know their risk typesThe results for inexperienced drivers show asimilar pattern and magnitude Consistent withthe learning hypothesis however they show alarger dispersion in risk preferences which maybe due to incomplete information and there-fore to more random choices of deductiblelevels

The second exercise we perform is a morestructural version of the learning story whichallows even experienced drivers to have incom-plete information regarding their risk types Wedo so by estimating a different specification ofthe model that assumes that individuals areBayesian and update their information abouttheir own (stable) risk types over time usingonly information about the number of claimsthey make each year While we do not observecomplete claims histories of individuals wecan simulate such histories and integrate overthese simulations Thus individualsrsquo informa-tion would be related to their true type andwould be more precise with longer driving his-tories The full modeling and estimation detailsof this specification are provided in the Appen-dix We view this model as an extreme versionof incomplete information as there are manyother sources through which individuals maylearn about their own types and thereby havebetter information about their types than whatwe estimate them to have While the resultsimply that the levels of risk aversion and heter-ogeneity are lower than the benchmark esti-mates (Table 6) the order of magnitude andqualitative pattern are quite similar suggestingthat the main qualitative findings are robust tothe information structure

Sample SelectionmdashThere are two differentways to think about the contract selection pro-cess One possibility is that individuals firstselect an insurer based on advertisement wordof mouth or access to an agent Then individ-uals select the insurance contract from amongseveral contracts the selected insurer offersThis selection process is consistent with the way

the company executives view their business45

Another possibility is that individuals first col-lect information about all available insurancecontracts and then choose their most preferredone According to the industryrsquos conventionalwisdom individuals do not shop much acrossinsurers and therefore this selection processseems less important

The unique characteristics of direct insurersmay attract individuals who are more likely toexperiment with new ways of doing businessand may therefore be less risk averse than thegeneral population In Table 7 we compare thedemographics of our sample of policyholderswith those of the general Israeli populationThis comparison reflects a similar intuitioncompared with the general population our av-erage policyholder is slightly younger moreeducated more likely to be male and less likelyto be married or an immigrant This direction ofselection may also apply to unobserved riskpreferences thereby making our policyholderson average less risk averse than a representa-tive individual This suggests that the level ofrisk aversion that we find may be viewed as alower bound on the level of risk aversion in thegeneral Israeli population

One potential way to model sample selectionis to allow for an additional outside option to beselected For the vast majority of individuals weobserve the outside option is to purchase sim-ilar insurance from competing insurance agen-cies Unfortunately data on the structure ofcompeting contracts their prices and the waythey vary with individual characteristics are un-available This makes us uncomfortable to try tomodel sample selection as results from anysuch model will be driven by our assumptionsrather than by meaningful variation in the dataThe results are still meaningful for two reasonsFirst this is a large population accounting forabout 7 percent of all drivers in Israel Secondto the extent that our estimates suggest higherlevels of risk aversion than previously esti-mated and that the sample selection is likely to

45 As discussed in Section I the company was the firstdirect insurance provider and it offered significantly lowerpremiums than those offered by competitors due to signif-icant cost advantage In Section I we also discuss theliterature which emphasizes that choice of direct insurers isdriven primarily by nonmonetary ldquoamenitiesrdquo

772 THE AMERICAN ECONOMIC REVIEW JUNE 2007

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 29: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

bias these estimates downward the results arestill informative

To assess the magnitude of sample selectionin driving the main results we perform severaltests First we estimate the benchmark modelseparately for the first two years of the compa-nyrsquos operation during which the company wasthe only company to sell insurance directly andfor the remaining three years after additionaldirect insurers entered the market Second weestimate the benchmark model separately forindividuals who were referred to the companyby word of mouth and for those who heardabout the company through advertisement (pri-marily on television) The latter may havesearched more and might be more price sensi-tive Third we estimate the benchmark modelseparately for individuals who renewed theirpolicy with the company and for those who didnot renew It seems likely that individuals whodid not renew are less selected as most of themswitch back to regular insurers who insure themajority of the population Fourth we estimatethe model for the second deductible choicemade by those individuals who renew It couldbe argued that switching to other companies is

costly so outside options are not as important indriving deductible choices for those who renewAll these results are summarized in Table 6 andshow some important differences between thetwo groups within each pair of subsamples prob-ably reflecting selection For all groups howeverthe qualitative pattern of the results and the orderof magnitude and dispersion of risk aversion aresimilar to those of the full sample This is sugges-tive that correcting for sample selection is unlikelyto change the qualitative results

E Caveats

Moral HazardmdashThroughout our analysiswe abstract from moral hazard ie we assumethat i can vary across individuals but is invari-ant to the coverage choice There are two typesof moral hazard that may play a role in thiscontext First individuals with less coveragemay take greater precaution and drive morecarefully thereby reducing their claim risk rateSecond conditional on a claim event peoplewith higher deductibles are less likely to file aclaim there exists a range of claims for which

TABLE 7mdashREPRESENTATIVENESS

Variable Sampleb Populationc Car ownersd

Agea 4114 (1237) 4255 (1801) 4511 (1413)Female 0316 0518 0367Family Single 0143 0233 0067

Married 0780 0651 0838Divorced 0057 0043 0043Widower 0020 0074 0052

Education Elementary 0029 0329 0266High school 0433 0384 0334Technical 0100 0131 0165College 0438 0155 0234

Emigrant 0335 0445 0447

Obs 105800 723615 255435

a For the age variable the only continuous variable in the table we provide both the mean and the standard deviation (inparentheses)

b The figures are derived from Table 1 The family and education variables are renormalized so they add up to one weignore those individuals for whom we do not have family status or education level This is particularly relevant for theeducation variable which is not available for about half of the sample it seems likely that unreported education levels arenot random but associated with lower levels of education This may help in explaining at least some of the gap in educationlevels across the columns

c This column is based on a random sample of the Israeli population as of 1995 We use only adult population ieindividuals who are 18 years of age or older

d This column is based on a subsample of the population sample The data provide information about car ownership onlyat the household level not at the individual level Thus we define an individual as a car owner if (a) the household owns atleast one car and the individual is the head of the household or (b) the household owns at least two cars and the individualis the spouse of the head of the household

773VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 30: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

filing is profitable only under a low deductible(the literature often refers to this second effectas ldquoex post moral hazardrdquo)

To the extent that moral hazard exists abstract-ing from it will likely bias our estimates of riskaversion downward To see this note that adjust-ing behavior will help individuals to self-insureagainst uninsured costs Similarly ex post moralhazard would reduce the value of a low deduct-ible as in the event of a claim the gain from alower deductible would sometimes be less thanthe difference between the two deductible levelsBoth these effects will make a low deductible lessattractive requiring individuals to be even morerisk averse than we estimate in order to purchasemore coverage Below we discuss why in ourview abstracting from moral hazard is a reason-able approximation in this setting

It seems reasonable to conjecture that ceterisparibus insured individuals will drive less care-fully than uninsured ones It may also seemreasonable that the existence of a deductiblemay make individuals more careful about smalldamages to their car However when all choicesinclude a deductible and deductibles are similarin their magnitude it seems less likely thatdriving or care behavior will be affected46 Fi-nally to separately identify moral hazard onewould need another dimension of the data overwhich risk types remain fixed but coveragechoices vary exogenously (Israel 2004 Jaap HAbbring et al forthcoming)

We rely on the data to justify why we abstractfrom the second potential effect that of ex postmoral hazard Figure 3 presents data on theclaim amounts and shows that about 99 percentof the claims filed by policyholders with low-deductible policies were for amounts greaterthan the higher deductible level If the distribu-tion of amounts of potential claims does notvary with the deductible choice and if individ-uals file a claim for any loss that exceeds their

deductible level this suggests that 99 percent ofthe claims would have been filed under eitherdeductible choice making the assumption toabstract from moral hazard not very restrictive

Individuals however may choose not to filea claim even when the claim amount exceedsthe deductible level This may happen due toexperience rating which increases future insur-ance premiums These dynamic effects do notdepend on the deductible level at the time of theclaim so they simply enter in an additive wayUsing our data on individuals who renew theirpolicies with the company we can assess howbig the dynamic effects are These data showthat the price effect of a claim lasts for threeyears and is highest when an individual files hissecond claim within a year In such a case hewould face about a 20 percent increase in hisinsurance premium in the subsequent year 10percent in the year after and 5 percent in thethird year after the claim The regular premiumis about twice the regular deductible amount soan upper bound for the dynamic costs is 70percent of the regular deductible In most casesthe actual dynamic costs are much lower thanthis upper bound (the dynamic costs of say thefirst claim within a year are close to zero) Inaddition an individual can opt out of the con-tract and switch to a different insurance pro-vider This is likely to reduce his dynamic costsbecause in Israel unlike in the United Statesand many other countries there is no publicrecord of past claims Therefore insurance pro-viders can take full advantage of past recordsonly for their past customers For this reasonnew customers will of course face higher pre-miums than existing ones but the premium in-crease would not be as high as it would havebeen with the old insurance provider This isdue to the presence of ldquoinnocentrdquo new custom-ers who are pooled together with the switchers(Cohen 2003)47 Using 70 percent as a conser-vative upper bound Figure 3 shows that about93 percent of those claims filed by individualswith a low deductible would have also been46 This assumption is consistent with Cohen and Einav

(2003) who find no evidence for behavioral response tochanges in seat belt laws The following anecdotal obser-vation also supports this In an informal survey we con-ducted among our colleagues all of them were aware of adeductible in their auto insurance policy but fewer than 20percent knew its level This does not imply that 80 percentof our colleagues did not pay attention to their deductiblechoice at the time the choice was made It does implyhowever that their driving or care behavior cannot dependon the deductible level

47 New customers may voluntarily report their claimshistory to their new insurance provider Voluntary disclo-sure of past claims is as may be expected not truthful Ourdata suggest an unconditional claim rate of 02453 in oursample population Our data on claims history as voluntar-ily disclosed by the same individuals suggest a claim rate of00604 which is four times lower

774 THE AMERICAN ECONOMIC REVIEW JUNE 2007

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 31: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

filed with a regular deductible While this is notnegligible it applies for only a small fraction ofindividuals For the vast majority of them the99 percent discussed above is a more appropri-ate benchmark Therefore ex post moral hazardis unlikely to play a major role in this settingand one can abstract from the loss distributionand focus on claim rates as we do in thispaper48

Additional Cost of an AccidentmdashOur modelassumes that in the event of an accident theonly incurred costs are those associated with thedeductible payment In practice however other

transaction costs may be incurred such as thetime spent for appraisal of the damage the costsassociated with renting a replacement car for theduration of a repair etc Such costs could bereadily incorporated into the model To illus-trate we assume that these costs are known inadvance and are given by a constant c (whichcould in principle vary with each individual)Since c will not vary with the chosen level ofdeductible it will not affect the value d andwill enter the empirical model only through itseffect on d In particular equation (7) willchange to

(19) r

p

d 1

d c

and everything else will remain the same

48 We would not be as comfortable with this statementfor the choice of high and very high deductibles which areat much higher levels This is one additional reason to focusonly on the choice between low and regular deductibles

FIGURE 3 CLAIM DISTRIBUTIONS

Notes This figure plots kernel densities of the claim amounts estimated separately depending on the deductible choice Forease of comparison we normalize the claim amounts by the level of the regular deductible (ie the normalization is invariantto the deductible choice) and truncate the distribution at ten (the truncated part which includes a fat tail outside of the figureaccounts for about 25 percent of the distribution and is roughly similar for both deductible choices) The thick line presentsthe distribution of the claim amounts for individuals who chose a low deductible while the thin line does the same for thosewho chose a regular deductible Clearly both distributions are truncated from below at the deductible level The figure showsthat the distributions are fairly similar Assuming that the claim amount distribution is the same the area below the thickerline between 06 and 1 is the fraction of claims that would fall between the two deductible levels and therefore (absentdynamic incentives) would be filed only if a low deductible were chosen This area (between the two dotted vertical lines)amounts to 13 percent implying that the potential bias arising from restricting attention to claim rate (and abstracting fromthe claim distribution) is quite limited As we discuss in the text dynamic incentives due to experience rating may increasethe costs of filing a claim shifting the region in which the deductible choice matters to the right an upper bound to these costsis about 70 percent of the regular deductible covering an area (between the two dashed vertical lines) that integrates to morethan 7 percent Note however that these dynamic incentives are a very conservative upper bound they apply to less than 15percent of the individuals and do not account for the exit option which significantly reduces these dynamic costs

775VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 32: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

This implies that in principle such costs willhave no effect on the results of the counterfac-tual exercise we present later The costs willhowever affect the interpretation of the esti-mates of risk aversion In particular instead ofthe distribution of r we will now be estimatingthe distribution of r(d c)d so the reportedestimates of the coefficient of absolute riskaversion will be biased upward The magnitudeof the bias depends on the size of this transac-tion cost c compared to the average deductibled If c is relatively small the bias is negligibleIf however c is as big as the (average) deduct-ible level all our reported estimates of the levelof risk aversion should be divided by two (butthe coefficients on observables which are semi-elasticities will not change) The intuitionwould be similar but more involved if c variesacross individuals but not proportionally to d

Data about such transaction costs are ofcourse not available The following back-of-the-envelope exercise may provide some guidanceas to the magnitude of such costs We collecteddata from the Israeli police about the annual num-bers of accidents49 accidents with fatalities andaccidents with severe injuries in Israel for theyears 1996ndash1999 We then divided these numbersby our estimate of the total number of auto insur-ance claims in Israel (161859)50 We obtain that159 percent of the claims involve reported acci-dents 22 percent involve accidents with severeinjuries and 03 percent involve fatal accidentsThus the majority of claims reflect small unre-ported accidents perhaps suggesting that theseadditional costs of a claim are often not too large

Deviations from Expected Utility TheorymdashThroughout the paper we restrict attention toexpected utility maximizers Despite much ev-idence in the literature against some of the pre-dictions of expected utility theory it still seemsto us the most natural benchmark to specify andone that facilitates comparison to previous stud-ies We note that expected utility theory is as-sumed it is not and cannot be tested within our

framework Given our cross-sectional analysiswhich in principle allows flexible forms ofunobserved heterogeneity in risk preferencesthere are no testable restrictions imposed byexpected utility theory We should also note thatmuch (but not all) of the documented evidenceagainst expected utility theory arises with ex-treme risk probabilities which are close to zeroor one Our data (and our estimates) are basedon risk probabilities that are all in the range of010 to 035 Over this range expected utilityseems to perform better Finally it is importantto stress two points First at the conceptuallevel it is straightforward to use an alternativetheory of decisions under uncertainty If con-ditional on objective risk individuals vary in asingle dimension the same conceptual modeland empirical strategy can be applied All oneneeds to do is to specify the parameter overwhich decisions vary and construct an indiffer-ence set in the space of the specified parameterand (objective) risk types similar to the onepresented in Figure 2 Second any alternativemodel of decisions under uncertainty would re-quire us to take an even stronger view regardingthe parameterized objective function For exam-ple prospect theory (Daniel Kahneman andAmos Tversky 1979) would require us to pa-rameterize not only the curvature of individu-alsrsquo utility functions but also their referencepoints for which there is no natural choice inour context Similar issues arise if we try toapply decision weights (Tversky and PeterWakker 1995) or measures of overconfidencewith respect to driving ability

F Implications for Profits and Pricing

We now look at how firm profits vary withalternative pricing schemes This exercise isinteresting for several reasons First althoughwe do not use supply-side information for esti-mation it shows how one may incorporate suchinformation in estimation Second it illustratesthe conceptual trade-off faced by a monopolistthat operates in a market with adverse selectionAlthough the conceptual trade-off betweenhigher demand and worse selection is wellknown and has been extensively analyzed in thetheoretical literature quantifying it is importantto understand its empirical relevance Finallywe argued earlier that unobserved heterogeneityin risk aversion seems more important than un-

49 An accident is counted in this measure if it was re-ported to the police and a police officer arrived at the scene

50 This is estimated by taking the number of activepolicies at the end of our sample (45871) dividing it by ourbest guess for the share of the market the company had atthe time (7 percent) and multiplying it by the estimatedclaim rate in our data as computed in Table 2 (0245)

776 THE AMERICAN ECONOMIC REVIEW JUNE 2007

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 33: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

observed heterogeneity in risk The current ex-ercise shows that this conclusion also translatesto pricing and profits

Throughout this section we hold fixed thedistribution of risk and risk aversion the firmfaces Since we have little information about thedeterminants of overall demand faced by thefirm and hence how restrictive it is to hold thedistribution fixed we make the simplifying as-sumption that individuals make their choicessequentially They first choose the insuranceprovider by observing only the price of theregular deductible Once they decide to buy apolicy from the insurer they choose the deduct-ible level This seems a reasonable approxima-tion as the regular deductible is the one alwaysadvertised and initially quoted while the otheroptions are revealed only once the potentialcustomer and the insurance sales person ldquogetinto detailsrdquo Consistent with this assumptionwe assume that the regular premium and de-ductible are dictated by competitive conditionsand we focus on the choice of the low deduct-ible and its associated premium

From the companyrsquos standpoint each indi-vidual can be represented by a random draw of(i ri) from the conditional (on observables)distribution of risk and risk aversion

(20) ln i

ln ri Nxi

xi

2 r

r r2

When analyzing the optimal menu to offer suchan individual the company is assumed to berisk neutral and to maximize expected profitsSuppose the company offered only the regulardeductible and premium (dh ph) Let the ex-pected profits from this strategy be 0 Givenour assumptions we proceed by analyzing howthe firmrsquos profits are affected by a choice of anadditional low deductible option (dl pl) withdl dh and pl ph It is easy to use a changein variables and analyze the choice of d dh dl and p pl ph Expected profits arenow given by

(21) maxdp

0 Prri rii d pp

d Eiri rii d p

The trade-off in the companyrsquos decision isstraightforward Each new customer who chooses

the low combination pays an additional p up-front but saves d for each accident she is in-volved in This translates into two effects thatenter the companyrsquos decision problem The first issimilar to a standard pricing problem higher(lower) price difference (deductible difference)p (d) leads to a higher markup (on those indi-viduals who select the low deductible) but tolower quantity (or probability of purchase) asfewer individuals elect to choose the low deduct-ible This effect enters the profit function throughD(d p) Pr(ri ri(i d p)) The secondcomposition effect arises because of adverse se-lection As the price of the low deductible in-creases those individuals who still elect to choosethe low combination are ceteris paribus thosewith higher risk This effect enters through (dp) E(iri ri(i d p)) Its magnitude andsign depend on the relative heterogeneity of i andri and on the correlation between them Sinceneither D(d p) nor (d p) has a closed-form solution we analyze this decision problemgraphically where D(d p) and (d p) arenumerically computed using simulations from thejoint distribution

We illustrate our analysis by using the meanindividual in the data whose expected ln(i)and ln(ri) are 154 and 1181 respectively(based on Table 4) Such an individual has tochoose between a regular contract of ( phdh) (3189 1595) (in NIS) and a lowcontract of ( pl dl) (3381 957) ie (pd) (191 638) Below we discuss severaladditional counterfactual cases First we con-sider a case with a negative correlation be-tween risk and risk aversion (with the samemagnitude ie 084) Second we con-sider cases when the company ignores unob-served heterogeneity in one of the dimensionsie it views individuals as a draw from theestimated marginal distribution on one dimen-sion with the other dimension known and fixedat its estimated mean We do this exercise foreach dimension separately

To get intuition for the different effects Figure2 presents the estimated distribution in the spaceof (i ri) A small increase (decrease) in p (d)shifts the indifference setup to the right therebymaking some marginal individuals who were pre-viously just to the right of it switch to choosingthe regular deductible The demand trade-off isjust the comparison between the marginal loss ofthe company from all the marginal individuals

777VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 34: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

who no longer buy higher coverage with thehigher profits made from the inframarginal indi-viduals who still elect to choose higher coverageFigure 2 also helps in illustrating the effect ofadverse selection and the importance of the cor-relation coefficient As the menu shifts to the rightthe positive correlation implies that the marginalindividuals have higher risk than the average Thismeans that ldquolosingrdquo them (namely havingthem buy less coverage) is not as costly forthe insurance company as such individualsare on average more adversely selected Anegative correlation for example would havemade these marginal individuals more valuablethereby decreasing the incentive to increase pricesor deductibles from the current levels

Figure 4 presents the effects of pricing onprofits by varying the low-deductible levelkeeping the premium charged for it fixed at theobserved price (of p 191) It implies that the

current low deductible benefit of 638 NIS re-sults in additional annual profits of about 368NIS per customer This is about 037 percent oftotal operating profits per customer which isabout 1000 NIS Note however that after sub-tracting the administrative and claim-handlingcosts associated with each customer and claimthe relative magnitude of this effect will behigher Note also that the estimates imply thatthe current low-deductible level is suboptimalBy setting a smaller low-deductible benefit ofd 355 NIS (ie increasing the current lowdeductible by 283 NIS) additional profits canbe increased to 659 NIS51 Of course the find-

51 There is no reason of course to limit the choice of thecompany to only one additional deductible level Moredegrees of freedom in choosing the pricing menu will leadto higher profits

FIGURE 4 COUNTERFACTUALSmdashPROFITS

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) We plot the additional profits fromoffering a low deductible as a function of the attractiveness of a low deductible d dh dl fixing its price at the observedlevel The thick solid line presents the counterfactual profits as implied by the estimated benchmark model The other threecurves illustrate how the profits change in response to changes in the assumptions when the correlation between riskand risk aversion is negative (thin solid line) when there is no heterogeneity in risk aversion (dot-dashed line) and whenthere is no heterogeneity in risk (dashed line) The maxima (argmax) of the four curves respectively are 659 (355)714 (583) 0 and 704 (500) The dotted vertical line represents the observed level of d (638) which implies that theadditional profits from the observed low deductible are 368 NIS per policy

778 THE AMERICAN ECONOMIC REVIEW JUNE 2007

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 35: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

ing that the current pricing is suboptimal mayalso be due to a limitation of the model

Consistent with the intuition discussed aboveFigure 4 also shows that the incentive to in-crease prices (or lower the deductible) is higherwith positive correlation When the correlationbetween risk and risk aversion is negative theoptimal low deductible is lower It also showsthat ignoring either dimension of unobservedheterogeneity has an important effect on pric-ing decisions However while ignoring het-erogeneity in risk does not change thequalitative pattern by much ignoring hetero-geneity in risk aversion completely changesthe pricing analysis In fact given the esti-mated coefficients and the observed prices ifindividuals vary only in risk offering a lowdeductible does not increase profits due to ad-verse selection

Figure 5 breaks down the effect on profits bypresenting the pricing effect on the demand forlow deductible D(d p) and on the compo-sition effect (d p) The former is simplygenerated by the distribution of certainty equiv-alents implied by the joint distribution of i andri (Landsberger and Meilijson 1999) It is S-shaped due to the assumption of lognormal dis-tribution The shape of the composition effect isdriven by the relative variance of i and ri andby the correlation coefficient As the estimatesimply that most of the variation in certaintyequivalents is driven by variation in ri thestrong positive correlation implies that the com-position effect is monotonically decreasing inthe deductible level As the low-deductible op-tion becomes more favorable more peoplechoose it with the most risky individuals choos-ing it first The effect of the deductible level onthe composition effect is dramatically differentwhen the correlation between risk and risk aver-sion is negative When this is the case theobserved relationship between the deductiblelevel and the composition effect is mostly re-versed This is because the effect of risk aver-sion dominates that of adverse selection due toits higher variance

IV Concluding Remarks

The paper makes two separate contributionsFirst from a methodological standpoint we layout a conceptual framework through which one

can formulate and estimate a demand system forindividually customized contracts The key datarequirements for this approach are contractchoices individual choice sets and ex post riskrealizations Since such data may be available inmany other contexts the methodological frame-work may be useful to uncover structural pa-rameters in such settings As an example onecould consider annuity data and use guaranteeperiod choices and mortality data to identifyheterogeneity in risk (mortality) and in prefer-ences for wealth after death (Einav Finkelsteinand Paul Schrimpf 2006) Similarly one couldconsider loan data and use down-paymentchoices and default data to identify heterogene-ity in risk (default) and in liquidity (Will Ad-ams Einav and Jonathan D Levin 2006)Second from an economic standpoint we pro-vide a new set of estimates for the degree andheterogeneity of (absolute) risk aversion and itsrelationship with risk We discuss these below

While our estimates of risk aversion help topredict other related insurance decisions it isnatural to ask to what extent these parametersare relevant in other contexts This is essentiallyan empirical question which can be answeredonly by estimating risk aversion parameters fora variety of bet sizes and in a variety of con-texts Since isolating risk preferences in manycontexts is hard such exercises are rare leavingus with no definite answer for the scope ofmarkets for which our estimates may be rele-vant On one hand Rabin (2000) and Rabin andThaler (2001) argue that different decisions inlife are taken in different contexts and thereforemay be subject to different parameters in theutility function On the other hand classicaltheory suggests that each individual has onevalue function over her lifetime wealth so allrisky decisions take into account the same valuefunction and are therefore subject to the samerisk preferences Our view is somewhere inbetween We are more comfortable with extrap-olation of our risk aversion estimates to setupsthat are closer to the auto insurance marketcontext in which these parameters are esti-mated To assess ldquoclosenessrdquo it is important toconsider various factors over which contextsmay differ Such factors may include bet size aswell as informational and behavioral effectssuch as default options framing and rarity of theevents Since we use claim data to identify risktype we leave everything else to be interpreted as

779VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 36: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

FIGURE 5 COUNTERFACTUALSmdashSELECTION

Notes This figure illustrates the results from the counterfactual exercise (see Section IIIF) Parallel to Figure 4 we breakdown the effects on profits to the share of consumers who choose a low deductible (bottom panel) and to the expected riskof this group (top panel) This is presented by the thick solid line for the estimates of the benchmark model As in Figure 4 wealso present these effects for three additional cases when the correlation between risk and risk aversion is negative (thin solidline) when there is no heterogeneity in risk aversion (dot-dashed line) and when there is no heterogeneity in risk (dashedline) The dotted vertical lines represent the observed level of d (638) for which the share of low deductible is 16 percentand their expected risk is 0264 This may be compared with the corresponding figures in Table 2A of 178 and 0309respectively Note however that the figure presents estimated quantities for the average individual in the data while Table2A presents the average quantities in the data so one should not expect the numbers to fit perfectly

780 THE AMERICAN ECONOMIC REVIEW JUNE 2007

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 37: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

risk aversion As an example overconfidence willbe captured by a lower level of estimated riskaversion so if overconfidence is more importantin auto insurance than in health insurance whenextrapolated to health insurance individuals maybehave as if they are more risk averse than weestimate them to be

With this caveat in mind let us discuss ourfour main findings First we find large hetero-geneity in risk preferences across individualsThis heterogeneity is important in various con-texts (a) it cautions against using pure cross-sectional variation to test expected utilitytheory (b) it may create strong selection ofparticipants into particular markets this maymake participants in voluntary markets quitedifferent in their risk preferences from those inmarkets in which participation is either manda-tory or driven by other factors and (c) it maymake the marginal individual quite differentfrom the average one Models in macroeconom-ics and finance which often use a representativeindividual framework may not be able to cap-ture and account for such differences

Our second set of findings concerns the wayrisk aversion relates to observable characteris-tics Our finding that women are more riskaverse than men has been documented in othersettings The finding that risk preferences ex-hibit a U-shaped pattern over the life cycle maybe interesting to explore further in other con-texts Other findings suggest that the estimatedcoefficient of risk aversion increases with ob-servables that are related to income and wealthAs we agree with the widely held belief of thedecreasing absolute risk aversion property ourpreferred interpretation for this finding is thatwealth and income may be endogenous gener-ating the estimated cross-sectional relationshipWhile lower risk aversion may be associatedwith higher propensity to become an entrepre-neur and thereby have higher wealth it mayalso be associated with lower propensity to saveor invest in education affecting wealth the otherway Therefore one important message of thisfinding is that accounting for heterogeneity inpreferences may be important as representativeconsumer models may provide misleading in-terpretations for otherwise natural results

The last two sets of findings concern therelationship between risk and preferences Sincerisk is particular to the context in which it ismeasured these findings may be sensitive to the

market context and may change once risk takesother forms Even within the auto insurancemarket it is important how risk is measuredMoreover risk in the auto insurance marketmay be conceptually different from risk in othermarkets In many markets risk is independentacross individuals In auto insurance howevermuch of the risk depends on coordinationamong drivers and therefore may be more re-lated to relative not absolute characteristicsFor this reason our finding of positive correla-tion between risk and risk aversion can coexistwith findings of negative correlation in othercontexts (Israel 2005 Finkelstein and McGarry2006) The positive correlation we find is alsoconsistent with the fact that the bivariate probittest in our data provides evidence for adverseselection (Cohen 2005) while similar reduced-form tests in other contexts do not

Finally we find that unobserved heterogene-ity in risk preferences is more important thanheterogeneity in risk This may be driven by thecasual evidence that insurance companies exertmuch effort and resources in collecting con-sumer data which are informative about riskclassification but not about preferences52 Weillustrate the empirical importance of our find-ings for the analysis of optimal contracts in autoinsurance The presence of more than one di-mension of unobserved heterogeneity may dra-matically change the nature of these contractsTheory is still not fully developed for suchmultidimensional screening problems as it typ-ically requires a small number of types (Lands-berger and Meilijson 1999) restricts the twodimensions to be independent of each other(Jean-Charles Rochet and Lars A Stole 2002)or assumes that the number of instrumentsavailable to the monopolist is not smaller thanthe dimension of unobserved heterogeneity(Steven Matthews and John Moore 1987 Rich-ard Arnott and Joseph E Stiglitz 1988)53 MarkArmstrong (1999) may be the closest theoreticalwork to the framework suggested here It cannot

52 This choice of data collection efforts may be justifiedif it is easier for such firms to price discriminate based onrisk but harder to price discriminate based on preferencesA common belief is that without cost-based (ie risk-based) justification for prices price discrimination may leadto consumer backlash

53 See also Smart (2000) Villeneuve (2003) and Jullienet al (2007) for related theoretical results

781VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 38: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

be directly applied however as it uses simpli-fying linearity assumptions which would behard to impose in the current context Our re-sults indicate that many applications can benefitfrom extending the theory to include the more

general case such as the one analyzed hereSuch a theory may also serve as a guide forusing supply-side restrictions in similar con-texts Our counterfactual analysis is a very pre-liminary start in this direction

APPENDIX A DESCRIPTION OF THE GIBBS SAMPLER

In this appendix we describe the setup of the Gibbs sampler that we use to estimate the modelOne of the main advantages of the Gibbs sampler is its ability to allow for data augmentation oflatent variables (Tanner and Wong 1987) In our context this amounts to augmenting the individual-specific risk aversion and risk type namely i rii1

n as additional parametersWe can write the model as follows

(22) ln i xi i

(23) ln ri xi vi

(24) choicei 1 if ri ri ()0 if ri ri ()

(25) claimsi Poissoni ti

(26) i

vi

iid

N00

2 r

r r2

Let

2 r

r r2 X

x1

xn

0

0 x1

xn

y r and ui i

vi

The set of parameters for which we want to have a posterior distribution is given by uii1

n The prior specifies that are independent of uii1n have a conventional

diffuse prior We adopt a hierarchical prior for uii1N

(27) uii 1n

iid

N0

(28) 1 Wishart2a Q

782 THE AMERICAN ECONOMIC REVIEW JUNE 2007

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 39: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

so conditional on all other parameters (and on the data which have no effect in this case) we have

(29) 1 uii 1n Wishart2a n k Q1

iuiui1

and

(30) uii 1n NXX1Xy 1 XX1

For 1 we use a convenient diffuse prior ie a 0 and Q1 0The part of the Gibbs sampler that is less standard in this case involves the sampling from the

conditional distribution of the augmented parameters uii1n Each individual is independent of the

others so conditional on the other parameters it does not depend on other individualsrsquo augmenteddata Thus all we need to describe is the conditional probability of ui Note that conditional on we have i ln i xi and vi ln ri xi so we can instead focus on sampling from theposterior distribution of i and ri These posterior distributions are

(31) Prri i data ln ri xi r

(ln i xi) r2(12) if choicei I(ri ri())

0 if choicei I(ri ri())

and

(32) Pri ri data

p(i claimsi ti) ln i xi

r(ln ri xi)

2(1 2) if choicei I(ri ri())

0 if choicei I(ri ri())

where p(x claims t) xclaimsexp(xt) is proportional to the probability density function of thePoisson distribution (x ) exp[(12)((x ))2] is proportional to the normal probabilitydensity function and I is an indicator function

The posterior for ln ri is a truncated normal for which we use a simple ldquoinvert cdfrdquo sampling(Devroye 1986)54 The posterior for ln i is less standard We use a ldquoslice samplerrdquo to do so (Damienet al 1999) The basic idea is to rewrite Pr(i) b0(i)b1(i)b2(i) where b0(i) is a truncatednormal distribution and b1(i) and b2(i) are defined below We can then augment the data with twoadditional variables ui

1 and ui2 which (conditional on i) are distributed uniformly on [0 b1(i)] and

[0 b2(i)] respectively Then we can write Pr(i ui1 ui

2) b0(i)b1(i)b2(i)[I(0 ui1 b1(i))

b1(i)][I(0 ui2 b2(i))b2(i)] b0(i)I(0 ui

1 b1(i))I(0 ui2 b2(i)) Using this form we

have that b1(ln i) iclaimsi (exp(ln i))

claimsi and b2(ln i) exp(iti) exp(tiexp(ln i))Because b1 and b2 are both monotone functions conditional on ui

1 and ui2 this just means that

54 Let F(x) be the cumulative distribution function The ldquoinvert cdfrdquo sampling draws from this distribution by drawing u froma uniform distribution on [0 1] and computing F1(u) In principle one can use the sampling procedure suggested by John Geweke(1991) which avoids computing F1 and therefore is more efficient It was easier however to vectorize the algorithm using Devroye(1986) The vectorization entails enormous computational benefits when coded in Matlab

783VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 40: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

b11(ui

1) ((ln ui1)claimsi) is a lower bound for ln i (for claimsi 0) and that b2

1(ui2) ln(ln

ui2) ln ti is an upper bound for ln i Thus we can just sample i from a truncated normal

distribution after we modify the bounds according to ui1 and ui

2

Gibbs Sampler for the Learning ModelmdashIn the end of Section IIID we add to the modelincomplete information of individuals about their own types Individualsrsquo types are fixed over theirlifetime and individuals are Bayesian and update their own type by their claim experience Sinceexpected utility is linear in claim probabilities only individualsrsquo ex post mean will affect theircoverage choices Before individuals obtain their driving license they believe that their risk type isa random draw from the observed population of drivers in the data Individuals are assumed to havea prior that follows a Gamma(13 ) distribution55 where 13 and are estimated from the dataIndividualsrsquo posterior mean is then given by [(ci 13)(li (1))] where c is the number of claimshistorically filed and li is the individualrsquos driving experience (license years) Let i denote theposterior mean The assumptions imply that i[li (1)] 13 is distributed Poisson(ili) The restof the model is as before with i used instead of i to explain the coverage choice Thus to implementit within the Gibbs sampler we augment i as well The conditional distribution for rii i is as beforewith i (rather than i) affecting the truncation point The conditional distribution for ii ri is a lineartransformation of a truncated Poisson with the truncation point being the same as the one used above fori Finally the conditional distribution for ii ri is of an unknown form Fortunately however theassumptions make it very similar to the one before with the following modifications First it is nottruncated Second i provides information on i In particular it can be written as

(33) Pri i ri data

pi i li 1

13 lipi claimsi ti ln i xi

r(ln ri xi)

2(1 2)

Because the first two elements follow a Poisson process however it is proportional to p(x claimst) xclaimsiciexp(x(ti li)) making it very similar to the form of the benchmark model

APPENDIX B VARIABLE DEFINITIONS

Below we describe the variables that may not be self-explanatory

EducationmdashldquoTechnicalrdquo education refers to post-high-school education which does not result ina college degree

EmigrantmdashA dummy variable that is equal to one if the individual was not born in Israel (Car) valuemdashCurrent estimated ldquoBlue Bookrdquo value of the car Car agemdashThe number of years the car has been in use Commercial carmdashA dummy variable that is equal to one if the car is defined as a commercial

vehicle (eg pickup truck) Engine sizemdashThe volume of the engine in cubic centimeters (cc) This is a measure of size and

power For modern cars 1 unit of horsepower is roughly equal to 15 to 17 cc depending on severalother variables such as weight

License yearsmdashNumber of years since the individual obtained a driving license Good drivermdashA dummy variable that is equal to one if the individual is classified as a good driver

55 Note that the gamma assumption is similar but not identical to the lognormal distribution we use for estimation Aswill be clear below the benefit of this slight internal inconsistency is very attractive computationally for the construction ofthe Gibbs sampler

784 THE AMERICAN ECONOMIC REVIEW JUNE 2007

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 41: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

The classification is made by the company based on the other observables and suggests that theindividual is likely to be a low-risk driver We do not know the exact functional form for thisclassification One can view this as an informative nonlinear functional form of the otherobservables already in the regressions

Any drivermdashA dummy variable that is equal to one if the policy stipulates that any driver candrive the car If it does not stipulate it the car is insured only if the policyholder (and sometimeshisher spouse) drives the car

Secondary carmdashA dummy variable that is equal to one if the car is not the main car in thehousehold

Business usemdashA dummy variable that is equal to one if the policyholder uses the car for business Estimated mileagemdashPredicted annual mileage (in kilometers) by the policyholder The company

does not use this variable for pricing as it is believed to be unreliable HistorymdashThe number of years (up to the required three) prior to the starting date of the policy for

which the policyholder reports hisher past claims history Claims historymdashThe number of claims per year for the policyholder over the three (or less) years

prior to the starting date of the policy Young drivermdashA dummy variable that is equal to one if the policy covers drivers who are below

the age of 25 In such cases the policyholder has to report the details of the young driver (whichmay be the policyholder or someone else) separately

Company yearmdashYear dummies that span our five-year observation period The first-year dummyis equal to one for policies started between 1111994 and 10311995 the second-year dummy isequal to 1 for policies started between 1111995 and 10311996 and so on

REFERENCES

Abbring Jaap H Pierre-Andre Chiappori JamesHeckman and Jean Pinquet ForthcomingldquoTesting for Moral Hazard on Dynamic Insur-ance Datardquo Journal of the European EconomicAssociation (Papers and Proceedings)

Abbring Jaap H Pierre-Andre Chiappori andJean Pinquet 2003 ldquoMoral Hazard and Dy-namic Insurance Datardquo Journal of the Euro-pean Economic Association 1(4) 767ndash820

Adams Will Liran Einav and Jonathan DLevin 2006 ldquoScreening in Consumer CreditMarketsrdquo Unpublished

Agarwall Sumit John C Driscoll Xavier Gabaixand David Laibson 2006 ldquoFinancial Mistakesover the Life Cyclerdquo Unpublished

Andersen Steffen Glenn W Harrison MortenIgel Lau and E Elisabet Rutstrom 2005ldquoPreference Heterogeneity in ExperimentsComparing the Field and Labrdquo Centre forEconomic and Business Research DiscussionPaper 2005ndash03

Armstrong Mark 1999 ldquoOptimal Regulationwith Unknown Demand and Cost FunctionsrdquoJournal of Economic Theory 84(2) 196ndash215

Arnott Richard and Joseph E Stiglitz 1988ldquoRandomization with Asymmetric Informa-tionrdquo RAND Journal of Economics 19(3)344ndash62

Auld M Christopher J C Herbert EmeryDaniel V Gordon and Douglas McClintock2001 ldquoThe Efficacy of Construction SiteSafety Inspectionsrdquo Journal of Labor Eco-nomics 19(4) 900ndash21

Barberis Nicholas Ming Huang and RichardH Thaler 2006 ldquoIndividual PreferencesMonetary Gambles and Stock Market Par-ticipation A Case of Narrow FramingrdquoAmerican Economic Review 96(4) 1069 ndash90

Barsky Robert B F Thomas Juster Miles SKimball and Matthew D Shapiro 1997ldquoPreference Parameters and Behavioral Het-erogeneity An Experimental Approach in theHealth and Retirement Studyrdquo QuarterlyJournal of Economics 112(2) 537ndash79

Beetsma Roel M W J and Peter C Schotman2001 ldquoMeasuring Risk Attitudes in a NaturalExperiment Data from the Television GameShow Lingordquo Economic Journal 111(474)821ndash48

Bombardini Matilde and Francesco Trebbi2005 ldquoRisk Aversion and Expected UtilityTheory A Field Experiment with Large andSmall Stakesrdquo httpwwweconubccambombardiniindexhtm

Bortkiewicz Ladislaus 1898 Das Gesetz derKlienen Zahlen (The Law of Small Numbers)Leipzig Teubner

785VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 42: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

Cardon James H and Igal Hendel 2001ldquoAsymmetric Information in Health Insur-ance Evidence from the National MedicalExpenditure Surveyrdquo RAND Journal of Eco-nomics 32(3) 408ndash27

Cawley John and Tomas Philipson 1999 ldquoAnEmpirical Examination of Information Barri-ers to Trade in Insurancerdquo American Eco-nomic Review 89(4) 827ndash46

Chetty Raj 2006 ldquoA New Method of Estimat-ing Risk Aversionrdquo American Economic Re-view 96(5) 1821ndash34

Chiappori Pierre-Andre Bruno Jullien BernardSalanie and Francois Salanie 2006 ldquoAsym-metric Information in Insurance GeneralTestable Implicationsrdquo RAND Journal ofEconomics 37(4)

Chiappori Pierre-Andre and Bernard Salanie2000 ldquoTesting for Asymmetric Informationin Insurance Marketsrdquo Journal of PoliticalEconomy 108(1) 56ndash78

Chiappori Pierre-Andre and Bernard Salanie2006 ldquoThe Microeconomics of InsuranceAn Empirical Perspectiverdquo Paper presentedat the annual meeting of the American Eco-nomic Association Boston

Choi Syngjoo Douglas Gale Ray Fisman and Sha-char Kariv 2006 ldquoSubstantive and ProceduralRationality in Decisions under Uncertaintyrdquohttpsocratesberkeleyedukariv

Cicchetti Charles J and Jeffrey A Dubin 1994ldquoA Microeconometric Analysis of Risk Aver-sion and the Decision to Self-Insurerdquo Jour-nal of Political Economy 102(1) 169ndash86

Cohen Alma 2003 ldquoProfits and Market Powerin Repeat Contracting Evidence from theInsurance Marketrdquo Unpublished

Cohen Alma 2005 ldquoAsymmetric Informationand Learning Evidence from the AutomobileInsurance Marketrdquo Review of Economics andStatistics 87(2) 197ndash207

Cohen Alma and Liran Einav 2003 ldquoThe Ef-fects of Mandatory Seat Belt Laws on Driv-ing Behavior and Traffic Fatalitiesrdquo Reviewof Economics and Statistics 85(4) 828ndash43

Cooper Russell and Beth Hayes 1987 ldquoMulti-Period Insurance Contractsrdquo InternationalJournal of Industrial Organization 5(2) 211ndash31

Cummins J David and Jack L VanDerhei1979 ldquoA Note on the Relative Efficiency ofProperty-Liability Insurance DistributionSystemsrdquo Bell Journal of Economics 10(2)708ndash19

Damien Paul John Wakefield and StephenWalker 1999ldquoGibbs Sampling for BayesianNon-Conjugate and Hierarchical Models byUsing Auxiliary Variablesrdquo Journal of theRoyal Statistical Society B 61(2) 331ndash44

DrsquoArcy Stephen P and Neil A Doherty 1990ldquoAdverse Selection Private Information andLowballing in Insurance Marketsrdquo Journalof Business 63(2) 145ndash64

De Meza David and David C Webb 2001 ldquoAd-vantageous Selection in Insurance MarketsrdquoRAND Journal of Economics 32(2) 249ndash62

Devroye Luc 1986 Non-Uniform RandomVariate Generation New York Springer-Verland

Dionne Georges and Neil A Doherty 1994ldquoAdverse Selection Commitment and Rene-gotiation Extension to and Evidence fromInsurance Marketsrdquo Journal of PoliticalEconomy 102(2) 209ndash35

Dionne Georges and Pierre Lasserre 1985ldquoAdverse Selection Repeated InsuranceContracts and Announcement Strategyrdquo Re-view of Economic Studies 52(4) 719ndash23

Dionne Georges and Charles Vanasse 1992ldquoAutomobile Insurance Ratemaking in thePresence of Asymmetrical InformationrdquoJournal of Applied Econometrics 7(2) 149ndash65

Donkers Bas Bertrand Melenberg and Arthurvan Soest 2001 ldquoEstimating Risk Attitudes Us-ing Lotteries A Large Sample Approachrdquo Jour-nal of Risk and Uncertainty 22(2) 165ndash95

Dreze Jacques H 1981 ldquoInferring Risk Toler-ance from Deductibles in Insurance Con-tractsrdquo Geneva Papers on Risk andInsurance 20(July) 48ndash52

Einav Liran Amy Finkelstein and PaulSchrimpf 2006 ldquoThe Welfare Cost of Asym-metric Information Evidence from the UKAnnuity Marketrdquo httpwwwstanfordeduleinav

Evans William N and W Kip Viscusi 1991ldquoEstimation of State-Dependent Utility Func-tions Using Survey Datardquo Review of Eco-nomics and Statistics 73(1) 94ndash104

Finkelstein Amy and Kathleen McGarry 2006ldquoMultiple Dimensions of Private Informa-tion Evidence from the Long-Term Care In-surance Marketrdquo American EconomicReview 96(4) 938ndash58

Finkelstein Amy and James Poterba 2004 ldquoAd-verse Selection in Insurance Markets Policy-holder Evidence from the UK Annuity

786 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 43: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

Marketrdquo Journal of Political Economy112(1) 183ndash208

Friend Irwin and Marshall E Blume 1975ldquoThe Demand for Risky Assetsrdquo AmericanEconomic Review 65(5) 900ndash22

Gertner Robert 1993 ldquoGame Shows and Eco-nomic Behavior Risk-Taking On lsquoCardSharksrsquordquo Quarterly Journal of Economics108(2) 507ndash21

Geweke John 1991 ldquoEfficient Simulation fromthe Multivariate Normal and Student-t Distri-butions Subject to Linear Constraintsrdquo InComputing Science and Statistics Proceed-ings of the 23rd Symposium on the Interfaceed E M Keramidas 571ndash78 Fairfax Inter-face Foundation of North America

Hartog Joop Ada Ferrer-i-Carbonell and NicoleJonker 2002 ldquoLinking Measured Risk Aver-sion to Individual Characteristicsrdquo Kyklos55(1) 3ndash26

Hendel Igal and Alessandro Lizzeri 2003 ldquoTheRole of Commitment in Dynamic ContractsEvidence from Life Insurancerdquo QuarterlyJournal of Economics 118(1) 299ndash327

Holt Charles A and Susan K Laury 2002 ldquoRiskAversion and Incentive Effectsrdquo AmericanEconomic Review 92(5) 1644ndash55

Israel Mark 2004 ldquoDo We Drive More Safelywhen Accidents Are More Expensive Identi-fying Moral Hazard from Experience RatingSchemesrdquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Israel Mark 2005 ldquoWhere Is All the HiddenInformation Hiding Evidence from Auto-mobile Insurance Panel Datardquo httpwwwkelloggnorthwesternedufacultyisraelhtmresearchhtm

Jullien Bruno and Bernard Salanie 2000 ldquoEs-timating Preferences under Risk The Case ofRacetrack Bettorsrdquo Journal of PoliticalEconomy 108(3) 503ndash30

Jullien Bruno Bernard Salanie and FrancoisSalanie 2007 ldquoScreening Risk-Averse Agentsunder Moral Hazard Single-Crossing and theCara Caserdquo Economic Theory 30(1) 151ndash69

Kachelmeier Steven J and Mohamed Shehata1992 ldquoExamining Risk Preferences under HighMonetary Incentives Experimental Evidencefrom the Peoplersquos Republic of Chinardquo Amer-ican Economic Review 82(5) 1120ndash41

Kahneman Daniel and Amos Tversky 1979ldquoProspect Theory An Analysis of Decisionunder Riskrdquo Econometrica 47(2) 263ndash91

Kimball Miles S and N Gregory Mankiw 1989ldquoPrecautionary Saving and the Timing ofTaxesrdquo Journal of Political Economy 97(4)863ndash79

Kocherlakota Narayana R 1996 ldquoThe EquityPremium Itrsquos Still a Puzzlerdquo Journal of Eco-nomic Literature 34(1) 42ndash71

Kunreuther Howard and Mark V Pauly 1985ldquoMarket Equilibrium with Private Knowl-edge An Insurance Examplerdquo Journal ofPublic Economics 26(3) 269ndash88

Landsberger Michael and Isaac Meilijson 1999ldquoA General Model of Insurance under AdverseSelectionrdquo Economic Theory 14(2) 331ndash52

Matthews Steven and John Moore 1987ldquoMonopoly Provision of Quality and War-ranties An Exploration in the Theory ofMultidimensional Screeningrdquo Economet-rica 55(2) 441ndash 67

Metrick Andrew 1995 ldquoA Natural Experimentin lsquoJeopardyrsquordquo American Economic Review85(1) 240ndash53

Michener Ron and Carla Tighe 1992 ldquoA PoissonRegression Model of Highway FatalitiesrdquoAmerican Economic Review 82(2) 452ndash56

Palacios-Huerta Ignacio and Roberto Serrano2006 ldquoRejecting Small Gambles under ExpectedUtilityrdquo Economics Letters 91(2) 250ndash59

Puelz Robert and Arthur Snow 1994 ldquoEvi-dence on Adverse Selection EquilibriumSignaling and Cross-Subsidization in the In-surance Marketrdquo Journal of Political Econ-omy 102(2) 236ndash57

Rabin Matthew 2000 ldquoRisk Aversion and Ex-pected-Utility Theory A Calibration Theo-remrdquo Econometrica 68(5) 1281ndash92

Rabin Matthew and Richard H Thaler 2001ldquoAnomalies Risk Aversionrdquo Journal of Eco-nomic Perspectives 15(1) 219ndash32

Rochet Jean-Charles and Lars A Stole 2002ldquoNonlinear Pricing with Random Participa-tionrdquo Review of Economic Studies 69(1)277ndash311

Rose Nancy L 1990 ldquoProfitability and ProductQuality Economic Determinants of AirlineSafety Performancerdquo Journal of PoliticalEconomy 98(5) 944ndash64

Rothschild Michael and Joseph E Stiglitz 1976ldquoEquilibrium in Competitive Insurance Mar-kets An Essay on the Economics of Imper-fect Informationrdquo Quarterly Journal ofEconomics 90(4) 630ndash49

Rubinstein Ariel 2001 ldquoComments on the Risk

787VOL 97 NO 3 COHEN AND EINAV ESTIMATING RISK PREFERENCES FROM DEDUCTIBLE CHOICE

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007

Page 44: Estimating Risk Preferences from Deductible Choicepublic.econ.duke.edu/.../socialinsurance/readings/Cohen_Einav07(2.1… · Estimating Risk Preferences from Deductible Choice By A

and Time Preferences in Economicsrdquo httparielrubinsteintauacil

Saha Atanu 1997 ldquoRisk Preference Estimationin the Nonlinear Mean Standard Deviation Ap-proachrdquo Economic Inquiry 35(4) 770ndash82

Smart Michael 2000 ldquoCompetitive InsuranceMarkets with Two Unobservablesrdquo Interna-tional Economic Review 41(1) 153ndash69

Smith Vernon L and James M Walker 1993ldquoRewards Experience and Decision Costs inFirst Price Auctionsrdquo Economic Inquiry31(2) 237ndash45

Sydnor Justin 2006 ldquoSweating the SmallStuffThe Demand for Low Deductibles inHomeowners Insurancerdquo httpswebfilesberkeleyedujrsydnor

Tanner Martin A and Wing Hung Wong 1987ldquoThe Calculation of Posterior Distributions

by Data Augmentationrdquo Journal of theAmerican Statistical Association 82(398)528ndash40

Tversky Amos and Peter Wakker 1995ldquoRiskAttitudes and Decision Weightsrdquo Economet-rica 63(6) 1255ndash80

Villeneuve Bertrand 2003 ldquoConcurrence etAntiselection Multidimensionnelle en Assur-ancerdquo Annales drsquoEconomie et de Statistique69(January-March) 119ndash42

Viscusi W Kip and William N Evans 1990ldquoUtility Functions That Depend on HealthStatus Estimates and Economic Implica-tionsrdquo American Economic Review 80(3)353ndash74

Watt Richard 2002 ldquoDefending Expected Util-ity Theory Commentrdquo Journal of EconomicPerspectives 16(2) 227ndash29

788 THE AMERICAN ECONOMIC REVIEW JUNE 2007