On the semantics and pragmatics of epistemic …ssmoss/Moss - Semantics and...Forthcoming in Semantics and Pragmatics. Penultimate version. On the semantics and pragmatics of epistemic

Forthcoming in Semantics and Pragmatics. Penultimate version.

On the semantics and pragmatics of epistemic vocabulary

Sarah [email protected]

There has been much recent debate over the correct semantics for epistemic vo-cabulary, i.e. expressions like the sentential operators in sentences such as:

(1) John might be in his office.

(2) John must be in his office.

(3) John is probably in his office.

(4) If John is in the building, he is in his office.

This paper explores a rich source of data for theories of this vocabulary. The debateover the viability of standard truth conditional theories has called attention to thedistinctive behavior of epistemic vocabulary in eavesdropping judgments, indicativesuppositions, and statements of disagreement and retraction. But extant accountsare not sufficiently sensitive to distinctive features of the way in which epistemicvocabulary interacts with other epistemic vocabulary. If we start by studying thebehavior of simple nested epistemic modals, we may naturally build a theory thatexplains the more complicated behavior of epistemic modals under disjunction andover indicative conditionals, and even the puzzling effects of embedding epistemicvocabulary in classically valid arguments. In §1, I make unifying observations aboutthe suggestive behavior of epistemic vocabulary in each of these contexts, extractingseveral desiderata for semantic and pragmatic theories.

1. Thanks to Fabrizio Cariani, Josh Dever, Cian Dorr, John Hawthorne, Eric Swanson, Brian Weatherson,and an anonymous referee for feedback on drafts of this paper. Thanks also to the University of ChicagoLinguistics and Philosophy Workshop, the University of Michigan Linguistics and Philosophy Work-shop, Ohio State University, and the 24th Semantics and Linguistics Theory Conference (SALT 24) forhelpful discussion.

In §2–3, I develop a semantics for epistemic vocabulary. This semantics constitutesa rather dramatic alternative to standard truth conditional theories, as it assigns setsof probability measures rather than sets of worlds as semantic values. I aim to demon-strate that what my theory lacks in conservatism is made up for by its strength. In§4, I argue that combined with a novel pragmatics, my semantic theory can accountfor the distinctive linguistic behavior observed in §1. The theory I defend therebyaddresses several challenges raised in recent literature. For instance, the theory an-swers concerns about epistemic modals under disjunction raised in Schroeder 2012.The theory explains why epistemic vocabulary produces invalid instances of classi-cally valid arguments, shedding light on important puzzles raised for constructivedilemma arguments in Kolodny & MacFarlane 2010 and modus tollens argumentsin Yalcin 2012b.

1. Data for a theory of epistemic vocabulary

A careful examination of the behavior of epistemic modals yields several desideratafor a theory of epistemic vocabulary. A few of these desiderata have been discussedelsewhere, usually as puzzles concerning epistemic modals. A number of the desider-ata make trouble for extant semantic theories. The literature on epistemic modals isso vast that it would be impractical to argue against every alternative to my preferredtheory here. For considerations of space, I set aside the possibility of resuscitatingthe standard truth conditional semantics for epistemic vocabulary, since persuasivearguments against that semantics have been discussed at length elsewhere.2 I pointout potential challenges for other prominent theories in passing, but the main focusof this paper is the exposition and development of a positive case for my own theory.

1.1. Nested epistemic vocabulary

Nested epistemic vocabulary occurs in many forms in ordinary conversation. Forexample, suppose Alice and Bob are both candidates for certain job positions. Wemay naturally talk about Bob using epistemic adjectives under epistemic operators:

(5) Alice is a likely hire, and Bob might be a likely hire.

(6) Alice is a possible hire, and Bob is probably also a possible hire.

2. For instance, see the implications of triviality results discussed in Edgington 1995, the discussion ofthe subject matter of indicative conditionals in Bennett 2003, the “speaker inclusion constraint” inEgan et al. 2005 and Weatherson 2008, the case of the missing car keys in Swanson 2006 and von

Fintel & Gillies 2011, the eavesdropping cases in Egan 2007, the discussion of embedding behaviorin Yalcin 2007, the discussion of inference patterns in Yalcin 2010, the discussion of assertability anddisagreement in Yalcin 2011, and the discussion of retraction and disputes in MacFarlane 2011.

2

And we could further spell out the above observations as follows:

(7) It is likely that we will hire Alice, and we might also be likely to hire Bob.

(8) We might hire Alice, and it is probably the case that we might hire Bob too.3

Both epistemic modals and epistemic comparative adjectives can occur in the scopeof indicative conditionals, and vice versa:

(9) If they did not hire Alice, they are more likely to have hired Bob than Carl.4

(10) It is more likely than not that the vase broke if he dropped it on concrete.

In addition, there are well-known examples of right-nested and left-nested indicatives:

(11) If a Republican wins the election, then if it’s not Reagan who wins it will beAnderson. (McGee 1985, 462)

(12) If the cup broke if it was dropped, it was fragile. (Gibbard 1981, 237)

And finally, there are attested uses of nested epistemic expressions occurring in shortsuccession:

(13) She could not but think [that] Wentworth was not in love with either. Theywere more in love with him; yet there it was not love. It was a little fever ofadmiration; but it might, probably must, end in love with some.5

(14) The time is now near at hand which must probably determine, whether Amer-icans are to be, Freemen, or Slaves.6

In wordy constructions such as (7) and (8) as well as condensed constructions suchas (13) and (14), we are intuitively using nested epistemic modals to say somethingdifferent from what we would use single modals to say. For example, intuitively (5)says something different about Bob than it says about Alice:

(5) Alice is a likely hire, and Bob might be a likely hire.

To take another simple example, (15) intuitively says something different about Bobfrom either (16) or (17):

(15) It is probably the case that Bob is a possible hire.

3. It cannot be taken for granted that both modals in these constructions are genuinely epistemic. However,in the next section of this paper, I give several arguments against the claim that one can always provideembedded modals with non-epistemic interpretations.

4. Hacquard & Wellwood 2012 give attested cases of epistemic vocabulary in indicative antecedents,while arguing that pragmatic considerations may limit the distribution of epistemic vocabulary in in-dicative antecedents and similar linguistic contexts.

5. Austen 1818, p.55; italics added.6. George Washington’s address to the Continental Army before the Battle of Long Island, 27 August 1776;

italics added.

3

(16) It is probably the case that Bob is a hire.

(17) Bob is a possible hire.

In particular, our judgments suggest that (15) is weaker than either (16) or (17). Be-lieving (16) is intuitively sufficient reason to bet at even odds that we will hire Bob,whereas merely believing (15) is not. Evidence for the semantic difference between(15) and (17) comes from direct intuitions about what we use these sentences to talkabout. In particular, nested epistemic modals are often used when you do not yethave some settled opinion on some question. If you say that Bob is a possible hire, itsounds as if you know that we might hire Bob. By contrast, if you merely say that itis probably the case that Bob is a possible hire, it sounds as if you have not yet settledon an opinion about Bob. Either Bob is a possible hire, or he isn’t, and you are moreinclined to side with the former opinion.

Relatedly, subjects sometimes report that they can easily make sense of nestedepistemic modals by imagining that the speaker has several sources of informationabout their prejacent, and she is not sure which source she should trust. For instance,suppose we survey several equally informed experts about whether we might hireBob. If most say that we might hire Bob and just a couple of experts disagree, thenit is natural to form the opinion that it is probably the case that we might hire Bob.And analogous generalizations hold for other uses of nested epistemic modals. Tocomment on the example (14) above: if you say that some battle must probably bedecisive, it sounds as if whatever settled opinion you may eventually have about theimportance of the battle, you will settle on an opinion according to which the battleis probably decisive. It is easy to make sense of this state by imagining that you haveseveral sources of information about whether the battle will be decisive, where eachsource agrees that the battle is at least more likely than not to be decisive.

According to naïve orthodoxy, when someone utters a declarative sentence, youshould add its content to your stock of full beliefs. But as theorists have developedalternatives to full belief models of mental states, many have argued that what we sayreflects what we think according to these more intricate models. For instance, somehave claimed that epistemic modals are used to communicate partial beliefs.7 At a firstglance, it may appear that sentences containing nested epistemic modals are used tocommunicate even more intricate mental states. In particular, according to imprecisecredence models, you are associated with multiple probability measures when you areunsettled as to how likely various propositions are, exactly as you might be when youare unsure what source of information you should trust. Rothschild 2012 argues that

7. See §2 for further discussion, and see Swanson 2012 for a recent catalog of relevant literature.

4

epistemic modals are used to communicate these sorts of imprecise credal states. Thetheory I develop does not model subjects as having imprecise credences. But whetheror not we adopt the sort of semantics Rothschild defends, it is important that ourtheory account for intuitive judgments that naturally lend themselves to that proposal.In other words, the above discussion highlights an important goal for any theoryof epistemic vocabulary. This is our first desideratum: our theory should explainwhy nested epistemic modals signal that different opinions about some subject are inplay. Relatedly, our theory should explain why we sometimes easily make sense ofembedded modals by imagining that a speaker bases her opinions on multiple sourcesof information.

A second desideratum for our theory of epistemic vocabulary is inspired by Yal-cin 2007. Yalcin points out that our theory of epistemic possibility modals shouldexplain why conjunctions of pairs of sentences such as (18) and (19) sound bad, andwhy such conjunctions continue to sound bad when embedded under indicative sup-position, as in (20) and (21):

Some detectives are discussing the identity of a certain masked murderer.

(18) It is not John.

(19) It might be John.

(20) #Suppose it is not John and it might be John.

(21) #If it is not John and it might be John. . .

Along the same lines, note that not only is it bad to assert (18) and (19) together,but it is difficult to imagine a single circumstance in which you could correctly uttereither of these sentences individually. If you would be correct in uttering (18) in somecircumstance, then it is difficult to imagine how you could simultaneously be just ascorrect in uttering (19).

In this last respect, (18) and (19) stand in striking contrast to a similar pair ofsentences, namely sentences that resemble (18) and (19), but where the embeddedsentence is replaced with a sentence containing epistemic vocabulary:

(22) It is not the case that it is probably John.

(23) It might be the case that it is probably John.

It is possible to imagine a single circumstance in which you could correctly utter either(22) or (23). For instance, suppose you simply cannot make up your mind about howlikely it is that the masked murderer is John. A few experts believe it is probably John,but a majority of experts believe it is probably Mary. In this case, you might correctly

5

use (22), insofar as you would side with the majority of experts if forced to chooseone suspect. But you might also correctly use (23), insofar as you refuse to simplyignore the minority expert opinion. Here different frames of mind are relevant toyour imagined utterances: (22) reflects your opinion after collating the advice of yourexpert advisors, while (23) reflects the fact that you are still not sure which expertsyou should trust. And of course, neither frame of mind vindicates the assertion ofboth sentences:

(24) #It is not the case that it is probably John and it might be the case that it isprobably John.8

These judgments yield a second desideratum for our theory of epistemic vocabulary:our theory should explain why in certain circumstances, we could correctly uttereither (22) or (23), though we could not correctly utter their conjunction.

A third desideratum comes from a final observation about nested modals, namelythat the strength of the outer modal often reflects the weight of your evidence and re-silience of your opinion about the prejacent of the inner modal. For example: supposethat Liem likes wearing green shirts. His dad Eric has observed the color of his shirton 800 consecutive days. Liem was wearing green on 500 of those days. His friendMadeleine has observed the color of his shirt on 8 consecutive days. Liem was wear-ing green on 5 of those days. Suppose that Eric and Madeleine have not yet seenwhat Liem is wearing today. Both Eric and Madeleine have .625 credence that Liemis wearing green, and both might guess that Liem is probably wearing green. But itseems more appropriate for Madeleine to assert (25) or (26), whereas Eric is intuitivelylicensed in asserting (27):

(25) It might be probable that Liem is wearing green.

(26) In fact, I’m fairly confident that he is probably wearing green.

(27) Liem is definitely likely to be wearing green.

The assertability of (27) tracks two differences between Eric and Madeleine. Eric baseshis credences about Liem on more evidence. In addition, his high credence that Liemis wearing green is more resilient. Joyce 2005 argues that in a number of evidentialsituations, “weight of evidence manifests itself in the resilience of credences in theface of new data” (166). In the above situation, both evidential weight and credalresilience are manifested in the strength of the modal that embeds (28):

(28) Liem is probably wearing green.

8. A less stilted but equally infelicitous version of the sentence: ‘John isn’t a probable killer and might bea probable killer’.

6

Suppose you have a relatively uninformed hunch that Liem is probably wearing green.In other words, suppose that your high credence that Liem is wearing green is notjustified by much evidence. Then you are intuitively licensed in asserting (25), butnot (27). As you acquire more and more evidence, your high credence that Liem iswearing green will become more and more resilient, and you may embed (28) understronger and stronger epistemic modals. Hence our third desideratum: our theory ofepistemic vocabulary should explain this intuitive connection between nested modals,evidential weight, and credal resilience.

All three of the above desiderata pose challenges for several extant theories ofepistemic modals. For example, consider the following standard dynamic semanticentries for epistemic possibility and necessity modals:9

c[♦φ] = {w ∈ c : c[φ] 6= ∅}

c[�φ] = c \ {w ∈ c : (c \ c[φ]) 6= ∅}

From these definitions, we can derive that c[�♦φ] = c[♦φ]. Hence according to thissemantics, any string of possibility and necessity modals is equivalent to its innermostmodal. Some dynamic semanticists explicitly embrace this result, claiming that “em-bedding an epistemic modal under another epistemic modal does not in general haveany interesting semantic effects” (Willer 2013, 12). The same result holds for a promi-nent competitor of the dynamic semantic proposal, namely the semantics defendedin Yalcin 2007. As Yalcin explains: “iterating epistemic possibility operators addsno value on this semantics. . . This may explain why iterating epistemic possibilitymodals generally does not sound right, and why, when it does, the truth-conditionsof the result typically seem equivalent to ♦φ. I will generally ignore iterated epistemicmodalities” (994). It is difficult to see how semantic proposals in this spirit could suc-cessfully explain the pervasive nature of nested modals, much less account for theirdistinctive behavior.

1.2. Against contextualist re-interpretations of nested epistemic vocabulary

The most substantive recent attempt at a more responsive semantics for nested epis-temic modals appears in Yalcin 2009, where Yalcin admits that sometimes nestedmodals do “allow for coherent interpretations not equivalent to corresponding ex-pression with the most narrow modal. The latter case is not provided for by theabove semantics. In such cases I would be inclined to appeal to tacit shifting of the

9. For canonical instances of semantic proposals along these lines, see Stalnaker 1970, Veltman 1996,Beaver 2001, von Fintel & Gillies 2008b, and Willer 2013.

7

information state parameter, akin to free indirect discourse” (21). For further elabo-ration, we are directed to the following passage in Yalcin 2007: “interpretation mayinvolve a tacit shift in the information parameter. . . to the target state of informationfor the context. Aside from Gricean considerations of charitable interpretation, it isnot obvious whether general principles are involved in the interpretation of such tacitshifts” (1013). It is difficult to know exactly what is intended by these brief sugges-tions, and hence my arguments so far may be understood as an invitation to developthese suggestions into a theory that satisfies the desiderata given above.

A natural development of these suggestions might say that in any sentence wherenested modals occur, the prejacent of the outer modal receives the same boring sort ofsemantic value as any simple declarative sentence. For instance, one might assimilatesentences such as (27) with sentences about particular probability functions, such as(29) or (30):

(27) It is almost certainly the case that Liem is probably wearing green.

(29) It is almost certainly the case that the objective chance that Liem is wearinggreen is high.

(30) It is almost certainly the case that my epistemic probability that Liem iswearing green is high.

However, there are many reasons to be skeptical of this approach. Recall that recentliterature has provided a host of reasons to reject the claim that the prejacent (28) isequivalent to some simple declarative sentence like (31) or (32):

(28) Liem is probably wearing green.

(31) The objective chance that Liem is wearing green is high.

(32) My epistemic probability that Liem is wearing green is high.

The crucial dialectical point to appreciate is that analogous concerns tell against theequivalence of these same sentences when they are embedded under epistemic vocab-ulary. For example, it is suspiciously difficult to say exactly what salient probabilityfunction (27) is talking about. In the case described above, Eric can utter (27). But hecannot utter (29), because Eric knows that the objective chance that Liem is wearinggreen is either 0 or 1, and Eric is not almost certain of the latter. Madeleine cannotutter (27). But she can utter (30), because she knows that her inductive evidence con-firms the claim that Liem is wearing green. Hence neither (29) nor (30) accuratelyparaphrases (27).

Furthermore, eavesdroppers may explicitly target the prejacent of (27) and cor-rectly evaluate it relative to their epistemic situation. For instance, if I have just seen

8

Liem wearing a red shirt and I overhear Eric utter (27), it would be pedantic butnevertheless acceptable for me to say:

(33) That isn’t almost certain; it’s just false. It’s not the case that Liem isprobably wearing green—he is wearing red.

A notorious dilemma for truth-conditional accounts replays itself here: if Eric wasusing ‘probably’ just to talk about his own evidential situation, then I am not licensedin saying ‘it’s false’ in judging the prejacent of (27). On the other hand, if Eric wasusing ‘probably’ to talk about some evidence that included my evidence, then he wasnot licensed in uttering (27) to begin with.10

In fact, nearly every argument against a uniform truth conditional theory of allepistemic modals yields an analogous argument against a uniform truth conditionaltheory of all embedded epistemic modals. Bennett 2003 may argue that any allegedparaphrases of (27) fail to capture its intuitive subject matter, for instance. Bennettargues that when someone utters an indicative conditional, “common sense and theRamsey test both clamour that [she] is not assuring me that her value for a certainconditional probability is high, but is assuring me of that high value. . . She aims toconvince me of that probability, not the proposition that it is her probability” (90).Yalcin 2011 adds that the reasons that I give in support of my utterance ‘it might beraining’ concern the first-order proposition that it is raining, rather than any contex-tually determined body of evidence. Both Bennett and Yalcin could complain that(27) intuitively concerns Liem, rather than any contextually determined body of evi-dence. Another challenge comes from Yalcin 2007. If embedded modals are alwaysinterpreted relative to some salient probability function, then we lack an explanationfor the infelicity of sentences such as:

(34) #Probably, it is raining and might not be raining.

(35) #It is unlikely that it is both raining and might not be raining.

(36) #It might be that it is both raining and might not be raining.

These judgments are not accommodated by expressivist, relativist, or dynamic theo-ries that resort to assigning simple semantic contents to embedded modal construc-tions.

In addition, it is worth noting that if we reinterpret the prejacent of (27) as havingstraightforward truth conditions, we are still left with the problem of interpreting(37-b) in the following dialogue:

10. This is just the first step in an involved dialectic. For further discussion of eavesdropping argu-ments against truth-conditional accounts of epistemic vocabulary, see Egan et al. 2005, Egan 2007,Hawthorne 2007, von Fintel & Gillies 2008a, Yalcin & Knobe 2010, and MacFarlane 2011.

9

(37) a. David: Is Liem probably wearing green?b. Eric: Almost certainly.

Familiar arguments challenge the claim that the unembedded (37-a) has straightfor-ward truth conditions. Furthermore, it is difficult to see why Gricean considerationsshould demand that we interpret (37-a) as containing free indirect discourse or atacitly shifted information parameter. Hence it seems we must find some way of in-terpreting (37-b) without appealing to such strategies. One would expect the resultingunderstanding of (37-b) to provide some similar understanding of (27), namely an al-ternative semantics that recognizes that ‘Liem is probably wearing green’ need notexpress a possible worlds content, whether it is embedded in a question or under fur-ther epistemic vocabulary. To sum up: it is not obvious that extant semantic theoriescan explain the behavior of nested epistemic modals. A natural way of developingpotential explanations on behalf of recent expressivist, relativist, and dynamic theo-ries meets with several challenges. Hence the behavior of nested epistemic modalsshould motivate us to look for alternative semantic theories.

1.3. Epistemic vocabulary under disjunction

A fourth desideratum for our theory of epistemic vocabulary is inspired by Schroeder

2012. Schroeder argues that a semantic theory should not predict that you can asserta disjunction only if you can assert one of its disjuncts, even in special cases wheredisjuncts are stipulated to be governed by wide-scope epistemic modals. Schroederpoints out several reasons why this prediction would be bad. Here is one example:

Last night Shieva calls me to express frustration with the paper that she is workingon, and tells me that if she hasn’t finished by this morning, she’s going to consulther magic 8-ball about whether to give up and follow its advice. Since I know thatmost of the answers on her magic 8-ball are positive, when I recall our conversationfrom last night, I conclude that either Shieva finished her paper by this morning,or she probably gave up. (21–2)

In this case, the speaker can correctly assert ‘Shieva finished or probably gave up’without being able to assert either disjunct. Similarly, you can correctly assert (38)about the result of throwing a fair die, without being able to assert either disjunct:

(38) It is less than four or probably even.

In this respect, disjunctions embedding epistemic vocabulary are just like ordinarydisjunctions of simple sentences. In fact, asserting a disjunction usually implicatesthat you are not in a position to assert either disjunct. There is something especiallypeculiar about disjunctions embedding epistemic vocabulary, though. Even if you

10

can deny one disjunct and you cannot assert the other, you may still be able to assertthe entire disjunction. For instance: you can assert (38) even though you can denythe second disjunct by itself, and you cannot assert the first. This does not hold fordisjunctions without epistemic vocabulary. If you can deny one half of a simple dis-junction, then disjunctive syllogism ordinarily proves that the remaining disjunct isequivalent to the entire disjunction, so one is not assertable without the other. Thisbrings us to our fourth desideratum: our theory should explain this surprising differ-ence between simple disjunctions and disjunctions containing epistemic vocabulary.

A semantics for ‘or’ is missing from Yalcin 2007, 2011, 2012b, and related pa-pers. Hence the relevant challenge for Yalcin is to state a semantics that predicts thebehavior just described.11 Substantially more progress has been made on disjunctionin the dynamic semantics literature. In fact, a number of dynamic accounts of disjunc-tion satisfy our fourth desideratum. According to these accounts, natural languagedisjunction is not commutative. Roughly speaking, the second half of a disjunction isnot interpreted relative to a global context, but rather relative to a local context thathas been updated with the negation of the first disjunct. This sort of account aims togive a uniform explanation of the local interpretation of ‘probably’ in (38) and localsatisfaction of licensing conditions for pronouns in disjunctions such as the followingfamous example from Roberts 1989:

(39) Either there is no bathroom in this house, or it is in a funny place.

Just as the licensing conditions for ‘it’ in (39) are satisfied in a local context where thefirst disjunct is false, values of contextual parameters in the second disjunct of (38) areprovided by a local context where the first disjunct is false. This explains why youmay assert (38) even when you can deny its second disjunct uttered in isolation. Thedisjunction is felicitous because its second disjunct is acceptable in all contexts wherethe negation of the first disjunct is given.

This dynamic account predicts that natural language disjunction is not commu-tative, and fans of this account often claim this predicted failure of commutativity asa benefit. For instance, they claim that a semantics for natural language disjunctionshould entail that (40) sounds bad even though (38) sounds fine:


(40) It is probably even or less than four.

11. Schroeder extrapolates a semantics for ‘or’ from Yalcin 2007 and criticizes that semantics for validating‘or’ exportation.

11

However, it is not clear that we should want our semantics to predict this differencebetween (38) and (40).12 For instance, there are a number of contexts in which (40)seems just as good as (38), namely contexts in which certain partitions of logical spaceare salient. Consider the following case:

Alice just rolled a fair die and hid it under a cup in front of me. I see a blue cupand a red cup. The die is under the blue cup if it landed on a four, five, or six. Thedie is under the red cup if it landed on a number less than four.

Bob offers me a pair of bets. For one dollar, he will sell me a bet that pays fivedollars if the die landed on an even number. For another dollar, he will sell me abet that pays five dollars if the die landed on a number less than four. I am veryrisk averse, and I do not always bet to maximize expected returns. But staringfirst at the blue cup and then at the red cup, I judge that I would be comfortableaccepting both bets, since, as I put it, “either it is probably even, or less than four.”

The circumstances of the above case call attention to a certain partition of logicalspace: either the die landed on a number less than four, or it landed on a highernumber. Against this background, my utterance of (40) seems perfectly correct.13

In fact, some disjunctions like (40) sound fine without heavy contextual cues. Forinstance, you can assert any of the following disjunctions, even if you can deny thefirst disjunct and cannot assert the second:

(41) It’s either unlikely he was being honest with you, or he just wanted you tothink that he was lying.

(42) The next United States president will either almost certainly attempt torepeal a lot of Barack Obama’s policies, or they will be a Democrat withmore liberal views than Obama has.

(43) John is probably playing baseball, or it has been raining all afternoon.

These disjunctions seem to mean the same thing regardless of the order in which theirdisjuncts are uttered. In fact, they might just as well be written with their disjunctsarranged in a circle, without detriment to our ability to understand or evaluate them.This yields a fifth desideratum for our theory of epistemic vocabulary: our theoryshould explain why disjunctions such as (40) sound infelicitous in some contexts andfelicitous in others. And our theory should explain why reversing disjunct order doesnot affect the interpretation of disjunctions in contexts where they sound felicitous.

12. The commutativity of disjunction is controversial even among advocates of dynamic semantic theo-ries. For instance, Schlenker 2009 and Rothschild 2011 both provide theories according to whichdisjunction is commutative; their accounts are sympathetic with my discussion of the fifth desideratum.

13. Some readers may find it difficult to evaluate the artificial speech described above, especially since thesalience of an objective chance function may introduce noise in our judgments. The essential point of thepresent discussion is that contextual cues may make certain readings of epistemic vocabulary available.See §4.5 for more natural illustrations and a more detailed defense of this point.

12

This fifth desideratum should give us pause before we endorse a semantic theorythat explicitly entails that natural language disjunction is not commutative. Further-more, the above dynamic explanation for why we can assert (38) seems insufficientlygeneral, since it does not explain why we can sometimes assert (40)–(43). The dy-namic proposal outlined above says that we can sometimes assert a disjunction like(38) when its second disjunct is deniable and its first disjunct is unassertable. But(40)–(43) are all sometimes assertable even when their first disjuncts are deniable andtheir second disjuncts are unassertable. According to the dynamic explanation, (38) isfelicitous because its second disjunct is acceptable in all contexts where the negationof the first disjunct is given. But for any of (40)–(43), the second disjunct is not accept-able even in contexts where the negation of the first disjunct is given. For example,the negation of the first disjunct of (40) is already given in an ordinary context wherea fair die is rolled, but the second disjunct of (40) is not acceptable in that context:


To sum up: several observations raise challenges for several extant dynamic semanticaccounts of the assertability of disjunctions. In particular, differences in the assertabil-ity of (38) and (40) seem sensitive to contextual factors, such as the salience of variousalternative sets. This should motivate us to doubt theories that derive differencesin assertability from context-insensitive semantic rules. Pragmatic theories are betterdesigned to account for the distinctive behavior of disjunctions embedding epistemicvocabulary.

1.4. Epistemic vocabulary over indicatives

A sixth desideratum for a theory of epistemic vocabulary is inspired by an examplein chapter 9 of Lycan 2001, which itself builds on a related discussion of subjunctiveconditionals in Slote 1978. Consider the following case:

Jill is standing on the roof of your office building. The local fire department occa-sionally hangs a net along the roof to protect workers doing construction. The netis strong enough to safely catch anyone who falls off the building. Just a few hoursago, you happened to notice that there was no net along the roof. As a result, youdo not believe that Jill is going to jump off the roof. Jill is a thrill-seeker who mightjump into a net for fun, but she definitely does not have a death wish. And withouta net, anyone who jumped off the roof would surely fall to the ground and die.

On the one hand, since you believe that there is no net along the roof, you are intu-itively justified in asserting:

(44) Probably, if Jill jumps off the building, she will die.

13

On the other hand, you are confident that Jill does not have a death wish. If you wereinformed that Jill jumped off the building, you would immediately conclude that thelocal fire department must have installed a net since you last checked the roof. Withthat information in the front of your mind, you are intuitively justified in denying (44)and asserting:

(45) Probably, if Jill jumps off the building, she will live.

To make these observations more vivid, suppose someone asks you whether there isa net along the roof of the building. They may well know that you promised the firedepartment that you wouldn’t go around telling people whether or not there was anet along the roof, but they may still persist in pestering you for information. It isintuitively fine for you to respond:

(46) I cannot answer your questions directly. But I can tell you this much: it isreally likely that if Jill jumps off this building, she will die.

On the other hand, suppose someone asks you whether you believe that Jill is suicidal.Again, they may well know that you promised Jill that you wouldn’t go around tellingpeople about her mental state, but they may persist in pestering you for information.Suppose that it is common ground that anyone suicidal would simply cut away anysafety net and jump off the building in question. It is intuitively fine for you torespond:

(47) I cannot answer your questions directly. But I can tell you this much: it isreally likely that if Jill jumps off this building, she will live.

Hence the assertability of (44) does not depend only on your opinions about Jill andthe net, which we may stipulate are the same when you utter (46) and (47). It must alsobe sensitive to some factor that varies between these contexts of utterance. As withmany other examples we have considered, you are considering different questions inthese different contexts, and which question you are considering seems relevant towhich utterances are felicitous. Suppose you are considering the question of whetherthere is a net along the roof. Then since you believe that there is probably no net,you may say that it is probably the case that if Jill jumps from the roof, she will die.Suppose you are considering the question of whether Jill is suicidal. Then since youbelieve that she is probably not suicidal, you may say that it is probably the case that ifJill jumps from the roof, she will live. The sixth desideratum: our theory of epistemicvocabulary should explain this variation in the assertability conditions of (44).

There is no obvious mechanism for explaining this variation in many extant theo-ries of epistemic vocabulary. The semantic values for ‘probably’ and ‘if’ given in Velt-

14

man 1996 and Yalcin 2012b do not depend on contextually determined parameters.An advocate of these semantic proposals might attribute variation in the interpreta-tion of (44) to scope ambiguity. At the level of logical form, ‘probably’ might takescope over the entire indicative conditional in (44) or just over its consequent. But thisdoes not seem like a plausible explanation of the behavior of (44), since context notonly affects our interpretation of (44), but also our interpretation of the unembeddedindicative conditional (48):


(48) If Jill jumps off the building, she will die.

The unembedded conditional is borderline assertable when we are focusing on whetherthere is a net along the roof, but definitely unassertable when we are focusing onwhether Jill is suicidal. These judgments suggest that the interpretation of the indica-tive itself depends on contextually determined parameters.

A related challenge arises when we embed sentences like (44) in indicative condi-tionals. If we are talking about whether there is a net, you can correctly assert:

(49) If it is probably the case that Jill will live if she jumps, then there is a net.

If we are talking about whether Jill is suicidal, you can correctly assert:

(50) It is probably the case that Jill will live if she jumps.

However, you can never correctly assert:

(51) There is a net.

These judgments make trouble for certain semantic theories. Several dynamic andexpressivist theories say something roughly like the following: you believe a sentencewhen your credal state accepts it. And an information state accepts a conditionalwhen the closest state that accepts its antecedent also accepts its consequent. Sinceyou believe (50), your actual credal state accepts the antecedent of (49). Hence youractual credal state is the credal state closest to yours that accepts that antecedent.Since you believe the conditional (49), we should conclude that your actual credalstate also accepts its consequent (51). But this conclusion seems clearly false.14

14. In order to keep my discussion as general as possible, I will not use this formula to construct objectionsfor particular theories. The interested reader should combine the discussion of attitude verbs in §7 ofYalcin 2007 with the semantics for ‘if’ and ‘probably’ in the appendix of Yalcin 2012b. For dynamictheories, combine the standard dynamic semantics for attitude verbs in Heim 1992 with the dynamicsemantics for ‘if’ and ‘probably’ developed in §4 of Gillies 2004, §10 of Gillies 2010, or the appendixof Yalcin 2012b, replacing “closest credal state to yours that accepts the antecedent” with “result ofupdating your credal state on the antecedent” in my discussion above.

15

The complex conditional (49) gives rise to our seventh desideratum: our theoryshould explain its assertability conditions. This is not a trivial endeavor. First, ourtheory must assign semantic contents to indicatives whose antecedents embed bothgraded epistemic vocabulary and other indicatives. Second, our theory must explainhow your beliefs can support asserting (49) in some contexts and (50) in others, with-out ever supporting (51). These facts intuitively depend on the context sensitivityof (49) and (50), and relevant contextual factors intuitively include facts about whatquestions are salient when each is uttered.

1.5. Epistemic vocabulary in classically valid arguments

The seventh desideratum also directs us toward one final category of useful obser-vations. If you believe both (49) and (50), it might seem that you could apply modusponens and infer that there is a net along the roof. But you are not licensed in believ-ing that there is a net along the roof. The final three desiderata concern instances ofclassically valid argument forms that seem invalid in virtue of containing epistemicvocabulary.

Suppose Carlos has rolled a fair die without telling us how it landed. A fair diehas three low numbers and three high numbers. Suppose we are considering thefollowing argument about the number Carlos rolled:

(52) a. If it is low, it is probably odd.b. It is not probably odd.c. Hence: it is not low.

This argument seems like an instance of modus tollens. But it also seems invalid. Thefirst premise seems correct, since 2 out of 3 of the low numbers are odd. The secondpremise seems correct, since it is just as likely that an even number was rolled as anodd number. But these premises do not justify our accepting the conclusion, since wehave no idea whether a low number was rolled. Several authors have made similarobservations about apparent instances of modus tollens containing epistemic modals.15

This raises a puzzle: should we say that (52) is not an instance of modus tollens, that(52) is valid, or that some instances of modus tollens are not valid? This brings us toour eighth desideratum: our theory of epistemic vocabulary should solve this puzzle.At a minimum, our theory should come equipped with a notion of consequence thatyields a verdict about whether (52) is valid. And whether or not it is valid, ourtheory should predict the apparent invalidity of instances of modus tollens containingepistemic vocabulary.

15. For related discussion, see Carroll 1894, Veltman 1985, Cantwell 2008, and especially Yalcin 2012b.

16

Here is another apparently invalid argument about the number rolled:

(53) a. If it is low, it is probably odd.b. If it is high, it is probably even.c. It is either low or high.d. Hence: either it is probably odd or probably even.

Kolodny & MacFarlane 2010 discuss similar arguments, including the following:

(54) a. Either the butler did it or the nephew did it.b. If the butler did it, the murder must have occurred in the morning.c. If the nephew did it, the murder must have occurred in the evening.d. Hence: either the murder must have occurred in the morning or it

must have occurred in the evening.

These arguments seem like instances of constructive dilemma. But they also seem in-valid. For instance, just as it seems incorrect to say that the number rolled is probablyeven, it seems incorrect to say it is probably odd. So in the absence of any special con-textual cues, it seems incorrect to say that the number rolled is either probably even orprobably odd. It is neither probably even nor probably odd, but just as likely to be oneor the other. This brings us to our ninth desideratum: our theory should say whether(53) is valid. And whether or not it is valid, our theory should predict the apparentinvalidity of instances of constructive dilemma containing epistemic vocabulary.

Similar problems arise not just for modus tollens and constructive dilemma, butalso for disjunctive syllogism:

(55) a. It is low or probably even.b. It is not probably even.c. Hence: it is not low.

And contraposition of indicative conditionals:

(56) a. If it is low, it is probably even.b. Hence: if it is not probably even, it is not low.

Furthermore, it seems entirely appropriate to give similar explanations for the appar-ent invalidity of these inferences. Kolodny & MacFarlane 2010 and Yalcin 2012b,for instance, defend semantic theories according to which each of the relevant infer-ence rules is literally invalid. In fact, Kolodny and MacFarlane go so far as to say thatmodus ponens itself is an invalid rule of inference.

Anyone rejecting classically valid inference rules bears the burden of explainingwhy we successfully use them in ordinary reasoning. The easiest way to discharge

17

this burden is by proving that the rules are indeed valid when restricted to premisesof a certain form. At a minimum, setting aside complications involving adverbs ofquantification, it seems our theory should predict that arguments are valid when theycontain no epistemic vocabulary at all. This condition raises an important question,namely exactly which arguments containing epistemic vocabulary are valid.

Kolodny & MacFarlane 2010 defend inferences involving conditionals whoseconsequents do not contain any epistemic vocabulary. However, some inferences in-volving conditionals whose consequents contain epistemic vocabulary are intuitivelyvalid as well. For instance, Yalcin 2012b suggests that the following inference isvalid:

(57) a. If the marble is big, then it might be red.b. It is not the case that it might be red.c. Hence: it is not big.

In addition, some probabilistic inference rules are intuitively valid, and some of thoserules govern indicatives with consequents embedding epistemic vocabulary. In fact,we just considered inferences of this sort in §1.4. The following inference licenses mysaying (58-c) when discussing whether there is a net along the roof:

(58) a. Probably, there is no net along the roof.b. If there is no net along the roof, then if Jill jumps, she will die.c. Hence: probably, if Jill jumps, she will die.

And the following licenses my saying (59-c) when discussing whether Jill is suicidal:

(59) a. Probably, Jill is not suicidal.b. If Jill is not suicidal, then if Jill jumps, she will live.c. Hence: probably, if Jill jumps, she will live.

This brings us to our tenth and final desideratum for a theory of epistemic vocabulary.Insofar as our theory says that standard inference rules are generally invalid, it shouldexplain why substantial classes of restricted rules appear to be genuinely valid. Inparticular, our theory should explain why (57), (58), and (59) are apparently valid,even though these inferences are riddled with epistemic vocabulary.

2. A basic semantics for epistemic vocabulary

Before stating specific semantic entries, it will be helpful to outline the basic idea ofthe semantic theory itself. Recall that in a certain context, you may correctly describethe outcome of rolling a fair die by saying:

18


The imagined context of (40) is somewhat contrived. In particular, the context iscontrived to make a certain partition of logical space especially salient. The partitionhas two elements: either the number rolled is low, or it is high. As a result, thereare also two kinds of salient credence distributions when you utter (40). First, youhave conditional credences, conditional on the partition propositions. For example,you have higher than .5 credence that the number rolled is even, conditional on itbeing high. Second, you have a credence distribution over the partition propositionsthemselves. For example, you have .5 credence that number rolled is high. In otherwords, there are various opinions you might have after learning some informationfrom the contextually salient partition. And on top of that, you have some opinionsabout the likelihood of each bit of information that you could learn.

A first pass at my semantics: the latter opinions are associated with highermodals, while the former are associated with embedded modals. For example, itwould sound fine for you to say (60) in the context mentioned above:

(60) It might well be that the number is probably even.

According to my semantics, that is roughly because you could learn some salientinformation—namely that the number rolled is high—confirming an opinion thatgives most of its credence to the number rolled being even. To take another exam-ple, suppose that you are torn between various ways of evaluating candidates for anacademic position. It is not clear how to weigh teaching experience against researchquality, for instance, and you are open to information that would decide this questionin different ways. In spite of your indecision, you might say:

(61) It must be the case that Bob is a possible hire.

According to my semantics, that is roughly because as far as your credences areconcerned, any salient information would support an opinion that gave at least somecredence to Bob being hired. Again, the embedded modal (‘possible’) is associatedwith your credences conditional on various propositions (about ways of evaluatingcandidates), while the higher modal (‘must’) is associated with your credences inthose propositions themselves.

According to a traditional account of assertion, an assertion is “something likea proposal” (cf. Stalnaker 2010, 152), namely the proposal that the content of theassertion be added to the propositions taken for granted in the conversation. In aparadigmatic case of assertion, you believe a proposition, you assert some sentencewith that proposition as its content, and as a result, I come to believe that same

19

proposition. This model of assertion fits well with a certain model of our mental life,according to which full beliefs are the opinions we have and the opinions we want toshare with each other. Meanwhile, theorists have developed alternate models of ourmental life in which degreed beliefs play a central role. It is natural to wonder whetherwe can update our account of assertion to fit these more sophisticated models.

The updated account: an assertion is like a proposal, not about a proposition thatyou should believe, but rather about a property that your credences should have. Itis still true that in a paradigmatic case of assertion, you have an opinion, you assertsome sentence with that opinion as its content, and as a result, I come to have thesame opinion. But the relevant opinions are degreed. In other words, having anopinion amounts to having credences with a certain property. The content of a declar-ative sentence is a property that credences can have. Formally, contents are sets ofprobability measures. In a paradigmatic case of assertion, when you assert a sentencewith a certain content, I come to have a credence distribution that is contained in thatcontent. For instance, you may assert a sentence whose content is the set of all mea-sures that assign probability greater than .5 to the proposition that it is raining. Onhearing your assertion, I will come to have more than .5 credence that it is raining.Following Swanson 2006, we may conceive of the content of a sentence as a constrainton credences, namely the constraint that my credences generally end up satisfying onhearing your assertion of that sentence.

Sentences containing epistemic vocabulary are context sensitive. In other words,which set of measures is the content of a sentence depends on what context you areusing the sentence in. In particular, context contributes partitions of logical space tothe semantic values of such sentences. The contextually determined partitions makethe contents of sentences more interesting. A second pass at the heart of my semantics:some asserted contents are straightforward constraints on credences, such as assign-ing greater than .5 credence to some particular proposition. But asserted contentscan also constrain your credences to have more fine-grained properties. In particu-lar, they can constrain the structure of your credences with respect to propositions innon-trivial contextually determined partitions. The content of a sentence containingnested epistemic modals will be a constraint having to do with your credences inthose propositions, and also with your credences conditional on those propositions.Higher modals correspond to the former sort of constraint, while embedded modalscorrespond to the latter.

20

2.1. A semantics for logical operators

In addition to formal semantic entries, it will be useful to have some shorthand forsaying what expressions mean. Let us say that your credences satisfy the constraintthat a certain proposition accepts that it is probably raining just in case it is probably rain-ing according to your credences conditional on that proposition, or in other words,just in case your conditional credences are contained in the content that it is probablyraining. If context determines a partition of logical space, we can quantify over themembers of that partition as if they were each identified with different people. Forinstance, given a contextually determined partition, let us say that your credences sat-isfy the shorthand constraint that someone accepts that it is probably raining just in casesome proposition in the partition accepts that it is probably raining. In general, letus say your credences satisfy the constraint that someone accepts a particular contentjust in case there is some proposition in the partition such that your credences giventhat proposition are contained in that particular content. Your credences satisfy theconstraint that everyone accepts a content just in case every proposition in the parti-tion is such that your credences given that proposition are contained in that content.And so on. Rather than always explicitly describing your credences conditional onpropositions in a contextually determined partition, we have a handy shorthand thatcaptures the sense in which your credences conditional on different partition elementsoften correspond to different states of opinion that you have not yourself decided be-tween. In a rough sense, one may imagine the shorthand expressions ‘someone’ and‘everyone’ as quantifying over different sides of yourself.16

Now for the semantics. In contrast with a number of extant theories, it is straight-forward to start with a semantics for all basic logical operators, including naturallanguage disjunction. For instance: your credences are contained in the content ofa disjunction just in case every proposition in the corresponding contextually deter-mined partition is such that your credences conditional on that proposition are con-tained in the content of one of the disjuncts. The semantic entries for ‘and’ and ‘not’are predictable variants. In shorthand:

‘S or T’ means that everyone accepts that S or accepts that T.

‘S and T’ means that everyone accepts that S and accepts that T.

16. In what follows, I often simplify my discussion by just talking about whether certain partition elementsaccept a certain constraint. It should be understood that strictly speaking, whether a proposition acceptsa constraint is relative to a measure, e.g. that your credences may satisfy the constraint that someoneaccepts that it is probably raining, while my credences fail to satisfy this same constraint.

21

‘not S’ means that no one accepts that S.17

In more formal vocabulary:

[[ori]]c = [λS . λT . {m : ∀p ∈ gc(i), m|p ∈ S or m|p ∈ T}]

[[andi]]c = [λS . λT . {m : ∀p ∈ gc(i), m|p ∈ S and m|p ∈ T}]

[[noti]]c = [λS . {m : ∀p ∈ gc(i), m|p /∈ S}]

A number of notes about the formal vocabulary are in order. The variable p rangesover sets of worlds, and m ranges over probability measures. The measure m|p is theresult of conditionalizing the measure m on the proposition p. Let us stipulate that Sis the semantic type of sets of measures. In the above entries, the variables S and Trange over values of type S. The logical operators ‘and’ and ‘or’ have semantic valuesof type 〈S, 〈S, S〉〉, whereas ‘not’ has a semantic value of type 〈S, S〉. For example, thecontent of a disjunction is a set of measures, as is the content of each disjunct.

Exactly which set of measures is the content of a disjunction depends on whatpartition context contributes to its content. Following Heim & Kratzer 1998, we saythat every context c determines an assignment function gc that specifies the values ofall contextually determined variables. The value gc(i) is the contextually determinedpartition relevant to the semantic entry spelled out above. The shorthand expression‘everyone’ corresponds to the formal expression ‘∀p ∈ gc(i)’ which quantifies overpropositions in that partition. In what follows, I use both shorthand and formalvocabulary, as the former allows me to make my arguments intuitive, while the latterallows me to make them precise.

In slightly less formal vocabulary, the semantic value of ‘S or T’ is the set ofmeasures m satisfying the following condition: for any proposition p in the relevantcontextually determined partition, m|p is either contained in the content of the firstdisjunct or in the content of the second disjunct. For example, recall that in somecontexts where you have equal credence in each possible outcome of rolling a fair die,it sounds okay for you to say:


As mentioned earlier, the sort of context that is hospitable for (40) makes a certainpartition salient: either the number rolled is low, or it is high. According to your

17. I use ‘not’ as shorthand for ‘it is not the case that’ and I treat this expression as an operator that occursjust before its argument, though ultimately one should allow many other expressions of sententialnegation at surface structure. The analogous claims hold for ‘might’, ‘must’, and ‘probably’.

22

credences conditional on it being low, the number is less than four. According toyour credences conditional on it being high, the number is probably even. Henceyour credences satisfy the content of (40), namely that everyone in the contextuallydetermined partition either accept that the number rolled is probably even or acceptthat it is less than four. In a nutshell: you believe (40), and that is why it sounds okayfor you to say it.

This explanation is incomplete as it stands. For starters, a complete explanationrequires identifying the content of each disjunct of (40) relative to the sort of contextin question, so that we may prove that your conditional credences are contained inthese contents. Appendix B.1 contains a complete explanation of why your credenceis in the content of (40), and §2.4 contains further commentary. Another clarificatorynote: the above semantic values are custom-made for logical operators embeddingepistemic vocabulary. The theory I develop assigns more traditional semantic val-ues to logical operators elsewhere. The careful reader will observe that accordingto this theory, logical operators embedding epistemic vocabulary act essentially likeepistemic vocabulary. This observation is implausible unless restricted to logical op-erators embedding epistemic vocabulary, so it is important to bear in mind that moretraditional semantic values for logical operators will be revived in §3.

2.2. A semantics for epistemic possibility and necessity modals

Here are shorthand semantic entries for epistemic possibility and necessity modals:

‘might S’ means that someone accepts that S.

‘must S’ means that everyone accepts that S.

In more formal vocabulary:

[[mighti]]c = [λS . {m : ∃p ∈ gc(i) such that m|p ∈ S}]

[[musti]]c = [λS . {m : ∀p ∈ gc(i), m|p ∈ S}]

Having expanded our lexicon, we can outline a semantics for some nested epistemicmodals. For example, (62) and (63) each mean that everyone accepts that someoneaccepts that we will hire Bob:

(62) It is definitely the case that Bob might be the best candidate for the job.

(63) It must be the case that Bob might be the best candidate for the job.

This shorthand calls attention to an important semantic feature: higher and lower

23

epistemic modals need not be associated with the same domain of quantification. Bothlogical operators and modals have indices. Assignment functions map expressionswith different indices to potentially different values. Hence unless expressions areco-indexed, context may contribute different partitions to their interpretation. Forexample, an utterance of (62) may contain modals that are not co-indexed:

(64) It is definitely1 the case that Bob might2 be the best candidate for the job.

The semantic value of (64) is as follows:

[[(64)]]c = {m : ∀p ∈ gc(1), m|p ∈ {m′ : ∃q ∈ gc(2) such that m′|q ∈ [[(65)]]c}},

where (65) is the prejacent of the inner modal in (64):

(65) Bob is the best candidate for the job.

For instance, in a context where (64) is uttered, it could be that the partition gc(1)contains propositions about what sorts of virtues matter when evaluating candidates,while the partition gc(2) contains propositions about which candidates have whatsorts of virtues. In that sort of context, your credences would satisfy (64) just in caseconditional on any proposition about what virtues matter, your credences satisfy thefollowing condition: conditional on some proposition about which candidates havewhich virtues, Bob is the best candidate for the job.

For those especially attentive to syntactic representation: strictly speaking, oursemantics could identify indexed variables as arguments of modals and logical op-erators, rather than indexing these expressions directly. For example, our formalsemantic entry for ‘must’ could be as follows, where v ranges over partitions:

[[must]]c = [λv . λS . {m : ∀p ∈ v, m|p ∈ S}]

In that case, (62) would contain two covert pronouns:

(66) It is definitely v1 the case that Bob might v2 be the best candidate for thejob.

Here the pronouns v1 and v2 denote partitions relative to contexts, according to thefamiliar semantics for referential pronouns, i.e. [[vi]]

c = gc(i). The resulting semanticvalue of ‘must vi’ matches the semantic value of ‘musti’ given above. The reader mayreplace expressions of the latter sort with their kosher substitutes throughout.18

18. For simplicity, I will sometimes talk as if the contextually supplied partition is the value of a covert pro-noun. But strictly speaking, I am neutral about the best syntactic implementation of my theory. Partee

1989 and Condoravdi & Gawron 1996 have given reasons to doubt that similar implicit arguments are

24

2.3. A small detour: advantages of constraining conditional credences

Recall from §1.1 that our use of nested epistemic modals fits naturally with the ideathat sentences constrain imprecise credal states. This idea should seem even morecompelling given all the shorthand just introduced. Suppose we model your mentalstate with a set of probability measures. In other words, suppose we model youas if you have an imaginary mental committee of subjects with precise credences.Then following Rothschild 2012, we could say that sentences constrain your mentalcommittee members, rather than your conditional credences. If a sentence demandsthat everyone accepts a content, for instance, that could just amount to demandingthat each committee member accept that content. In other words, my shorthandsemantic entries for ‘might’ and ‘must’ seem like apt translations of the followingalternative formal semantic entries:

[[might]] = [λS . {I : ∃m ∈ I such that m ∈ S}]

[[must]] = [λS . {I : ∀m ∈ I, m ∈ S}]

Here the variable m ranges over precise credal states, i.e. probability measures, whileS and I range over imprecise credal states, i.e. sets of probability measures. Thisproposal may appear to satisfy many desiderata given in §1. It is worthwhile toreflect on how my semantics differs from this proposal, and especially to notice thatthe imprecise credence proposal is deficient in two respects.

First, on the imprecise credence semantics stated above, embedding a sentenceunder ‘might’ or ‘must’ raises its semantic type. Each modal accepts sets of measuresas inputs and delivers sets of imprecise credal states as outputs. That means thata sentence with a wide-scope ‘might’ or ‘must’ has the wrong semantic type to beembedded under another epistemic modal—a bad result, given our pervasive use ofembedded modals. The most natural repair strategy requires that we model subjectsas having not just imprecise credences, but more complicated mental states. In fact,very complicated mental states are required, since subjects commonly embed epis-temic vocabulary under embedded epistemic vocabulary. For instance, recall that wehave no trouble understanding (49):


And deeper embeddings seem perfectly intelligible, as long as the context is richenough to supply the interpretations of relevant expressions. For instance, (49) soundsfine when you are trying to figure out whether there is a net along the roof of your

best analyzed as the values of covert pronouns, and I will not evaluate their arguments in this paper.

25

office building. Suppose that the local fire department occasionally puts a trampolineinstead of a net along the roof. Then we are not really licensed in saying that thereis a net along the roof, given just that it is probably the case that Jill will live if shejumps. Instead, we should say something more hedged:

(67) Probably, if it is probably the case that Jill will live if she jumps, then there isa net. (But it might be that there is a trampoline.)

In light of (49) and (67), it is hard to imagine a reason for ruling that embeddingsof epistemic vocabulary beyond a certain level of complexity are are semanticallyuninterpretable. In the absence of such a reason, our theory should deliver semanticvalues for embeddings of arbitrary complexity. Hence in order to repair the imprecisecredence proposal, we would have to model subjects as having not just sets of sets ofmeasures as mental states, but sets of sets of sets of measures, and so on. It is difficultto independently motivate such an arcane model of our mental life.

Second, semanticists like Rothschild must endorse even more complicated mod-els of mental states in order to give a semantics for graded modal vocabulary. It iseasy to imagine existential or universal quantification over members of an imaginarymental committee. But graded modals call for probability measures over committeemembers, and it is difficult to see how one could make sense of this added structurewithin the imprecise credence model without essentially describing subjects as havingprecise credences.

The semantics I defend offers a viable alternative in the neighborhood of theimprecise credence proposal. For starters, the semantics extends naturally to gradedmodals, without requiring that we represent subjects as having mental states morearcane than ordinary credences. As a result, even though it is fairly revisionary to saythat contents of sentences are sets of measures instead of sets of worlds, our model ofcontents can still be defended on the grounds that it simply reflects an independentlymotivated model of our mental life. In addition, according to our semantics, ‘might’,‘must’, and ‘probably’ are all type 〈S, S〉, and ‘if’ is type 〈S, 〈S, S〉〉. Hence complicatedsentences like (67) have well-defined semantic values.

Furthermore, our theory even has the resources to say why complicated sentenceslike (67) might nevertheless sound bad when uttered out of the blue. The same goesfor many sentences containing several referential pronouns. For instance, when ut-tered out of the blue, (68) sounds questionable at best:

(68) ?That made that do that to that.

In particular, sentences with several referential pronouns sound bad in isolation whenthere is a presumption that context will determine different denotations for different

26

pronouns. For instance, (68) sounds worse than (69), just as the nested epistemicvocabulary in (70) sounds worse than the repeated unembedded vocabulary in (71):

(68) ?That made that do that to that.

(69) It entered; it saw me; it squealed; and it fainted.

(70) ?Probably, it is probable that probably Jill will probably live.

(71) Jill will probably live; John will probably die; Janet will probably cry; andJoe will probably celebrate.

Context often determines different denotations for pronouns in sentences with nestedepistemic modals. As a result, a rich context is required for the simultaneous inter-pretation of the covert pronouns in sentences such as (67) and (70). Here again, incontrast with semantic injunctions against complicated embeddings, pragmatic ac-counts better fit the contours of our judgments about epistemic vocabulary.

2.4. A semantics for ‘probably’, ‘if’, and a covert type-shifting operator

The expression ‘probably’ has a more complicated semantic function than possibil-ity and necessity modals. The latter modals constrain your credences conditional onpropositions in a contextually determined partition. But as a graded modal, ‘proba-bly’ constrains your credences in members of the partition itself:

[[probablyi]]c = [λS . {m : m(

⋃{p ∈ gc(i) : m|p ∈ S}) > .5}]

In our shorthand: find the union of everyone that accepts that S. If you give thatproposition greater than .5 credence, then your credences are contained in the contentof ‘probably S’.19 For example, recall that if we are talking about whether Jill issuicidal, you can correctly assert:


The partition relevant to the interpretation of ‘probably’ in (50) contains two proposi-tions: either Jill is suicidal or she isn’t. Just one of these propositions accepts that Jillwill live if she jumps, namely the proposition that Jill isn’t suicidal.20 Since you give

19. This semantics follows Kratzer 1991 in taking ‘probably’ to indicate that something is more likely thannot. It is straightforward to adjust the definition so that ‘probably’ instead indicates likelihood above acontextually defined threshold. In a similar vein, it is straightforward to extend the lexicon of this paperto include other simple epistemic vocabulary, such as ‘unlikely’, ‘at least .3 likely’, ‘more likely than’,and comparative epistemic adjectives.

20. A reminder about our shorthand: your credences satisfy the constraint that a proposition accepts acontent just in case your credences conditional on that proposition are contained in that content.

27

more than .5 credence to that proposition, your credences are contained in the contentof (50), and that is roughly why it sounds okay for you to say it.

At this point, we can also give a more complete explanation of why the contentof (40) contains your credences about the outcome of rolling a fair die:

(40) It is probably2 even or1 less than four.

As mentioned earlier, the sort of context that is hospitable for (40) makes a certainpartition salient: either the number rolled is low, or it is high. A second partition isalso salient, namely the six possible outcomes of the rolling the die. The first partitiondetermines the content of ‘or’ and the second determines the content of ‘probably’.If you conditionalize your credences on the proposition that the number rolled islow, then you accept that the number is less than four. If you conditionalize yourcredences on the proposition that the number rolled is high, then you have equalcredence in each of the three high number outcomes. Hence you give more than .5conditional credence to the union of outcomes that accept the number rolled is even.That means your credences conditional on the number being high accept that thenumber is probably even. It follows from our semantics for ‘or’ that your credencesare in the content of (40), and that is roughly why it sounds okay for you to say it.

Indicative conditionals are semantically like graded modals, insofar as they alsoconstrain your credences in propositions in contextually determined partitions:

[[ifi]]c = [λS . λT . {m : m(

⋃{p ∈ gc(i) : m|p ∈ T}|⋃{p ∈ gc(i) : m|p ∈ S}) = 1}]

In other words, using our shorthand: find the union of everyone that accepts theantecedent of the conditional, and find the union of everyone that accepts the conse-quent. If you have full credence in the latter proposition conditional on the former,then your credences are contained in the content of the conditional itself.21

For example, consider the indicative conditional:

(72) If1 it is high, it is probably2 even.

The context of (72) makes a certain partition salient: either the number rolled is low, orit is high. The former proposition rejects the antecedent of the conditional, while thelatter accepts it. The former proposition also rejects the consequent of the conditional,while the latter accepts it. Hence you have full credence in the union of propositionsthat accept the consequent of (72), conditional on the union of propositions that accept

21. A disclaimer: this semantics is sufficient to address the motivating concerns of the present paper, but itis not my final word on indicative conditionals. I defend an alternative probabilistic semantics in Moss

2014, motivated by concerns that I have bracketed for ease of exposition here.

28

the antecedent. It follows from our semantics for ‘if’ that your credences are in thecontent of (72), and that is roughly why it sounds okay for you to say it.

There is one important respect in which our theory so far is incomplete. I havenot yet given a semantics for simple sentences such as:

(65) Bob is a hire.

(73) Jill jumps.

(74) The number rolled is high.

For instance, I have said certain partition propositions “accept that the number rolledis high” or “accept the antecedent of ‘if it is high, it is probably even’.” This isshorthand for a constraint on probability measures, namely that after conditionalizingon the partition proposition, the resulting measure is contained in the content of (74).Hence simple sentences like (74) must have sets of measures as their contents.

There is a natural way of associating simple sentences with sets of measures.According to standard truth conditional semantic theories, the content of a simplesentence is a set of worlds. According to my theory, the content of a simple sentenceis the set of measures that assign probability 1 to that set of worlds.22 This meansthat the theory need not start from scratch to deliver semantic values for referringexpressions, predicates, quantifiers, and so forth. Instead, a covert operator convertstraditional semantic values into alternative semantic values:

[[C]]c = [λp . {m : m(p) = 1}]

For example, the logical form of the sentence ‘Jill jumps’ is more accurately repre-sented as ‘C Jill jumps’. The semantic value of this sentence is a set of measures,namely {m : m({w . Jill jumps in w}) = 1}. Since simple sentences accompanied bythe covert operator C have sets of measures as semantic values, simple sentences canbe arguments of type 〈S, S〉 operators and type 〈S, 〈S, S〉〉 operators.

Furthermore, arguments of logical operators can include both simple sentencesand sentences containing epistemic vocabulary. For example, the logical form of (40)is more accurately represented as follows:

(40) [ probably2 [ C [ it is even ] ] ] or1 C [ it is less than four ].

This detail lets us finally give a complete explanation of why your credences arecontained in the content of (40) in the context described above. In our most recent

22. This content may seem inappropriate, since giving full credence to some proposition is a very strongconstraint. In short, I have made some assumptions in order to simplify my discussion, and refinementsof the theory in §3 address this worry. For a more thorough treatment of these issues, see chapter 2 ofMoss 2014.

29

explanation of this fact, we said that “if you conditionalize your credences on theproposition that the number rolled is low, then you accept that the number is lessthan four.” The more complete explanation replaces this with the following claim: ifyou conditionalize your credences on the proposition that the number rolled is low,then the resulting credence distribution has full credence that the number is less than four.Fans of gory detail should see Appendix B.1 for an explanation in formal vocabulary.

To sum up so far: I have introduced a semantics for eight expressions, includingbasic logical operators and epistemic vocabulary. According to this theory, there is asense in which logical operators are epistemic vocabulary. If they occur in the midstof epistemic modals, logical operators deliver constraints on credences that dependon what is accepted by propositions in contextually determined partitions. Assigningthe same sort of semantic values to logical operators and epistemic vocabulary helpsexplain the behavior of the latter. The way that ‘might’ and ‘must’ and ‘probably’interact with each other has a lot in common with the way they interact with logicaloperators. According to my theory, this is to be expected, as both are interactionsbetween different sorts of epistemic vocabulary.

3. A number of refinements and explanations

I have made three simplifying assumptions in developing the semantics in §2. Inorder to refine the semantics, I will identify these assumptions and say how they canbe removed. The first is about the standard effect of assertion, namely that whenyou hear an assertion with a certain content, you generally come to have credencescontained in that content. This claim abstracts away from lying, pretense, supposition,and so on. But more importantly, even in normal cases of assertion, your credencesdo not really come to be contained in asserted contents. The contents of sentencesare simply too strong to play that role. The content of a simple sentence is the set ofmeasures that assign probability 1 to some proposition. But it is arguably almost neverrational to have full credence in a proposition. Having full credence in a propositionmakes you bet on that proposition at arbitrarily risky odds, and makes your beliefin that proposition rationally unrevisable by conditionalizing on further evidence. Inother words, it makes you have blind faith in a proposition. Assertions rarely if everhave such a dramatic effect.

It might be possible to answer this complaint by saying that our theory governsideal cases, and that ideal communication really does make subjects have full cre-dence in asserted contents. But even this answer should be accompanied by somesuggestions about the effect of assertion in realistic cases. Here is one suggestion: as

30

far as the conversational record is concerned, an act of assertion is a proposal thatthe content of the assertion be accepted for conversational purposes. For example,suppose you assert that it is raining. Then it will sound bad for either of us to say oreven suppose that it might not be raining:

(75) a. Alice: Oh no. It is raining.b. Bob: #If it might not be raining, we should buy some sunglasses.

If your assertion is not challenged or retracted, it does seem that we accept its strongcontent for conversational purposes. Having accepted that content, Alice and Bob doresemble subjects who would accept bets at arbitrary odds, conversationally speaking,as they cannot even raise the possibility that it is not raining.23

In addition to affecting the conversational record, an assertion affects conversa-tional participants. An assertion does not exactly affect your credences, but somethingmore like your credences for practical purposes. For example, an assertion of (75-a) mayhave the effect that for practical purposes, it is just as if your credences are containedin its content—i.e. when it comes to your preferences and decisions, it is just as ifyou have full credence in the proposition that it is raining. This account of assertionis designed to mimic contemporary accounts of full belief according to which youbelieve a proposition when you can treat it as certain for practical purposes. For in-stance, according to Weatherson 2005, you believe a proposition roughly just in caseconditionalizing on that proposition changes none of your preferences over salientoptions.24 The analogous account of assertion says you accept an assertion just incase updating your credences on its content changes none of your preferences oversalient options. For example, you accept (75-a) just in case updating on the proposi-tion that it is raining changes none of your preferences over salient options. In otherwords, given the analogous account of full belief, you accept (75-a) just in case youbelieve that it is raining. This seems like exactly the right result, as assertions of sim-ple sentences are traditionally taken to constrain your full beliefs. To sum up: giventhe above accounts of full belief and assertion, you accept an assertion of a simplesentence just in case you believe its content. Even if our accounts of full belief andassertion must ultimately be modified, the latter will deliver intuitive results as longas it mirrors the former.

The second simplifying assumption made in §2 is that logical operators have justone semantic value each. In fact, my theory requires a serious and significant revision

23. This effect of assertion on the conversational record is elegantly explained by models on which thecontext set itself is fine-grained. For further discussion, see the context probabilism introduced in §8 ofYalcin 2007.

24. Cousins of this principle are defended by Williamson 2000, Ganson 2008, Fantl & McGrath 2010,and Schroeder & Ross 2014.

31

of this assumption, namely that logical operators have different types of semanticvalues, depending on whether they embed non-epistemic or epistemic vocabulary.For example, the semantic value of negation given in §2 must have a different type ofsemantic value of negation in simple sentences such as:

(76) John does not smoke.

For suppose (76) has the following logical form:

(77) not1 [ C John smokes ]

Then according to the semantics for ‘not’ in §2, the content of (76) contains yourcredences just in case there is no proposition in the relevant partition such that youhave full credence that John smokes, given that proposition. This is not at all what (76)intuitively means. For many partitions, it is very easy for your credences to satisfythis constraint, even if you have a relatively high credence that John smokes. It shouldintuitively be much harder for your credences to be contained in the content of (76).In fact, in light of our semantics for other sentences without epistemic vocabulary, thecontent of (76) should intuitively be the set of measures that assign probability 1 tothe proposition that John does not smoke.

The appropriate refinement of our semantics involves distinguishing between log-ical operators that embed epistemic vocabulary and logical operators that embed sim-ple sentences. A simple sentence actually has a set of worlds as its semantic value,which can serve as the argument of a covert type-raising operator. This covert oper-ator need not occur immediately above every simple sentence. In our refined seman-tics, logical operators can have sets of worlds as arguments. In addition to reinstatingtraditional semantic values for simple sentences, we reinstate traditional semantic val-ues for logical operators, adding these values to those introduced in §2. Hence logicaloperators have different semantic values in different linguistic contexts: traditionalvalues when their arguments are sets of worlds, and our §2 semantic values whentheir arguments are sets of measures. The logical form of ‘John does not smoke’ is(78) rather than (76):

(76) not1 [ C John smokes ]

(78) C [ not John smokes ]

The sentence under the covert operator has a set of worlds as its semantic value,namely the proposition that John does not smoke. Hence the content of (76) is the setof measures that assign probability 1 to that proposition, as desired.

This refinement of our semantics addresses several other potential problems as

32

well. For instance, suppose the logical form of (79) is given by (80):

(79) John smokes or Jill drinks.

(80) [ C John smokes ] or1 [ C Jill drinks ]

Then if the content of (79) contains your credences, there must be some contextuallydetermined partition such that conditional on each proposition in the partition, youeither have full credence that John smokes or full credence that Jill drinks. But in-tuitively you can utter a disjunction even if no such propositions would make yousure of either disjunct. In addition, our semantics should predict that the followinginference is valid:

(81) a. It is not the case that John does not smoke.b. Hence: John smokes.

And likewise for the following:

(82) a. It is not the case that both John smokes and Jill drinks.b. Hence: either John does not smoke or Jill does not drink.

However, from the premise that no one accepts that no one accepts that John smokes,we cannot generally infer that John smokes. From the premise that no one acceptsthat everyone accepts both that John smokes and Jill drinks, we cannot generally inferthat everyone either accepts: (a) that no one accepts that John smokes, or (b) thatno one accepts that Jill drinks. In other words, if the covert type-raising operator ‘C’occurs immediately above ‘John smokes’ and ‘Jill drinks’ in (81) and (82), the result-ing inferences are invalid. Hence our §2 semantics does not automatically validatedouble negation elimination or applications of De Morgan’s Laws, even restricted toinferences not containing any epistemic vocabulary.

The above refinement of our semantics validates instances of these inferenceswhere appropriate. For instance, the logical form of ‘John smokes or Jill drinks’ isgiven by (83):

(83) C [ John smokes or Jill drinks ]

The semantic value of (83) is the set of measures that assign probability 1 to theproposition that either John smokes or Jill drinks. This semantic value may containyour credences even if no salient information would make you sure of either disjunct.The logical form of the double negation elimination argument is not (84) but (85):

(84) a. not1 not2 C John smokesb. Hence: C John smokes

33

(85) a. C not not John smokesb. Hence: C John smokes

The logical form of the De Morgan’s Law argument is not (86) but (87):

(86) a. not1 [ C John smokes and2 C Jill drinks ]b. Hence: [ not3 C John smokes ] or4 [ not5 C Jill drinks ]

(87) a. C not [ John smokes and Jill drinks ]b. Hence: C [ [ not John smokes ] or [ not Jill drinks ] ]

It is not hard to verify that the latter inferences are valid, as desired.I should emphasize that on the semantics developed here, logical operators are

polymorphic. This claim constitutes a loss of theoretical parsimony, which some read-ers may count as a cost of my theory. Some may even judge that this cost is ultimatelytoo great to be outweighed by the benefits of the theory. However, several facts mayhelp mitigate this cost. For starters, it is a familiar observation that logical opera-tors can embed expressions of various semantic types; indeed, “virtually every majorcategory can be conjoined with ‘and’ and ‘or’” (Partee & Rooth 1983). The theoryI defend introduces semantic values with novel semantic types, such as sets of prob-ability measures and functions from sets of measures to sets of measures. In otherwords, the same sort of semantic type variation in the operators in (88) and (89) ispresent in (90) and (91):

(88) John is young, and Mary is young.

(89) John and Mary are young.

(90) It is probable that John is young and certain that he is handsome.

(91) It is probable but not certain that John is young.

A useful and familiar theory of (88) and (89) is that higher-type occurrences of logicaloperators are the product of type-raising (cf. Partee & Rooth 1983 for a canoni-cal early discussion of generalized conjunction and disjunction). This theory can beextended to (90) and (91) and other uses of logical operators embedding epistemicvocabulary.

In more detail: Partee & Rooth 1982 use a recursive definition to distinguish theconjoinable semantic types for which generalized logical operations are defined. Theirdefinition counts both sets of measures, and functions from sets of measures to sets ofmeasures, as conjoinable. Gazdar 1980 proposes simple recursive definitions of gen-eralized conjunction and disjunction, thereby unifying logical operations on differentcategories. For instance, the function denoted by ‘and’ always yields either the inter-

34

section of its arguments, or the function mapping each element to the intersection ofits image under those arguments. If the contextually supplied partition is the trivialpartition, then the definitions proposed by Gazdar generate exactly the same semanticvalues for logical operators as those assigned by my semantic theory. The semanticvalue of (90) is the intersection of the sets of measures denoted by each conjunct. Thesemantic value of ‘it is probable but not certain’ is roughly the function mapping eachconstraint to the intersection of the set of measures that count its prejacent as probableand the set of measures that count its prejacent as not certain.

In more complicated cases, the relevant type-shifting principles are more com-plicated than the recursive principles that Gazdar defines. But more complicatedtype-shifting principles are also not without precedent in the literature; as Partee

1986 notes, type-shifting principles are heterogenous (363). Partee observes that nom-inalization corresponds to a lexical rule relating the use of ‘blue’ as an adjective and‘blue’ as a proper noun. The situation with higher-type logical operators is similar, asthere is a common core of meaning shared by ‘and’ as a conjunction of propositionsand ‘and’ as a conjunction of sets of measures.

These classic discussions of generalized logical operators provide our theory withuseful precedents. They also provide a useful moral, namely that variation in semantictype is not necessarily as costly as unsystematic lexical ambiguity. Partee & Rooth

1983 argue that “the potential disadvantage of having multiple interpretations. . . isoffset by the processing strategy of trying the simplest type first” (13). This argumentapplies equally well given our semantics for logical operators embedding epistemicvocabulary. The semantic type of a logical operator still uniquely determines its se-mantic value, just as with the examples discussed by Partee and Rooth. The semanticsI defend does not introduce any unforced choices as to what ‘and’ means in some con-struction. This fact does not erase the cost of imputing variation in semantic type tothe logical operators, but it should make that cost easier to bear.

A couple of final notes on logical operators. First, we have identified a respectin which ‘if’ is distinctive among logical operators. Like ‘probably’ and epistemicpossibility and necessity modals, ‘if’ is thoroughly epistemic. There is no evidencefor a second reading of ‘if’ that accepts sets of worlds rather than sets of measuresas arguments. Second, there is also no evidence that logical operators have sets ofmeasures as arguments when they do not embed any overtly epistemic vocabulary.It is not just hard to hear readings riddled with covert type-raising operators, as in(84-a) and (86-a). They seem genuinely unavailable. In other words, semantic valuesof sentences are raised from sets of worlds to sets of measures only when forced. If asentence contains no epistemic vocabulary, then a single type-raising operator scopes

35

over that entire sentence, making it have the right sort of content for assertion. If asentence does contain epistemic vocabulary, then lower type-raising operators occuronly where they are required to make embedded sentences have the right sorts ofcontents to serve as arguments of that vocabulary. These facts may follow from moregeneral injunctions against unforced type-raising, perhaps along the lines of claimsdefended in Partee & Rooth 1983, though for reasons of space I shall leave thederivation of these facts as an open question for future investigation.

The third simplifying assumption made in §2 concerns the scope of my semanticsfor epistemic vocabulary. The assumption is that epistemic expressions have just onesemantic value each, namely those introduced in §2. In fact, sentences with epistemicvocabulary have multiple readings. Just like logical operators, epistemic expressionssometimes have exactly the sort of semantic values that traditional truth conditionalsemantic theories say they have. In such cases, epistemic expressions do not exhibitany of the behavior that motivates us to reject those semantic theories. For instance,recall that traditional theories come under fire from Yalcin 2007 for failing to predictthe infelicity of constructions like the following:

(92) Suppose that there might be snipers and there are not snipers.

(93) If there might be snipers and there are not snipers. . .

However, notice that the following constructions sound fine:

(94) Suppose that there might—for all you know—be snipers, and there are notsnipers.

(95) If there might—for all you know—be snipers, and there are not snipers. . .

Furthermore, in the presence of substantial contextual cues, (92) and (93) intuitivelymean the same thing as (94) and (95). For instance, imagine that you are in themilitary, and your instructor gives you the following advice on jungle warfare:25

There are a lot of deadly snipers in the jungle. Before you walk into an area wherethere are lots of high trees, if there might be snipers hiding in the branches, clearaway the foliage with flamethrowers. Do not worry about wasting equipment.Burn the foliage whenever there might be snipers. If there might be snipers andthere are not snipers, you will have wasted a flamethrower. But if there are snipersand you do not use that flamethrower, you will have wasted human lives.

In the context of this monologue, (93) sounds fine. In short, it sounds equivalent to(95). The arguments in Yalcin 2007 against the standard truth-conditional semanticsfor ‘might’ do not succeed in this context.

25. This military monologue is a variation on an example from Egan et al. 2005. The original exampleserves a different dialectical purpose, and embeds a simple ‘might’ sentence rather than a conjunction.

36

The same goes for other arguments against the standard semantics for epistemicmodals. For instance, imagine that after being trained with the above information,several soldiers enter a jungle warfare situation in which they have the followingradio conversation with their commanding officer:

(96) a. Soldier: Should we use flamethrowers to clear the branches?b. Commander: Is it the case that there might be snipers?c. Soldier: The scouts have not made a report, so there might be snipers.d. Commander: Then obviously you should be using your flamethrowers.

Furthermore, imagine that some military students are eavesdropping and judging thejungle soldiers as part of their basic training. The students may say (97), even if theyhave been informed that there are no snipers in the branches:

(97) They should use their flamethrowers, since there might be snipers.

In a similar spirit, the jungle soldiers may later defend themselves by saying that theywere right to use flamethrowers, since there might have been snipers. The soldiersmay stand by (96-c), even if they later find out that there were no snipers in thebranches. And finally, in contrast with some other uses of epistemic vocabulary, itis not at all suspiciously hard to identify the correct modal base for ‘might’ in (96-c).Intuitively, the soldiers are simply saying that for all they know, there are snipers inthe branches. A traditional contextualist semantics along the lines of Kratzer 1977

seems able to get the content of their utterance exactly right.26

It is important to get clear on the dialectical force of such examples. The failure ofanti-contextualist arguments when it comes to (96-c) does not demonstrate that con-textualist semantic theories are sufficient. In the same vein, the success of the samearguments elsewhere does not demonstrate that contextualist theories are unneces-sary. The proper reconciliation of our observations is that epistemic expressions havemultiple semantic values. In recent literature, facts about embeddings, eavesdrop-ping, and retraction have been understood as supporting anti-contextualist projects.But these facts are better understood as providing us with diagnostic tests. Someuses of epistemic vocabulary exhibit distinctive embedding, eavesdropping, and re-traction behavior. Some uses do not. The semantics developed here is a theory ofthe former, while standard accounts are theories of the latter. In that spirit, factsabout eavesdropping should play a role in the literature on epistemic vocabulary likethe role played by facts about projection behavior in the literature on presupposition

26. The contextualist-friendly uses of epistemic vocabulary include those that are “exocentric,” in the ter-minology introduced by Lasersohn 2005 and adapted to discussions of epistemic modals by Cappelen

& Hawthorne 2009 and others. For more examples in a similar spirit, see §3 of Dorr & Hawthorne

2012.

37

(cf. Karttunen 1973, Geurts 1999). In both cases, some distinctive behavior calls outfor some modification of a standard theory of content. And in both cases, the behav-ior itself is so distinctive that it may adequately function as partly constitutive of thesort of language that is best modeled by the modified theory. Just as we have the pro-jection test battery for presuppositional content, we may have a similar test batteryfor non-propositional content, useful for identifying exactly what uses of epistemicvocabulary the semantic theory in §2 is meant to describe.

In addition to removing three assumptions from §2, we may add pragmatic prin-ciples to our theory. The §2 semantics does not include any principles that distinguishthe order in which conjuncts or disjuncts are uttered. Hence supplementary princi-ples must explain phenomena that dynamic semantic theories aim to predict, such asthe similarity of the following sentences:


(98) It’s in a funny place, or there’s no bathroom in this house.

(99) John always feels hungry, and he goes to the movies almost every other day.

The alleged facts to be explained are that certain readings of these sentences are un-available: ‘probably’ in (40) cannot just be talking about situations where the numberrolled is high; ‘it’ in (98) cannot be referring to the bathroom in the house in question;and ‘always’ in (99) cannot just be talking about situations where John goes to themovies. Meanwhile, reversing the order of disjuncts or conjuncts makes each of thesereadings available. According to many dynamic semantic theories, that is becausethe semantic value of a disjunction depends on the effects of each disjunct on certainlocal contexts, where reversing the order of the disjuncts changes which local contextsare relevant to those effects. However, pragmatic principles provide strong alterna-tive explanations of the behavior of (98) and (99). Absent any special extra-linguisticcontext, (100) raises the salience of bathrooms more than ‘it’s in a funny place’ does,and (101) raises the salience of situations in which John goes to the movies more than‘John always feels hungry’ does:

(100) There’s no bathroom in this house. . .

(101) John goes to the movies almost every other day. . .

In fact, extra-linguistic context alone is rarely rich enough to determine the referentof ‘it’ when you say (98). But if you are rather desperately searching around anunfamiliar house while your child is making obvious signs of needing a bathroom,it may sound fine for someone to tell you (98) out of the blue. In the same vein,extra-linguistic context alone is rarely rich enough to determine that ‘always’ in (99)

38

is just talking about situations where John goes to the movies. But say we have justbeen debating whether frequent movie-goers become so accustomed to the smell ofpopcorn that it no longer makes them feel hungry when they go to the movies. Thenthe first conjunct of (99) can just be talking about situations where John goes to themovies, exactly as if it had occurred after the second conjunct instead of before.

The §2 semantics fits best with a similar pragmatic theory of (40). Talking aboutrolling a fair die makes a certain partition salient, namely the partition of possibleoutcomes of rolling the die. It is rare that extra-linguistic context alone is rich enoughto determine that some other partition is relevant to the interpretation of ‘or’ in (40).But it is not impossible, as illustrated by our example of the felicitous utterance of(40). In that example, extra-linguistic context alone makes another partition salient,namely the partition containing the proposition that the number rolled is low and theproposition that it is high. That is why your credences can be contained in the se-mantic value of (40), as long as you have a high conditional credence that the numberis even given that it is high. There are other ways of raising the salience of the samealternative partition, such as using the following sentence fragment:

(102) The number rolled is less than four, or. . .

By contrast, the following phrase does not raise the same partition to salience:

(103) The number rolled is probably even, or. . .

These facts about salience help explain why it is easier for context to contribute analternative partition to the interpretation of ‘or’ in sentences starting with (102) asopposed to (103), but also why it is not impossible for context to contribute an alter-native partition in the latter case. It seems inappropriate to promote this explanationto a semantic rule. First, it is difficult to see how a semantic rule could be defeatedas necessary in creatively constructed contexts, as pragmatic generalizations are. Sec-ond, our pragmatic theory follows from general principles about salience that havelittle to do with conjunction or disjunction. These concerns by no means settle thedebate, but they do support our pragmatic theory as a viable alternative to dynamicsemantic accounts of (40).27

A few additional definitions complete our semantic theory. Any context sensitiv-

27. Disjunctions like (40) may remind the reader of “modal splitting” cases introduced by Landman 1986.For comparison, it may be helpful to note that Landman does not argue that modal splitting accompa-nies any disjunction, but merely that sometimes “we have to assume that [some] sentence is added withmodal splitting to make sense of it” (205). For a contrasting defense of dynamic semantic theories ofdisjunction, see Dever 2012. In addition to certain salience facts, using ‘either’ may enable context tocontribute an alternative partition to the interpretation of a disjunction, possibly by inducing contrastivefocus (cf. Hendriks 2004). For reasons of space I shall leave this proposal as an open question for futureinvestigation.

39

ity enables equivocation in arguments, namely by evaluating earlier claims relativeto one context and later claims relative to another. In deciding whether an argumentis valid, we must stipulate that we are only concerned about whether its conclusionfollows from its premises when all are evaluated relative to a single context. Fur-thermore, it cannot be that a conclusion follows from some premises just in case anyworld contained in the latter is contained in the former, since contents are sets of mea-sures rather than sets of worlds. Alternative semantic theories call for an alternativenotion of logical consequence. A conclusion follows from some premises just in caseevery probability measure contained in the latter is contained in the former. Formally,an argument is valid in a context c just in case the semantic value of its conclusionin c contains the union of the semantic values of its premises in c. An argument isvalid simpliciter just in case it is valid in every context. Finally, it is important to notethat the proper objects of semantic evaluation are sentences containing indices. Forexample, the following sentences constitute one argument:

(104) a. If1 C low, probably2 C odd.b. Not3 probably2 C odd.c. Hence: not3 C low.

And the following sentences constitute a distinct argument:

(105) a. If1 C low, probably2 C odd.b. Not1 probably2 C odd.c. Hence: not1 C low.

These arguments sound just the same in English. But one may well be valid even ifthe other is not. This fact accounts for certain behavior of epistemic vocabulary inclassically valid arguments.

4. How our theory accounts for the behavior of epistemic vocabulary

Now for the payoff. The semantics in §2 and pragmatics in §3 account for the distinc-tive behavior of epistemic vocabulary described in §1.

4.1. Nested epistemic vocabulary

The semantics in §2 explains why nested epistemic modals signal that different opin-ions about some subject are in play. A sentence with nested epistemic modals con-strains your credences conditional on contextually determined propositions. Theseconditional credences are different opinions about some subject. For example, re-

40

member that you might say (64) if you are torn between various ways of evaluatingjob candidates:

(64) It is definitely1 the case that Bob might2 be the best candidate for the job.

As mentioned in §2.2, on the most natural reading of (64) in this context, the partitionused to interpret ‘definitely’ contains propositions about what virtues matter whenevaluating candidates. Just settling what virtues matter does not leave you entirelycertain which candidates are best for the job, since you may still be unsure whichcandidates have which virtues. The different opinions that are salient in the contextof (64) are credences conditional on propositions about what sorts of virtues matter.The content of (64) is the constraint that each opinion consider it possible that Bob isthe best candidate for the job, conditional on any proposition about which candidateshave which virtues.

It is easier to make sense of nested epistemic modals by imagining that each con-ditional credence distribution is the opinion of some expert, since it is easier to makesense of internal facts by imagining that they correspond with external ones. If someexperts most value teaching experience and others most value research quality, forinstance, conditional credences governed by (64) may simply match unconditionalcredences that actual experts have. The subject forms her credences by splitting thedifference between expert credences, weighting the credence of each expert accord-ing to her credence that they are trustworthy with respect to what sorts of virtuesmatter when evaluating candidates. In cases where there are no obvious facts aboutwhich imagined experts disagree—such as what sorts of virtues matter in evaluatingcandidates—it is harder to associate each expert with an element of some obvious par-tition associated with the outer modal. But it does seem that whenever we imaginea subject splitting the difference between various expert credences, we still visual-ize that compromise as if there are various propositions that she splits her credencebetween. In other words, we imagine the subject acting as if there is a sense of ‘trust-worthy’ according to which she should simply defer to the most trustworthy expert,and then simply splitting her credence between propositions about which expert ismost trustworthy. These propositions form an artificial partition, and we imagine thatpartition being contributed by context to the interpretation of the outer modal.

The semantics in §2 can also explain the behavior of contradictory nested epis-temic modals. For example: recall that you could correctly utter (60) after a fair diehas been placed under a cup according to whether the number rolled was low or high:

(60) It might be that the number rolled is probably even.

41

In the same circumstances, since you have just .5 credence that the die landed on aneven number, you could also correctly utter (106):

(106) It is not probably even.

But you could not utter the conjunction of these sentences:

(107) #It might be that the number rolled is probably even and not probably even.

It sounds fine to utter (60) and (106) separately because ‘might’ and ‘not’ are mostnaturally interpreted using different partitions. It is most natural to say (60) whenyou are thinking about which cup the die is under. The modal ‘might’ is interpretedusing the partition that the number rolled is either low or high. (60) means thatsomeone in that partition accepts that the number rolled is probably even, and yourcredences satisfy that constraint. It is most natural to say (106) when you are not mostconcerned with which cup the die is under, but rather with your all-things-consideredcredences about the die. The operator ‘not’ is interpreted using the trivial partition.(106) means that no one in that partition accepts that the number rolled is probablyeven, and your credences also satisfy that constraint.

This derivation demonstrates some nice features of the §2 semantics. First, ourintuitive feeling that someone is talking about all-things-considered credences corre-sponds to a specific feature of semantic interpretation, namely that context contributesa trivial partition to the interpretation of an epistemic modal. Second, we can say whythe content of epistemic vocabulary under negation often seems to be the complementof the content of that epistemic vocabulary. In the absence of defeating contextualcues, subjects commonly use epistemic expressions under negation to constrain all-things-considered credences. Hence ‘not’ is interpreted using the trivial partition,thereby denoting the simple operation of taking the complement of the content of theembedded sentence.

If you utter (106) and (107) together, by contrast, it is natural to interpret ‘might’and ‘not’ using the same partition. This follows from more general facts about the in-terpretation of contextual parameters. For instance, it is natural to hear the followingsentence as expressing a contradiction:

(108) #No one danced and everyone danced.

It is hard to hear ‘no one’ and ‘everyone’ as quantifying over different domains in(108), even if you are inclined to be charitable. For similar reasons, it is hard tointerpret ‘might’ and ‘not’ using different partitions in (107). The same goes for ouroriginal conjunction:

42

(24) #It is not the case that it is probably John and it might be the case that it isprobably John.

This sentence sounds bad when some experts believe that a certain masked murdereris probably John, but most experts believe it is probably Mary. In isolation, the firstconjunct expresses the constraint that you side with the majority in forming your all-things-considered credences. The second expresses the constraint that you not ignorethe possibility that the minority opinion is most trustworthy. But when the conjunctsare put together, it is hard to interpret them using such different partitions.

Finally, our theory can explain the intuitive connection between nested modals,evidential weight, and credal resilience. Recall that Eric can say (27) instead of (25)because his high credence that Liem is wearing green is resilient, and based on a lotof evidence:

(25) It might be the case that Liem is probably wearing green.

(27) It is almost certainly the case that Liem is probably wearing green.

Joyce 2005 suggests that evidential weight manifests itself in credal resilience whenyour credences are mediated by chance hypotheses. In particular, “the weight of evidencetends to stabilize [your] credence in a particular way: it stabilizes credences of chancehypotheses, while concentrating most of the credence on a small set of these hypothe-ses” (166). Eric’s credences about what Liem is wearing are mediated by propositionsrelevantly like chance hypotheses, namely claims about how much Liem likes wear-ing green. The more Liem likes wearing green, the higher the objective chance thathe will wear green on any given day. Having seen Liem wear green on 500 out of 800

days, Eric concentrates almost all of his credence on specific hypotheses about exactlyhow much Liem likes wearing green. That is why the weight of Eric’s evidence makeshis high credence that Liem is wearing green so resilient.

That is also why Eric can say (27). The sentence (27) is intuitively interpretedusing just these same hypotheses about how much Liem likes wearing green. Inshorthand: Eric gives almost all his credence to hypotheses that accept that Liem isprobably wearing green. According to our semantics, that is why his credences arecontained in the content of (27). Madeleine has much less evidence. She may givea majority of her credence to hypotheses that accept that Liem is probably wearinggreen. But she should still give considerable credence to a number of other hypothesescompatible with his wearing green on 5 out of 8 days. For instance, it is compatiblewith Madeleine’s evidence that Liem likes red more than green, but dumped all hisred shirts in the laundry just before the 8 days in question. Hence according to oursemantics, her credences are not contained in the content of (27). Having a lot of

43

evidence simultaneously makes Eric’s high credence that Liem is wearing green veryresilient, and licenses his embedding ‘Liem is probably wearing green’ under verystrong epistemic modals. In more generality: when your credences are mediatedby something like chance hypotheses, increasing evidential weight simultaneouslymakes your credences more resilient and licenses your embedding correspondingconstraints under stronger epistemic modals.

4.2. Epistemic vocabulary under disjunction

The theory in §2 and §3 explains why you can assert (38) even when you cannot assertits first disjunct and you can deny its second disjunct:


As discussed in §3, the first disjunct of (38) makes a certain partition salient, namelythat the number rolled is either low or high. In the resulting context, (38) has just thesame content as (40):


As discussed in §2.4 and Appendix B.1, your credences are contained in that content.That is why you can assert (38). This holds even though your credences are notcontained in the content of the first disjunct, since you do not have full credence inthe proposition that the number rolled is less than four. And it holds even thoughyour credences are contained in the content of the negation of the second disjunct,for reasons just reviewed in §4.1. This discussion highlights another nice feature ofour semantics: your credences may be contained in the content of a disjunction evenwhen they are not contained in the content of either disjunct, as long as all membersof the salient partition accept at least one of the disjunct constraints. This featurealleviates the central concern about ‘or’ exportation that Schroeder 2012 raises forthe semantics in Yalcin 2007.

In the meantime, disjunctive syllogism remains valid for disjunctions free of epis-temic vocabulary. For example, consider the disjunction:

(79) C [ John smokes or Jill drinks ]

If the content of (79) contains your credences, then you have full credence that eitherJohn smokes or Jill drinks. If you can also deny the second disjunct, you have fullcredence that Jill does not drink. From this it follows that you have full credence thatJohn smokes, and hence that you can assert the first disjunct by itself.

44

In addition, our §3 theory explains why disjunctions such as (40) sound felicitousin some contexts and infelicitous in others. In some contexts, the partition that thenumber rolled is low or high is salient. In those contexts, ‘or’ in (40) is interpretedusing that partition, and the disjunction sounds fine. In some contexts, the partitionthat the number rolled is low or high is not salient. In those contexts, ‘or’ is interpretedusing the trivial partition, and the disjunction sounds bad. In short, absent contextualcues, it is hard to know what non-trivial partition you might have in mind when yousay the first half of (40), just as it is hard to know what specific situations you mighthave in mind when you say the first half of (99):

(99) John always feels hungry, and he goes to the movies almost every other day.

To sum up: disjunct order makes a purely pragmatic contribution to the content ofa disjunction. If extra-linguistic context does not make a certain partition salient,using disjuncts in a certain order can have that effect. But if extra-linguistic contextalready makes that partition salient, changing around the order of the disjuncts hasno additional semantic effect. That is why reversing disjunct order does not affect theinterpretation of disjunctions like (40) in contexts where they are already felicitous.

4.3. Epistemic vocabulary over indicatives

The semantic theory given in §2 explains why you can assert (44) when we are talkingabout whether there is a net along the roof of your office building, but not when weare talking about whether Jill is suicidal:


The logical form of (44) is as follows:

(109) probably1 [ if2 [ C Jill jumps off the building ] [ C she will die ] ]

In shorthand: the content of (44) contains your credences just in case you give morethan .5 credence to the union of everyone in gc(1) who accepts that Jill will die ifshe jumps. Say we are talking about whether there is a net along the roof. Thengc(1) contains two propositions: that there is a net, and that there is no net. Thelatter proposition accepts that Jill will die if she jumps. And you give that propositionmore than .5 credence. Hence according to our semantics, the content of (44) containsyour credences. That is why you can assert (44).28 By contrast, say we are talkingabout whether Jill is suicidal. Then gc(1) contains different propositions: that Jill is

28. Similar reasoning explains the observation by Kaufmann 2004 that when certain partitions are salient,the credence that we assign to ‘if I pick a red ball, it will have a black spot’ fails to match our conditionalcredence in the consequent given the antecedent.

45

not suicidal, and that she is suicidal. The latter proposition accepts that Jill will dieif she jumps. But you do not give that proposition more than .5 credence. And theformer proposition does not accept that Jill will die if she jumps. Hence according toour semantics, the content of (44) does not contain your credences. That is why youcannot assert (44).

The semantic theory given in §2 also explains why you can correctly assert (49)when we are talking about whether there is a net along the roof:

(49) If1 it is probably2 the case that Jill will live if3 she jumps, then there is a net.

In shorthand: the content of (49) contains your credences just in case you have fullconditional credence in the union of everyone in gc(1) who accepts that there is a net,given the union of everyone in gc(1) who accepts that it is probably the case that Jillwill live if she jumps. Again, if we are talking about whether there is a net, gc(1)contains two propositions: that there is a net, and that there is no net. The formerproposition accepts that it is probably the case that Jill will live if she jumps. Thelatter does not. Since you have full conditional credence that there is a net, given thatthere is a net, your credences are indeed contained in the content of (49). That is whyyou can assert it when we are talking about whether there is a net along the roof.

The context dependence of (49) and (50) accounts for why you can assert eitherpremise separately but you cannot conclude (51):



(51) There is a net.

The content of each premise depends on whether we are talking about whether thereis a net along the roof, or about whether Jill is suicidal. Your credences are containedin the content of (49) in the former but not the latter contexts. And they are containedin the content of (50) in the latter but not the former contexts. There is no singlecontext in which both premises are assertable, and hence no context in which you arelicensed in concluding (51) on the basis of (49) and (50).

4.4. Epistemic vocabulary in classically valid arguments

Recall from §1.5 that the following argument seems invalid when we are reasoningabout the outcome of rolling a fair die:

(52) a. If it is low, it is probably odd.b. It is not probably odd.

46

c. Hence: it is not low.

As discussed in §4.1, sentences like (52-b) intuitively concern your all-things-consideredcredences, and so the negation operator in (52-b) is naturally interpreted using thetrivial partition. As discussed in §4.2, linguistic context can make non-trivial parti-tions salient. For instance, recall that ‘it is high or. . . ’ makes a certain partition salient,namely that the number rolled is either low or high. The construction ‘if it is low. . . ’makes the same partition salient, and so the conditional operator in (52-a) is natu-rally interpreted using this partition. Hence the most natural reading of (52) has thefollowing logical form:

(110) a. if1 [ C it is low ] [ probably2 C it is odd ]b. not3 [ probably2 C it is odd ]c. Hence: C not [ it is low ]

This argument is indeed invalid. There are contexts in which the semantic value ofits conclusion does not contain the disjunction of the semantic values of its premises.See Appendix B.2 for a formal proof.29

A similar diagnosis follows for our troublesome instance of constructive dilemma:

(53) a. If it is low, it is probably odd.b. If it is high, it is probably even.c. It is either low or high.d. Hence: either it is probably odd or probably even.

The conclusion of this argument sounds bad because ‘or’ in (53-d) is naturally inter-preted using the trivial partition, and you neither have greater than .5 credence thatthe number rolled is even, nor greater than .5 credence that it is odd. In other words,the most natural reading of (53) has the following logical form:

(111) a. if1 [ C it is low ] [ probably2 C it is odd ]b. if1 [ C it is high ] [ probably2 C it is even ]c. C [ it is low or it is high ]d. Hence: [ probably2 C it is odd ] or3 [ probably2 C it is even ]

29. As an editor helpfully points out, there seems to be a special problem with modus tollens that thereisn’t for modus ponens; it is easier to hear invalid interpretations of the former. This felt difference be-tween modus ponens and modus tollens corresponds to a theoretical difference between these inferences.In particular, the conclusion of a modus tollens inference such as (110-c) contains a negation operatorembedding a simple sentence. Standard injunctions against unforced type-raising predict that this oc-currence of ‘not’ will express propositional negation. Meanwhile, the constraint embedded in the secondpremise of (110) forces a higher-type interpretation of ‘not’ in that premise. Hence the determinationof the semantic type of ‘not’ in modus tollens inferences ensures that these inferences are given invalidinterpretations, whereas no similar mechanism leads us to hear modus ponens inferences as invalid.

47

This argument is invalid. See Appendix B.3 for a formal proof.These inferences share a couple of features that are responsible for their invalid-

ity. First of all, the major logical operators in these arguments are not co-indexed. Anon-trivial partition is used to interpret ‘if’ in the first premise of (52), while a triv-ial partition is used to interpret ‘not’ in the second. A non-trivial partition is usedto interpret ‘if’ in the first two premises of (53), while a trivial partition is used tointerpret ‘or’ in the conclusion. In other words, there is an equivocation in these argu-ments. The arguments are just as bad as arguments where context naturally suppliesdifferent domains of quantification to overt quantifiers, such as:

(112) a. Everyone failed the exam.b. Everyone who failed the exam took the exam.c. Hence: everyone took the exam.

Second of all, certain logical operators have different semantic values in the premisesand conclusions of these arguments. The operator ‘not’ has higher type in the secondpremise of (52) and lower type in its conclusion. The operator ‘or’ has lower type inthe third premise of (53) and higher type in its conclusion.

Although it is not directly relevant to our discussion of (52) and (53), it is worthmentioning a third feature that may be responsible for the invalidity of argumentscontaining epistemic vocabulary. I have occasionally assumed that probability mea-sures assign some probability to each member of contextually determined partitions.This simplifying assumption deserves much more discussion than I can give it in thispaper. A number of arguments become more complicated once we allow that contex-tually determined partitions may contain propositions in which you have absolutelyno credence. For example, consider the following alleged counterexample to modusponens from McGee 1985:

(113) a. If a Republican wins the election, then if it’s not Reagan who wins itwill be Anderson.

b. A Republican will win the election.c. Hence: if it’s not Reagan who wins, it will be Anderson.

It may be that (113-b) is assertable because for practical purposes, you have full cre-dence that Reagan will win the election. Then according to my theory, whether youcan assert (113-c) will depend partly on your credences conditional on some proposi-tion in which you have absolutely no credence, practically speaking. This complicatesour evaluation of the validity of (113).

If we restrict our attention to arguments without these troublesome features, wecan resurrect classical inference rules. Recall from §3 that sentences containing indices

48

are the proper objects of semantic evaluation. Hence strictly speaking, argumentscontaining indices are evaluated for validity. There is an entire family of modus tollensarguments. One such argument is invalid. But our acceptance of other instances ofmodus tollens is justified. The same goes for constructive dilemma arguments. In moreprecise terms: let us say that a context c is well-behaved with respect to a measurem and an argument A just in case for every index i on epistemic vocabulary in A,we have m(p) > 0 for all propositions p ∈ gc(i). Let us say that an argument A isquasi-valid just in case for every measure m and every context c that is well-behavedwith respect to m and A, if m is contained in the semantic values of the premises of Ain c, then m is contained in the semantic value of the conclusion of A in c. Then thefollowing argument schema is quasi-valid:

(114) a. if1 P, Qb. not1 Qc. Hence: not1 P

In other words, replacing each letter in the above schema with a sentence whosesemantic value is a set of measures always yields a quasi-valid argument. The samegoes for the following argument schema:

(115) a. if1 P, Qb. if1 R, Sc. P or1 Rd. Hence: Q or1 S

In addition, instances of chancy modus ponens are valid simpliciter:

(116) a. probably1 Pb. if1 P, Qc. Hence: probably1 Q

Thus our semantics vindicates our attachment to certain instances of modus tollens andconstructive dilemma, even instances riddled with epistemic vocabulary (cf. Appen-dices B.4 and B.5 for proofs). In addition, our semantics validates any instance ofchancy modus ponens (cf. Appendix B.6). As long as we are in a well-behaved context,we are perfectly justified in inferring according to the argument forms given above.

I noted above that ‘or’ in the conclusion of (53) is interpreted relative to the trivialpartition, and that this helped explain why the inference was invalid. It is instructiveto note that on my theory, ‘or’ in the conclusion of a constructive dilemma may not al-ways be interpreted relative to the trivial partition. In fact, sometimes context suppliesa non-trivial partition and informants actually accept the conclusions of constructive

49

dilemma inferences similar to (53).This phenomenon is already familiar in the case of constructive dilemma infer-

ences embedding deontic vocabulary. For example, suppose that a bunch of minersare trapped together in one of several shafts. We can either save all the miners byblocking the correct shaft or kill just one miner by blocking no shaft. Consider thefollowing inference:

(117) a. If the miners are in shaft A, we ought to block A.b. If the miners are in shaft B, we ought to block B.c. The miners are in shaft A or shaft B.d. So either we ought to block A or we ought to block B—we just don’t

know which!

There is an acceptable reading of (117-d) in this inference, on which it communicatesroughly that certain further evidence would either decisively recommend that weblock A or decisively recommend that we block B.

The same phenomenon is less familiar, but just as present with constructivedilemma inferences embedding epistemic vocabulary. For example, suppose that youhave taken a test to determine how likely it is that you have cancer, where the testproduced a colored line that is either red or green or blue. Red means that cancer isvery likely; green means that it is very unlikely; and blue means that the test failedto yield any useful information.30 The doctor tells you that the test did not produce ablue line. Then you may infer:

(118) a. If the test line was red, it is very likely that I have cancer.b. If the test line was green, it is very unlikely that I have cancer.c. The test line was red or green.d. So either it is very likely that I have cancer or very unlikely that I have

cancer—we just don’t know which!

Just as with (117-d), many informants hear an acceptable reading of the conclusion(118-d). On this reading, (118-d) communicates roughly that certain further evidencewould either strongly confirm that you have cancer or strongly disconfirm that youhave cancer. Happily, my theory naturally accommodates this reading. The ‘or’ in(118-d) may be interpreted relative to a contextually salient partition, namely the pos-sible test results. This reading of (118-d) will contain your credences after you updatethem on (118-c), since each remaining test result will confirm one of the disjuncts of(118-d). Hence (118-d) will sound acceptable in this context.

30. I am grateful to Josh Dever for suggesting that I discuss this sort of example here.

50

Of course, we cannot conclude that context will always supply some non-trivialpartition whenever an instance of constructive dilemma is uttered. In fact, if we re-move ‘we just don’t know which’ from (118-d) and instead say that red and green areequally likely test results, informants may start hearing the conclusion as unaccept-able. The point here is just that my theory accommodates some contextual variabilityin whether we hear constructive dilemma inferences as valid or invalid, and that thisis a benefit rather than a cost of my theory.

4.5. The scope of my theory and avenues of further research

In assessing the explanations in §4.1–4.4, readers may observe that my semantic the-ory yields predictions about concrete cases only in conjunction with supplementaryassumptions to the effect that certain contexts make certain partitions available as thevalues of certain pronouns. This may cause some readers to worry that the theoryitself does not generate strong enough predictions to constitute an interesting pieceof research. In response, it is worth noting that it would be unusual to demand thata contextualist theory come equipped with conditions for deriving the available val-ues of covert pronouns from facts about the context. To compare a salient example,compare the defense of contextualism in Kratzer 1981. Kratzer does not equip herreaders with hard and fast rules for determining what accessibility relations couldconstitute the modal base supplied by a particular context. It is not hard to under-stand why. The facts about context that determine the availability of pronoun valuesare so subtle that it may be literally impossible for any theorist to provide rules thatyield predictions about arbitrary examples that readers may construct themselves.

Even in the absence of such rules, however, it is possible to give strong argumentsin support of a contextualist theory of some vocabulary. In short, it is possible to give“non-constructive proofs” that the semantic value of an expression depends on certainsorts of contextual parameters. These arguments do not establish that my theorycorrectly predicts that a particular reading of a sentence is available in a particularcontext. Instead they present indirect evidence that we use epistemic vocabulary toexpress different constraints on our conditional credences in different contexts.

The indirect arguments supporting my theory highlight five features of sentencescontaining epistemic vocabulary. In brief: facts about partitions affect whether sen-tences containing epistemic vocabulary are acceptable in various contexts. In somecontexts, sentences containing epistemic vocabulary express multiple constraints onconditional credences. There are infelicitous reports of disagreement between speak-ers who utter apparently contradictory sentences in contexts where different parti-tions are salient. The behavior of contextually supplied partitions mimics attested

51

behavior of contextually supplied quantifier domains. Finally, sentences containingepistemic vocabulary exhibit binding effects where different partitions are relevant tothe interpretation of different values of the bound variable.

The first three of these five features are closely related. For starters, recall thatfacts about salient questions affect whether (44) or (45) is assertable in a context:


(45) Probably, if Jill jumps off the building, she will live.

(44) is assertable when someone is wondering whether there is a net along the roofof the building, and (45) is assertable when someone is wondering whether Jill issuicidal. This means that in contexts where it is not clear whether the first or secondquestion is at issue, sentences like (44) can have multiple readings. For instance, youmight say (120) to clarify which reading of (119) is intended in a particular context:

(119) Is it likely that Jill will die if she jumps off this building?

(120) Wait a minute. Are you just asking whether I think there is still a safety netalong the roof, or are you asking whether I think Jill may be ready to jumpfrom the roof even though—I am pretty sure—the net isn’t there anymore?

In the absence of clarification, though, you may use (44) or (45) to express consistentattitudes about Jill, namely because you may consistently believe both that there is nonet along the roof and that Jill would not jump unless there was a net. That is alsowhy it can sound bad to report someone who says (44) and someone who says (45) asdisagreeing about likelihood facts. They may have consistent beliefs and differ onlywith respect to which beliefs they would use (44) to report. In the same vein, insofaras you are not sure which answer to give to (119), it is not that you cannot make upyour mind about whether Jill will probably live if she jumps. In both cases, you areperfectly clear about the facts, and merely unclear about which facts you would beusing the conditional to report.

In addition to indicative conditionals, graded modal vocabulary may have differ-ent readings in different contexts. For example, suppose Alice, Bob, Casey, and Dylanare among your good friends. Alice and Bob are married, and Casey and Dylan aremarried. There is a party next door, and you are wondering whether your variousfriends are there. Alice and Casey are best friends with each other and often go toparties together. The same goes for Bob and Dylan. But none of them really enjoysgoing out with their best friend’s spouse, so all four friends seldom end up at thesame party together. In a normal context, your knowledge of these facts is enough tolicense your saying:

52

(121) If Alice is more likely than Bob to be at the party, then Casey is more likelythan Dylan to be there.

If you were to find out that Alice was almost certainly at the party, for instance, thenyou would conclude that Casey was almost certainly there, while Dylan was almostcertainly absent. But here is a twist: suppose that someone overhears you say (121)and replies:

(122) But what if the reason that Alice is more likely than Bob to be at the partyis that Bob has just recently been diagnosed with some serious illness?Then Casey and Dylan would both surely be present to provide Alice withemotional support.

After someone says (122), it is hard to continue affirming (121). The speaker of (122)raises the possibility that Alice is more likely than Bob to be at the party in virtueof Bob having some serious illness. Conditional on that possibility, you accept theantecedent of (121), reject its consequent, and reject the conditional itself. Henceyou no longer take yourself to be in a position to say (121). But at the same time,you may stand by your original reason for saying it. If you had to update yourcredences on the information that Alice is more likely than Bob to be at the party,your resulting credence that Casey was there would be higher than your credence thatDylan was there.31 In fact, you may even feel that it is somewhat unfair to criticizeyour original assertion by introducing an obscure possibility into the conversation.The more obscure the possibility, the more you may feel as if your interlocutor ischanging the subject rather than taking what you said at face value, though it maystill be hard to ignore possibilities once they have been introduced.

This discussion should sound familiar, namely because natural language quanti-fiers exhibit just the same behavior. In a normal context, you might say (123) only tohave someone reply with (124):

(123) There is nothing in the fridge.

(124) But what about condiments? Isn’t there mustard on the refrigerator door?

After someone says (124), it is hard to continue affirming (123), even though you maystand by your original reason for saying it. In fact, you may even feel that it is unfairto criticize your original assertion by introducing unanticipated refrigerator contents

31. In this example, (122) raises a possibility that is probabilistic in nature. The possibility in the antecedentof (121) is also probabilistic; if you update on that possibility, your credences are updated on a non-propositional constraint. This sort of learning is governed by generalizations of classic updating rules.See Diaconis & Zabell 1982 for an introduction to this method of maximum entropy updating, and seeYalcin 2007 and Moss 2013 for discussion of this sort of updating in the context of semantic theoriesof epistemic vocabulary.

53

into the conversation. The less anticipated the contents, the more you may feel as ifyour interlocutor is changing the subject; it feels inappropriate to challenge (123) onthe basis of dust or air molecules, for instance, though it may still be hard to contractthe domain of quantification once it has been expanded.

It is easy for my theory to explain these similarities between epistemic vocabularyand quantifiers, namely because on my theory, introducing potential probabilisticevidence just amounts to expanding a contextually supplied domain of quantification.The informal gloss of my semantics explicitly uses ‘everyone’ and other quantifiers tohighlight this feature of the semantic theory. (122) and similar sentences expand thedomain over which ‘everyone’ quantifies.

For example: when you utter (121), you may be considering several simple like-lihood assessments that you might end up with. For instance, you may get evidencethat Alice is more likely or equally likely or less likely to be at the party than Bob.The same likelihood assessments will say that Casey is more likely or equally likely orless likely to be at the party than Dylan, respectively. The indicative (121) is acceptedby each member of this contextually supplied partition, and that is why it sounds justfine. By contrast, uttering (122) introduces more complicated likelihood assessmentsinto the contextually supplied partition, including your credences conditional on theinformation that Alice is more likely to be at the party than Bob for reasons related toserious illness. (121) is not accepted by each member of this expanded partition, andthat is why it no longer sounds fine.

In addition to these asymmetries in context shifting, epistemic vocabulary sharesanother feature with paradigmatic implicit arguments, namely that both exhibit bind-ing effects. For example, many accept that a covert location argument in (125) isbound in sentences such as (126):

(125) The local bar is closed.

(126) Everywhere John goes, the local bar is closed.

In just the same way, quantifiers can bind covert partition arguments of sentences like(127), namely in sentences such as (128):

(127) Alice might be a probable hire.

(128) However you look at it, Alice might be a probable hire.

Suppose that Alice is a candidate for a job in our department. There are two waysof collecting information about the likelihood that we will hire her. First, Alice hasseveral colleagues, each of whom has expressed an informal opinion about whethershe is a probable hire. Second, we have collected several formal letters, each of which

54

contains an opinion about whether she is a probable hire. At least one colleaguesays that she is a probable hire, and at least one letter says that she is a probablehire. In this context, even if we are unsure whether informal or formal opinions aremore trustworthy, we may still accept (128). To spell out this reading: we accept(128) because regardless of whether we form our credences about Alice by collatinginformal or formal likelihood assessments, we give some credence to an assessmentaccording to which she is a probable hire. This reading is easy for my theory toexplain, since on my theory, ‘however you look at it’ can bind the covert partitionargument of ‘might’ in (128).

This behavior of ‘however you look at it’ is not without precedent. For instance,suppose that causation is essentially contrastive, and that ‘cause’ has a covert partitionargument.32 For instance, if the Indian government and a horrible drought are eachcausally relevant to a famine, then a contextually supplied partition of alternativesmay determine whether (129) or (130) is acceptable:

(129) The Indian government caused the famine.

(130) The drought caused the famine.

But now suppose that the Indian government was secretly depleting groundwaterresources in a way that caused the drought. Then the following will sound acceptable:

(131) However you look at it, the Indian government caused the famine.

There are multiple sets of causal alternatives relevant to (129), including sets of theo-ries about potential political causes and sets of theories about potential meteorologicalcauses. (131) is acceptable because any such set of alternatives will contain some the-ory that says that the Indian government is responsible for the famine.

It is just the same with (128): there are multiple sets of theories relevant to ourjudgment that Alice is a probable hire, including sets of informal opinions and setsof formal opinions. (128) is acceptable because any such set will contain some the-ory that says that Alice is a probable hire. To sum up: binding effects in sentenceslike (128) are evidence for a contextualist theory of epistemic vocabulary, where thebound argument of the higher modal ranges over various ways of generating likeli-hood assessments, i.e. various partitions of propositions about what evidence shouldbe trusted in assessing the embedded probabilistic claim.

The above arguments constitute indirect evidence for my theory. It is also possibleto argue for the theory more directly, namely by assessing substantive predictionsdelivered by my theory in conjunction with supplementary principles about available

32. See Mackie 1974 for a classic discussion of causal contrastivism and Schaffer 2005 for a more detaileddiscussion of the semantics of causal claims. The famine example is due to Hart & Honoré 1985.

55

values of partition pronouns. For starters, my theory entails that for many cases ofnested epistemic modals, if context supplies both the inner and outer modal with thesame simple partition, the resulting semantic value could be expressed using just onemodal. For example:

(132) It is unlikely that Alice is a probable hire.

Suppose we evaluate both ‘unlikely’ and ‘probable’ using the partition consisting ofthe claim that Alice is a hire, and the claim that she is not. According to my theory,the result is the set of measures that give less than .5 credence to the claim that Aliceis a hire. According to my theory, that set of measures is also the semantic value ofthe sentence:

(133) It is unlikely that Alice is a hire.

Hence general pragmatic reasoning should lead us to interpret at least one modal in(132) relative to a fine-grained partition. Against the background of Gricean adviceagainst unnecessary prolixity, we should charitably interpret (132) as having somemeaning that could not have been more succinctly expressed using (133). The samegoes for many nested modal constructions. To sum up: my theory predicts that nestedmodals are naturally interpreted as being about higher-order information.

A number of other useful results follow from facts about what questions are com-monly addressed by certain sentences. For instance, modals embedding a simpleprejacent are commonly used to address the question of whether or not that prejacentis the case. For example, the following sentences are commonly used to address thesimple question of whether John is guilty:

(134) John is probably guilty.

(135) John might be guilty.

(136) If John is guilty, then. . .

This fact helps explain the result that ‘probably’ is commonly used to express thesimple constraint that you give more than .5 credence to its prejacent. Consider anycontext where the proposition that John is guilty is a member of the contextuallysupplied partition, or even just a member of the algebra generated by the contextuallysupplied partition elements. In any such context, my semantics for ‘probably’ entailsthat you accept (134) just in case you give more than .5 credence to the propositionthat John is guilty.33

33. This result follows from the axiom of finite additivity, since for the purposes of this paper, I am settingaside cases where the contextually supplied partition has infinitely many elements.

56

By contrast, sometimes (134) is instead used to directly address more complicatedprobabilistic questions such as:

(137) Is it probable that John is guilty?

For example, suppose that you are sitting on a jury and you hear several expertstestify about the likelihood that John committed a certain murder. A large majorityof the experts say that John is probably innocent, but some experts insist that Johnis probably guilty. Suppose the judge then calls you to answer (137). There aremultiple reactions you may have. On the one hand, you may feel as if you do notknow the answer to the question, since you are not in a position to rule out the experttestimony of the minority. But on the other hand, you may feel as if you shouldsimply answer ‘no’ to the question, since you have greater than .5 credence that thelarge majority of experts have formed the right opinion about John. This multiplicityof readings of (137) reflects a multiplicity of underlying questions that the judge couldbe addressing. On the one hand, the judge may be interested in whether she shouldstrike the testimony of particular experts as misleading. On the other hand, she maysimply be wondering whether she should give more than .5 credence to the first-order claim that John is guilty. In the first-order context, the judge uses (137) toaddress the question of whether John is guilty, and an affirmative answer expressesa simple constraint. By contrast, in situations where speakers are explicitly assessingthe reliability of various pieces of evidence, ‘probably’ sentences are used to expressmore complicated constraints on credences.

The same sort of generalization helps explain the behavior of simple ‘might’ sen-tences. For example, ‘John might be guilty’ commonly raises the question of whetherJohn is guilty. Hence ‘John might be guilty’ not only relies on the contextually sup-plied partition for its semantic value, but affects that same partition. And as longas the algebra generated by the partition contains the claim that John is guilty, mytheory predicts that ‘John might be guilty’ will sound acceptable to any subject whohas some credence that John is guilty, since that subject must therefore end up givingsome credence to at least one partition element that entails that John is guilty.34

Finally, the same sort of generalization applies to antecedents of indicative con-ditionals. For example, it is hard to imagine someone uttering (138) without therebyaddressing the question of whether the number rolled is prime:

(138) If the number rolled is prime, it must be even.

For example, say that a fair die is rolled and placed under a red cup just in case it

34. I am grateful to an anonymous referee for suggesting that my theory could deliver this prediction.

57

comes up 2, and placed under a green cup otherwise. Conversations about this casemay address multiple questions, each corresponding to a different partition of logicalspace. (138) may address which cup the die is placed under. But in addition, (138)itself raises another question, namely whether a prime number was rolled. And aslong as this question is in the algebra generated by the contextually salient partition,my theory predicts that (138) will sound bad to anyone with equal credence in eachdie outcome.35 The same goes for many indicative conditionals, which commonlyraise the question of whether their antecedents are acceptable.

To sum up: simple epistemic sentences are commonly used to address whethertheir prejacent is the case. This holds for ‘probably’ sentences, ‘might’ sentences, andindicative conditionals. Further investigation should explore whether similar gener-alizations hold for other epistemic vocabulary, such as strong modals and epistemicadjectives. In the end, further research should also decide whether such generaliza-tions follow from semantic stipulations or from general pragmatic reasoning.

The choice between semantic and pragmatic explanations arises for other con-structions involving epistemic vocabulary as well. I will conclude this paper by ex-plaining some facts about strong modals that have been recently argued to be prag-matic in nature. Suppose that we are going to a party and wondering whether ourfriend Ted is already there. Consider the following minimal pair:

(139) Ted is there.

(140) Ted must be there.

It is not appropriate to utter or even merely suppose either of these sentences alongwith the negation of (139):

(141) # Suppose that Ted is not there and that he is there.

(142) # Suppose that Ted is not there and that he must be there.

This suggests that both (139) and (140) are semantically inconsistent with the negationof (139). In spite of this similarity, though, (140) feels less forceful than (139). Forinstance, ‘your keys must be in the drawer’ inspires less confidence than ‘your keysare in the drawer’ when we are looking for my keys. Furthermore, ‘must’ carries anevidential signal, namely that you have come to believe the prejacent as a result ofsome indirect inference. For example, (140) is inappropriate when you are staring ata party and see that Ted is present, but appropriate if you believe that Ted is at theparty on the basis of seeing his car parked outside.36

35. The unassertability of (138) follows from the fact that the resulting partition will contain elements thataccept that the number rolled is prime, but not that it must be even.

36. For further discussion of these observations, see von Fintel & Gillies 2010 and the earlier discussions

58

These similarities and differences between (139) and (140) are explained by mytheory of epistemic vocabulary. Recall that ‘John is probably guilty’ could address thesimple question of whether John is guilty, or a more complicated question about howlikely it is that John is guilty. In the same spirit, (139) and (140) could address thesimple question of whether Ted is at the party, or a more complicated question abouthow likely it is that Ted is at the party. The relevant observation for our purposes isthat compared with simple sentences, sentences containing epistemic vocabulary aremore likely to address more complicated questions about likelihoods. The statementthat Ted is at the party addresses the question of whether Ted is at the party, whilethe statement that Ted must be at the party intuitively addresses some question aboutthe likelihood that Ted is at the party, affirming that the likelihood is indeed veryhigh. This intuitive distinction neatly corresponds to a theoretical distinction: (139)prompts context to supply a coarse-grained partition to the evaluation of epistemicvocabulary, where the elements of the partition simply say whether Ted is at the party.(140) prompts context to supply a more fine-grained partition, where the elements ofthe partition correspond to evidence about whether Ted is at the party.

According to my semantics, (139) and (140) have the same effect on your cre-dences, namely that you only have credence in partition elements that accept that Tedis at the party. But since (140) prompts the introduction of a more fine-grained parti-tion, (140) demands that you are certain of the proposition that Ted is at the party invirtue of being certain of some union of evidence propositions, each of which definitely con-firms that Ted is at the party. Hence my theory explains why (139) and (140) are bothsemantically inconsistent with the claim that Ted is not at the party, namely becauseboth sentences demand that you give full credence to the proposition that Ted is atthe party. At the same time, though, the assertion of (140) is the result of one or moreinferences. The evidential component of ‘must’ corresponds to the confirmation of itsprejacent by one or more elements of the contextually supplied partition. Followingvon Fintel & Gillies 2010, my theory avoids misinterpreting this evidential featureas genuine semantic weakness. Rather, ‘must’ sentences inspire less confidence thansimple sentences because they highlight potential sources of doubt in their semanticvalues. The constraint expressed by a ‘must’ sentence may be rejected because youdoubt that the relevant evidence propositions support the constraint, or because youdoubt the evidence propositions themselves.

There are several other subjects that deserve investigation beyond the scope of thispaper. For instance, we should explain why your credences may be contained in thecontent of an indicative conditional, even if you have no credence in its antecedent.

they reference, including Karttunen 1972 and Groenendijk & Stokhof 1975.

59

That explanation may call for a more fine-grained model of your mental life, accord-ing to which you have primitive conditional credences. In addition, we have set asidecases where context contributes infinite partitions to the contents of sentences; suchcases will ultimately demand modifications of the theory developed here. Further-more, arguments in this paper do not settle exactly what it is to have a credence forpractical purposes, or exactly what sorts of attitudes are constrained by assertions.Further development of the theory should also discuss shared conversational featuressuch as the common ground, saying whether these features have some probabilis-tic structure and saying how they are affected by probabilistic assertions (cf. Yalcin

2012a for commentary and progress on this project). And finally, a more completetheory should give even stronger guidelines for how context determines which parti-tions are used for interpreting epistemic expressions, perhaps even connecting theseguidelines with independently motivated literature on questions under discussion (cf.Roberts 2012).

This paper is a progress report. The main goal of this paper is not to prove thatthe theory developed here is correct, nor that it is the only way to account for theobservations we started with. The goals are more modest, namely to characterizethe behavior of epistemic vocabulary and to develop a theory that makes sense ofthat behavior. The theoretical moral: epistemic vocabulary may concern not just youropinions about particular propositions, but more structured properties of your opin-ions. The empirical moral: context plays a role in determining exactly what structureis relevant. The semantic and pragmatic theories informed by these morals providea unified account of several distinctive features of epistemic vocabulary. In additionto the specific theories defended here, I hope that the more general theoretical andempirical morals informing these theories may prove to be useful springboards forfurther research.

60

A. Lexical entries

For ease of reference, here is the semantics for the fragment I have discussed:

[[ori]]c = [λS . λT . {m : ∀p ∈ gc(i), m|p ∈ S or m|p ∈ T}]

[[andi]]c = [λS . λT . {m : ∀p ∈ gc(i), m|p ∈ S and m|p ∈ T}]

[[noti]]c = [λS . {m : ∀p ∈ gc(i), m|p /∈ S}]

[[mighti]]c = [λS . {m : ∃p ∈ gc(i) such that m|p ∈ S}]

[[musti]]c = [λS . {m : ∀p ∈ gc(i), m|p ∈ S}]

[[probablyi]]c = [λS . {m : m(

⋃{p ∈ gc(i) : m|p ∈ S}) > .5}]

[[ifi]]c = [λS . λT . {m : m(

⋃{p ∈ gc(i) : m|p ∈ T}|⋃{p ∈ gc(i) : m|p ∈ S}) = 1}]

[[C]]c = [λp . {m : m(p) = 1}]

In addition, as discussed in §3, epistemic expressions and lower-type logical operatorsretain their traditional semantic values.

B. Derivations

In this section, I use ‘high’ as shorthand for ‘the die landed on an high number’, ‘one’for ‘the die landed on a one’, and so on. Let us expand the natural language fragmentunder discussion in this paper with lexical entries such as [[high]] = {w : high in w},[[one]] = {w : one in w}, and so on. Let c0 be a context that resolves the values ofreferential variables as follows:

gc0(1) = {[[low]], [[high]]}

gc0(2) = {[[one]], [[two]], [[three]], [[four]], [[five]], [[six]]}

gc0(3) = {>}

In §B.1–B.3, all semantic values are computed relative to c0. Superscripts are omittedfor readability. Let m0 be the credence distribution you should have regarding howa certain die landed, if you are certain it is fair. In other words: m0([[one]]) = 1/6,

61

m0([[two]]) = 1/6, and so on, and conditional on each proposition about how the dielanded, m0 assigns probability 1 to the correct proposition about its parity and size,counting numbers up to three as low and numbers above three as high. This credencedistribution will function as our countermodel for the invalidity proofs in §B.2–B.3.

B.1. An example of ‘probably’ under disjunction

We demonstrate that m0 is contained in the semantic value of (40):

(40) [ probably2 C it is even ] or1 [ C less than four ]

The precise semantic value of (40) is as follows:

[[(40)]] = {m : ∀p ∈ gc0 (1),

m|p ∈ [[probably2 C it is even]] or

m|p ∈ [[C it is less than four]]}

= {m : ∀p ∈ gc0 (1),

m|p ∈ {m′ : m′(⋃{p′ ∈ gc0 (2) : m′|p′ ∈ [[C it is even]]}) > .5} or

m|p ∈ [[C it is less than four]]}

= {m : ∀p ∈ gc0 (1),

m|p′ ∈ {m′ : m′(⋃{p′ ∈ gc0 (2) : m′|p ∈ {m′′ : m′′({w : it is even in w}) = 1}}) > .5} or

m|p ∈ {m′′′ : m′′′({w : it is less than four in w}) = 1}}

The following facts follow from the construction of m0:

m0|[[low]]([[less than four]]) = 1

m0|[[high]]|[[four]]([[even]]) = 1

m0|[[high]]|[[six]]([[even]]) = 1

And so we may conclude:

m0|[[low]] ∈ [[C it is less than four]]

m0|[[high]]|[[four]] ∈ [[C it is even]]

m0|[[high]]|[[six]] ∈ [[C it is even]]

Furthermore, m0|[[high]]([[four]]⋃[[six]]) = 2/3 > 1/2. As a result, since gc0(2) con-

tains both [[four]] and [[six]], we can conclude that m0|[[high]] ∈ [[probably2 C it is even]].Finally, since gc0(1) contains just [[low]] and [[high]], it follows that m0 is contained inthe content of the disjunction (40) itself.

62

B.2. The invalidity of modus tollens

We demonstrate that the following argument is invalid:

(143) a. if1 C low, probably2 C oddb. not3 probably2 C oddc. Hence: C not low

In what follows, we show that m0 ∈ [[(143-a)]], m0 ∈ [[(143-b)]], and m0 /∈ [[(143-c)]].First, note that [[(143-a)]] = {m : m(B|A) = 1}, where A and B are defined as follows:

A =⋃{p ∈ gc0(1) : m|p ∈ [[C low]]}

B =⋃{p ∈ gc0(1) : m|p ∈ [[probably2 C odd]]}

First we wish to test whether m0 ∈ [[(143-a)]]. The answer depends on m0(B|A).The proposition A is the following union of propositions:

A =⋃{p ∈ gc0(1) : m0|p ∈ [[C low]]}

=⋃{p ∈ {[[low]], [[high]]} : m0|p ∈ [[C low]]}

=⋃{p ∈ {[[low]], [[high]]} : m0|p ∈ {m′ : m′([[low]]) = 1}

Since (m0|[[low]])([[low]]) = 1, [[low]] is a member of the set of propositions whoseunion is A. But since (m0|[[high]])([[low]]) 6= 1, [[high]] is not a member of that set.Since [[low]] and [[high]] are the only candidate members of the set whose union is A,we may conclude that A = [[low]]. Similarly, B is the following union of propositions:

B =⋃{p ∈ gc0(1) : m0|p ∈ [[probably2 C odd]]}, and

=⋃{p ∈ {[[low]], [[high]]} : m0|p ∈ [[probably2 C odd]]}

=⋃{p ∈ {[[low]], [[high]]} :

m0|p ∈ {m′ : m′(⋃{p′ ∈ {[[one]], [[two]], [[three]], [[four]], [[five]], [[six]]} :

m′|p′ ∈ {m′′ : m′′([[odd]]) = 1}}) > 1/2}}

Since p′ = [[one]], [[three]] are among the values for which (m0|[[low]])|p′([[odd]]) = 1,and we have m0|[[low]]([[one]]

⋃[[three]]) = 2/3 > 1/2, we may conclude that [[low]] is

in the set of propositions whose union is B. Furthermore, since the values for which(m0|[[high]])|p′([[odd]]) = 1 include at most the values p′ = [[one]], [[three]], [[five]], andwe have m0|[[high]]([[one]]

⋃[[three]]

⋃[[five]]) = 1/3 ≯ 1/2, we may conclude that

[[high]] is not in the set of propositions whose union is B. Since [[low]] and [[high]]

63

are the only candidate members of the set whose union is B, we may conclude thatB = [[low]]. Finally, since m0([[low]]|[[low]]) = 1, it follows that m0 ∈ [[(143-a)]].

Second, we compute the semantic value [[(143-b)]]:

[[(143-b)]] = {m : ∀p ∈ gc0 (3), m|p /∈ [[probably2 C odd]]}

= {m : ∀p ∈ {>}, m|p /∈ [[probably2 C odd]]}

= {m : m /∈ [[probably2 C odd]]}

= {m : m /∈ {m′ : m′(⋃{p ∈ gc0 (2) : m′|p ∈ [[C odd]]}) > 1/2}}

= {m : m /∈ {m′ : m′(⋃{p ∈ gc0 (2) : m′|p([[odd]]) = 1}) > 1/2}}

= {m : m /∈ {m′ : m′(⋃{p ∈ {[[one]], [[two]], . . . , [[six]]} : m′|p([[odd]]) = 1}) > 1/2}}

Since p = [[one]], [[three]], [[five]] are the values for which m0|p([[odd]]) = 1, and wehave m0([[one]]

⋃[[three]]

⋃[[five]]) = 1/2 ≯ 1/2, we have that m0 /∈ [[probably2 C odd]],

and it follows that m0 ∈ [[(143-b)]].Third, [[(143-c)]] = {m : m([[not low]]) = 1} = {m : m({w : w /∈ [[low]]}) = 1}.

But m0({w : w /∈ [[low]]}) = 1/2 6= 1, so m0 /∈ [[(143-c)]]. Hence the argument from[[(143-a)]] and [[(143-b)]] to [[(143-c)]] is not valid in the context c0, and therefore it is notvalid simpliciter.

B.3. The invalidity of constructive dilemma

We demonstrate that the following argument is invalid:

(144) a. if1 C low, probably2 C oddb. if1 C high, probably2 C evenc. C [low or high]d. Hence: [ probably2 C odd ] or3 [ probably2 C even ]

As demonstrated in §B.2, we have m0 ∈ [[(144-a)]]. An analogous argument demon-strates that m0 ∈ [[(144-b)]], if we simply replace ‘low’, ‘high’, ‘odd’, ‘even’, ‘one’, ‘two’,‘three’, ‘four’, ‘five’, and ‘six’ as they occur in the argument with ‘high’, ‘low’, ‘even’,‘odd’, ‘four’, ‘five’, and ‘six’, ‘one’, ‘two’, and ‘three’, respectively. It just remains tobe shown that m0 ∈ [[(144-c)]] and m0 /∈ [[(144-d)]].

First, we compute the semantic value [[(144-c)]]:

[[(144-c)]] = {m : m([[low or high]]) = 1}= {m : m({w : w ∈ [[low or high]]}) = 1}= {m : m({w : w ∈ [[low]] or w ∈ [[high]]}) = 1}

64

Since m0({w : w ∈ [[low]] or w ∈ [[high]]}) = 1, it follows that m0 ∈ [[(144-c)]].Second, we compute the semantic value [[(144-d)]]:

[[(144-d)]] = {m : ∀p ∈ gc0(3), m|p ∈ [[probably2 C odd]] or m|p ∈ [[probably2 C even]]}

= {m : ∀p ∈ {>}, m|p ∈ [[probably2 C odd]] or m|p ∈ [[probably2 C even]]}

= {m : m ∈ [[probably2 C odd]] or m ∈ [[probably2 C even]]}

But recall from §B.2 that m0 /∈ [[probably2 C odd]], and an analogous argument demon-strates that m0 /∈ [[probably2 C even]]. From this it follows that m0 /∈ [[(144-d)]].

B.4. The quasi-validity of modus tollens

Formally, an argument A is quasi-valid just in case: for any measure m and any contextc such that for every index i on epistemic vocabulary in A we have m(p) > 0 for allp ∈ gc(i), if m is contained in the semantic values of the premises of A in c, then m iscontained in the semantic value of the conclusion of A in c.

We can demonstrate that the following argument schema is quasi-valid, i.e. thatreplacing each schematic letter with a sentence whose semantic value is a set of mea-sures always yields a quasi-valid argument:

(145) a. if1 P, Qb. not1 Qc. Hence: not1 P

Let m be an arbitrary measure contained in both [[(145-a)]] and [[(145-b)]], and let c bean arbitrary context such that m(p) > 0 for all p ∈ gc(1). Since m ∈ [[(145-a)]], weknow that m(Q|P) = 1, where we have:

P =⋃{p ∈ gc(1) : m|p ∈ [[P]]}

Q =⋃{p ∈ gc(1) : m|p ∈ [[Q]]}

Furthermore, since m ∈ [[(145-b)]], the set {p ∈ gc(1) : m|p ∈ [[Q]]} is empty. HenceQ = ⊥. For reductio, assume that m /∈ [[(145-c)]]. In other words, m /∈ {m : ∀p ∈gc(1), m|p /∈ [[P]]}. From this it follows that the set {p ∈ gc(1) : m|p ∈ [[P]]} is notempty, hence P 6= ⊥. Since m(p) > 0 for all p ∈ gc(1), we can infer that m(Q|P)is well-defined, and since Q = ⊥, we can infer that m(Q|P) = 0 6= 1. But thiscontradicts our assumption that m ∈ [[(145-a)]].

65

B.5. The quasi-validity of constructive dilemma

We can demonstrate that the following argument schema is quasi-valid:

(146) a. if1 P, Qb. if1 R, Sc. P or1 Rd. Hence: Q or1 S

Let m be an arbitrary measure contained in [[(146-a)]], [[(146-b)]], and [[(146-c)]], and letc be an arbitrary context such that m(p) > 0 for all p ∈ gc(1). Since m ∈ [[(146-a)]], weknow that m(Q|P) = 1, where P andQ are defined as in B.4. Hence m(P) = m(PQ).Furthermore, since m is a probability measure and gc(1) is a partition, we have:

m(P) = ∑p∈gc(1)

p⊆P

m(p) = ∑p∈gc(1)

p⊆Qp⊆P

m(p) + ∑p∈gc(1)

p*Qp⊆P

m(p) = m(PQ) + ∑p∈gc(1)

p*Qp⊆P

m(p)

From this we may conclude that there is no p ∈ gc(1) such that p * Q, p ⊆ P ,and m(p) > 0. Since m(p) > 0 for all p ∈ gc(1) by stipulation, it follows that for allp ∈ gc(1), if p ⊆ P then p ⊆ Q. By an analogous argument, the fact that m ∈ [[(146-b)]]entails that for all p ∈ gc(1), if p ⊆ R then p ⊆ S , where we have:

R =⋃{p ∈ gc(1) : m|p ∈ [[R]]}

S =⋃{p ∈ gc(1) : m|p ∈ [[S]]}

Since m ∈ [[(146-c)]], for all p ∈ gc(1), either p ⊆ P or p ⊆ R. This taken togetherwith the results just proved entails that for all p ∈ gc(1), either p ⊆ Q or p ⊆ S , fromwhich it follows that m ∈ [[(146-d)]], as desired.

B.6. The validity of chancy modus ponens

We can demonstrate that the following argument schema is valid:

(147) a. probably1 Pb. if1 P, Qc. Hence: probably1 Q

Let m be an arbitrary measure contained in [[(147-a)]] and [[(147-b)]], and let us defineP and Q as in Appendix B.4. Since m ∈ [[(147-a)]], we know that m(P) > 1/2.Furthermore, since m(P) > 0, and since m ∈ [[(147-b)]] entails that m(Q|P) = 1, we

66

can infer that m(PQ) = m(P). And finally, since m is a probability measure, we knowthat m(Q) ≥ m(PQ). To sum up: we have that m(Q) ≥ m(PQ) = m(P) > 1/2, fromwhich it follows that m ∈ [[(147-c)]].

67

References

Austen, Jane. 1818. Persuasion. Norton, New York. Ed. Patricia Meyer Spacks. NortonCritical Editions. 1st edition, printed in 1995.

Beaver, David. 2001. Presupposition and Assertion in Dynamic Semantics. CSLI Publica-tions, Stanford.

Bennett, Jonathan. 2003. A Philosophical Guide to Conditionals. Oxford UniversityPress, Oxford.

Cantwell, John. 2008. “Changing the Modal Context.” Theoria, vol. 74 (4): 331–51.

Cappelen, Herman & John Hawthorne. 2009. Relativism and Monadic Truth. OxfordUniversity Press, Oxford.

Carroll, Lewis. 1894. “A Logical Paradox.” Mind, vol. 3 (11): 436–40.

Condoravdi, Cleo & Jean Gawron. 1996. “The Context-Dependency of Implicit Ar-guments.” In Quantifiers, Deduction, and Context, Makoto Kanazawa, Christopher

Piñón & Henriëtte de Swart, editors, 1–32. CSLI Publications, Stanford.

Dever, Josh. 2012. “Must or Might.” Ms., University of Texas at Austin: https:

//webspace.utexas.edu/deverj/personal/papers/mustAPA.pdf.

Diaconis, Persi & Sandy L. Zabell. 1982. “Updating Subjective Probability.” Journalof the American Statistical Association, vol. 77: 822–30.

Dorr, Cian & John Hawthorne. 2012. “Embedding Epistemic Modals.” Mind, vol.122 (488): 867–913.

Edgington, Dorothy. 1995. “On Conditionals.” Mind, vol. 104: 235–329.

Egan, Andy. 2007. “Epistemic Modals, Relativism, and Assertion.” Philosophical Stud-ies, vol. 133 (1): 1–22.

Egan, Andy, John Hawthorne & Brian Weatherson. 2005. “Epistemic Modals inContext.” In Contextualism in Philosophy, Gerhard Preyer & Georg Peter, editors.Oxford University Press, Oxford.

Egan, Andy & Brian Weatherson, editors. 2011. Epistemic Modality. Oxford Univer-sity Press, Oxford.

Fantl, Jeremy & Matthew McGrath. 2010. Knowledge in an Uncertain World. OxfordUniversity Press, Oxford.

68

von Fintel, Kai & Anthony Gillies. 2008a. “CIA Leaks.” Philosophical Review, vol.117 (1): 77–98.

—. 2008b. “An Opinionated Guide to Epistemic Modality.” In Oxford Studies in Episte-mology, Tamar Szabó Gendler & John Hawthorne, editors, vol. 2, 32–62. OxfordUniversity Press, Oxford.

—. 2010. “‘Must’. . . Stay. . . Strong!” Natural Language Semantics, vol. 18 (4): 351–83.

—. 2011. “Might Made Right.” In Egan & Weatherson (2011), 108–30.

Ganson, Dorit. 2008. “Evidentialism and Pragmatic Constraints on Outright Belief.”Philosophical Studies, vol. 139 (3): 441–58.

Gazdar, Gerald. 1980. “A Cross-Categorial Semantics for Coordination.” Linguisticsand Philosophy, vol. 3 (3): 407–9.

Geurts, Bart. 1999. Presuppositions and Pronouns. Elsevier, Amsterdam.

Gibbard, Allan. 1981. “Two Recent Theories of Conditionals.” In Ifs: Conditionals,Belief, Decision, Chance, and Time, William L. Harper, Robert Stalnaker & Glenn

Pearce, editors, 211–47. D. Reidel Publishing Company, Dordrecht.

Gillies, Thony. 2004. “Epistemic Conditionals and Conditional Epistemics.” Noûs,vol. 38 (4): 585–616.

—. 2010. “Iffiness.” Semantics and Pragmatics, vol. 3 (4): 1–42.

Groenendijk, Jeroen & Martin Stokhof. 1975. “Modality and Conversational In-formation.” Theoretical Linguistics, vol. 2: 61–112.

Hacquard, Valentine & Alexis Wellwood. 2012. “Embedding Epistemic Modals inEnglish: A Corpus-Based Study.” Semantics and Pragmatics, vol. 5 (4): 1–29.

Hart, H. L. A. & Tony Honoré. 1985. Causation in the Law. Oxford University Press,Oxford, second edn.

Hawthorne, John. 2007. “Eavesdroppers and Epistemic Modals.” Philosophical Issues,vol. 17 (1): 92–101.

Heim, Irene. 1992. “Presupposition Projection and the Semantics of Attitude Verbs.”Journal of Semantics, vol. 9 (3): 183–221.

Heim, Irene & Angelika Kratzer. 1998. Semantics in Generative Grammar. BlackwellPublishers, Ltd., Malden, MA.

69

Hendriks, Petra. 2004. “Either, Both and Neither in Coordinate Structures.” In TheComposition of Meaning: From Lexeme to Discourse, Alice ter Meulen & Werner

Abraham, editors, 115–138. John Benjamins, Amsterdam.

Joyce, James. 2005. “How Probabilities Reflect Evidence.” Philosophical Perspectives,vol. 19: 153–78.

Karttunen, Lauri. 1972. “Possible and Must.” In Syntax and Semantics, J. Kimball,editor, vol. 1, 1–20. Academic Press, New York.

—. 1973. “Presuppositions of Compound Sentences.” Linguistic Inquiry, vol. 4 (2):169–93.

Kaufmann, Stefan. 2004. “Conditioning Against the Grain.” Journal of PhilosophicalLogic, vol. 33 (6): 583–606.

Kolodny, Niko & John MacFarlane. 2010. “Ifs and Oughts.” Journal of Philosophy,vol. 107 (3): 115–43.

Kratzer, Angelika. 1977. “What Must and Can Must and Can Mean.” Linguistics andPhilosophy, vol. 1 (3): 337–55.

—. 1981. “The Notional Category of Modality.” In Words, Worlds, and Contexts: NewApproaches in Word Semantics, Hans Jurgen Eikmeyer & Hannes Rieser, editors,38–74. W. de Gruyter, Berlin.

—. 1991. “Modality.” In Semantics: An International Handbook of Contemporary Research,Arnim von Stechow & Dieter Wunderlich, editors, 639–50. W. de Gruyter,Berlin.

Landman, Fred. 1986. “Conflicting Presuppositions and Modal Subordination.”Chicago Linguistics Society, vol. 22: 195–207.

Lasersohn, Peter. 2005. “Context Dependence, Disagreement, and Predicates ofPersonal Taste.” Linguistics and Philosophy, vol. 28 (6).

Lycan, William. 2001. Real Conditionals. Oxford University Press, Oxford.

MacFarlane, John. 2011. “Epistemic Modals Are Assessment-Sensitive.” In Egan &Weatherson (2011), 144–78.

Mackie, J. L. 1974. The Cement of the Universe: A Study of Causation. Oxford UniversityPress, Oxford.

70

McGee, Vann. 1985. “A Counterexample to Modus Ponens.” Journal of Philosophy,vol. 82 (9): 462–71.

Moss, Sarah. 2013. “Epistemology Formalized.” Philosophical Review, vol. 122 (1):1–43.

—. 2014. “Probability and Knowledge.” Book manuscript. Department of Philosophy,University of Michigan.

Partee, Barbara. 1986. “Noun Phrase Interpretation and Type-Shifting Principles.”In Partee (2004), 203–30.

—. 1989. “Binding Implicit Variables in Quantified Contexts.” In Partee (2004), 259–81.

—. 2004. Compositionality in Formal Semantics. Blackwell Publishers, Ltd., Oxford.

Partee, Barbara & Mats Rooth. 1982. “Conjunction, Type-Ambiguity, and WideScope ‘Or’.” Proceedings of the First West Coast Conference on Formal Linguistics, 1–10.

—. 1983. “Generalized Conjunction and Type-Ambiguity.” In Meaning, Use, and theInterpretation of Language, Rainer Bauerle, Christoph Schwarze & Arnim von

Stechow, editors, 361–83. de Gruyter, Berlin.

Roberts, Craige. 1989. “Modal Subordination and Pronominal Anaphora in Dis-course.” Linguistics and Philosophy, vol. 12 (6): 683–721.

—. 2012. “Information Structure: Towards an Integrated Formal Theory of Pragmat-ics.” Semantics and Pragmatics, vol. 5 (6): 1–69.

Rothschild, Daniel. 2011. “Explaining Presupposition Projection with Dynamic Se-mantics.” Semantics and Pragmatics, vol. 4 (3): 1–43.

—. 2012. “Expressing Credences.” Proceedings of the Aristotelian Society, vol. 112 (1):99–114.

Schaffer, Jonathan. 2005. “Contrastive Causation.” Philosophical Review, vol. 114 (3):297–328.

Schlenker, Philippe. 2009. “Local Contexts.” Semantics and Pragmatics, vol. 2 (3):1–78.

Schroeder, Mark. 2012. “Attitudes and Epistemics.” Ms., Department of Philosophy,University of Southern California.

71

Schroeder, Mark & Jake Ross. 2014. “Belief, Credence, and Pragmatic Encroach-ment.” Philosophy and Phenomenological Research, vol. 88 (2): 259–88.

Slote, M. 1978. “Time in Counterfactuals.” Philosophical Review, vol. 87: 3–27.

Stalnaker, Robert. 1970. “Pragmatics.” In Context and Content, 31–46. Oxford Uni-versity Press, Oxford.

—. 2010. “Responses to Stanley and Schlenker.” Philosophical Studies, vol. 151 (1):143–57.

Swanson, Eric. 2006. “Interactions with Context.” PhD. dissertation, Department ofLinguistics and Philosophy, MIT.

—. 2012. “The Application of Constraint Semantics to the Language of SubjectiveUncertainty.” Forthcoming in the Journal of Philosophical Logic.

Veltman, Frank. 1985. Logics for Conditionals. Ph.D. thesis, University of Amsterdam.

—. 1996. “Defaults in Update Semantics.” Journal of Philosophical Logic, vol. 25 (3):221–61.

Weatherson, Brian. 2005. “Can We Do Without Pragmatic Encroachment?” Philo-sophical Perspectives, vol. 19 (1): 417–43.

—. 2008. “Attitudes and Relativism.” Philosophical Perspectives, vol. 22 (1): 527–44.

Willer, Malte. 2013. “Dynamics of Epistemic Modality.” Philosophical Review, vol.122 (1): 45–92.

Williamson, Timothy. 2000. Knowledge and its Limits. Oxford University Press, Ox-ford.

Yalcin, Seth. 2007. “Epistemic Modals.” Mind, vol. 116 (464): 983–1026.

—. 2009. “More on Epistemic Modals.” Mind, vol. 118: 785–93.

—. 2010. “Probability Operators.” Philosophy Compass, vol. 5 (11): 916–937.

—. 2011. “Nonfactualism about Epistemic Modality.” In Egan & Weatherson (2011),295–332.

—. 2012a. “Bayesian Expressivism.” Proceedings of the Aristotelian Society, vol. 112 (2):123–60.

72

—. 2012b. “A Counterexample to Modus Tollens.” Journal of Philosophical Logic,vol. 41 (6): 1001–24.

Yalcin, Seth & Joshua Knobe. 2010. “Fat Tony Might Be Dead: An ExperimentalNote on Epistemic Modals.” Ms., Department of Philosophy, Yale University andUniversity of California, Berkeley.

73

On the semantics and pragmatics of epistemic …ssmoss/Moss - Semantics and...Forthcoming in Semantics and Pragmatics. Penultimate version. On the semantics and pragmatics of epistemic

Documents