Chapter 9 Reasoning and Decision Making
We like to think that we make important decisions rationally, logically, and without bias or error – but what if that’s not the case? Standing in the cereal aisle, for example you face dozens of choices. You reason that whole grain cereals are healthier, but the colorful box at eye level catches your attention. The “25% more free!” label seems like a good deal, but is it what you want? Your decision involves weighing nutrition against taste, cost against convenience, and logical analysis against marketing and nostalgia. Even this simple choice reveals the complex interplay between reasoning and decision making that governs daily life.
LEARNING OBJECTIVES
- Distinguish between deductive and inductive reasoning
- Explain why people make systematic errors in logical reasoning, including belief bias and confirmation bias
- Describe how heuristics help with reasoning but can also lead to biases
- Compare normative theories of decision making (how we should decide) with descriptive theories (how we actually decide)
- Understand how choice context affects decisions through framing effects, default effects, and other influences on our preferences
9.1 Reasoning
In reasoning, available information is taken into account in the form of premises. A conclusion is reached on the basis of these premises through a process of inference. The content of the conclusion goes beyond either one of the premises. To demonstrate, consider the following reasoning process you might use in that cereal aisle:
Premise 1: All cereals with whole grains are healthier than those without.
Premise 2: This cereal contains whole grains.
Conclusion: Therefore, this cereal is healthier than cereals without whole grains.
The conclusion in this example follows directly from the premises, but it includes information that is not explicitly stated in the premises. This is a typical feature of a process of reasoning. We will discuss the two major kinds of reasoning, inductive reasoning and deductive reasoning, which logically complement one another.
9.1.1 Deductive reasoning
Deductive reasoning is concerned with syllogisms in which the conclusion follows logically from the premises. Consider the following example about preparing for a vacation:
- Premise:If it is warm, one needs shorts and t-shirts.
- Premise: It is warm in Spain during summer.
- Conclusion: Therefore, one needs shorts and t-shirts in Spain.
In this example it is obvious that the premises are about rather general information and the resulting conclusion is about a more special case which can be inferred from the two premises. We will now differentiate between the two major kinds of syllogisms: categorical and conditional syllogisms.
Categorical syllogisms
In categorical syllogisms, the statements of the premises typically begin with “all”, “none” or “some” and the conclusion starts with “therefore,” “thus,” or “hence.” These kinds of syllogisms describe a relationship between two categories. In the example given above in the introduction of deductive reasoning these categories are Spain and the need for shorts and T-Shirts. Two different approaches serve the study of categorical syllogisms: the “normative approach” and the “descriptive approach”.
The normative approach
The normative approach to categorical syllogisms is based on logic, and deals with the problem of categorizing conclusions as either valid or invalid. “Valid” means that the conclusion follows logically from the premises whereas “invalid” means the contrary. Two basic principles and a method called Euler Circles have been developed to help make validity judgments. The first principle was created by Aristotle, and states “If the two premises are true, the conclusion of a valid syllogism must be true” (Goldstein, 2005). The second principle states “The validity of a syllogism is determined only by its form, not its content.” These two principles explain why the following syllogism is (surprisingly) valid:
Premise 1: All flowers are animals.
Premise 2: All animals can jump.
Conclusion: Therefore, all flowers can jump.
Even though it is quite obvious that the first premise is not true and further that the conclusion is not true, the whole syllogism is still valid. That is, when you apply formal logic to the syllogism in the example, the conclusion is valid.
It is possible to display a syllogism formally with symbols or letters and explain its relationship graphically with the help of diagrams. One way to demonstrate a premise graphically is to use Euler circles (pronounced “oyler”). Starting with a circle to represent the first premise and adding one or more circles for the second one (Figure 9.1), one can compare the constructed diagrams with the conclusion. The displayed syllogism in Figure 9.1 is obviously valid. The conclusion shows that everything that can jump contains animals which again contains flowers. This aligns with the two premises which point out that flowers are animals and thus are able to jump. Euler circles help represent such logic.

Figure 9.1: Euler Circles
The descriptive approach
The descriptive approach is concerned with estimating people’s ability to judge the validity of syllogisms and explaining errors people make. This psychological approach uses two methods in order to study people’s performance:
Method of evaluation: People are given two premises and a conclusion. Their task is to judge whether the syllogism is valid.
Method of production: Participants are given two premises. Their task is to develop a logically valid conclusion.
In addition to the form of a syllogism, the content can influence a person’s decision and cause the person to neglect logical thinking. The belief bias states that people tend to judge syllogisms with believable conclusions as valid, while they tend to judge syllogisms with unbelievable conclusions as invalid. Given a conclusion as like “Some bananas are pink”, hardly any participants would judge the syllogism as valid, even though it might be logically valid according to its premises (e.g. Some bananas are fruits. All fruits are pink.)
Conditional syllogisms
Another type of syllogism is called “conditional syllogism.” Just like the categorical syllogisms, they also have two premises and a conclusion. The difference is that the first premise has the form “If … then”. Syllogisms like this one are common in everyday life. Consider the following example:
Premise 1: If it is raining, you needs an umbrella.
Premise 2: It is raining.
Conclusion: Therefore, you need an umbrella.
Conditional syllogisms are typically given in the abstract form: “If p then q”, where “p” is called the “antecedent” and “q” the “consequent”.
Forms of conditional syllogisms
There are four major forms of conditional syllogisms: “modus ponens”, “modus tollens”, “denying the antecedent”, and “affirming the consequent”. These are illustrated in the table below (9.1) by means of the conditional syllogism above (i.e. If it is raining, you need an umbrella). The table indicates the premises, the resulting conclusions and whether the form is valid. The bottom row displays the how frequently people correctly identify the validity of the syllogisms.
Modus Ponens | Modus Tollens | Denying the Antecedent | Affirming the Consequent | |
---|---|---|---|---|
Description | The antecedent of the first premise is affirmed in the second premise. | The consequent of the first premise is negated in the second premise. | The antecedent of the first premise is negates in the second premise. | The antecedent of the first premise is affirmed in the second premise. |
Formal | If P then Q. P Therefore Q. | If P the Q. Not-Q Therefore Not-P. | If P then Q. Not-P Therefore Not-Q. | If P then Q. Q Therefore P. |
Example | If it is raining, you need an umbrella. It is raining. Therefore you need an umbrella. | If it is raining, you need an umbrella. You do not need an umbrella. Therefore it is not raining. | If it is raining, you need an umbrella. It is not raining. Therefore you do not need an umbrella. | If it is raining, you need an umbrella. You need an umbrella. Therefore it is raining. |
Validity | Valid | Valid | Invalid | Invalid |
Correct Judgements | 97 % correctly identify as valid. | 60% correctly identify as valid. | 40% correctly identify as invalid. | 40% correctly identify as invalid. |
As we can see, the validity of the syllogisms with valid conclusions is easier to judge correctly than the validity of the syllogisms with invalid conclusions. The conclusion in the instance of the modus ponens is apparently valid. In the example it is very clear that you need an umbrella if it is raining.
The validity of the modus tollens is more difficult to recognize. Referring to the example, if you do not need an umbrella it can’t be raining. The first premise says that if it is raining, she needs an umbrella. So the reason for not needing an umbrella is that it is not raining. Consequently, the conclusion is valid.
The validity of the remaining two kinds of conditional syllogisms is judged correctly by only 40% of people. If the method of denying the antecedent is applied, the second premise says that it is not raining. But from this fact it does not follow logically that you do not need an umbrella: you could need an umbrella for another reason, such as to shield from the sun.
Affirming the consequent in the case of the given example means that the second premise says that you need an umbrella, but again the reason for this can be circumstances apart from rain. So, it does not logically follow that it is raining. Therefore, the conclusion of this syllogism is invalid.
The four kinds of syllogisms have shown that it is not always easy to make correct judgments concerning the validity of the conclusions. The following passages will deal with other errors people make during the process of conditional reasoning.
The Wason Selection Task
The Wason Selection Task is a famous experiment which shows that people make more reasoning errors in abstract situations than when the situation is taken from real life (Wason, 1960).
In the abstract version of the Wason Selection Task, four cards are shown to the participants with a letter on one side and a number on the other (Figure 9.2). The task is to indicate the minimum number of cards that have to be turned over to test whether the following rule is observed: “If there is a vowel on one side then there is an even number on the other side.” 53% of participants selected the ‘E’ card which is correct, because turning this card over is necessary to test the truth of the rule. However another card still needs to be turned over. 64% indicated that the ‘4’ card has to be turned over which is not right. Only 4% of participants answered correctly that the ‘7’ card needs to be turned over in addition to the ‘E’. The correctness of turning over these two cards becomes more obvious if the same task is stated in terms of real-world items instead of vowels and numbers. One of the experiments for determining this was the beer/drinking-age problem used by Griggs & Cox (1982). This experiment is identical to the Wason Selection Task except that instead of numbers and letters on the cards, everyday terms (beer, soda and ages) were used (Figure 9.3). Griggs and Cox gave the following rule to participants: “If a person is drinking beer then he or she must be older than 21.” In this case 73% of participants answered correctly, that the cards with “beer” and “14 years” have to be turned over to test whether the rule is kept.

Figure 9.2: Original Wason Selection Task cards

Figure 9.3: Drinking-age Wason Selection Task
Why is the performance better in the case of real–world items?
There are two different approaches to explain why participants’ performance is significantly better in the case of the beer/drinking-age problem than in the abstract version of the Wason Selection Task: the permission schemas approach and the evolutionary approach.
The rule, “if a person is 21 years old or older then they are allowed to drink alcohol,” is well-known as an experience from everyday life. Based on a lifetime of learning rules in which one must satisfy some criteria for permission to perform a specific act, we have a “permission schema” already stored in long-term memory to think about such situations. Participants can apply this previously-learned permission schema to the Wason Selection Task for real–world items to improve participants’ performance. On the contrary such a permission schema from everyday life does not exist for the abstract version of the Wason Selection Task (Cheng & Holyoak, 1985; Griggs & Cox, 1983).
The evolutionary approach concerns the human ability of “cheater detection”. This approach states that an important aspect of human behavior across our evolutionary history is the ability for people to cooperate in a way that is mutually beneficial. As long as a person who receives a benefit also pays the relevant cost, everything works well in a social exchange. If someone cheats, however, and receives a benefit from others without paying the cost, problems arise. It is assumed that the ability to detect cheaters became a part of the human cognitive makeup during evolution. This cognitive ability improves the performance in the beer/drinking-age version of the Wason Selection Task as it allows people to detect a cheating person who does not behave according to the rule. Cheater-detection does not work in the case of the abstract version of the Wason Selection Task as vowels and numbers cannot behave in any way, much less cheat, and so the cheater detection mechanism is not activated (Cosmides, 1989; Gigerenzer & Hug, 1992).
9.1.2 Inductive reasoning
So far we have discussed deductive reasoning, which is reaching conclusions based on logical rules applied to a set of premises. However, many problems cannot be represented in a way that would make it possible to use these rules to come to a conclusion. Inductive reasoning is the process of making observations and applying those observations via generalization to a different problem. Therefore one infers from a special case to the general principle, which is just the opposite of the procedure of deductive reasoning (9.4). A good example of inductive reasoning is the following:
Premise: All crows you have ever seen are black. Conclusion: Therefore, you reason that all crows on earth are black.
In this example it is obvious that you infer from the simple observation about the crows they have seen to the general principle about all crows. Considering Figure 9.4, this means that you infer from the subset (yellow circle) to the whole (blue circle). As with this example, it is typical in inductive reasoning that the premises are believed to support the conclusion, but do not ensure the conclusion.

Figure 9.4: An example of inductive reasoning would be generalizing from the subset of crows you have seen to all crows.
Forms of inductive reasoning
The two different forms of inductive reasoning are “strong” and “weak” induction. The former indicates that the truth of the conclusion is very likely if the assumed premises are true. An example for this form of reasoning is the one given in the previous section. In this case it is obvious that the premise (“All crows you have ever seen are black”) gives good evidence for the conclusion (“All crows on earth are black”) to be true. Nevertheless it is still possible, although very unlikely, that not all crows are black.
On the contrary, conclusions reached by “weak induction” are supported by the premises in a relatively weak manner. In this approach, the truth of the premises makes the truth of the conclusion possible, but not likely. An example for this kind of reasoning is the following:
Premise: You always listen to music with your iPhone. Conclusion: Therefore, all music is only heard with iPhones.
In this instance the conclusion is obviously false. The information the premise contains is not very representative and although it is true, it does not give decisive evidence for the truth of the conclusion.
To sum it up, strong inductive reasoning yields conclusions which are very probable whereas the conclusions reached through weak inductive reasoning are unlikely to be true.
Reliability of conclusions
If the strength of the conclusion of an inductive argument has to be determined, three factors concerning the premises play a decisive role. The example of observations about crows (see previous sections) displays these factors:
When you observe crows not just in your hometown but in different countries, the number of observations increases. The representativeness improves if you observe crows at different times and places. The quality of evidence increases if you find scientific research showing that crow genes determine they can only be black.
Conclusions reached through a process of inductive reasoning are never definitely true, as no one has seen all crows on earth. It is possible, although very unlikely, that there is a green or brown exemplar. The three above factors contribute to the strength of an inductive argument. The stronger these factors are, the more reliable the conclusions reached through induction.
Processes and constraints
In the process of inductive reasoning people often make use of heuristics. These heuristics help people make judgments, but sometimes cause errors when the cues they are sensitive to are misleading. In the following sections, two of these heuristics, the availability heuristic and the representativeness heuristic, are explained. Subsequently, confirmation bias is introduced, which refers to when people rely too heavily on their own prior beliefs when assessing new evidence.
The availability heuristic
The availability heuristic refers to when people judge more memorable events (or traits, classes of objects, etc.) to be more frequent than less memorable events. In Kahneman and Tversky’s paper introducing the heuristic, they describe that “Availability is an ecologically valid clue for the judgment of frequency because, in general, frequent events are easier to recall or imagine than infrequent ones. However, availability is also affected by various factors which are unrelated to actual frequency” (Tversky & Kahneman, 1973). In other words, while an event’s memorability is often a good cue as to how frequent that event is, this cue is imperfect.
For example, in an experiment done by Sarah Lichtenstein and colleagues, participants were asked to choose from a list which causes of death occur most often. Because of the availability heuristic, people judged more “spectacular” causes like homicide or tornadoes to cause more deaths than less dramatic causes, like asthma. They propose that this is because dramatic causes of death are disproportionately publicized in media, resulting in easier recall for participants and thus erroneously higher estimates of frequency (Lichtenstein et al., 1978).
The representativeness heuristic
Similarly to the availability heuristic, the representativeness heuristic is used when people judge frequency. However in this case, people rely on a judgment of similarity instead of using memorability as a cue. For example, Tversky & Kahneman (1983) gave the following description to a group of participants:
“Linda is 31 years old, single, outspoken and very bright. She majored in philosophy. As a student, she was deeply concerned with issues of discrimination and social justice, and also participated in anti-nuclear demonstrations.”
They then asked participants to rank a series of descriptions in order of how likely they were to describe Linda. Most participants rated “Linda is a bank teller and is active in the feminist movement” as more likely than “Linda is a bank teller”. However, this is logically impossible, as the first description includes the second (Figure 9.5). Because the participants based their judgment on similarity between the description of Linda and being “active in the feminist movement”, rather than reasoning in line with the formal laws of logic, most committed this “conjunction fallacy”.
Like the availability heuristic though, the representativeness heuristic is often accurate and useful. Tversky and Kahneman write: “Representativeness tends to covary with frequency: Common instances and frequent events are generally more representative than unusual instances and rare events” (Tversky & Kahneman, 1983).

Figure 9.5: Feminist bank tellers.
Confirmation bias
Confirmation bias refers to a set of related phenomena wherein people seek or interpret evidence in a way that aligns with beliefs they already hold. This can take the form of overweighting evidence that supports one’s belief, underweighting evidence that opposes one’s belief, using an information search strategy that is more likely to reveal supportive evidence, remembering only evidence that aligns with one’s beliefs, and more (Nickerson, 1998b). It is important to note that with most forms of confirmation bias, psychologists are not referring to intentional, goal-directed reasoning, such as what a lawyer defending a case might engage in. Rather, these phenomena are typically understood to be unintentional, at least at a conscious level.
For example, if someone believes that black cats bring bad luck, they may be additionally inclined to notice or remember negative events that occurred after seeing a black cat.
Induction vs. deduction

Figure 9.6: Deductive and inductive reasoning.
The table below (9.2) summarizes the most prevalent properties and differences between deductive and inductive reasoning which are important to keep in mind.
Deductive Reasoning | Inductive Reasoning | |
---|---|---|
Premises | Stated as facts or general principles (“It is warm in the Summer of Spain.”) | Based on observations of specific cases (“All crows you have seen are black.”) |
Conclusion | Conclusion is more special than the information the premises provide. It is reached directly by applying logical rules to the premises. | Conclusion is more general than the information the premises provide. It is reached by generalizing the premises` information. |
Validity | If the premises are true, the conclusions must be true. | If the premises are true, the conclusion is probably true. |
Usage | More difficult to use (mainly in logical problems). One needs facts which are definitely true. | Used often in everyday life (fast and easy). Evidence is used instead of proved facts. |
9.2 Decision making
The psychological processes of decision making are critical in everyday life. Imagine deciding whether to carry an umbrella because of a forecasted “20% chance of rain” or choosing which of your streaming subscriptions to keep when money is tight.
There are two main ways to describe and analyze decision making. The “normative” approach describes how decision makers ought to make decisions. That is, it describes how people should behave if they were perfectly rational and had well-defined preferences. Here, “rational” typically means that one adheres to the laws of probability and formal logic. The “descriptive” approach to studying decision making describes how real-life decision makers actually make decisions, rather than how they should. A common finding among decision making researchers is that people do not typically follow what a perfectly rational actor would do, though this claim is contentious and widely debated within the field.
Theories of Decision Making
Expected Value Theory
In the earliest days of probability theory, theorists used Expected Value Theory (EVT) as a normative account. Under EVT, the value of an option is calculated as the probability that option will occur, multiplied by the quantitative value of that option. For example, if a casino offers a game in which there as an 80% chance of winning $10, the “expected value” is .8 x 10 = $8. Because EVT is normative, people should be willing to pay up to $7.99 in order to play that game, because the expected value ($8) is higher than the cost (< $8). However, EVT has several issues as a normative theory. One sizeable issue is that most decisions do not have quantitative outcomes. For example, when deciding whether to take an umbrella due to a 20% chance of rain, there is no clear number that expresses how good or bad it would be to get caught in the rain! A second sizeable issue is that some decisions can have infinitely large expected values but have infinitely small probabilities of occurring, and people are typically uninterested in taking those gambles, despite EVT suggesting they should. This was first demonstrated in a game called the St. Petersburg Paradox, described in box 9.4.1.
St. Petersburg Paradox
A classic paradox dating back to 1713, the St. Petersburg Paradox, describes a game in which a fair coin is flipped repeatedly until it lands on heads, at which point the game ends. The number of times the coin was flipped (including the final heads) is the variable k, and the player wins \(2^k\) dollars. How much would you be willing to pay to play this game? Most people are unwilling to pay more than ten dollars. However, you might notice that the expected value of this game is infinite, because hypothetically the coin could never land on heads, though it’s incredibly unlikely. Needless to say, a game with infinite expected value that most people would not pay ten dollars for presents a problem for Expected Value Theory.
Expected Utility Theory
Due to these issues, most modern theorists believe that Expected Utility Theory (EUT) is a better normative account of decision making. Under EUT, “utility” instead of “value” is multiplied by a probability. Utility is a more general term that refers to how subjectively good we believe an outcome to be, and “disutility” refers to how subjectively bad we believe an outcome to be. If you hate being caught in the rain, you might multiply the chance of rain (20%) by how much disutility rain gives you (perhaps -60 utils). Your expected utility from rain would be .2 x -60 = -12 utils. If carrying a heavy umbrella gives you -20 utils, you’d decide to skip the umbrella. EUT also accounts for the fact that people typically experience diminishing marginal utility. This is the observation that the utility of a gain is proportional to how much of that good you already have. For example, if you have $100 dollars and somebody gives you $5,000, you will experience much more gain in utility than if you have $100,000 and somebody gives you that same $5,000. Diminishing marginal utility helps resolve the issue with EVT demonstrated by the St. Petersburg Paradox because the extremely high potential values cease to be meaningful (if you have $100 billion, the next dollar you earn is meaningless).
Prospect Theory
By having participants in experiments report their willingness to play various gambles, Daniel Kahneman and Amos Tversky measured how people actually weigh value and probability, which they called Prospect Theory (Kahneman & Tversky, 1979). The major findings of Prospect Theory (PT), called the probability weighting function and the value function, are shown in Figure 9.7. The value function shows that, on average, people show diminishing marginal (dis)utility for large gains or large losses, and that losses typically have greater disutility than comparably sized gains have utility (hence the “kink” at zero). The probability weighting function shows that, on average, people tend to overweight small probabilities (i.e., behave as though they have higher probabilities than they actually do), and underweight large probabilities (i.e., behave as though they have lower probabilities than they actually do).

Figure 9.7: a) value weighting function and b) probability weighting function, both from Prospect Theory. v(.) refers to subjective utility, and w(.) refers to subjective probability.
Constructed Preferences
Contrary to theorists’ claim that rational decision makers always have well-defined preferences, real-life decision makers tend to make different selections depending on how choices are presented to them. Two common examples of this are default effects and framing effects.
Default Effects
The default effect refers to the tendency for decision makers to choose a pre-selected option, rather than select non-pre-selected options in a particular choice context. A classic example of this effect concerns organ donor rates across different countries. In some countries, such as Austria and France, citizens are enrolled to be organ donors by default. That is, unless a citizen actively selects to not be an organ donor, they will be presumed to be one. In other countries, such as the United States, citizens are presumed not to be organ donors unless they actively select to be one. In countries like Austria and France, organ donor rates are typically as high as 99% (E. J. Johnson & Goldstein, 2003). As of 2022, organ donor rates in the United States are approximately 54%. This large gap is partially due to the different default option in each country.
Default effects are thought to occur for three main reasons (Dinner et al., 2011; Jachimowicz et al., 2019). First, people are sometimes inattentive when making selections (Choi et al., 2002). Second, because people often perceive losses as worse than comparable gains (as discussed in the Prospect Theory section), the losses associated with switching away from a particular option may feel more significant than the gains associated with a new option. Finally, people often perceive defaults as implicit recommendations (McKenzie et al., 2006).
Framing Effects
Framing effects occur when logically equivalent descriptions of an option lead to different choices. There are a number of types of framing effects, with one of the simplest being attribute framing (Levin et al., 1998). For example, a doctor might describe that five years after undergoing a particular surgery, 90% of patients are still alive. A doctor could also describe that five years after that same surgery, 10% of patients had died, a logically equivalent description (one entails the other). In multiple experiments, people have been found to be much more willing to undergo riskier medical procedures when they are described in the “positive” frame (90%, here) than the “negative” frame (10%) (McNeil et al., 1982; Wilson et al., 1987). There is debate in the literature as to why framing effects occur, with varying perspectives on whether they are reasonable ways for decision makers to behave.
Are more options better?
People often have the intuition that more options are better, because there is a higher chance a decision maker can find something that suits their needs or preferences. Traditional economic theory agrees! However, psychological research reveals this is typically not the case. In one classic experiment, shoppers in a grocery store were asked to sample flavors of jam. Some shoppers were given 6 flavors to choose from, while others were given 24 flavors to choose from. Shoppers were far more likely to purchase a jar afterward if they had only 6 flavors to choose from. Subsequent studies suggest people are also more satisfied with their ultimate selections if they had fewer options to choose from (Iyengar & Lepper, 2000). A recent meta-analysis found that this “choice overload” tends to be a function of decision task difficulty, choosers’ uncertainty in their preferences, choice set complexity, and the decision goal (Chernev et al., 2015).
Key Takeaways
- Deductive reasoning describes when we draw a specific conclusion from general premises. Inductive reasoning describes the opposite process, where we draw general conclusions from specific premises.
- Normative and descriptive theories of decision making do not always align. People typically use heuristics (cognitive shortcuts) rather than formal logic and reasoning. These heuristics often help people arrive at sufficiently accurate conclusions, though they sometimes result in reasoning errors.
Exercises
- Imagine a gamble with a 20% probability of winning $50. How much money does Expected Value Theory suggest you should be willing to pay to try this gamble?
- Describe default effects and framing effects. How are those psychological phenomena divergent from what a perfectly “rational” decision maker would do?