TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Where Should We Eat? The Five Rules of Decision Analysis Explained

6 min readMay 16, 2022

--

Photo by Chad Montano on Unsplash

As a data scientist, I find the field of decision analysis an underrated tool in my work. When my passion for statistics and programming drags me deep into the weeds, decision analysis pulls my focus back to the big picture impact.

The core of decision analysis consists of five rules that unlock a powerful quantitative framework. These rules were initially formulated by Professor Ronald Howard and are outlined in his book, The Foundations of Decision Analysis [1].

These rules outline the “algebra” of decision-making. They provide the ability to capture subjective beliefs and preferences in a mathematically rigorous framework.

In this post, we will explore each of the rules of decision analysis to answer one of the great mysteries of the ages — where should we eat for dinner tonight?

The dinner decision

We are trying to decide which of the two following restaurants we want to eat at for dinner:

  1. Roulette’s Tacos: Roulette’s is a new (imaginary) taco place in Austin, TX, so you know it’s good. Their salmon tacos are stellar when they have them in stock. Unfortunately, the only other meal they serve is a salad, which is disappointing.
  2. Reliable Randy’s Pizzaria: Reliable Randy’s has a good pepperoni pizza. They run out of pepperoni occasionally, but they always have cheese pizza as a backup.

The rules

We will now break down this decision using the five rules of decision analysis. By the end, we will know where to eat tonight.

1. The Probability Rule

We must quantify all uncertainties in the decision with probabilities.

The first rule instructs us to assign probabilities to quantify any uncertainties in the decision. These probabilities are inherently Bayesian rather than frequentist because they capture the subjective beliefs of the decision-maker.

There are two uncertainties in the dinner decision:

  1. Whether Roulette’s Tacos will have tacos when we arrive.
  2. Whether Reliable Randy’s will have pepperoni when we arrive.

Based on past experiences (data), we believe that there is a 75% chance that Roulette’s tacos will have tacos and a 90% chance that Reliable Randy’s will have pepperoni.

The probabilistic assessment is where data science contributes to the decision analysis workflow. Perhaps we could use modeling techniques to analyze data to unlock more granular insights. For example, a good data scientist might compute a probability that accounts for the time of day we go to the restaurants. In fact, a core concept of decision analysis is the value of information (VOI), which quantifies the value of a data scientist’s work based on its ability to update the beliefs of a decision-maker. See here for a more detailed discussion of VoI.

In decision analysis, the role of data science is to inform the beliefs of the decsion-maker.

Based on our beliefs, we can represent the decision with the following decision tree:

Image by author. A tree summarizing the dinner decision. I really did photograph and eat all of the food items shown.

Notice that in decision trees, squares represent decisions, and circles represent lotteries.

The red square represents the decision regarding which restaurant we will choose. Once we choose a restaurant, we will face an uncertain lottery regarding which meals they will have available.

2. The Order Rule

We must order all possible outcomes based on our preferences.

In the dinner decision, there are four possible outcomes: tacos at Roulette’s, salad at Roulette’s, pepperoni pizza at Reliable Randy’s, and cheese pizza at Reliable Randy’s.

We must order the outcomes in a non-cyclical manner. For example, we cannot prefer tacos to salad, salad to cheese pizza, and cheese pizza to tacos because that results in a cycle of preferences.

In this case, ordering our preferences is straightforward. Tacos are the best option. After that, pepperoni pizza is our second choice, followed by cheese pizza. Salad is disappointing, so we will put it at the bottom. Our ordering is then:

  1. Tacos at Roulette’s
  2. Pepperoni Pizza at Reliable Randy’s
  3. Cheese Pizza at Reliable Randy’s
  4. Salad at Roulette’s

3. The Equivalence Rule

If we prefer A to B and B to C, there exists a probability p that makes us indifferent between B and a lottery with p chance of A and 1-p chance of C.

The equivalence rule allows us to equate each possible outcome to a lottery between the best and worst outcomes — tacos and salad. Let’s start with pepperoni pizza.

We need to determine the indifference probability, p, such that:

Image by author. We need to determine the lottery between tacos and salad that we view as equally desirable to a guaranteed pepperoni pizza.

We prefer tacos to pepperoni pizza and pepperoni pizza to salad. So naturally, there must be some amount of salad risk we’re willing to accept for a chance at delicious tacos instead of a sure-fire pepperoni pizza.

Consider two limiting cases:

  • If p=1, we would be indifferent between a guaranteed pepperoni pizza and guaranteed tacos. We’re not; we prefer tacos!
  • If p=0, we would be indifferent between a guaranteed pepperoni pizza and a guaranteed salad. We’re not; the pizza is much better!

So, p is somewhere between 0 and 1.

The indifference probability only describes our preferences. It is not calculated from data.

I am indifferent between a guaranteed pepperoni pizza and a lottery with an 80% chance of tacos and a 20% chance of salad. So, we will use p=80% for Reliable Randy’s pepperoni pizza.

We also need to determine our indifference probability for cheese pizza. Cheese pizza is almost as good as pepperoni pizza and still much better than salad. My indifference probability is 70%, so I agree with the following statement:

Image by author. I am indifferent between a guaranteed cheese pizza and the lottery between tacos and salad shown here.

4. The Substitution Rule

If we are indifferent between B and a lottery between A and C, we must be willing to substitute B for the lottery.

The substitution rule allows us to exchange pepperoni pizza and cheese pizza in the decision tree with the lotteries we determined in the last section. This procedure gives the following decision tree:

Image by author. The decision tree after substituting the pizzas for their equivalent lotteries.

The substitution is helpful because now our decision tree only has two possible outcomes — tacos and salad. Notice that the lotteries for Reliable Randy’s Pizzaria can be simplified using the law of total probability.

The total probability of tacos with the Reliable Randy’s branch is (0.90 x 0.80) +(0.10 x 0.70) = 79%. We find that the total probability of salad for the branch is 21% with the same procedure or by recognizing that the total probability must sum to 100%.

The simplification results in the following decision tree:

Image by author. The decision tree after simplifying the “Reliable Randy’s Pizzaria” branch using the law of total probability.

5. The Choice Rule

If we prefer A to C, and we face multiple lotteries between A and C, we must choose the lottery with the highest probability of A.

This rule is intuitive. We have reduced our decision tree to two lotteries between tacos and salad. We prefer tacos to salad, so we must choose the lottery with a higher taco probability, which is Reliable Randy’s Pizzaria.

Roulette’s Tacos is actually a lottery between tacos and salad with a 75% chance of tacos. From the rules of decision analysis, we determined that Reliable Randy’s Pizzaria is equivalent to a lottery between tacos and salad with a 79% chance of tacos.

It looks like we’re having pizza tonight!

Conclusions

In this post, we explored the five rules of decision analysis. These rules are the foundation of a powerful quantitative decision-making framework. Although the dinner example we explored is trivially simple, it helps build the intuition for more complex decisions.

The core of decision analysis is translating beliefs and preferences into rational actions.

Thank you for reading this post. If you are interested in more decision analysis and statistics content, follow me on Medium or LinkedIn.

References

  1. Howard, Ronald A. “The Foundations of Decision Analysis.” IEEE transactions on systems science and cybernetics 4.3 (1968): 211–219.

--

--

TDS Archive
TDS Archive

Published in TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Tyler Buffington, PhD
Tyler Buffington, PhD

Written by Tyler Buffington, PhD

Experienced data scientist specializing in causal inference, experimentation, and decision analysis.