Skip to main content

The Monty Hall Problem Solved, Once and For All

The first time I ever encountered the infamous Monty Hall Problem was 5 years ago on a video from one of my favorite YouTubers, Numberphile. The problem states:

Source: Wikipedia
"Suppose you're on a game show, and you're given the choice of three doors: Behind one door is a car; behind the others, goats. You pick a door, say No. 1, and the host, who knows what's behind the doors, opens another door, say No. 3, which has a goat. He then says to you, 'Do you want to pick door No. 2?' Is it to your advantage to switch your choice?"

The answer is a resounding yes, you are twice as likely to win by switching. However, the problem is seemingly paradoxical and to this day baffles many people - why does switching doors matter in the grand scheme? Through hundreds of YouTube comments on the original videos, I see 2 major misconceptions that I will layout and explain the best that I can. 

Misconception 1: The event of the door not opened/picked is independent of the door the host picked, thus the probability of winning by switching is $\frac{1}{2}$

This thought process is simply a misunderstanding of independence, which in of itself is a very unintuitive topic. We often think of the idea as synonymous to no correlation or association, but that definition doesn't quite fit. Independence implies no correlation but the converse is not true - 0 correlation does not imply independence! Thus, rather than an intuitive explanation, independence is best explained by the following formula: 

Events $A$ and $B$ are independent $\iff \mathbb{P}[A \wedge B] = \mathbb{P}[A] \cdot \mathbb{P}[B]$.

For any two events, $\mathbb{P}[A \wedge B] = \mathbb{P}[A | B] \cdot \mathbb{P}[B]$ and vice versa, (switching $A$ and $B$), so we can conclude if two events are independent, $\mathbb{P}[A | B] = \mathbb{P}[A]$

That's a lot of mathy-math terms to make me sound smart and credible. In English, this says that if two events are independent, their joint probability is the same as the product of their probability separately, and that the probability of a first event given that a second event is true is the same as the probability of the first event (intuitively, independence says one event happening shouldn't affect the probability of another!) Let's lay out an example to see why this is untrue. 

Label the 3 doors as follows: picked, shown, and switched. Let $A$ be the event that the switched door is the prize and $B$ be the event that the shown door is a goat. The probability of $A$ is $\frac{1}{3}$ because only one of the 3 doors are prizes. However, $\mathbb{P}[A | B]$ (Event $A$ given event $B$ is true) is not $\frac{1}{3}$, it is $\frac{1}{2}$ because given door $B$ is given to be a goat! Thus these events are clearly not independent.

Misconception 2: If you list out all the possible outcomes, there are 6 events in which you win and 6 events in which you lose, thus the probability of winning by switching is $\frac{1}{2}$

This claim is true if you only consider the number of possible events, so the claim baffled me for a long time as a high schooler. However, the key insight, which also helps solve the Monty Hall Problem, comes in the PROBABILITY of each event happening - the claim suggests that each of the 12 events as equally likely, which is untrue.

Source: Summer 2019 CS70 Lecture 15 slides by Elizabeth Yang
The diagram shown at the bottom right of the picture lays out all possibilities of the game by using the switching strategy - the first row (labeled $c_1$) contains all the possible outcomes of prize doors (door 1, 2, or 3), each occurring with a probability of $\frac{1}{3}$. The second row (labeled $c_2$) contains all the possible doors the contestant first chooses, again each occurring with a probability of $\frac{1}{3}$.

Here's the kicker - the last row ($c_3$) contains all the possible doors that host reveals that contains a goat. As we can see, if initially we choose the prize door, the host has a $\frac{1}{2}$ chance of revealing either of the goat doors (routes highlighted in yellow) and switching would yield a loss, whereas if we initially choose a goat door, we force the host to reveal the other goat door (routes highlighted in blue) and switching would yield a win! Thus, although the yellow routes and blue routes match in number of events, the probability of each route is different - blue routes happen with a probability of $\frac{1}{3} \cdot \frac{1}{3} \cdot 1 = \frac{1}{9}$ while yellow routes happen with a probability of $\frac{1}{3} \cdot \frac{1}{3} \cdot \frac{1}{2} = \frac{1}{18}$! There are 6 of each route, so the total probability of yellow routes is $\frac{1}{3}$ and the total probability of blue routes is $\frac{2}{3}$.

Thus, by switching doors every time you play the game, you win (blue route) with twice the probability of losing (yellow route). If the explanation is not convincing enough and you are really set on proving it wrong, try setting up and playing 100 games on your own, multiple times, switching every time - if you manage to consistently lose 50 games or more, call me :)

Comments

Popular posts from this blog

Jack of All Trades or Master of One?

What does it mean to be the best at something? Einstein. Mozart. Jordan. Aristotle. These are often the most recognized names in their respective field of work, looked up to by millions every day.  But out of the billions of people who have ever lived in this world, how many of us can reasonably expect to live up to their status and ability? Short answer: the vast majority can't, and won't ever. But that doesn't stop us from aspiring to be the best at whatever we pursue, including me. Why do some people appear more successful than others? A short formula for our ability to do something is as follows (note that this may not be comprehensive, or entirely accurate, but is meant for simplicity's sake): Ability = Talent x Efficient Work where "Efficient Work" can be further broken down into time spent x quality of work . Making lots of different mistakes and reflecting on them counts as quality work. Gazing at your phone every 5 minutes while practicin...

Class Review: Discrete Math and Probability Theory (CS70)

CS70 at UC Berkeley during Summer 2019 was the hardest class I've ever taken. It takes centuries of development in Discrete Mathematics and Probability Theory (some theories took decades to prove or refine) and shoves abstract concepts down your throat in a span of 8 weeks. French is beautiful I believe that there is no other class at Berkeley in which the disparity between the ability of the students in the course is so stark - students range from International Math Olympiad medalists who have years of rigorous contest math experience to beginning computer science students who hate math and just want to declare the major (a pity that this course is required for that purpose). I fall somewhere in the middle, a math-loving CS student with a bit of contest math background, but little experience in formal proof writing or anything beyond basic probability. The course is divided into two parts: Discrete Mathematics and Probability Theory (as can probably be inferred by my prev...