One week, it seems, isn’t enough to tell the difference conclusively between the first bidder on Contestants Row having a 25 percent chance of winning — winning one out of four times — or a 17 percent chance of winning — winning one out of six times. But we’re not limited to watching just the one week of The Price Is Right, at least in principle. Some more episodes might help us, and we can test how many episodes are needed to be confident that we can tell the difference. I won’t be clever about this. I have a tool — Octave — which makes it very easy to figure out whether it’s plausible for something which happens 1/4 of the time to turn up only 1/6 of the time in a set number of attempts, and I’ll just keep trying larger numbers of attempts until I’m satisfied. Sometimes the easiest way to solve a problem is to keep trying numbers until something works.
In two weeks (or any ten episodes, really, as talked about above), with 60 items up for bids, a 25 percent chance of winning suggests the first bidder should win 15 times. A 17 percent chance of winning would be a touch over 10 wins. The chance of 10 or fewer successes out of 60 attempts, with a 25 percent chance of success each time, is about 8.6 percent, still none too compelling.
Here we might turn to despair: 6,000 episodes — about 35 years of production — weren’t enough to give perfectly unambiguous answers about whether there were fewer clean sweeps than we expected. There were too few at the 5 percent significance level, but not too few at the 1 percent significance level. Do we really expect to do better with only 60 shows?
Continue reading “Proving Something With One Month’s Counting”
The last important idea missing before we can judge this problem about The Price Is Right clean sweeps of Contestants Row is the significance level. Whenever an experiment is run — whether it’s the classic probability class problems of flipping coins or rolling dice, or whether it’s watching 6,000 episodes of a game show to see whether any seat produces the most winners, or whether it’s counting the number of red traffic lights one gets during the commute — there are some outcomes which are reasonably likely, some which are unlikely, and some which are vanishingly improbable.
We have to decide that some outcomes have such a low probability of happening naturally that they represent something going on, and are not just the result of chance. How low that probability should be is our decision. There are some common dividing lines, but they’re common just because they represent numbers which human beings find to be nice round figures: five percent, one percent, half a percent, one-tenth of a percent. What significance level one picks depends on many factors, including what’s common in the field, how different outcomes are expected to be, even what one can afford. Physicists looking for evidence of new subatomic particles have an extremely high standard before declaring something is definitely a new particle, but, they can run particle detection experiments until they get such clear evidence.
To be fair, we ought to pick our significance level before we’ve worked out the probability of something happening, but this is the earliest I could discuss it with motivation for you to read about it. But if we take the five percent significance level, we see we know already that there’s a little more than a one and a half percent chance of there being as few clean sweeps as observed. The conclusion is obvious: all six winning contestants in an episode should have come from the same seat, over 6,000 episodes, more often than the one time Drew Carey claimed they had. We can start looking for explanations for why there should be this deficiency.
When last we talked about the “clean sweep” of winning contestants coming from the same of four seats in Contestants Row for all six Items Up For Bid on The Price Is Right, we had got established the pieces needed if we suppose this to be a binomial distribution problem. That is, we suppose that any given episode has a probability, p, of successfully having all six contestants from the same seat, and a probability 1 – p of failing to have all six contestants from the same seat. There are N episodes, and we are interested in the chance of x of them being clean sweeps. From the production schedule we know the number of episodes N is about 6,000. We supposed the probability of a clean sweep to be about p = 1/1000, on the assumption that the chance of winning isn’t any better or worse for any contestant. The probability of there not being a clean sweep is then 1 – p = 999/1000. And we expected x = 6 clean sweeps, while Drew Carey claimed there had been only 1.
The chance of finding x successes out of N attempts, according to the binomial distribution, is the probability of any combination of x successes and N – x successes — which is equal to (p)(x) * (1 – p)(N – x) — times the number of ways there are to select x items out of N candidates. Either of those is easy enough to calculate, up to the point where we try calculating it. Let’s start out by supposing x to be the expected 6, and later we’ll look at it being 1 or other numbers.
Continue reading “A Simple Demonstration Which Does Not Clarify”