Search Results for “probability” – Page 113

Why most statistics don’t mean what you think they do: Part II.

In Part I of this post, we started with a typical problem: which of two advertising campaigns was “better” in terms of generating more sales. Campaigns A and B were each tested for 20 days, during which time sales data was collected. The mean sales during Campaign A was $421 and the mean sales during Campaign B was $440.

Campaign B looks better on this evidence, doesn’t it? But suppose instead of 20 days, we only ran the campaigns one day each, and that the sales for A was just $421 and that for B was $440. B is still better, but our intuition tells us that the evidence isn’t as strong because the difference might be due to something other than differences in the ad campaigns themselves. One day’s worth of data just isn’t enough to convince us that B is truly better. But is 20 days enough?

Maybe. How can we tell? This is the part that Statistics plays. And it turns out that this is no easy problem. But please stay with me, because failing to understand how to properly answer this question leads to the most common mistake made in statistics. If you routinely use statistical models to make decisions like this—“Which campaign should I go with?”, “Which drug is better?”, “Which product do customers really prefer?”—you’re probably making this mistake too.

In Part I, we started by assuming that the (observable) sales data could be described by probability models. A probability model gives the chance that the data can take any value. For example, we could calculate the probability that the sales in Campaign A was greater than $500. We usually write this using math symbols like this:

Pr(Sales in Campaign A > $500 | e)

Most of that formula should make sense to you, except for the right-hand side of it. The bar at the end, the “|”, is the “given” bar. It means that whatever appears to the right of it is accepted as true. The “e” is whatever evidence we might have, or think is true. We can ignore that part for the moment, because what we really want to know is

Pr(Sales in B > Sales in A | data collected)

But that turns out to be a question that is impossible to answer using classical statistics!

How to Exaggerate Your Results: Case study #2

That’s a fairly typical ad, which is now running on TV, and which is also on Glad’s web site. Looks like a clear majority would rather buy Glad’s fine trash bag than some other, lesser, bag. Right?

Not exactly.

So what is the probability that a “consumer” would prefer a Glad bag? You’ll be forgiven if you said 70%. That is exactly what the advertiser wants you to think. But it is wrong, wrong, wrong. Why? Let’s parse the ad used and see how you can learn to cheat from it.

The first notable comment is “over the other leading brand.” This heavily implies, but of course does not absolutely prove, that Glad commissioned a market research firm to survey “consumers” about what trash bag they preferred. The best way to do this is to ask people, “What trash bag do you prefer?”

But evidently, this is not what happened here. Here, the “consumer” was given a dichotomy, “Would you rather have Glad? Or this other particular brand?” Here, we have no idea what that other brand was, nor what was meant by “leading brand.” Do you suppose it’s possible that the advertiser gave in to temptation and chose, for his comparison bag, a truly crappy one? One that, in his opinion, is obviously inferior to Glad (but maybe cheaper)? It certainly is possible.

So we already suspect that the 70% guess is off. But we’re not finished yet.

Philosophy

hudbwu on The Enlightenment Was The Beginning Of The Great Leveling: Stove 2July 20, 2024
All correct. Slavery had to be specially justified and strictly limited because nobody was inherently worthy of slavery. Therefore, all…
TFBW on Lesson 13: No “Fair” Dice & Jaynes SymmetryJuly 20, 2024
If nothing else, this lecture is giving me pause to reconsider some of my presuppositions. I'm not too satisfied with…
Hagfish Bagpipe on Covid Is The Boogerman of Maladies: Now It Causes Cognitive Confusion Claim ResearchersJuly 20, 2024
Johnno: "It’s like a panacea, but in reverse." Good one.
Johnno on Covid Is The Boogerman of Maladies: Now It Causes Cognitive Confusion Claim ResearchersJuly 19, 2024
Have the reporters asked the White House if perhaps another booster might help the Biden? A certain someone said it…
Johnno on The Enlightenment Was The Beginning Of The Great Leveling: Stove 2July 19, 2024
Technically, the Church had and used slaves. And even the Old Testament allowed for slavery under certain conditions. For example,…

Why most statistics don’t mean what you think they do: Part II.

How to Exaggerate Your Results: Case study #2

The impossibility of there being no truth.

How many false studies in medicine are published every year?

Calculated Risks: How to know when numbers deceive you: Gerd Gigerenzer

Asinine comparison of holocaust deniers and honest scientists

A safe, but misleading, prediction about global warming

What can we learn about global warming from poor reporting?