Statistics

Predictive Probability Resources, Why Frequentism Fails & Uncertainty

From the mailbag, reader I.W. writes:

Dear Professor,

I represent social science (but of a priori bent), and I recently got really hooked on frequentialism. After all, prof. Mises employed it as the only scientific idea of probability.

Then I saw your lecture on Youtube where you cast doubt on all this statistical method and I have to acknowledge I totally share your position.

For weeks, I am trying to discover the current status of good old frequentialism and how it relates to other concepts of probability.

My ultimate aim is to work out such an idea of probability that would tally finely with social science and accounts for the permetaing uncertainty there (as a side effect I would like to shed some light on the problem of risk and insurance, which Mises discusses too).

I do realize this letter is a bit chaotic but I was carried away by the lecture 🙂

Would you please refer me to the reading you mentioned at the end thereof?

I would most appreciate your help.

With kind regards,

[I.W.]

Well to the side of Truth and Beauty, I.W.!

A reminder to all. Please visit the Uncertainty book page here. Also, be sure to buy a copy of Uncertainty, too. Surprisingly, some people who make a living doing machine learning, AI, probability, statistics, and scientific modeling do not yet own a copy! It is a great mystery why not.

On the book page are a collection of articles where many fundamentals are discussed. There are also some reviews of Uncertainty of interest.

Next is the Class — Applied Statistics category tag. This is a collection of practical posts, some of which have R code so you can follow along. I mean to continue these, and have even done the code—but haven’t yet written the articles. Writing takes vastly more time than mere lectures.

Now as to frequentism, it is philosophically wrong, but it’s also easy to see why it’s adopted. It is philosophically wrong because of its insistence on actual infinities of (only) observable “events”. Consider that no probability can be known until the End of Time. Frequentism is defined only at the limit. Before the limit is only darkness.

At first it seems odd that people so readily believe in frequentism, since no actual infinity of events has ever been observed, thus no probabilities are known. The appeal is explained by noticing no one remembers the actual mathematical definition and acts as if probability is epistemic (some might say “logical”). Which it is: epistemic, that is.

It is true that once one assumes, based on subjective whim, what a frequentist probability is, then one can do lots of calculations. Think for instance of a binomial. Assume the unobservable-in-finite-time probability p is some fixed value, and it’s off the slide rule!

You might assume that you can estimate the value of this mysterious p based on some data. No.

Problem #1: the empiricist bias. Under frequentism, the only probabilities that can (in potentia) exist are empirical. Imagine limiting logic (or metaphysics) to only empirical examples!

Problem #2: Either the point estimate is absolutely certain, which nobody claims, or the confidence interval is a reasonable guess of the uncertainty of the estimate, which is false. And known and designed to be false under the theory! The only the sole the lone the one interpretation of the confidence interval allowed under frequentist theory is that the parameter is in the interval or it isn’t.

So why are estimates and confidence intervals taken to be of some value? Because, again, in practice, people treat them as if they were epistemic. Which they are. They are more-and-less good approximations to the right answers, which are found thinking everything is epistemic from the beginning. Strict frequentist thinking never survives beyond textbooks.

Of course, actual frequencies are of great use. They are informative. But observations are not probabilities. Frequencies inform probabilities.

Categories: Statistics

2 replies »

  1. Just a small note, one only owns a copy of the book if they bought the paper version. Amazon owns the Kindle version. They allow you to read it as long as you play nice and don’t tick them off.

    I suppose, for fun, we could figure out the probability of Amazon cutting off your Kindle reader and access to the books. What was that about confidence intervals and point estimates? No probabilities are actually known?? Oh, I am sooooo confused.

  2. Tools are used relative to a specific context. Arguing the merits & demerits of a tool independent of a particular context is next to meaningless.

    RE: Problem #1: the empiricist bias. Under frequentism, the only probabilities that can (in potentia) exist are empirical. Imagine limiting logic (or metaphysics) to only empirical examples!

    LIMITING metaphysics to only empirical examples would eliminate a whole lot of pseudoscience. …not to mention any number of quaint religious notions that depend on the suspension of critical thinking…

    Problem #2: Either the point estimate is absolutely certain, which nobody claims, or the confidence interval is a reasonable guess of the uncertainty of the estimate, which is false.

    THESE statements tell us something about the author:

    – An absolutely certain point estimate — For those of us who get out, or have, this is a well-known sign of cheating, of a “fix” and is a wonderful thing if one is on the receiving end of the payoff, or, at least picks up on the signs and avoids a game with a secret, but foreordained, outcome. Good negotiators, knowledgeable about their product area, are good at picking up such indicators of deceit/cover-up.

    – Confidence intervals very often DO reasonably reflect the uncertainty of estimates, and ARE often TRUE (not “false” as represented). This is exploited in manufacturing processes (when the observed results start drifting to & beyond the intervals is a sure sign the machinery needs adjustment). Not to mention Las Vegas casinos, which have fine-tuned their analysis of what those intervals are, and, have contrived the associated payouts (calculate the odds, then note the payoffs — the payoffs invariably reflect different odds from reality … but so many [most?, nearly all??] recreational gamblers believe the payoffs derive from the odds).

    The philosophical view might be likened to the “40,000 foot” perspective — a sweeping generalization that argues, independent of any specific context, why one tool is “better” than another.

    Tools are used relative to a specific context. Arguing the merits & demerits of a tool independent of a particular context is next to meaningless.

Leave a Reply

Your email address will not be published. Required fields are marked *