<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>William M. Briggs</title>
	<atom:link href="http://wmbriggs.com/blog/?feed=rss2" rel="self" type="application/rss+xml" />
	<link>http://wmbriggs.com/blog</link>
	<description>Statistician to the Stars!</description>
	<lastBuildDate>Wed, 19 Jun 2013 12:35:50 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>All Probability Is Conditional: An Answer To Senn; Part IV</title>
		<link>http://wmbriggs.com/blog/?p=8379</link>
		<comments>http://wmbriggs.com/blog/?p=8379#comments</comments>
		<pubDate>Wed, 19 Jun 2013 12:35:50 +0000</pubDate>
		<dc:creator>Briggs</dc:creator>
				<category><![CDATA[Philosophy]]></category>
		<category><![CDATA[Statistics]]></category>

		<guid isPermaLink="false">http://wmbriggs.com/blog/?p=8379</guid>
		<description><![CDATA[Read Part III. Still with me? Hope so, because we&#8217;re only on the second page of Senn&#8217;s article (but don&#8217;t fret; we&#8217;ll be skipping most of it). Review: in logical-probability Bayes (as in all Aristotelian logic), we begin with a &#8230; <a class="more-link" href="http://wmbriggs.com/blog/?p=8379">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><div class="wp-caption alignright" style="width: 330px"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/1/18/Bayes%27_Theorem_MMB_01.jpg/320px-Bayes%27_Theorem_MMB_01.jpg" width="320" height="205" class /><p class="wp-caption-text">A strictly incorrect way of writing Bayes&#8217;s theorem.</p></div><a href="http://wmbriggs.com/blog/?p=8368">Read Part III.</a></p>
<p>Still with me?   Hope so, because we&#8217;re only on the second page of Senn&#8217;s article (but don&#8217;t fret; we&#8217;ll be skipping most of it).</p>
<p>Review: in logical-probability Bayes (as in all Aristotelian logic), we begin with a list of premises (data, observations, evidence, or other synonymous term) and a proposition which is hoped to be related to the premises; from the premises we deduce the probability the proposition is true. Not all premises are sufficient to guarantee a numerical value, nor any value, nor any precise number: the probability could be stated merely in words, nonexistent, an interval, or a precise number.</p>
<p>Senn writes &#8220;we let <img src="//s0.wp.com/latex.php?latex=%5CPr%28A%29&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Pr(A)" title="&#92;Pr(A)" class="latex" /> stand for the so-called marginal probability of an &#8216;event&#8217;, &#8216;statement&#8217; or &#8216;hypothesis&#8217; A and we let <img src="//s0.wp.com/latex.php?latex=%5CPr%28B%7CA%29&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Pr(B|A)" title="&#92;Pr(B|A)" class="latex" /> stand for the conditional probability&#8230;of B <em>given</em> A.&#8221; (Note: I have edited the notation ever so slightly so that it will render well on a web page; I have not changed any meaning.)</p>
<p>As before, I already disagree.  There just is no such thing as unconditional probability, or probability without respect to any evidence, thus it never makes sense to write &#8220;<img src="//s0.wp.com/latex.php?latex=%5CPr%28A%29&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Pr(A)" title="&#92;Pr(A)" class="latex" />.&#8221; We <em>can</em> write (say) <img src="//s0.wp.com/latex.php?latex=%5CPr%28A%7CE%29&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Pr(A|E)" title="&#92;Pr(A|E)" class="latex" /> which is the probability of A (a proposition) given the evidence E (also a proposition, albeit possibly a complex one which includes data observations).  </p>
<p>Example: A = &#8220;A &#8217;6&#8242; shows&#8221; and E = &#8220;We have a Martian breen, etc.&#8221; (see the <a href="http://wmbriggs.com/blog/?p=8368">previous part</a> for an explanation).  But it makes no sense to ask, &#8220;What is the probability a &#8217;6&#8242; appears&#8221; without reference to something&#8212;whether it be a breen, a die, or something else.</p>
<p>Failure to recognize this creates another stumbling block in understanding probability. Probability is usually introduced as unconditional, and the complexity of conditioning follows some time later: but this is a mistake. There just is no such thing as unconditional probability.  Just as there is no unconditional truth (of a proposition, we always at least refer to our intuition or faith if not a list of premises.)  </p>
<p>Of course, inside a given problem, once we have E in hand, and it&#8217;s agreed to by all and always understood to be there, and for the simple ease of notation, there is no harm in writing <img src="//s0.wp.com/latex.php?latex=%5CPr%28A%29&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Pr(A)" title="&#92;Pr(A)" class="latex" />.  But this should not be done until one is well used to logical probability, else it seems like probability is a <em>thing</em> and not a measure of knowledge. </p>
<p>Don&#8217;t think this is a big deal?  Oh, boy, is it ever, as we&#8217;ll see.</p>
<p>Senn introduces <img src="//s0.wp.com/latex.php?latex=H_i&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="H_i" title="H_i" class="latex" /> for a hypothesis, i.e. some proposition, indexed by i, as there may be more than one proposition which is true in some situation.  He adds a superscript T to indicate we believe <img src="//s0.wp.com/latex.php?latex=H_i&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="H_i" title="H_i" class="latex" /> is true, but I&#8217;ll skip this complication.  Whenever we see shorthand like <img src="//s0.wp.com/latex.php?latex=%5CPr%28H_i%7CE%29&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Pr(H_i|E)" title="&#92;Pr(H_i|E)" class="latex" /> it means &#8220;The probability <img src="//s0.wp.com/latex.php?latex=H_i&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="H_i" title="H_i" class="latex" /> is true given E.&#8221;</p>
<p>Senn then writes (with my change in notation) &#8220;We suppose that we have some evidence E . If we are Bayesians we can assign a probability to any hypothesis <img src="//s0.wp.com/latex.php?latex=H_i&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="H_i" title="H_i" class="latex" /> and, indeed, to the conjunction of this truth, <img src="//s0.wp.com/latex.php?latex=H_i+%5C%26+E&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="H_i &#92;&amp; E" title="H_i &#92;&amp; E" class="latex" />, with evidence E.&#8221;</p>
<p>Subjective Bayesians would agree: logical probability Bayesians do not. Subjective Bayesians are as free with numerical probabilities as politicians are with other people&#8217;s money. Subjectivists &#8220;feel&#8221; probability is a matter of emotion because they fail to write down the conditioning premises.  They may arrive at a number (and bet using it: subjectivists are inveterate gamblers; at least in theory), but this does not mean the numbers they produce have any bearing on the probabilities unless it can be demonstrated their (unwritten) premises imply these (and no other) numerical values. There is much more to the errors subjectivists make, but given that they usually only make them in theory and not in many problems, where they usually agree with LPBs, we&#8217;ll let these go until another day.</p>
<p>LPBs are more Socratic and admit their ignorance.   The <a href="http://wmbriggs.com/blog/?p=7862">original series</a> proves the LBPs are right: we can not always discover a probability, and so we can not always discover a numerical value for a probability.  We can still manipulate the symbols until we get the <em>form</em> of an answer, but that doesn&#8217;t make the answer right. I believe Senn would agree with that.</p>
<p>Thus we can always write Bayes&#8217;s theorem, which Senn does:</p>
<p><img src="//s0.wp.com/latex.php?latex=%5CPr%28H_i+%7C+E%29+%3D+%5Cfrac%7B%5CPr%28H_i%29+%5CPr%28E+%7C+H_i+%29%7D%7B%5CPr%28E%29%7D&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Pr(H_i | E) = &#92;frac{&#92;Pr(H_i) &#92;Pr(E | H_i )}{&#92;Pr(E)}" title="&#92;Pr(H_i | E) = &#92;frac{&#92;Pr(H_i) &#92;Pr(E | H_i )}{&#92;Pr(E)}" class="latex" />.</p>
<p>Spot the trouble?  What could it possibly mean to say <img src="//s0.wp.com/latex.php?latex=%5CPr%28E%29&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Pr(E)" title="&#92;Pr(E)" class="latex" />? or even <img src="//s0.wp.com/latex.php?latex=%5CPr%28H_i%29&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Pr(H_i)" title="&#92;Pr(H_i)" class="latex" />?   If we <em>knew</em> <img src="//s0.wp.com/latex.php?latex=%5CPr%28H_i%29&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Pr(H_i)" title="&#92;Pr(H_i)" class="latex" /> then we would <em>know</em> <img src="//s0.wp.com/latex.php?latex=%5CPr%28H_i%29&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Pr(H_i)" title="&#92;Pr(H_i)" class="latex" /> and thus we wouldn&#8217;t have to bother with any kind of experiment or other evidence or indeed anything.  We&#8217;d just write the answer down!  </p>
<p>We&#8217;re in just as much trouble with <img src="//s0.wp.com/latex.php?latex=%5CPr%28E%29&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;Pr(E)" title="&#92;Pr(E)" class="latex" />. How can we ask about the probability of evidence we just witnessed?  It has to be a probability of 1, right?, or it wouldn&#8217;t have happened! </p>
<p>The problem does not lie in Bayes&#8217;s theorem, which nobody disputes (how could they?), but in the way it is written and what the symbols mean.  Senn is right that Subjective Bayesians can (and do) say anything, but that doesn&#8217;t mean what they say has any bearing on reality (I&#8217;ll let you provide the politician comparison).</p>
<p>About that, more next time.</p>
<p><a href="https://twitter.com/mattstat" class="twitter-follow-button" data-show-count="false">Follow @mattstat</a><br />
<script>!function(d,s,id){var js,fjs=d.getElementsByTagName(s)[0];if(!d.getElementById(id)){js=d.createElement(s);js.id=id;js.src="//platform.twitter.com/widgets.js";fjs.parentNode.insertBefore(js,fjs);}}(document,"script","twitter-wjs");</script></p>
]]></content:encoded>
			<wfw:commentRss>http://wmbriggs.com/blog/?feed=rss2&#038;p=8379</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Probability Isn&#8217;t &#8220;Fair&#8221;: An Answer To Senn; Part III</title>
		<link>http://wmbriggs.com/blog/?p=8368</link>
		<comments>http://wmbriggs.com/blog/?p=8368#comments</comments>
		<pubDate>Tue, 18 Jun 2013 11:55:49 +0000</pubDate>
		<dc:creator>Briggs</dc:creator>
				<category><![CDATA[Philosophy]]></category>
		<category><![CDATA[Statistics]]></category>

		<guid isPermaLink="false">http://wmbriggs.com/blog/?p=8368</guid>
		<description><![CDATA[A quick reminder that we&#8217;re trying to unpack the meaning of the &#8220;is fair&#8221; in the proposition &#8220;This die is fair,&#8221; and trying to deduce the probability this proposition is true given (and only given) the evidence &#8220;This die has &#8230; <a class="more-link" href="http://wmbriggs.com/blog/?p=8368">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><div class="wp-caption alignright" style="width: 165px"><img src="http://upload.wikimedia.org/wikipedia/commons/6/62/Dice01.jpg" width="155" height="219" class /><p class="wp-caption-text">Varying degrees of &#8220;fairness.&#8221;</p></div>A quick reminder that we&#8217;re trying to unpack the meaning of the &#8220;is fair&#8221; in the proposition &#8220;This die is fair,&#8221; and trying to deduce the probability this proposition is true given (and only given) the evidence &#8220;This die has been rolled five times and showed five &#8217;6&#8242;s.&#8221;  See the <a href=" http://wmbriggs.com/blog/?p=8362">previous installment</a> for why. </p>
<p>&#8220;Is fair&#8221; can take one of several definitions. Our predicament arises from not being clear which, and by mixing versions at different stages of the problem.</p>
<p>Meaning 1: In any finite number of tosses, the proportion of observed tosses will match the probabilities <em>deduced</em> from the first example; i.e., the observed proportions will show 1/6 &#8217;1&#8242;s, 1/6 &#8217;2&#8242;s, and so on, or whatever is closest to these if the number of tosses is not divisible by six. </p>
<p>Assuming Meaning 1, and given our evidence, we <em>deduce</em> the probability the proposition is true is 0; it is false. If the proposition were true, we should have seen some combination of five numbers with one missing (e.g. &#8217;6&#8242;, &#8217;3&#8242;, &#8217;5&#8242;, &#8217;1&#8242;, &#8217;4&#8242;); the missing could have been any number between &#8217;1&#8242; and &#8217;6.&#8217; (I keep the quotes around the outcomes to help us recall these are labels and not numbers.)</p>
<p>Meaning 2: In any finite number of tosses, the proportion of observed tosses will <em>approximately</em> equal the probabilities <em>deduced</em> from the first example; i.e., the proportions will approximately show 1/6 &#8217;1&#8242;s, 1/6 &#8217;2&#8242;s, and so on. </p>
<p>Assuming Meaning 2, and given our evidence, we deduce the probability the proposition is true is not calculable.  The probability is unknown&#8212;because &#8220;approximately&#8221; is not defined.  If &#8220;approximately&#8221; means (and I do not jest) &#8220;Leave me alone, I&#8217;m tired of playing dice&#8221; then the proposition is true, because the observed frequencies are more than close enough for somebody who doesn&#8217;t give a damn about dice.  If you fail to appreciate this example, you are in for tough times ahead; so pause here and make sure this sinks in.</p>
<p>If &#8220;approximately&#8221; means &#8220;not varying more than 5% from&#8221; then the proposition is deduced to be false because, of course, the observed proportions have differed by more than 5%. But if &#8220;approximately&#8221; means &#8220;not varying more than 90% from&#8221; then the proposition is deduced to be true, because the observed variations are within this bound.</p>
<p>Who gets to decide what &#8220;approximately&#8221; means?  Well, you do; as does Senn; as do I. Fights start over things like this.   What is <em>the</em> one and only definition of &#8220;approximately&#8221;?  There isn&#8217;t one! It depends on the situation.  As we saw, for some it could mean &#8220;Leave me alone&#8221;, for others, say casinos, it would be much tighter.  </p>
<p>Think this ambiguity bad?  It&#8217;s even worse than this.</p>
<p>Meaning 3: In any finite number of tosses greater than or equal to 6, the proportion of observed tosses will  equal the probabilities <em>deduced</em> from the first question; i.e., the proportions will be 1/6 &#8217;1&#8242;s, 1/6 &#8217;2&#8242;s, and so on, or whatever is closest to that if the number of tosses is not divisible by six. </p>
<p>Given this and our evidence, the proposition is not true or false (1 or 0) but somewhere in between because we haven&#8217;t yet reached the limit of 6 tosses.   Kind of. If the die were tossed just one more time (for 6 in total), then there is no way the observed proportions could equal the deduced probabilities.  The proposition would then be false. But the die <em>hasn&#8217;t been</em> tossed just one more time. It could be tossed 100 more times. Who knows?  But we still have the feeling, based on the observations, that the future tosses won&#8217;t bring the final proportions in line with the deduced probabilities (I keep repeating deduced to remind us these are not subjective guesses nor are they estimates).</p>
<p>Our evidence and assumed definition isn&#8217;t proof the proposition is false, especially if we consider it with respect to Meaning 4, which is the same as 3 but with &#8220;approximately&#8221; put in usual place.  Nor is the proposition true.  But we also don&#8217;t seem in a strong position to quantify the probability.  Nothing in the world wrong with that.  Not all probability is quantifiable. See the <a href="http://wmbriggs.com/blog/?p=7862">original series</a> for why.</p>
<p>If we insist on asking the original question, we&#8217;re left trying to understand what &#8220;is fair&#8221; could mean. We need to settle on a definite, unambiguous meaning before we can progress.  And then even if we do we&#8217;re going to be left will all kinds of nagging questions about real dice.</p>
<p>Real dice have weight distributed unevenly.  There&#8217;s no way to create perfect balance.  We can prove this easily: displaying the numbers, which are of different shape, creates an imbalance, however minuscule.  It might be possible to engineer a die down to the level of a quark, so that each side is precisely the same number of quarks across, and that the mass of the die is uniform at the Planck scale (except for the surface where the displays are).  In practice, for macro-scale dice, this is impossible.  But maybe some physicist will figure it out for some tiny thing.  Even then, he won&#8217;t be sure that the strings which comprise the quarks are the &#8220;same length&#8221; everywhere and uniformly (if that even makes sense to say). </p>
<p>But even supposing we have this toy, we have the problem of tossing it. How?  Onto what kind of surface?  From what height? How much spin? With what downward force?  In what gravitational field?  After all, if we want to discuss tossing a &#8220;fair&#8221; die, all these things have to be considered. Tossing is part of &#8220;fairness.&#8221;</p>
<p>It is at this point it dawns on us that we&#8217;re on a fool&#8217;s errand.  If the die were perfect, as we imagine (and as a logical die is in effect), and if the environmental conditions and forces were known precisely, then we&#8217;d know&#8212;before tossing&#8212;exactly what the outcome would be. Indeed, if the forces did not vary, the die would land the <em>same way</em> each time.  </p>
<p>Point is, just by our knowledge of physics we know that any real die and its tossing environment isn&#8217;t &#8220;fair&#8221; in any complete physical sense. There&#8217;s no point to the original question.  <em>No</em> real die (or its tosses) is &#8220;fair&#8221; in this sense. The proposition is contingent.</p>
<p>We&#8217;re asking the wrong question. What we really want to know is if the die is &#8220;fair enough&#8221;, and to answer that requires, as above, knowing what decisions we want to make regarding the die. </p>
<p>What we can do is to deduce the probability of seeing any arrangement of observations, either before seeing any observations whatsoever, or conditional on our initial knowledge of six-sided (logical) objects supplemented by a set of observations specified by the evidence.   (We do this using Bayes&#8217;s theorem: see the next Parts.)</p>
<p>In other words, we can then make statements like this, &#8220;Given our evidence about six-sided objects and the old observations, the probability of seeing departures of future observed proportions at least as great as X% from the deduced probabilities is Y.&#8221;   If Y exceeds a threshold, then we act as if the die is not &#8220;fair&#8221;, but if it is less than this threshold, we say it is.   The threshold varies depending on the application.  For the person sick to death of dice, X is unimportant and Y is quite low. Casinos want a small X and large Y for obvious reasons. </p>
<p>We&#8217;ll <em>never</em> have 100% certainty that any real die is &#8220;fair&#8221; in this (final) sense that Y = 0 (for vanishingly small X), because we knew before we started that question dealt with a contingent matter, and we are never 100% certain of contingent matters (though we can be 1 &#8211; &epsilon; certain).</p>
<p>And you&#8217;ll notice that nowhere did we confuse the observed proportions&#8212;i.e. the relative frequencies&#8212;as probabilities.   We <em>knew</em> the probabilities and used them to discern whether the relative frequencies were in line with the them; this is what we meant by &#8220;fairness.&#8221; </p>
<p>We have proved what we set out to show. That we don&#8217;t, at least for the kinds of examples that Senn provided, need two kinds of probability.  The one kind&#8212;probability as logic&#8212;was enough.</p>
<p>Yet there is still more to understand. Stick around!</p>
<p><strong>Update</strong>  We could also form statements like this: &#8220;Given our evidence and old observations, in the next n throws, there is probability Y of seeing X &#8217;1&#8242;s&#8221; and so forth.  In other words, this and the previous example are predictions, statements of uncertainty of the future (or of that which is as yet unseen). </p>
]]></content:encoded>
			<wfw:commentRss>http://wmbriggs.com/blog/?feed=rss2&#038;p=8368</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>There Is Only One Kind Of Probability: An Answer To Senn; Part II</title>
		<link>http://wmbriggs.com/blog/?p=8362</link>
		<comments>http://wmbriggs.com/blog/?p=8362#comments</comments>
		<pubDate>Mon, 17 Jun 2013 12:08:03 +0000</pubDate>
		<dc:creator>Briggs</dc:creator>
				<category><![CDATA[Philosophy]]></category>
		<category><![CDATA[Statistics]]></category>

		<guid isPermaLink="false">http://wmbriggs.com/blog/?p=8362</guid>
		<description><![CDATA[Read Part I. Some of this material is explained in detail in this series. Just after the introduction, Senn starts his argument by claiming an &#8220;important distinction between two types of probabilities: direct and inverse.&#8221; The distinction is simply explained &#8230; <a class="more-link" href="http://wmbriggs.com/blog/?p=8362">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><div class="wp-caption alignright" style="width: 330px"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/a/a5/6sided_dice.jpg/320px-6sided_dice.jpg" width="320" height="214" class /><p class="wp-caption-text">What are the chances a green die will land on top?</p></div><a href=" http://wmbriggs.com/blog/wp-admin/post.php?post=8346">Read Part I.</a>  Some of this material is explained in detail in <a href="http://wmbriggs.com/blog/?p=7862">this series</a>.</p>
<p>Just after the introduction, Senn starts his argument by claiming an &#8220;important distinction between two types of probabilities: direct and inverse.&#8221;</p>
<blockquote><p>
The distinction is simply explained by an example. The probability that five rolls of a fair die will show five sixes is an example of a direct probability&#8212;it is a probability from model to data. The probability that a die is fair given that it has been rolled five times to show five sixes is an inverse probability: it is a probability from data to model.
</p></blockquote>
<p>If we accept this distinction and example as written, we are already lost; all the standard confusions are there.  </p>
<p>If probability is all of one sort, then there is no distinction between &#8220;direct&#8221; and &#8220;inverse&#8221; kinds. Our candidate is logical probability, in which, as in just-plain-logic, there is only evidence (equivalently, premises), a proposition to be considered with respect to that evidence, and a probability this proposition is true deduced from the evidence. </p>
<p>Let&#8217;s begin by rewriting the examples. The evidence is what?  Trouble starts with the words &#8220;fair die&#8221;.  This is taken to mean that we have a real, physical, tangible object which <em>must</em>, when tossed, results in equal chance of any side face up.   This is asserted and not proved. It is a dictate.  It sets in the mind a view of an actual die, of the kind that cannot (or at least does not) exist.   Once this die is imagined, objections immediately arise: what if it <em>isn&#8217;t</em> &#8220;fair&#8221;? Can real dice be &#8220;fair&#8221;? What about imperfections?  The confusion between asserting a probability and wondering whether the asserted probability equals the &#8220;real&#8221; probability, i.e. the long-run frequency of tosses, is already ineradicable.  It becomes impossible to keep in mind what the real question is. </p>
<p>Start over rewriting all as a logical argument. &#8220;We have a six-sided (logical) object, just one side of which is labeled &#8217;1&#8242;, just one side of which is labeled &#8217;2&#8242;, and so on up to &#8217;6&#8242;, which when tossed must show just one of these sides.&#8221; No physical, real die is implied, though because of the ubiquity of dice-like examples, people usually think one is.  So if you find yourself unable to imagine a logical, i.e. non-physical, six-sided object, change it to a six-state Martian <em>bleen</em>, a device which is activated by tentacle and displays each time it is activated on a screen one and only one of the figures (translated into English) &#8217;1&#8242;, &#8217;2&#8242;, etc.  There is no hint&#8212;as in <em>no hint</em>&#8212;of the workings of this device.  All&#8212;as in <em>all</em>&#8212;we know is that the device when activated can show one of &#8217;1&#8242; through &#8217;6&#8242;; how it does so <em>is a mystery</em>. </p>
<p>I stress again (and again) that since there are no Martians, there are no bleens. Any imperfections we imagine in a bleen are our own creations and are not part of the evidence supplied. The key to LPB is that we must&#8212;as in <em>must</em>&#8212;use only the evidence supplied, and all of it, in our deductions of probability.  What is not directly implied from the given evidence must&#8212;as in, well, you get the idea&#8212;be ignored.</p>
<p>Now using the <a href="http://wmbriggs.com/blog/?p=7862">statistical syllogism</a> (which itself can be deduced from simpler principles), we deduce the probability a &#8217;6&#8242; shows on one activation of a bleen, just as we can deduce the probability of five &#8217;6&#8242; activations.   Or we can deduce anything which can happen in any (for now stick to finite) number of activations.  </p>
<p>We are done with the first example which ends with at a conditional probability; i.e. a probability deduced from given, fixed evidence.  All probability is likewise conditional.  If you think not, see the series linked above for examples, or see Part III tomorrow for more on this.</p>
<p>Notice that I do not use the word &#8220;model&#8221;. It isn&#8217;t needed.  Not here, and in far fewer cases than usually thought.</p>
<p>Senn&#8217;s second (&#8220;inverse&#8221;) example is also confusing. This asks the probability the following proposition is true: &#8220;<em>This</em> die is &#8216;fair&#8217;.&#8221; The only <em>written</em> evidence is &#8220;<em>This</em> die has been rolled five times and has showed five &#8217;6&#8242;s.&#8221;  That we are dealing with a real, physical die is implied from the words, but it is never stated.   But suppose this is wrong and Senn meant a logical die or a breen: then where would we be?</p>
<p>Right where we started. If this is the logical &#8220;die&#8221; or breen, then we start by <em>knowing</em> the chance each number is displayed is 1/6.  We end there, too. We have deduced &#8220;fairness.&#8221;  </p>
<p>So we must be talking of a physical, rea-life die.  Our task is to interpret this proposition with regard to the given observations.   </p>
<p>This evidence is easy and means just what it says: five rolls, five &#8217;6&#8242;s of some real die. The proposition is less clear. The subject makes sense: &#8220;This die&#8221; means some real, actual physical die.  The difficulty is with the verb: &#8220;is fair.&#8221;  </p>
<p>Ah, fairness. From youth we are told that there is nothing finer! Indeed, fairness is so fine that we discuss it next time.</p>
<p><a href="https://twitter.com/mattstat" class="twitter-follow-button" data-show-count="false">Follow @mattstat</a><br />
<script>!function(d,s,id){var js,fjs=d.getElementsByTagName(s)[0];if(!d.getElementById(id)){js=d.createElement(s);js.id=id;js.src="//platform.twitter.com/widgets.js";fjs.parentNode.insertBefore(js,fjs);}}(document,"script","twitter-wjs");</script></p>
]]></content:encoded>
			<wfw:commentRss>http://wmbriggs.com/blog/?feed=rss2&#038;p=8362</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Bayes Is More Than Probably Right: An Answer To Senn; Part I</title>
		<link>http://wmbriggs.com/blog/?p=8346</link>
		<comments>http://wmbriggs.com/blog/?p=8346#comments</comments>
		<pubDate>Sun, 16 Jun 2013 13:12:08 +0000</pubDate>
		<dc:creator>Briggs</dc:creator>
				<category><![CDATA[Philosophy]]></category>
		<category><![CDATA[Statistics]]></category>

		<guid isPermaLink="false">http://wmbriggs.com/blog/?p=8346</guid>
		<description><![CDATA[Stephen Senn very kindly answered a post I wrote on p-values (Unsignificant Statistics: Or Die P-Value, Die Die Die) by sending me his &#8220;You May Believe You Are a Bayesian But You Are Probably Wrong&#8221; (in Rationality, Markets and Morals). &#8230; <a class="more-link" href="http://wmbriggs.com/blog/?p=8346">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><a href="http://errorstatistics.com/tag/stephen-senn/">Stephen Senn</a> very kindly answered a post I wrote on p-values (<a href="http://wmbriggs.com/blog/?p=8295">Unsignificant Statistics: Or Die P-Value, Die Die Die</a>) by sending me his &#8220;<a href="http://www.rmm-journal.de/downloads/Article_Senn.pdf">You May Believe You Are a Bayesian But You Are Probably Wrong</a>&#8221; (in <a href="http://www.rmm-journal.de/"><em>Rationality, Markets and Morals</em></a>).</p>
<p>Since I will be teaching at Cornell these two weeks, and the topics are the same, I will use part of this time to answer his paper in depth.</p>
<p>It would be best to start here <a href="http://wmbriggs.com/blog/?p=7862">Subjective <div id="attachment_8350" class="wp-caption alignright" style="width: 181px"><a href="http://wmbriggs.com/blog/wp-content/uploads/2013/06/senn.jpg"><img src="http://wmbriggs.com/blog/wp-content/uploads/2013/06/senn.jpg" alt="Stephen Senn" width="171" height="299" class="size-full wp-image-8350" /></a><p class="wp-caption-text">Stephen Senn</p></div>Versus Objective Bayes (Versus Frequentism): Part I</a>, since that series explains matters in greater detail.</p>
<p><strong>Probability</strong></p>
<p>Senn went wrong before he even began, with his title: &#8220;You May Believe You Are a Bayesian But You Are Probably Wrong.&#8221;  If you are only &#8220;probably wrong&#8221; about your belief then you also might be right.  And if you were certainly wrong, then we would have a proof which says so.  A proof is a string of deductions, i.e. a valid and sound argument, which begins with obviously true premises (agreed to by all) and ends at a proposition we must believe&#8212;even if we don&#8217;t want to.  </p>
<p>Senn does not have, nor does he claim to have, a proof which shows being a Bayesian is certainly wrong.  It is only his best guess that this philosophy is wrong. Probably wrong.  So here we are, already at probability.  What could Senn mean by his probabilistic statement &#8220;probably wrong&#8221;?  (Besides the pun, I mean.)  It can&#8217;t be any kind of frequentist statement, as in &#8220;I&#8217;ve collected a &#8216;random sample&#8217; of Bayesian philosophies, itself embedded in an infinite sequence of such philosophies, and the mean of this sample (considering errors in theory equal to zero) tends towards zero.&#8221;   That makes no kind of sense, as I&#8217;m sure Senn would agree, but it would have to if probability was frequentism.</p>
<p>Bayesian philosophy, at best, comes in a finite number of flavors.  It could be that some of these are false (I agree subjectivism, as it is usually understood, is), but in no way can we imagine any individual theory as being embedded in an infinite sequence of theories, which is required for frequentist theory to hold.  No: either we can prove each theory true or false, or our evidence is not (yet?) sufficient, and thus we are only probably sure each theory is true or false.  This sounds like a Bayesian statement, no? (If so, do we fail because of self-reference?  Well, no, because we can build this theory from simpler propositions.)  </p>
<p>It could be that Senn took a subjective Bayesian tack when he formed his title, or perhaps he took a logical probability, or objective, Bayesian one. (Incidentally, I&#8217;ll call this latter theory LPB for short.)  Or he could have meant some as yet unknown (or at least unidentified) theory. Whatever it was, it couldn&#8217;t have been frequentism, as shown. </p>
<p>His leading candidate is eclecticism (Senn is <em>not</em>  frequentist), which is one of two things. One is no belief at all. It means &#8220;I&#8217;ll do whatever I want whenever it seems good to me.&#8221; There is no theory here to disprove, nor prove. To say &#8220;I&#8217;m an eclectic&#8221; this way means &#8220;I don&#8217;t want to argue for anything, just against things.&#8221; Since we go nowhere engaging with this &#8220;theory&#8221;, we pass on to number two. This is to say, &#8220;I&#8217;ll take a little of that, some of this, and some of the other.&#8221; Here we have several sub-theories.  As such, this kind of eclecticism is actually a whole theory (the compilation of sub-theories) which might be true or false.  Thus Senn might have used Bayes for his title and he might use frequentism for (say) dice tosses.  </p>
<p>Senn recalls that Fisher himself was &#8220;skeptical&#8221; of attempts to unify probability. Hacking, another Big Cheese, in line with other well-aged curds, is of the same opinion.  Why should we have <em>a</em> theory? Why not many?  The obvious answer to this is that there is that which is true and that which is false and we should seek the truth. If it turns out a theory of probability works for all kinds of uncertainty, we&#8217;re stuck with it. If it must be that several theories are true, then we must accept them all.  But it&#8217;s wrong to use desire or suspicion as proof there are many and not one theory.</p>
<p>Senn himself proved that frequentism is out (and forever) as a complete theory of probability because it cannot handle propositions like his &#8220;probably wrong.&#8221;  But this isn&#8217;t proof that Bayes everywhere right; not yet.  Senn&#8217;s later examples might be sufficient to show all versions of Bayes are wrong, in which case some other theory must be true. </p>
<p>But we&#8217;ll have to see next time, because we&#8217;re already out of space, and because next topic isn&#8217;t simple. </p>
<p><a href="https://twitter.com/mattstat" class="twitter-follow-button" data-show-count="false">Follow @mattstat</a><br />
<script>!function(d,s,id){var js,fjs=d.getElementsByTagName(s)[0];if(!d.getElementById(id)){js=d.createElement(s);js.id=id;js.src="//platform.twitter.com/widgets.js";fjs.parentNode.insertBefore(js,fjs);}}(document,"script","twitter-wjs");</script></p>
]]></content:encoded>
			<wfw:commentRss>http://wmbriggs.com/blog/?feed=rss2&#038;p=8346</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>Cornell Teaching Sojourn: Probability, Stats, &amp; R</title>
		<link>http://wmbriggs.com/blog/?p=8343</link>
		<comments>http://wmbriggs.com/blog/?p=8343#comments</comments>
		<pubDate>Sat, 15 Jun 2013 12:24:06 +0000</pubDate>
		<dc:creator>Briggs</dc:creator>
				<category><![CDATA[Statistics]]></category>

		<guid isPermaLink="false">http://wmbriggs.com/blog/?p=8343</guid>
		<description><![CDATA[Time for the annual migration to Ithaca via a well accoutered golden coach (complete with undergraduates feeding professors grapes grown at Cornell&#8217;s orchards). There I will linger for two weeks, ruling as benign and loving dictator over ILRST 5150, i.e. &#8230; <a class="more-link" href="http://wmbriggs.com/blog/?p=8343">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><img src="http://www.ilr.cornell.edu/images/ILR_2line_b_4c.gif" width="264" height="75" class="alignright" />Time for the annual migration to Ithaca via a well accoutered <a href="http://transportation.fs.cornell.edu/coach/">golden coach</a> (complete with undergraduates feeding professors grapes grown at Cornell&#8217;s orchards). There I will linger for two weeks, ruling as benign and loving dictator over ILRST 5150, i.e. Statistical Research Methods in ILR&#8217;s <a href="http://www.ilr.cornell.edu/mpsnyc/index.html">MPS program</a>. </p>
<p>The class works by me holding forth with dulcet but brief pontifications followed by intense questioning of the students, as a cop might grill a suspect. &#8220;What did I just say? What in the dark-mattered universe do you think I meant by that? Have you signed up for the wine tour yet?&#8221;</p>
<p>The wine tour&#8212;completely unofficial and off the books&#8212;ends Week One with a journey to several Finger Lakes wineries to sample their wares.  To be cruelly honest, many of these are poor. If the wines aren&#8217;t sour and vinegary, they are so sweet you could stand a teaspoon up in them. One unbearable vineyard (the name of which is hidden in a riddle) produces nothing but pinkish paint thinner.  But everywhere the wines are wet and contain (among other chemicals) ethanol, which is welcome after five full days of statistics statistics statistics and with another week of the same to come. </p>
<p>(But there are dangers, too. At one stop on the wine trail, I was once nearly abducted by a bachelorette party and had to be rescued by one of my students.)</p>
<p>The class contains almost no math and certainly no memorization of formulas.  I figure the computer can do those things for you, and that time spent proving things mathematically removes time spent in understanding what probability is and learning the strengths and limitations of statistics.   As regular readers know, the latter are many, nefarious, and ubiquitous. </p>
<p>I have only one or two canned examples. The rest have to be provided by the students themselves. This eliminates having to figure out a whole new field and its data <em>and</em> how to describe its uncertainty.  Besides, textbook examples are far too neat, even coy.  Better to see how messy, compromising, and ambiguous collecting data is.  Gives a far better appreciation of the ease of making mistakes and the resultant over-confidence.</p>
<p>I teach R; successfully, too. Yes, it is a programming language, but that is its great advantage. I was able to teach R to a man who did not know what a spreadsheet was and could not type. He did not own a computer. This wasn&#8217;t because of my ability, but because learning the rudiments of any logical programming language is something almost anybody can do.  (I do <em>not</em> include SAS in this list; it is an appalling language.)</p>
<p>Following my custom, for the next two weeks posts will reflect, broadly or in detail, what is going on in the class. I won&#8217;t have time to do anything more.  Feel free to ask questions, but understand I might not be able to get to all of them.</p>
<p><a href="https://twitter.com/mattstat" class="twitter-follow-button" data-show-count="false">Follow @mattstat</a><br />
<script>!function(d,s,id){var js,fjs=d.getElementsByTagName(s)[0];if(!d.getElementById(id)){js=d.createElement(s);js.id=id;js.src="//platform.twitter.com/widgets.js";fjs.parentNode.insertBefore(js,fjs);}}(document,"script","twitter-wjs");</script></p>
<p><strong>Update</strong> A good joke.</p>
<blockquote class="twitter-tweet" width="550"><p>Father of the Martin Amis. (Original much superior to its remake.) <a href="https://twitter.com/search?q=%23swapwordinafilmtitlewithMartinAmis&amp;src=hash">#swapwordinafilmtitlewithMartinAmis</a></p>
<p>&mdash; William M. Briggs (@mattstat) <a href="https://twitter.com/mattstat/statuses/345582194380636160">June 14, 2013</a></p></blockquote>
<p><script async src="//platform.twitter.com/widgets.js" charset="utf-8"></script></p>
]]></content:encoded>
			<wfw:commentRss>http://wmbriggs.com/blog/?feed=rss2&#038;p=8343</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Tobacco Ads Could Lead To Cancer Cure</title>
		<link>http://wmbriggs.com/blog/?p=8335</link>
		<comments>http://wmbriggs.com/blog/?p=8335#comments</comments>
		<pubDate>Fri, 14 Jun 2013 12:50:31 +0000</pubDate>
		<dc:creator>Briggs</dc:creator>
				<category><![CDATA[Philosophy]]></category>
		<category><![CDATA[Statistics]]></category>

		<guid isPermaLink="false">http://wmbriggs.com/blog/?p=8335</guid>
		<description><![CDATA[Today&#8217;s headline is true. True means that which is certain, without the possibility of error; that which is not false; that which accords with reality. It means that which is so even if you don&#8217;t want it to be; even, &#8230; <a class="more-link" href="http://wmbriggs.com/blog/?p=8335">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><div id="attachment_8336" class="wp-caption alignright" style="width: 350px"><a href="http://wmbriggs.com/blog/wp-content/uploads/2013/06/briggs.pipe_.jpg"><img src="http://wmbriggs.com/blog/wp-content/uploads/2013/06/briggs.pipe_.jpg" alt="A sure cure for cancer?" width="340" height="270" class="size-full wp-image-8336" /></a><p class="wp-caption-text">A sure cure for cancer?</p></div>Today&#8217;s headline is true. <em>True</em> means that which is certain, without the possibility of error; that which is not false; that which accords with reality.  It means that which is so even if you don&#8217;t want it to be; even, that is, if you have attended a sacred Raising Awareness ceremony about the evils of tobacco.</p>
<p>Yes: it is <em>true</em> that the next tobacco ad you see could lead to a cure for cancer. How?  </p>
<p>I haven&#8217;t the slightest idea. I don&#8217;t have to know how, either. The headline is still true as long as somebody, sometime, somewhere could describe how, even if the description is only &#8220;in theory.&#8221; </p>
<p>My powers of imagination are weak, so I&#8217;ll rely on you to divine the path from tobacco ad to cancer cure.  What I&#8217;ll do instead is distract you from thinking about this difficulty and talk about the glories of a cancer-free world.</p>
<p>Hey!  No more cancer!  Now that would be a fine thing. Right? No more pain, no suffering, no tears, grief, misery. No more mothers burying their blighted-by-disease daughters. </p>
<p>Like Sally Q. Evalston, 42, a Pinewood, Illinois elementary school teacher, beloved three-time winner of Teacher of the Year, who was carried away before her time with capital-C Cancer (which she &#8220;battled&#8221; with). Just you think about her. Look at her picture, feel for her mother, weep with her students. </p>
<p>This is the sort of tragedy that could be avoided thanks to our truthful headline.  Admit it: you feel good thinking about this, don&#8217;t you?  Isn&#8217;t it nice to be part of the cure for cancer, albeit in small proportion?  Maybe you can email your Congressman (<a href="http://wmbriggs.com/blog/?p=8226&#038;cpage=1#comment-95500">or woman</a>!) and let him know you&#8217;re on his side, that you&#8217;d support him if he voted to increase funding for tobacco advertising. You could at least frown with severe disapproval at the next person you meet who suggests he&#8217;d rather not see more tobacco ads.</p>
<p>Assimilated all that?   Then here is another true headline, &#8220;<a href="http://www.hngn.com/articles/5243/20130613/tobacco-ads-lead-daily-teen-smoking-kids-14-under.htm">Tobacco Ads Could Lead to Daily Teen Smoking for Kids 14 and Under</a>&#8220;.</p>
<p>Wait. Didn&#8217;t we just say that tobacco ads could cure cancer and now a rival claim says these same ads could cause kiddies to smoke?  We did: both headlines are true.  And so is is true that &#8220;Tobacco Ads Could Lead To More Cancer&#8221;.  Just as it is true as true that &#8220;Tobacco Ads Could Lead To Mars Mission&#8221; or &#8220;Tobacco Ads Could Cause <a href="http://www.lifesitenews.com/news/nancy-pelosi-issue-of-late-term-abortions-is-sacred-ground">Nancy Pelosi</a> To Stop Speaking Gibberish.&#8221;</p>
<p>The magic happens in <em>could</em>.  Adding it&#8212;or <em>might</em>, <em>may</em>, <em>possibly</em> or the like&#8212;turns any proposition about the contingent into a truth. (<em>Contingent</em> = not logically necessary.)  Anything contingent <em>could</em> or <em>might</em> be true; that is the nature of contingency.  So adding a word like <em>could</em> in a contingent proposition merely makes the proposition tautological, and all tautologies are true. </p>
<p>Headlines like today&#8217;s are cheap journalist tricks; one of the most common, too.  &#8220;Could Lead To&#8221; headlines and ledes betray the reporter&#8217;s prejudices and desires and make at best weak claims about reality.   And the following articles usually fall prey to the standard human failing of searching only for supportive evidence, assuming that contradictory theories are the first refuge of scoundrels and &#8220;deniers.&#8221;  No idea of the uncertainty in the claim of the headline ever appears.</p>
<p>Just for fun, I did a search on &#8220;Could Lead To&#8221; (surrounded by quotes; try this yourself).  &#8220;Repetitive soccer ball &#8216;heading&#8217; could lead to brain injury&#8221;, &#8220;10 nail deformities that could lead to bigger health problems&#8221;, &#8220;Heavy rain could lead to explosion in mosquito population&#8221;, &#8220;NYCHA Budget Cuts Could Lead To 500 Jobs Lost&#8221;, &#8220;Crowdfunding help could lead to a sandwich named after you&#8221;, &#8220;NHS changes could lead to hospital being sponsored by junk food firms.&#8221;  An endless, ever-increasing stream.</p>
<p>And isn&#8217;t it curious that all of these, tacitly or directly, argue for government intervention?</p>
<p><a href="https://twitter.com/mattstat" class="twitter-follow-button" data-show-count="false">Follow @mattstat</a><br />
<script>!function(d,s,id){var js,fjs=d.getElementsByTagName(s)[0];if(!d.getElementById(id)){js=d.createElement(s);js.id=id;js.src="//platform.twitter.com/widgets.js";fjs.parentNode.insertBefore(js,fjs);}}(document,"script","twitter-wjs");</script></p>
]]></content:encoded>
			<wfw:commentRss>http://wmbriggs.com/blog/?feed=rss2&#038;p=8335</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
		<item>
		<title>Unsignificant Statistics: Or Die P-Value, Die Die Die</title>
		<link>http://wmbriggs.com/blog/?p=8295</link>
		<comments>http://wmbriggs.com/blog/?p=8295#comments</comments>
		<pubDate>Thu, 13 Jun 2013 12:27:18 +0000</pubDate>
		<dc:creator>Briggs</dc:creator>
				<category><![CDATA[Statistics]]></category>

		<guid isPermaLink="false">http://wmbriggs.com/blog/?p=8295</guid>
		<description><![CDATA[Must&#8230;resist&#8230;quoting&#8230; from Stephen Ziliak&#8217;s gorgeous invective &#8220;Unsignificant Statistics&#8221; (where I stole today&#8217;s title) in the Financial Post. Well, just a little (all emphasis mine and joyfully placed): Statistical significance is junk science, and its big piles of nonsense are spoiling &#8230; <a class="more-link" href="http://wmbriggs.com/blog/?p=8295">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><div class="wp-caption alignright" style="width: 425px"><img src="http://stephentziliak.com/images/StephenZiliak.jpg" width="415" height="250" class /><p class="wp-caption-text">&#8220;My p-value was this big.&#8221;</p></div><em>Must&#8230;resist&#8230;quoting&#8230;</em> from <a href="http://stephentziliak.com/">Stephen Ziliak&#8217;s</a> gorgeous invective &#8220;<a href="http://opinion.financialpost.com/2013/06/10/junk-science-week-unsignificant-statistics/">Unsignificant Statistics</a>&#8221; (where I stole today&#8217;s title) in the <em>Financial Post</em>.  </p>
<p>Well, just a little (all emphasis mine and joyfully placed):</p>
<blockquote><p>
<strong>Statistical significance is junk science</strong>, and its <strong>big piles of nonsense</strong> are spoiling the research of more than particle physicists&#8230;</p>
<p>But here is something you can believe, and will want to: <strong>Statistical significance stinks</strong>&#8230;</p>
<p>The null hypothesis test procedure is not the only test of significance but it is the most commonly used and <strong>abused</strong> of all the tests. From the get go, the test of statistical significance asks <strong>the wrong question</strong>&#8230;</p>
<p>In framing the quantitative question the way they do, the significance-testing scientists have <strong>unknowingly reversed</strong> the fundamental equation of statistics. Believe it or not, they have transposed their hypothesis and data, forcing them to <strong>grossly distort</strong> the magnitudes of probable events&#8230;</p>
<p>They have fallen for a <strong>mistaken logic</strong> called in statistics the &#8220;fallacy of the transposed conditional.&#8221;
</p></blockquote>
<p>And that&#8217;s just the first part. I couldn&#8217;t finish the second because my eyes were overflowing with happy tears. </p>
<p>Ziliak and pal Deirdre McCloskey, incidentally, co-authored the must-read <a href="http://sites.roosevelt.edu/sziliak/home/cult-of-statistical-significance/"><em>The Cult of Statistical Significance</em></a>. </p>
<p><em>Cult</em>, they say.  Cult because there is an initiation at high price. Cult because statistical &#8220;significance&#8221; is invoked by occult incantations, the meaning of which has been lost in the mists of time.  Cult because these things can <em>not</em> be questioned!</p>
<p>The p-value is a mysterious, magical threshold, an entity which lives, breathes, and gazes sternly over spreadsheets; a number gifted to us by the great, mysterious god Stochastikos<sup>1</sup>.  It was he who decreed that great saying, &#8220;Oh-point-oh-five and thrive; Oh-point-oh-six and nix.&#8221;</p>
<p>Adepts know the meaning of this shorthand.  So 0.050000001 is sufficient to cast a result outside the gates where there is weeping and gnashing of teeth.  Yet 0.04999999 produces bliss of the kind had when the IRS decides not to audit.</p>
<p>Members cannot be identified by dress but by their manner of speaking. Clues are evasiveness and glib over-confidence. They say, &#8220;The probability my hypothesis is true is Amen&#8221; when what they mean is &#8220;Given my hypothesis is false, here is the value of an obscure function&#8212;one of many I could have picked&#8212;applied to the data assuming the model which quantifies its uncertainty is certainly true and that one of its parameters is set to zero and assuming I could regather my data in the same manner but randomly different <em>ad infinitum</em>.&#8221;</p>
<p>In the hands of a master, more significant p-values can be squeezed out of a set of data than donations <a href="http://www.thedailybeast.com/articles/2013/06/03/al-sharpton-s-long-bill-of-goods-from-tawana-brawley-to-primetime.html">Al Sharpton</a> can secure by marching into an all-white corporation&#8217;s board room.</p>
<p>&#8220;Statistically significant&#8221; does not imply true nor useful nor even interesting.  &#8220;Significance&#8221; is a fog which emanates from a computerized thurible, thick and pungent.  It obscures and conceals.  It woos and insinuates. It distracts.  It is a mathematical sleight-of-hand, a trick. It takes the eye from the direct evidence at hand and refocuses it on the pyrotechnics of p-values.  So delighted is the audience at seeing wee p-values that all memory of the point of a study vanishes. </p>
<p>Statistical significance is so powerful that it can prove both a hypothesis and its contrary simultaneously.  One day it pronounces broccoli as the awful cause of splentic fever and tomorrow it is asserts unequivocally that broccoli is the only sane cure for the disease. </p>
<p>Both results will be accepted and believed, especially by those manning (<a href="http://wmbriggs.com/blog/?p=8226&#038;cpage=1#comment-95500">and womanning!</a>) bureaucracies and press rooms.  Journalists won&#8217;t tell you about the deadly effect of either until 10 p.m.  Government minions will latch gratefully on to anything &#8220;significant&#8221; as proof their budget (and therefore power) should be increased.</p>
<p>Time for statistical significance to be slain, its bones cremated, and its ashes scattered in secret.  No trace should remain lest the infection re-spread.  The only word of it should appear in Latin in tomes guarded by monks charged with collecting man&#8217;s (and woman&#8217;s!) intellectual follies.  </p>
<p><a href="https://twitter.com/mattstat" class="twitter-follow-button" data-show-count="false">Follow @mattstat</a><br />
<script>!function(d,s,id){var js,fjs=d.getElementsByTagName(s)[0];if(!d.getElementById(id)){js=d.createElement(s);js.id=id;js.src="//platform.twitter.com/widgets.js";fjs.parentNode.insertBefore(js,fjs);}}(document,"script","twitter-wjs");</script></p>
<p><strong>Update</strong> Wuhahaha! </p>
<p>&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;</p>
<p><em>Thanks to Steve E for finding Ziliak&#8217;s piece.</em></p>
<p><sup>1</sup>I didn&#8217;t think of this; I recall the name from the old usenet days.</p>
]]></content:encoded>
			<wfw:commentRss>http://wmbriggs.com/blog/?feed=rss2&#038;p=8295</wfw:commentRss>
		<slash:comments>25</slash:comments>
		</item>
		<item>
		<title>Scientists Discover Way To Increase Publication Count</title>
		<link>http://wmbriggs.com/blog/?p=8226</link>
		<comments>http://wmbriggs.com/blog/?p=8226#comments</comments>
		<pubDate>Wed, 12 Jun 2013 12:46:03 +0000</pubDate>
		<dc:creator>Briggs</dc:creator>
				<category><![CDATA[Philosophy]]></category>
		<category><![CDATA[Statistics]]></category>

		<guid isPermaLink="false">http://wmbriggs.com/blog/?p=8226</guid>
		<description><![CDATA[Anybody who has spent any time in a university library amidst the papers of his specialty knows that the absolute last thing which is needed is more of them. Journals abound and apparently breed&#8212;asexually, by dividing&#8212;when librarians turn their heads. &#8230; <a class="more-link" href="http://wmbriggs.com/blog/?p=8226">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><a href="http://wmbriggs.com/blog/wp-content/uploads/2013/06/publish.perish.gif"><img src="http://wmbriggs.com/blog/wp-content/uploads/2013/06/publish.perish.gif" alt="publish.perish" width="340" height="449" class="alignright size-full wp-image-8261" /></a>Anybody who has spent any time in a university library amidst the papers of his specialty knows that the absolute last thing which is needed is more of them.  Journals abound and apparently breed&#8212;asexually, by dividing&#8212;when librarians turn their heads. </p>
<p>The reason is obvious: academics must publish whether they want to or not, whether or not they have anything useful to say, and whether or not anybody reads what they write.  </p>
<p>The glut appears across all areas of knowledge, but the effects are different in the humanities and sciences.  In the former, the world would be a far better place had many of its practitioners obeyed the ancient truism that silence is golden.  Over-supply in the sciences is less troublesome because poor and inconsequential works are ignored.  The presence of this chaff only makes it difficult to discover the wheat.</p>
<p>In the humanities (which I take to incorporate the gooier sciences, like education) one can say anything, <a href="http://wmbriggs.com/blog/?p=8258">the more outr&eacute; the better</a>.  Not so in the hard sciences where at least some passing resemblance to the truth is expected. </p>
<p>Too much resemblance, as a matter of fact.  Editors, reviews, and authors follow a rigid positivistic philosophy: only good news shall find its way into print! Papers with &#8220;statistically significant&#8221; effects are vastly likelier to be published than are works which admit there&#8217;s nothing to see.  Failures with billets are as rare as Republicans in English departments.  </p>
<p>Then because traditional statistical methods used are fertile in labeling results positive, even when they are not, there exists a tremendous publication bias.  Many false things are believed true.  </p>
<p>All this is known and of concern to the seventy-plus signatories to the article <a href="http://www.guardian.co.uk/science/blog/2013/jun/05/trust-in-science-study-pre-registration">Trust in science would be improved by study pre-registration</a> in <em>The Guardian.</em>  This open letter proclaims &#8220;We must encourage scientific journals to accept studies before the results are in.&#8221;</p>
<p>The eminences lament publish and perish and say the &#8220;publishing culture is toxic to science.&#8221;  </p>
<blockquote><p>
Recent studies have shown how intense career pressures encourage life scientists to engage in a range of questionable practices to generate publications &#8212; behaviours such as cherry-picking data or analyses that allow clear narratives to be presented, reinventing the aims of a study after it has finished to &#8220;predict&#8221; unexpected findings, and failing to ensure adequate statistical power. These are not the actions of a small minority; they are common, and result from the environment and incentive structures that most scientists work within.
</p></blockquote>
<p>It&#8217;s worse than just that.  &#8220;[J]ournals incentivise bad practice by favouring the publication of results that are considered to be positive, novel, neat and eye-catching.&#8221;  Although there is no conceivable universe where the string of letters which comprise &#8220;incentivise&#8221; should be used when ladies are present, we cannot help but agree that the situation is grim.</p>
<p>The &#8220;file-drawer&#8221; problem adds to the misery.  This is when a study which is not a success or isn&#8217;t sexy or part of the consensus rests in a lonely file in the forgotten reaches of a scientist&#8217;s computer.  Lack of negative results in print gives an over-optimistic picture of scientific progress.</p>
<p>The solution the <em>Guardian</em> writers have is to publish &#8220;pre-registration&#8221; papers, outlines of the studies which are not yet conducted.  Journal which air these outlines must agree to publish the eventual results whatever they may be.   Thus &#8220;questionable practices to increase &#8216;publishability&#8217;&#8221; will be &#8220;greatly reduced.&#8221;</p>
<p>I doubt it.  Authors will still aim for high &#8220;impact factor&#8221; journals for their &#8220;pre-registrations.&#8221;  The &#8220;impact factor&#8221;, incidentally, is an &#8220;arguably meaningless as an indicator of scientific quality&#8221;, though always a matter of bragging rights. </p>
<p>There will be a minor flood of papers pre-registering sketchy theories, and these will be all that is remembered. Some authors will publish their negative results, but many will forget them and move on to more fertile grounds.   The bulk of these maybe-so works will be taken as positive evidence even if positive effects are never found or if negative effects are published.  </p>
<p>Journalists, by nature not very inquisitive, will tout &#8220;If these promised results hold&#8230;&#8221;, and again these reports will be all that is remembered.  Retractions will never appear. What&#8217;s to retract?</p>
<p>And worst of all will be the huge increase in papers that must be navigated to get to the good stuff.  Pre-registration papers will only be &#8220;read&#8221;&#8212;i.e. their abstracts will be glanced at on PubMed&#8212;by other authors looking to pad their bibliographies.</p>
<p>No. The real solution is to judge a fellow by the quality and promise of his work, not by its quantity, and not by even a hint of a numerical rating. </p>
<p><a href="https://twitter.com/mattstat" class="twitter-follow-button" data-show-count="false">Follow @mattstat</a><br />
<script>!function(d,s,id){var js,fjs=d.getElementsByTagName(s)[0];if(!d.getElementById(id)){js=d.createElement(s);js.id=id;js.src="//platform.twitter.com/widgets.js";fjs.parentNode.insertBefore(js,fjs);}}(document,"script","twitter-wjs");</script></p>
<p>&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;&#8211;</p>
<p><em>Thanks to Bryan Davies for pointing us to this.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://wmbriggs.com/blog/?feed=rss2&#038;p=8226</wfw:commentRss>
		<slash:comments>18</slash:comments>
		</item>
		<item>
		<title>Krauthammer&#8217;s Wrong: NSA Spying Not Equivalent To Policing</title>
		<link>http://wmbriggs.com/blog/?p=8286</link>
		<comments>http://wmbriggs.com/blog/?p=8286#comments</comments>
		<pubDate>Tue, 11 Jun 2013 12:34:06 +0000</pubDate>
		<dc:creator>Briggs</dc:creator>
				<category><![CDATA[Statistics]]></category>

		<guid isPermaLink="false">http://wmbriggs.com/blog/?p=8286</guid>
		<description><![CDATA[Charles Krauthammer ought to be taken seriously. He said yesterday from his daily perch that he wasn&#8217;t concerned about NSA spying on Americans because there hasn&#8217;t been any or many complaints or other evidence of abuse, besides the latest. No &#8230; <a class="more-link" href="http://wmbriggs.com/blog/?p=8286">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><div class="wp-caption alignright" style="width: 250px"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/0/04/National_Security_Agency.svg/240px-National_Security_Agency.svg.png" width="240" height="240" class /><p class="wp-caption-text">We&#8217;re spying on you for your own good.</p></div>Charles Krauthammer ought to be taken seriously.  He said yesterday from his daily perch that he wasn&#8217;t concerned about NSA spying on Americans because there hasn&#8217;t been any or many complaints or other evidence of abuse, besides the latest.  No complaints, no harm.  </p>
<p>He compared NSA&#8217;s sucking up data to the police carrying guns and using other tools, and that these guns and tools can be used maliciously by those sworn to protect us, and sometimes are, but not to the extent we would disarm cops. </p>
<p>He didn&#8217;t use the word &#8220;spying.&#8221;</p>
<p>The analogy is poor and if considered at length supports the opposite conclusion: NSA ought not to spy on citizens without probable cause.  Which is to say, NSA, and every other government agency, ought to follow the Fourth Amendment in its letter and spirit.</p>
<p>If the police suspect a man of crime they must provide sufficient and lengthy evidence to a judge <em>before</em> receiving a search warrant to implement a wire tap (and equivalents).  <em>After</em> the warrant is issued, the tap is put in place.  Contrariwise, the NSA wire taps everybody <em>first</em>, stores the information, and then swears not to look at it until it receives a warrant from a court with the sole purpose of handing out warrants.</p>
<p>This violates the <a href="http://wmbriggs.com/blog/?p=8278">spirit and letter of the Constitution</a>.  Whether our dear leaders come to this same conclusion and voluntarily renounce the power they have awarded themselves is a separate question.  Smart money says no: smarter money says these programs will intensify.</p>
<p>Add to this knowledge that the chief law enforcement officer of the country was discovered lying to both Congress and a judge for political reasons, accusing one of his enemies falsely of treason and spying, a double irony (but unfortunately no record).</p>
<p>Police carry guns because crooks do.  Except in a very few enlightened locales, such as blood-stained Chicago, any citizen may arm and protect himself in the absence of police. The armament on both sides is in rough parity, with the police holding a slight edge. </p>
<p>There is no equivalent in this case. No citizen can compete with or evade a Dark Star or PRISM, the 007-enemy-like names our beneficent government chose for its programs (this was also Krauthammer&#8217;s observation).  All a citizen can do is avoid anything electronic: no cell phone, no computer, no medical records (which the IRS will soon have), no credit card, no car, and no heat signature&#8212;drones will track these.  Remember drones? Only proles and animals are free.</p>
<p>The best comparison is between Stop &#038; Frisk and NSA surveillance.  Under Stop &#038; Frisk, cops patrol high-crime areas and detain and search individuals who meet minimal suspicion standards.  Minimal is not none.  The visible presence of the police is often a sufficient deterrent. Records of Stop &#038; Frisk cannot be used for personal identification. </p>
<p>The NSA can scan records which are minimally suspicious, too.  But to do that, they must first collect information on everybody, including people who meet no standard of suspicion. A person frisked on the street knows what has happened to him. The citizen whose records are pored over remains ignorant.  The former can complain of abuse, the latter cannot.  This is why there are no complaints of harassment. </p>
<p>Revealingly, the government does not claim electronic snooping does deters, only that it discovers.  Perhaps people would feel better were they to learn the spying programs have uncovered innumerable foreign spies and stopped countless attacks.  The government is understandably reticent to admit these successes, assuming they exist, fearing its enemies will backward engineer its methods and thus countervail them. </p>
<p>However, there is good reason to think the number of these successes is low. For one, our current administration loves to brag, and it has not about uncovering spies. And the <a href="http://wmbriggs.com/blog/?p=8239">failures have been many</a>.  The Boston bombers, the shoe and underwear bombers, Mumbai, Benghazi, England&#8217;s bombings; many others including Snowden himself.  Contradictorily, these will encourage governments to <em>increase</em> its powers rather than admit its programs don&#8217;t work.</p>
<p>I had, in the 1980s, a Top Secret SCI etc. etc. clearance, natural for a USAF Sergeant in cryptography. The clearance came with endless briefings about traitorous spies and the damage they caused. These were mostly useless as training to spot spies, because all we learned was that spies excel at hiding.  But because spies existed and the government feared them, it thought it should screen for them using lie detectors.  </p>
<p>These were not useless: they were able to produce many accusations, all (or nearly all) false. Real spies pass polygraphs. (Incidentally, the attack is psychological: &#8220;There seems to be a problem with question seven, Sergeant Briggs. Can you help me with that?&#8221; &#8220;Nope.&#8221; &#8220;Are you sure?&#8221; &#8220;Yep.&#8221; &#8220;You are dismissed.&#8221; Always the same.)  </p>
<p>It&#8217;s a good guess the same will and is happening with NSA&#8217;s terrorist screening. The failures suggest screening is too blunt a tool to catch bad guys, especially well organized and intelligent adversaries. </p>
<p><a href="https://twitter.com/mattstat" class="twitter-follow-button" data-show-count="false">Follow @mattstat</a><br />
<script>!function(d,s,id){var js,fjs=d.getElementsByTagName(s)[0];if(!d.getElementById(id)){js=d.createElement(s);js.id=id;js.src="//platform.twitter.com/widgets.js";fjs.parentNode.insertBefore(js,fjs);}}(document,"script","twitter-wjs");</script></p>
<p><strong>Update</strong> Recommended reading.  <a href="http://www.nationalreview.com/article/350609/liberty-tentacular-state-charles-c-w-cooke">Liberty in the Tentacular State</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://wmbriggs.com/blog/?feed=rss2&#038;p=8286</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>You Do Too Have Something To Hide</title>
		<link>http://wmbriggs.com/blog/?p=8278</link>
		<comments>http://wmbriggs.com/blog/?p=8278#comments</comments>
		<pubDate>Mon, 10 Jun 2013 12:34:55 +0000</pubDate>
		<dc:creator>Briggs</dc:creator>
				<category><![CDATA[Culture]]></category>
		<category><![CDATA[Politics]]></category>
		<category><![CDATA[Statistics]]></category>

		<guid isPermaLink="false">http://wmbriggs.com/blog/?p=8278</guid>
		<description><![CDATA[&#8220;Let them look at my phone records. I have nothing to hide. They can read all my emails. There&#8217;s nothing there worth reading.&#8221; We hear this primarily from the young, who are said to be used to living openly on-line, &#8230; <a class="more-link" href="http://wmbriggs.com/blog/?p=8278">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><div class="wp-caption alignright" style="width: 250px"><img src="http://upload.wikimedia.org/wikipedia/commons/thumb/0/04/National_Security_Agency.svg/240px-National_Security_Agency.svg.png" width="240" height="240" class /><p class="wp-caption-text">We&#8217;re spying on you for your own good.</p></div>&#8220;Let them look at my phone records. I have nothing to hide.  They can read all my emails.  There&#8217;s nothing there worth reading.&#8221;</p>
<p>We hear this primarily from the young, who are said to be used to living openly on-line, and from the old who share this in common with the young: ignorance of history and of human nature.   Today, and for the benefit of these mis-, or rather uninformed, people a brief lesson.</p>
<p>No man is innocent in the eyes of his enemy.  A zealous prosecutor can take the most harmless of circumstance and through innuendo, contrivance, and brazen lie turn it into at least dark suspicion if not &#8220;proof&#8221; of heinous crime.</p>
<p>You bought a book on the Middle East because of a noble interest in history?  So do terrorists read those books.   You made a phone call from nearby a mosque?  So do terrorists make these calls.  You made a crude and in poor taste joke after an incident?  So do terrorists make these quips. </p>
<p>You decided to join a local Tea Party-like organization because of your sincere belief in limited government?  Or have you renewed your membership in the NRA? So do &#8220;domestic&#8221;, &#8220;home-grown&#8221;, &#8220;self-radicalized&#8221; terrorists join these groups.</p>
<p>Have you not heard the term railroaded?  How about framed? How about falsely accused, hounded, harassed?  Is it is merely paranoia and lapsing into extremism to suggest that the government, sated on your secrets, could act in these ways as it has and far too often?  </p>
<p>Could the IRS target groups which it perceives as its enemies?   Could sealed divorce records be publicly aired?  Could a zealous prosecutor who cares only of her public image and is a stranger to truth convict the innocent?   Could a government libel and slander a man in order to bamboozle a judge into issuing a warrant against this man?</p>
<p>The powers of my imagination pale, but a story of your culpability can always be weaved by a determined enemy. Anything can be turned against you, and the more information government has on you, the easier it becomes to manufacture &#8220;evidence&#8221; of your misdeed.   </p>
<p>Information is power: it is the lifeblood of politics.  Giving bureaucrats and politicians this much power is to tempt them beyond human ability to resist.  (Giving power to computer and statistical algorithms used to data mine records is no solution.  <a href="http://wmbriggs.com/blog/?p=8239">These cannot be perfect</a>, and it is people who run them.)  </p>
<p>It&#8217;s probably far too late to remind anybody of these words, taken from the document which at one time dictated the law under which even politicians had to live:</p>
<blockquote><p>
The right of the people to be secure in their persons, houses, papers, and effects, against unreasonable searches and seizures, shall not be violated, and no Warrants shall issue, but upon probable cause, supported by Oath or affirmation, and particularly describing the place to be searched, and the persons or things to be seized.
</p></blockquote>
<p>It has now become a &#8220;reasonable&#8221; search to track your every movement, your every call, perhaps even your every email and on-line transaction.  It is now &#8220;probable cause&#8221; that you are guilty of a crime just because you exist.</p>
<p>&#8220;But it&#8217;s only meta-data.&#8221;  Meta-data forsooth!  Have you any idea what this is?  It tells the time of your call and where you were when you made it.  It tells who you made it to, and tells of the people you contacted who they in turn contacted.  Do you text?  Then they have that information too.  </p>
<p>It&#8217;s rumored the government even has your emails. Perhaps not the text of these, but the meta-data.  Again, this tells much. It tells where you were and on what you wrote them. It tells the time and length. It tells who it went to, and it tells this of everybody.  </p>
<p>Even without your exact words spoken or written, this is a dense and nearly complete picture of your behavior.  If a bureaucrat cannot find something in this trove that at least casts you in a bad light, then he isn&#8217;t trying.</p>
<p>&#8220;But what about the children!  We demand safety!&#8221;  I have yet to hear any politician respond to these words of a man of a far superior mind:</p>
<blockquote class="twitter-tweet" width="550"><p>Those who would give up Essential Liberty to purchase a little Temporary Safety, deserve neither Liberty nor Safety. Ben Franklin. <a href="https://twitter.com/search?q=%23NSA&amp;src=hash">#NSA</a></p>
<p>&mdash; William M. Briggs (@mattstat) <a href="https://twitter.com/mattstat/statuses/342764646220050432">June 6, 2013</a></p></blockquote>
<p><script async src="//platform.twitter.com/widgets.js" charset="utf-8"></script></p>
<p>If the government can&#8217;t catch terrorists without spying on its own citizens, then tough luck. Let if find some other way.  The price we have to pay for this program of <a href="http://wmbriggs.com/blog/?p=8239">extremely limited success</a> is just too high.</p>
<blockquote class="twitter-tweet" width="550"><p>If NSA algorithms so good at detecting spies, how did Edward Snowden go undetected?</p>
<p>&mdash; William M. Briggs (@mattstat) <a href="https://twitter.com/mattstat/statuses/343890674057371648">June 10, 2013</a></p></blockquote>
<p><script async src="//platform.twitter.com/widgets.js" charset="utf-8"></script></p>
<p><strong>Update</strong> Until my server&#8217;s DNS problems (not &#8220;issues&#8221;) are resolved, you might not be able to see the tweets linked.  They are, in order:</p>
<p>&#8220;Those who would give up Essential Liberty to purchase a little Temporary Safety, deserve neither Liberty nor Safety. Ben Franklin.&#8221;</p>
<p>&#8220;If NSA algorithms so good at detecting spies, how did Edward Snowden go undetected?&#8221;</p>
<p><strong>Update</strong> See this from brother statistician John Cook: <a href="http://www.johndcook.com/blog/2013/06/10/a-statistical-problem-with-nothing-to-hide/">A statistical problem with &#8220;nothing to hide.&#8221;</a></p>
]]></content:encoded>
			<wfw:commentRss>http://wmbriggs.com/blog/?feed=rss2&#038;p=8278</wfw:commentRss>
		<slash:comments>51</slash:comments>
		</item>
	</channel>
</rss>
