
<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/2.3.3" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>
<channel>
	<title>Comments on: You cannot measure a mean</title>
	<link>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/</link>
	<description>"All manner of statistical analyses cheerfully undertaken."</description>
	<pubDate>Wed, 20 Aug 2008 15:54:24 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.3.3</generator>
		<item>
		<title>By: Briggs</title>
		<link>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/#comment-1177</link>
		<dc:creator>Briggs</dc:creator>
		<pubDate>Mon, 10 Mar 2008 14:18:14 +0000</pubDate>
		<guid>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/#comment-1177</guid>
		<description>Khebab, it might help to just replace the "t" with an "x" in the models and ignore the "time" aspect.  These are just made up data anyway.

&#928;, thank you for bringing up Gelman.  I take his name in vain in a paper about logic in probability, which you can find on my Resume tab  : look for "Broccoli".

However, this does not answer your point, which I promise to do at a later time in a post about "randomness."

The distinction between climate and weather is true, but it is separate entirely from this posting.  Whether "trend" is useful is relevant, but only insofar as it helps us make predictions.  There, the main point still stands: usual statistical methods lead to overconfidence.</description>
		<content:encoded><![CDATA[<p>Khebab, it might help to just replace the &#8220;t&#8221; with an &#8220;x&#8221; in the models and ignore the &#8220;time&#8221; aspect.  These are just made up data anyway.</p>
<p>&Pi;, thank you for bringing up Gelman.  I take his name in vain in a paper about logic in probability, which you can find on my Resume tab  : look for &#8220;Broccoli&#8221;.</p>
<p>However, this does not answer your point, which I promise to do at a later time in a post about &#8220;randomness.&#8221;</p>
<p>The distinction between climate and weather is true, but it is separate entirely from this posting.  Whether &#8220;trend&#8221; is useful is relevant, but only insofar as it helps us make predictions.  There, the main point still stands: usual statistical methods lead to overconfidence.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Khebab</title>
		<link>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/#comment-1176</link>
		<dc:creator>Khebab</dc:creator>
		<pubDate>Mon, 10 Mar 2008 13:06:38 +0000</pubDate>
		<guid>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/#comment-1176</guid>
		<description>Re #3.

I'm a little bit lost here. I thought that a stationary process (wide-sense stationarity) requires that 1st and 2nd moments do not vary with respect to time:

E[y(t]]= m(t)= m(t+Dt)

which is not the case here.</description>
		<content:encoded><![CDATA[<p>Re #3.</p>
<p>I&#8217;m a little bit lost here. I thought that a stationary process (wide-sense stationarity) requires that 1st and 2nd moments do not vary with respect to time:</p>
<p>E[y(t]]= m(t)= m(t+Dt)</p>
<p>which is not the case here.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: PI</title>
		<link>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/#comment-1175</link>
		<dc:creator>PI</dc:creator>
		<pubDate>Mon, 10 Mar 2008 11:49:05 +0000</pubDate>
		<guid>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/#comment-1175</guid>
		<description>I don't know what "mysticism" you're talking about.  There is nothing wrong with taking random draws from the posterior to see examples of what the predictions look like.  (See, for instance, pretty much anything ever written by Andrew Gelman.)  In general, not all of this information will be captured either by full posteriors of the parameters, or by confidence intervals on the predictions.  What you want is posteriors of the predictions, but when the predictions are time series this is difficult.  Hence, pick N samples and plot them on top of each other.  As you say, when the posterior is of simple form this is not necessary, but in more complicated cases it is useful.

And while the trend doesn't tell you the future values of observables, the posterior predictive distribution also obscures the trend.  We always experience the combination of weather and climate, but it is of physical interest to disentangle weather from climate.  Both kinds of inferences are useful.</description>
		<content:encoded><![CDATA[<p>I don&#8217;t know what &#8220;mysticism&#8221; you&#8217;re talking about.  There is nothing wrong with taking random draws from the posterior to see examples of what the predictions look like.  (See, for instance, pretty much anything ever written by Andrew Gelman.)  In general, not all of this information will be captured either by full posteriors of the parameters, or by confidence intervals on the predictions.  What you want is posteriors of the predictions, but when the predictions are time series this is difficult.  Hence, pick N samples and plot them on top of each other.  As you say, when the posterior is of simple form this is not necessary, but in more complicated cases it is useful.</p>
<p>And while the trend doesn&#8217;t tell you the future values of observables, the posterior predictive distribution also obscures the trend.  We always experience the combination of weather and climate, but it is of physical interest to disentangle weather from climate.  Both kinds of inferences are useful.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Briggs</title>
		<link>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/#comment-1173</link>
		<dc:creator>Briggs</dc:creator>
		<pubDate>Mon, 10 Mar 2008 10:46:24 +0000</pubDate>
		<guid>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/#comment-1173</guid>
		<description>Wolfgang,  I am working on a paper on this subject with my friend Russ Zaretzki (who, everybody should know, has no idea what the word "climate" even means, which I point out so nobody will think he is a crazy as I am; he is an excellent statistician, however).   Data and code will be coming when finished.  However, it's no secret.  I used normals and flat priors for all, which lead to t posteriors.  See Bernardo and Smith, the Appendix to see the exact equations for regression.

Mike D, Amen.  Measurement error just adds to the uncertainty.  I have pointed this out elsewhere, but have given no examples yet.

Bob, right on.  It's not that a model cannot describe reality, it's just that they often do a poor job.

&#928; 1. Yes, the posterior predictive, not the posterior for &#956;  2. Not quite.  See today's post on "meaning on mean means".  Red lines are just posterior predictive.  The blue lines are the parameter posterior (on &#956;).

And I agree with you that it's always better to show the full posterior distribution when possible.  This is because showing an interval can give misleading results, as you say.  However, these posteriors (for observables or &#956;) are all t, and are symmetric, so the intervals are reasonable summaries.

The term "random draws", oh my.  I'll be talking about the mysticism of "random" in future posts.

And no matter if I know the exact value of the parameters, I still do not know the future value of the observables.

Briggs</description>
		<content:encoded><![CDATA[<p>Wolfgang,  I am working on a paper on this subject with my friend Russ Zaretzki (who, everybody should know, has no idea what the word &#8220;climate&#8221; even means, which I point out so nobody will think he is a crazy as I am; he is an excellent statistician, however).   Data and code will be coming when finished.  However, it&#8217;s no secret.  I used normals and flat priors for all, which lead to t posteriors.  See Bernardo and Smith, the Appendix to see the exact equations for regression.</p>
<p>Mike D, Amen.  Measurement error just adds to the uncertainty.  I have pointed this out elsewhere, but have given no examples yet.</p>
<p>Bob, right on.  It&#8217;s not that a model cannot describe reality, it&#8217;s just that they often do a poor job.</p>
<p>&Pi; 1. Yes, the posterior predictive, not the posterior for &mu;  2. Not quite.  See today&#8217;s post on &#8220;meaning on mean means&#8221;.  Red lines are just posterior predictive.  The blue lines are the parameter posterior (on &mu;).</p>
<p>And I agree with you that it&#8217;s always better to show the full posterior distribution when possible.  This is because showing an interval can give misleading results, as you say.  However, these posteriors (for observables or &mu;) are all t, and are symmetric, so the intervals are reasonable summaries.</p>
<p>The term &#8220;random draws&#8221;, oh my.  I&#8217;ll be talking about the mysticism of &#8220;random&#8221; in future posts.</p>
<p>And no matter if I know the exact value of the parameters, I still do not know the future value of the observables.</p>
<p>Briggs</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: PI</title>
		<link>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/#comment-1169</link>
		<dc:creator>PI</dc:creator>
		<pubDate>Mon, 10 Mar 2008 08:39:13 +0000</pubDate>
		<guid>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/#comment-1169</guid>
		<description>Can you elaborate on what you're calculating?  I'll restrict myself to the Bayesian method.

"It must be stressed that the 95% interval is for the mean, which is itself an unobservable parameter.  What we really want to know about is that data values themselves."

1. This (and most of the rest of your post) implies that you're talking about the posterior predictive distribution for the data, instead of the posterior for the model trend.

"What we have to do is to completely account for our uncertainty in the values of a and b, but also in the parameters that make up OS."

2. On the other hand, this implies that you're treating the distributional parameters (e.g., residual variance or autocorrelation) as uncertain with some prior, and marginalizing over them in the posterior.

Which of these represents the red lines?  1, 2, or both?

As for 1, I believe it's useful to give both posterior trend and posterior predictive intervals.  The underlying trend itself is interesting and informative, but if you want to predict what will be measured in a specific year, you need the posterior predictive distribution.

(Actually, instead of intervals, I prefer a collection of random draws from the posterior or posterior predictive distributions.  Means and confidence intervals can hide structure.)

2 should always be done when computationally feasible, unless there is good reason to believe the priors on distributional parameters are very narrow.</description>
		<content:encoded><![CDATA[<p>Can you elaborate on what you&#8217;re calculating?  I&#8217;ll restrict myself to the Bayesian method.</p>
<p>&#8220;It must be stressed that the 95% interval is for the mean, which is itself an unobservable parameter.  What we really want to know about is that data values themselves.&#8221;</p>
<p>1. This (and most of the rest of your post) implies that you&#8217;re talking about the posterior predictive distribution for the data, instead of the posterior for the model trend.</p>
<p>&#8220;What we have to do is to completely account for our uncertainty in the values of a and b, but also in the parameters that make up OS.&#8221;</p>
<p>2. On the other hand, this implies that you&#8217;re treating the distributional parameters (e.g., residual variance or autocorrelation) as uncertain with some prior, and marginalizing over them in the posterior.</p>
<p>Which of these represents the red lines?  1, 2, or both?</p>
<p>As for 1, I believe it&#8217;s useful to give both posterior trend and posterior predictive intervals.  The underlying trend itself is interesting and informative, but if you want to predict what will be measured in a specific year, you need the posterior predictive distribution.</p>
<p>(Actually, instead of intervals, I prefer a collection of random draws from the posterior or posterior predictive distributions.  Means and confidence intervals can hide structure.)</p>
<p>2 should always be done when computationally feasible, unless there is good reason to believe the priors on distributional parameters are very narrow.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Erik</title>
		<link>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/#comment-1168</link>
		<dc:creator>Erik</dc:creator>
		<pubDate>Mon, 10 Mar 2008 06:41:05 +0000</pubDate>
		<guid>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/#comment-1168</guid>
		<description>This point is so obvious, but it somehow eludes lots of smart people.  

A model of a mean is meaningless (?).

Even if you could be certain that the global mean temperature would rise by x, you would have no idea what that would me for people or polar bears or crops.

It's easy to construct a scenario in which the global mean temp goes up by six degrees yet we all freeze to death.</description>
		<content:encoded><![CDATA[<p>This point is so obvious, but it somehow eludes lots of smart people.  </p>
<p>A model of a mean is meaningless (?).</p>
<p>Even if you could be certain that the global mean temperature would rise by x, you would have no idea what that would me for people or polar bears or crops.</p>
<p>It&#8217;s easy to construct a scenario in which the global mean temp goes up by six degrees yet we all freeze to death.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bob North</title>
		<link>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/#comment-1164</link>
		<dc:creator>Bob North</dc:creator>
		<pubDate>Mon, 10 Mar 2008 04:06:33 +0000</pubDate>
		<guid>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/#comment-1164</guid>
		<description>Thank you for a well-written and easy to understand on the fact that the mean is simply a measure of central tendency and this does not require that the mean (or the results of a predictive formala) actually match any of the observed data.  For me at least, it is always easier to understand a math concept when it is written in normal, expository english rather than what I'll call "mathese" where everything is written as formulas and "proofs".

I think one thing that some lose sight of when they get too caught up in the statistics and formulae is that math doesn't explain the physical world around us, we are only trying to use math to explain the world around us as best we can.  Sometimes it works, sometimes it doesn't,  but ultimately all mathematical descriptions of the physical world are subject to refutation or refinement.  In other words, if there is an observable phenomenom that is proven to be true, but which is in conflict with one of our formulae with which we attempt to describe the physical world, it is the formulae that needs to modified, not the phenomenom.

Regards,
Bob North</description>
		<content:encoded><![CDATA[<p>Thank you for a well-written and easy to understand on the fact that the mean is simply a measure of central tendency and this does not require that the mean (or the results of a predictive formala) actually match any of the observed data.  For me at least, it is always easier to understand a math concept when it is written in normal, expository english rather than what I&#8217;ll call &#8220;mathese&#8221; where everything is written as formulas and &#8220;proofs&#8221;.</p>
<p>I think one thing that some lose sight of when they get too caught up in the statistics and formulae is that math doesn&#8217;t explain the physical world around us, we are only trying to use math to explain the world around us as best we can.  Sometimes it works, sometimes it doesn&#8217;t,  but ultimately all mathematical descriptions of the physical world are subject to refutation or refinement.  In other words, if there is an observable phenomenom that is proven to be true, but which is in conflict with one of our formulae with which we attempt to describe the physical world, it is the formulae that needs to modified, not the phenomenom.</p>
<p>Regards,<br />
Bob North</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mike D.</title>
		<link>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/#comment-1158</link>
		<dc:creator>Mike D.</dc:creator>
		<pubDate>Mon, 10 Mar 2008 00:23:36 +0000</pubDate>
		<guid>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/#comment-1158</guid>
		<description>Whoops. I meant to say data are, not data is.</description>
		<content:encoded><![CDATA[<p>Whoops. I meant to say data are, not data is.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mike D.</title>
		<link>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/#comment-1157</link>
		<dc:creator>Mike D.</dc:creator>
		<pubDate>Mon, 10 Mar 2008 00:21:02 +0000</pubDate>
		<guid>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/#comment-1157</guid>
		<description>Love your blog!!!!!!

Just to add more confusion, your model is fit to "observed" data, but in many cases (global temps being a good example) the "observed" data is squishy, or as statisticians say, loaded with measurement error.

Some phenomena can be measured with great precision but some cannot. When the "yardstick" is rubbery, the data are more like guesses than precise readings that can be replicated by unbiased observers.

Measurement error throws an entirely different uncertainty into models, called statistical bias, that is difficult to account for. When measurement error is folded in, somehow or other, it invariably boosts the model variance (a stat pun) and widens the confidence intervals.

PS Satellite data has measurement error. Don't kid yourself about that. And terrestrial weather station data is thoroughly messed up. And anytime an analysis is based on "proxies," you can sure the "observed" data is as squishy as mud.</description>
		<content:encoded><![CDATA[<p>Love your blog!!!!!!</p>
<p>Just to add more confusion, your model is fit to &#8220;observed&#8221; data, but in many cases (global temps being a good example) the &#8220;observed&#8221; data is squishy, or as statisticians say, loaded with measurement error.</p>
<p>Some phenomena can be measured with great precision but some cannot. When the &#8220;yardstick&#8221; is rubbery, the data are more like guesses than precise readings that can be replicated by unbiased observers.</p>
<p>Measurement error throws an entirely different uncertainty into models, called statistical bias, that is difficult to account for. When measurement error is folded in, somehow or other, it invariably boosts the model variance (a stat pun) and widens the confidence intervals.</p>
<p>PS Satellite data has measurement error. Don&#8217;t kid yourself about that. And terrestrial weather station data is thoroughly messed up. And anytime an analysis is based on &#8220;proxies,&#8221; you can sure the &#8220;observed&#8221; data is as squishy as mud.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Wolfgang Flamme</title>
		<link>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/#comment-1155</link>
		<dc:creator>Wolfgang Flamme</dc:creator>
		<pubDate>Mon, 10 Mar 2008 00:00:20 +0000</pubDate>
		<guid>http://wmbriggs.com/blog/2008/03/09/you-cannot-measure-a-mean/#comment-1155</guid>
		<description>William,
would you mind posting the data / code for the graph above? Just want to be sure my interpretation of your presentation is correct.</description>
		<content:encoded><![CDATA[<p>William,<br />
would you mind posting the data / code for the graph above? Just want to be sure my interpretation of your presentation is correct.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic Page Served (once) in 4.660 seconds -->
