- Discussion - Statistics question - averaging measurements of multiple objects.

Discussion > Statistics question - averaging measurements of multiple objects.

You are tasked with finding the average height of 100 houses. With a ruler at arms length and a tape measure you are able to measure the heights +/- 10cm.

When you've found the average height, what is the error band?

Nial.

Aug 26, 2014 at 9:01 AM |

Nial

When I did this at school, I always found it instructive to find the maximum and minimum of the measurements and work with both those figures: e.g.

Max: if all 100 houses are at the max of the error, then your total will be 1000cm larger than actual.
Min: if all 100 houses are at the -max of the error, then your total will be 1000cm smaller than actual.

Thus the Error bands on the total is -1000cm to +1000cm

Now you divide by 100 to get the average, the 100 has no error bars, so it's a simple arithmetic division of the error bars, so the error bars become -10cm to +10cm again.

Which is intuitive, your average can't be any more precise than the worst of your measurements.

Is this wrong?

Aug 26, 2014 at 9:45 AM |

TheBigYinJames

"tasked"

Yes, these days, any noun can be verbed.

"Which is intuitive, your average can't be any more precise than the worst of your measurements.
Is this wrong?"

Yes, I think it's wrong if you are talking about, for example, the rms error, rather than the maximum possible error. (And assuming things like the errors being independent).

Aug 26, 2014 at 10:24 AM |

Martin A

Also, I didn't read the instructions properly, I think he's measuring the houses by holding a ruler at arm's length, measuring the apparent height on the ruler, and then using trig to work out the actual heights.

Aug 26, 2014 at 10:30 AM |

TheBigYinJames

Thanks for the replies guys (apart from Martin's pedantic grammar point), the method of measurement isn't important, just that you know the accuracy of each measurement.

Elsewhere someone queried the accuracy quoted in global temperature measurements (0.05 Deg C).

An ardent warmist posted this link....

http://web.archive.org/web/20080402030712/tamino.wordpress.com/2007/07/05/the-power-of-large-numbers/

...which discussed improvements in resolution of a single measurement by performing it multiple times.

This however isn't the same as the global temperature that combines multiple measurements of _different_
things.

The house measurement query was the simplest way I could think of posing the question.

Intuitively I thought there's probably _some_ improvement in accuracy, but after TBYJ's post above I'm not sure.

Nial

Aug 26, 2014 at 11:09 AM |

Nial

Nial,

One of the bugbears of the whole AGW thing for me is that the rise in temps (0.7 degrees) is still within the error bars on the individual readings. Which is fine if you assume the errors in the measurements are symmetrical about the real values. But we have found that they have a bias towards high.

Aug 26, 2014 at 11:12 AM |

TheBigYinJames

"One of the bugbears of the whole AGW thing for me is that the rise in temps (0.7 degrees) is still within the error bars on the individual readings. "

Aye, but this article is saying that taking a large number of individual readings, over time, reduces the error.

I need to show it doesn't.

Aug 26, 2014 at 11:27 AM |

Nial

Well you might be in trouble Nial.

Whilst the individual readings errors might be large, if the average and the spread moves over time, then this can be measured with an accuracy far more than the individual readings.

E.g. if in year 2000 a measure is 100 plus or minus 50, and in 2010 it's 101 plus or minus 50, then you can say that it's gone up by 1% even if the error is 50%.

Aug 26, 2014 at 12:01 PM |

TheBigYinJames

Nial - sorry that my 'grammar point' seemed pedantic. I just thought it interesting that your usage is now standard English but it would not have been in, say, my Dad's day. Nowadays, you see stuff (such as the use of the word 'stuff' to mean 'things') that you would not have seen just a few decades ago. The English language changes as we watch it.

Perhaps more relevant to your thread: The “ensemble” of models is completely meaningless, statistically

I too think you may be in trouble. If you can take a large number of measurements, where the error in each measurement is statistically independent of the error in the other measurements, then computing the average of all your measurements will generally give a more accurate estimate of the thing you are trying to measure than any one single measurement.

Aug 26, 2014 at 12:35 PM |

Martin A

Thanks again guys, Martin don't worry I was joking about your pedantry, I should have included a :-) .

"I too think you may be in trouble. If you can take a large number of measurements, where the error in each measurement is statistically independent of the error in the other measurements, then computing the average of all your measurements will generally give a more accurate estimate of the thing you are trying to measure than any one single measurement."

Hmm.

It turns out the builders didn't dig the foundations properly and the houses are _slowly_ subsiding.
If you continually repeated your experiment over the next few years I suppose you would detect the average height reducing, even though each result still had significant error bars.

Obvious when you think about it. I will concede the point elsewhere!

Thanks again,

Nial.

Aug 26, 2014 at 1:16 PM |

Nial

It depends if the error is systematic (your measuring device is such that you get the height too large by ten cm for every house), in which case the overall error on the average is the same as the individual error, or random for each house, some too high, some too low, others just right in which the error on the average decreases as the inverse of the square root of the number of measured values. Here it would be one tenth, just 1cm. For temperature measurements there may be a bit of both, with the added twist that the systematic error can change in time sure to increasing UHI. But the error you'd think of from making a couple of measurements in your house or garden is mostly of the random type.

Aug 26, 2014 at 1:20 PM |

Jeremy Harvey

"others just right in which the error on the average decreases as the inverse of the square root of the number of measured values. Here it would be one tenth, just 1cm."

I'm not sure that's correct Jeremy.

I understand that if you took 100 measurements of the same thing the error would reduce by this amount, but if you are meauring 100 different things I'm not sure you can give the average height +/- 1cm.

What I do accept is that if you continually do the same measurement of the 100 houses you'll be able to spot the long term trends of subsidence.

Possibly.

Aug 26, 2014 at 2:05 PM |

Nial

Nial - I don't think it makes a difference. The error is added to each measurement so the estimate for the average height that you finish up with is

Estimate for average height

= (1/100) × [SUM (all the actual heights) + SUM (all the measurement errors)]

= true average of the 100 houses + final error

So whether the actual heights are all the same or are all over the place, it makes no difference to the final error.

*However* a different problem to the one you stated would be where you had (say) 10 million houses and you were trying to estimate the average height of the 10 million by measuring the height of 100 of them (with a measurement error on each measurement). But I don't think you were talking about this problem.

The average of 100 things is a precise and definite number.

But the average of 100 samples of 10 million things gives you an *estimate* (with its own statistical error) of the true average of the 10 million things. The average of the 100 samples is just an estimate of the average of the 10 million and it has a statistical error due to the 100 being not a true representation of the 10M.

And, as well as the statistical error, you also have your measurement error. Even if the measurement error were zero (each measurement being made precisely) the average of the 100 samples would only be a statistcal approximation to the true average of the 10 million things.

Does that help? Or does it miss the point?

Aug 26, 2014 at 2:27 PM |

Martin A

Martin,

If we are looking at error bars for the result......

> Estimate for average height
> = (1/100) × [SUM (all the actual heights) + SUM (all the measurement errors)]

is that the sum of the actual measurement errors or the potential errors?

> = true average of the 100 houses + final error

If you're summing potential errors you end up with the individual measurement error (or the average of the potential measurement errors).

Aug 26, 2014 at 2:56 PM |

Nial

Nial - if you sum all your actual measurements, you are summing the actual heights and the actual measurement errors.

So what I called 'the final error' will have a specific value but unknown to you. So it has to be regarded as a random variable.

If you make some assumptions on the individual measurement errors (for example, that they are independent of each other and what their distribution is (uniform in the range ± 10cm? Normally distributed with standard deviation 10cm?) then you can calculate things characterising the 'final error' - for example, its standard deviation. Or the range within which there is probability 0.95 that it lies in that range.

Does that answer your question?

Could you say exactly what you mean by 'error bars'?

Aug 26, 2014 at 3:37 PM |

Martin A

Thanks Martin.

> Could you say exactly what you mean by 'error bars'?

I'm not sure any more!

I hardly slept last night and feel my brain is like mush today!

Aug 26, 2014 at 3:54 PM |

Nial

I know the feeling. Take a rest and come back when you wish.

From a very quick look "error bars" seem to be a term often used in graphical displays to show the range within which the true value of a thing probably lies. I don't think the term is commonly used in probability theory or statistics and there does not seem to be a standard mathematical definition.

The definition of "probably" can evidently vary. It might be a confidence interval - eg the true value lies within the given range with 95% (for example) probability. Or it might be the standard deviation of the measurement (which requires less information to compute than confidence intervals do).

Aug 26, 2014 at 5:16 PM |

Martin A

Perhaps you are thinking of Standard Error, the measure of how well the sample mean approximates the population mean. For a relatively simple reference try here .

Aug 28, 2014 at 6:20 PM |

Entropic man

Thanks for trying to help EM.

Aug 28, 2014 at 7:01 PM |

Martin A

I sympathise with nial. :-) There is a continuum of statistical understanding.

To a layman whose maths ended at GCSE statistics is a dark art used by politicians to support their dogma; lies, dammed lies and statistics!

To working stiffs like thee and me it was a toolbox of techniques used to judge the quality and reliability of the data we used.

Professional statisticians work at a higher level altogether!

Aug 28, 2014 at 7:28 PM |

Entropic man