Re: clarify or clogify? re: Why the human race is growing apart



On Fri, 10 Apr 2009 11:44:13 -0700, tgdenning wrote:

On Apr 10, 1:52 pm, Garamond Lethe <cartographi...@xxxxxxxxx> wrote:
On Fri, 10 Apr 2009 10:32:13 -0700, tgdenning wrote:

<snip>

Sigh. We're talking about IQ tests. Are you saying that the size of
the population that has taken IQ tests is 'small', and that half of
that number is also 'small', in a statistical sense?

It's not the number of people who have taken IQ tests, it's the number
of people per group included in this particular study.  I'm smart
enough to know that questions of "is this sample big enough to make
these particular results significant" is best left to people who do
statistics for a living, and barring that, should be done only by
people who are very familiar with the limitations of their data.

The question I was addressing, though, was much simpler:  given G
groups of N numbers each pulled from some distribution, is it likely
that the average for one group will be higher than the rest?  The
answer is "probably yes" for small values of N and "certainly not" for
infinite values of N.  Whether or not this largest value is significant
is a much harder question that I will sensibly leave to others more
skilled in the art.


I'm not completely clear what you mean by "largest value is
significant". But if the sample size is large enough---and that doesn't
have to be very large at all, relative to the population size (look at
political polls for example)---then the mean of the sample will be the
same as the mean of the population as a whole.

So let's say there is a population of 5000 taking a test. The mean and
median score is 65%. If we select at random 5 groups of 1000 each, I
would be willing to wager a few bucks that the mean and median score of
each group would be 65%.

Here's what I get. R uses ">" as the input prompt; apologies if this
screws up the quoting.

# Start with a normal distribution of 5000 individuals where the
# mean should be .65.
population<-rnorm(5000, mean=.65)

# Median and mean of the population: mean is tolerably close,
# median isn't.
median(population)
[1] 0.680727
mean(population)
[1] 0.6596989

# Median and mean of each 1000-individual subgroup.
median(population[1:1000])
[1] 0.6391295
mean(population[1:1000])
[1] 0.6206391
median(population[1001:2000])
[1] 0.7184306
mean(population[1001:2000])
[1] 0.6971344
median(population[2001:3000])
[1] 0.7047903
mean(population[2001:3000])
[1] 0.6709924
median(population[3001:4000])
[1] 0.7241578
mean(population[3001:4000])
[1] 0.6858281
median(population[4001:5000])
[1] 0.6308663
mean(population[4001:5000])
[1] 0.6239006

R is free software and can be grabbed here:

http://www.r-project.org/

Try playing with it and see what you think. For example, cranking up the
population figures gives you something much closer to what you were
expecting:

population2<-rnorm(5000000, mean=.65)
median(population2)
[1] 0.6500446
mean(population2)
[1] 0.6498886

I'll leave the proper explanation of this to an Actual Mathematician.





So I am disinclined to accept your comparison of 'small N' and
'approaching infinity'---you may be bordering on (unconscious) circular
reasoning, I think, where you define "small N" as "that which is small
enough not to be representative". Perhaps one of those real
statisticians will advise us on the real ranges we are talking about (or
take my money).

-tg




BTW, I give you credit for at least having a clue as to what I am
talking about, which the others obviously don't.

Thanks.

<snip>

.



Relevant Pages

  • Re: clarify or clogify? re: Why the human race is growing apart
    ... particular results significant" is best left to people who do statistics ... would be willing to wager a few bucks that the mean and median score ... statisticians will advise us on the real ranges we are talking about ...
    (talk.origins)
  • Re: library (random numberS)
    ... Which average do you want (common ones are mean, mode, and median)? ... Try looking them up on Wikipedia: ... http://en.wikipedia.org/wiki/Mode_ (statistics) ...
    (perl.beginners)
  • Re: clarify or clogify? re: Why the human race is growing apart
    ... statistics for a living, and barring that, should be done only by ... median score is 65%. ... (but not necessarily zero or very low probability) ... not a realistic scenario, but it seems to be the basis of ...
    (talk.origins)
  • Re: Most valuable poster
    ... Ooh, the Poisson distribution! ... If you actually *knew* anything about statistics, ... probabilities I want, just like you do. ...
    (talk.origins)
  • Re: Definition of insanity
    ... Plus I also provided multiple citations in which a sexual component ... a median and an average are not identical nor synonymous... ... colloquial acceptance of the word Average in Statistics classes in ... area of Statistics being rejected as 'improper'. ...
    (comp.sys.mac.advocacy)

Loading