Re: Is this sample representative of the population?



On Mon, 12 Dec 2005 18:00:27 +0100, "Z" <zingerNOSPAM@xxxxxxx> wrote:

>
> Hello,
>
> I've received the following problem (no, it's not homework), and I'm trying
> to figure out what would be the best way to answer it.
>
> A client (in marketing, you can probably guess) sent us this table,
> containing counts:
>
> Profession Population Sample
>
> Executive 10041 1164
> Commercial 1734 162
> Employee/labourer 3591 309
> Farmer 414 29
> CEO or upper executive 5330 797
> Self-employed 3621 410
> Retired 5699 927
> Student 5051 294
> blank 21 21
> Other 8536 403
> Total 44038 4495
>
> "Population" implies the true population from which the sample ("Sample")
> was taken.
>
> The question asked by the client is : "Is this sample representative of the
> population?"
>
> I can't figure out how to answer this. I've tried as follows:

I think that you re-write this as a contingency table:
Look up chi squared contingency table.

The table wants to compare the two parts of a total,
rather than comparing one part to the total.

Thus, the numbers below should be written as
"In sample", "not in sample", "Total" and if you find
a calculator online, it will want the first two (not Total).

Executive 10041 1164
Commercial 1734 162
Employee/labourer 3591 309
Farmer 414 29
...

The contingency table tests whether all the rows share
the same proportions, that is, about 10% and about 90%.

I'm sure that you want to omit the "blank" numbers, 0+21 = 21.

I'd probably give the client the table in two ways, one of
them leaving out the large "other" category, which has a
low proportion sampled.

The standardized cell deviations can be used as a measure
for whether each cell is contributing to the total "test".
The raw proportions might also be interesting.

>
> I wondered if the idea was that the proportions should be compared, so I did
> a line-by-line parametric comparison of two percentages. I got the p-values
> (for each category/row respectively) as:
[snip, rest]

I spent a couple of minutes studying your examples,
and I could not come up with a reasonable guess
about what you were testing, in either one. So I won't
say more about them.

--
Rich Ulrich, wpilib@xxxxxxxx
http://www.pitt.edu/~wpilib/index.html
.



Relevant Pages

  • =?iso-8859-1?q?Re:_Bob_Ling=B4s_ignorance_is_clear?=
    ... can do in order to compare two INDEPENDENT proportions." ... Fisher's Exact Test is most appropriate for comparing two independent ... be used to compare two independent proportions. ... Not once have I suggested using the Normal approximation to a binomial. ...
    (sci.stat.math)
  • Re: Comparing documents confusion...
    ... > someone could expand on the help instructions and explain the odd alert ... I send a report to a client. ... Compare anyway?˛ In fact, ...
    (microsoft.public.mac.office.word)
  • Re: Comparing documents confusion...
    ... That warning message just occurs because you have Tracked Changes that have ... been made in your original document, since you last sent it to the client. ... client, and then in meantime added some text in the middle of Paragraph łA˛, ... When you open the Changed Document and then Compare it to the Original you ...
    (microsoft.public.mac.office.word)
  • Re: hypothesis test question on two proportions
    ... If the statistic is in the critical region, ... estimates for the two sample proportions which appear in the CI ... We now want to test whether H0: pi1 ... have to compare the Type I error rates and power of the two approaches ...
    (sci.stat.edu)
  • Re: Using document compare on a form
    ... I can understand why you've come to this proposed solution; ... Each client is provided ... the form changes slightly, people can run, say a macro to compare to the ... This is our first entry field ...
    (microsoft.public.word.vba.general)