Re: Are graded clinical signs more reliable than dichotomized?
- From: Frank E Harrell Jr <f.harrell@xxxxxxxxxxxxxx>
- Date: Wed, 05 Jul 2006 13:44:41 -0500
John Uebersax wrote:
Frank E Harrell Jr wrote:
Generally speaking, having more categories improves every aspect of
prediction and decision making,
Here's a point I don't think has yet been raised. What if the trait is
a true dichotomy, or nearly so?
Let x denote a latent (not directly observed) trait with two levels,
and let it be measured by two fallible measures, y1 and y2.
The fallible measurement introduces error:
y1(i) = x(i) + e1(i)
y2(i) = x(i) + e2(i)
where e1 and e2 denote measurement error and (i) denotes some case i.
We assume e1 and e2 are uncorrelated.
Example:
Let the two latent trait levels be 3 and 8 on a 10-point scale. Let
measurement error variance for y1 and y2 be equal.
Hypothetical data for several cases might look like this:
x y1 y2
------------
3 1 3
3 2 2
3 3 4
3 4 5
3 5 1
8 6 8
8 7 9
8 8 10
8 9 7
8 10 8
Ideally one would choose numbers above such that y1 and y2 are
completely uncorrelated within each level of x, as that is the
implication of the measurement model. But the point is that r(y1, y2)
is less than 1, and these numbers suffice to show that.
Now suppose we optimally discretize y1 and y2. Then we have:
x y1 y2
------------
3 3 3
3 3 3
3 3 3
3 3 3
3 3 3
8 8 8
8 8 8
8 8 8
8 8 8
8 8 8
and r(y1, y2) = 1. I would consider this "better agreement" obtained
by having fewer levels. The recoded variables y1 and y2 also now
correlate better with x, so that we would consider them more valid as
measures of x.
It's simpler if we use 1 and 2 instead of 3 and 8, but I leave it this
way because this seems potentially closer to the point about
considering the magnitude of disagreement.
Note that using the ICC here instead of the Pearson correlation the
conclusion is the same.
Do I make heroic assumptions? Yes. But only to potentially establish
the principle that discretization is sometimes better. If so, that
places the question in the realm of considering particular data, and
not applying a universal rule.
Unless I miss some obvious point, this example seems to demonstrate
that if one has a trait which is fundamentally discrete or strongly
multi-modal, then a rating system with fewer levels can be more
reliable and more valid.
Without elaborating, let me also suggest an analogy to digital signal
filtering with a low-pass filter. It seems easily verified that
sometimes one prefers a filtered signal with coarser resolution than a
noisier signal with higher resolution. I don't want to pursue this
analogy, I just propose it for consideration.
--
John Uebersax PhD
John,
I think that in the presence of error it's still generally better to use the observed continuous scale. There might be a gain of dichotomizing if you knew the ideal cut point, but we don't know that.
Cheers
Frank
.
- References:
- Re: Are graded clinical signs more reliable than dichotomized?
- From: John Uebersax
- Re: Are graded clinical signs more reliable than dichotomized?
- Prev by Date: Re: Significance test for differences in standard mortality ratios
- Next by Date: Re: random generation
- Previous by thread: Re: Are graded clinical signs more reliable than dichotomized?
- Next by thread: Re: Are graded clinical signs more reliable than dichotomized?
- Index(es):
Relevant Pages
|