Re: describing logistic regression results for clinicians
- From: Jerry Dallal <gdallal@xxxxxxxxxxxxxxxxxxxxxxxx>
- Date: Mon, 10 Apr 2006 23:17:44 -0400
David Winsemius wrote:
Jerry Dallal <gdallal@xxxxxxxxxxxxxxxx> wrote in
news:hct_f.1947$No6.42645@xxxxxxxxxxxxxx:
David Winsemius wrote:Jerry Dallal <gdallal@xxxxxxxxxxxxxxxx> wrote in news:IVQYf.1917When reading these questions, it is important to keep in mind not only
$No6.42339@xxxxxxxxxxxxxx:
Bill Howells wrote:The departure would not have been from "additivity" but from multiplicativity. Additive on the log-odds scale is the same as multiplicative on the odds scale. Lack of power is besides the pointThis was a hallway conversation with a colleague. He is responding
to reviews from a paper to a second tier health research journal
and showed me a table with coefficients and confidence intervals
from a logistic regression model of a binary outcome with two
binary predictors, say x1 and x2, and their interaction. The two
binary predictors were combined into four categories of 0/0, 0/1,
1/0, 1/1 with the 0/0 as the reference category (ie. x1=0, x2=0). He said in another model where the interaction was parameterized as
the product of the two binary variables, the coefficient for the
interaction was not statistically different from zero. The odds
ratios in the model he showed me were something like OR1=7, OR2=9,
OR12=23 (95% CI: 11-77) where OR1 is the odds ratio for the "X1
only" group, OR2 is the odds ratio for the "X2 only" group, and
OR12 is the odds ratio for the "both" group (1/1), relative to the
0/0 group. OR1 and OR2 were statistically different than zero.
Anyway, the question! He originally described the results using
the phrase "X1 and X2 are independent predictors" of outcome
because they are both statistically signficant in the model and the
interaction is statistically non-significant. Reviewer #1 balked
at this description because the paper also contained a test of
association between X1 and X2 which was statistically signficant,
and the point estimate of the "both" group, ie. OR12, is "far from"
what one would expect if the terms were truly independent and
therefore additive (on the log scale). So, really two questions:
1. how to respond to Reviewer #1? 2. how to describe the results
in the paper whose audience is non-statistical clinicians?
Regarding question 2, another colleague had suggested using the
phrase "additive on the log-scale" but the author felt this wasn't
very meaningful for a non-statistical audience. I suggested
pointing out that although the point estimate is far from what one
would expect from additivity, the confidence interval for OR12
includes the null value, eg. 7*9 = 63 is within the confidence
interval 11-77, and commenting there is low power to detect the
interaction due to fewer subjects in the "both" group.
if the test for the interaction came in positive.I read it as having a "full" model with parameterisation:As far as responding to the reviewer, not sure how to address his questions about using the term "independent" to describe the[I found the first paragraph of the post too dense to sort out in a casual reading, so I skipped to the second.]
results. Comments, suggestions appreciated. Bill H.
intercept for 0/0, b1, b2, b(interact)
If his ORs of OR1=7, OR2=9, ORinteract=23 and the addition of the interaction is significant, it make me wonder if the interaction is
"sub- multiplicative".
Welcome to the land of Murk, where things are murky.I would think the reviewer "might", but he would be wrong. The joint association of the two independent variables does not mean that they cannot be independent predictors of an outcome. Consider the
I can understand why a reviewer might draw the line at using the
phrase "independent predictors" to describe variables that are
associated.
association of HDL cholesterol with gender. Does anyone think that
just because HDL levels are lower in men that you would not want both
variables in the model predicting coronary disease events? The
mechanics of the LR machinery should be able to determine whether two
variables have independent predictive capacity. You should also be
asking what the predictors are _in_reality_. What does prior science
say about these as predictors? Is there any causal connections of one
of these Xs with the other?
how we ourselves see them, but how others might reasonably see them.
Which club do you imagine "we" belong to? My graduate degrees lead me to call myself a "physician" and "epidemiologist". You had already passed on interpreting his first paragraph so I thought a non-statistician should take a shot.
The problem here is the word "predictors". For many, "predictors" is synonymous with "predictor variables" with emphasis on "variables". If one reads it that way, then "independence" becomes "independent variables", which has a particular meaning in statistics. If oneYour suggested fix does not satisfy my logical objections in the slightest. It appears you did not read my comments for meaning. I gave two specific counter-examples. What "can be done about" age or gender?
reads "predictor" as "risk factor", then the argument I gave for "risk
factor" applies. Because "predictor" lends itself to this kind of
confusion, I can understand why a random reviewer might not like it. It is easily fixed, however, with a few simple edits.
I seem to be speaking more colloquially than you like. In place of "modify" insert "done all one can about".For better or worse, the medical literature has adopted the phrase "independent risk factor" to describe the sort of thing yourModifiabilty has nothing to do with prediction. Gender and age are
colleague is seeing. That is, it's okay to call something an
independent risk factor if, after one has modified his/her risk by
all other factors, one can further modify it by attending to the
factor under consideration.
not modifiable in the slightest, yet are potent predictors of many
outcomes.
You are correct that I read your comment too quickly. I've got no problem if you want to dot all the i's and cross all the t's. There seems to be NO disagreement over the fact that two things can properly be called ndependent risk factors without being statistically independent rvs, which, in fact was the OP's question.
My point still ... modifiability has nothing to do with "independent" predictors. Granted, they would not be amenable to modification if you were designing a randomized intervention trial, but that was not the question.>
Even then, you would want to consider the risk of an imbalance in important non-modifiable, independent predictors in your allocation. There are a host of modelling issues that may arise in dealing with these non-modifiable variables, but you cannot just define them away as not worthy of consideration as independent predictors.
Take a look at the history of the term "risk factor" in medicine. You will find age, gender, and family history in most of the early discussions of independent risk factors. Here is a current link to a page by the American Heart Association that might be considered "colloquial": http://www.americanheart.org/presenter.jhtml?identifier=4726
<quote> The American Heart Association has identified several risk factors. Some of them can be modified, treated or controlled, and some can't. The more risk factors you have, the greater your chance of developing coronary heart disease. </quote>
It is not necessary that the risk factors beWell, sure! But, again, I seem to be speaking more colloquially than you like and said "model" rather than "logistic regression model". This has been a discussion about logistic regression. There are otherstatistically independent, but that each makes an "independent"There is another way?
contribution to the risk. One way investigators establish that
something is an independent risk factor is by showing that it makes
a statistically significant contribution to a model that contains
all other known or suspected risk factors.
techniques for handling such data.
There are, certainly. But even if you adopt another link function or embed the LR model in a more general family of models, you will still be determining whether adding the interaction term reduces the variance or deviance significantly relative to the smaller model.
Of course. My suggested constraints on the search terms were intended to foster a more dense collection of hits. I got over half a million hits with your phrase. Adding "confounding" narrowed that down by a factor of one hundred. Adding "joint association" probably narrowed it too far. Sometimes it is useful for someone who lacks the proper statistical language to be given terms for a more focussed search.Even without!A search on the phrase "independent risk factor" in any good search engine will give you a wealth of examples.Maybe with the added terms "confounding" and "joint association".
No question that there are always ways to improve a search and ways to tighten a description until it's textbook ready. I think it's fine that you've tightened things up, but the essential fact hasn't changed--namely, that independent risk factors need not be "statistically indpendent" to be called independent.
.
- References:
- describing logistic regression results for clinicians
- From: Bill Howells
- Re: describing logistic regression results for clinicians
- From: Jerry Dallal
- Re: describing logistic regression results for clinicians
- From: David Winsemius
- Re: describing logistic regression results for clinicians
- From: Jerry Dallal
- Re: describing logistic regression results for clinicians
- From: David Winsemius
- describing logistic regression results for clinicians
- Prev by Date: Re: describing logistic regression results for clinicians
- Next by Date: Pairing covariates with outcome variables in a MANOVA
- Previous by thread: Re: describing logistic regression results for clinicians
- Next by thread: Re: describing logistic regression results for clinicians
- Index(es):
Relevant Pages
|