Why are exponentiated logistic regression coefficients considered âodds ratiosâ?
Clash Royale CLAN TAG#URR8PPP
.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty margin-bottom:0;
up vote
9
down vote
favorite
Logistic regression models the log odds of an event as some set of predictors. That is, log(p/(1-p)) where p is the probability of some outcome. Thus, the interpretation of the raw logistic regression coefficients for some variable (x) has to be on the log odds scale. That is, if the coefficient for x = 5 then we know that a 1 unit change in x correspondents to 5 unit change on the log odds scale that an outcome will occur.
However, I often see people interpret exponentiated logistic regression coefficients as odds ratios. However, clearly exp(log(p/(1-p))) = p/(1-p), which is an odds. As far as I understand it, an odds ratio is the odds of one event occurring (e.g., p/(1-p) for event A) over the odds of another event occurring (e.g., p/(1-p) for event B).
What am I missing here? Is seems like this common interpretation of exponentiated logistic regression coefficients is incorrect.
regression logistic logit
add a comment |Â
up vote
9
down vote
favorite
Logistic regression models the log odds of an event as some set of predictors. That is, log(p/(1-p)) where p is the probability of some outcome. Thus, the interpretation of the raw logistic regression coefficients for some variable (x) has to be on the log odds scale. That is, if the coefficient for x = 5 then we know that a 1 unit change in x correspondents to 5 unit change on the log odds scale that an outcome will occur.
However, I often see people interpret exponentiated logistic regression coefficients as odds ratios. However, clearly exp(log(p/(1-p))) = p/(1-p), which is an odds. As far as I understand it, an odds ratio is the odds of one event occurring (e.g., p/(1-p) for event A) over the odds of another event occurring (e.g., p/(1-p) for event B).
What am I missing here? Is seems like this common interpretation of exponentiated logistic regression coefficients is incorrect.
regression logistic logit
add a comment |Â
up vote
9
down vote
favorite
up vote
9
down vote
favorite
Logistic regression models the log odds of an event as some set of predictors. That is, log(p/(1-p)) where p is the probability of some outcome. Thus, the interpretation of the raw logistic regression coefficients for some variable (x) has to be on the log odds scale. That is, if the coefficient for x = 5 then we know that a 1 unit change in x correspondents to 5 unit change on the log odds scale that an outcome will occur.
However, I often see people interpret exponentiated logistic regression coefficients as odds ratios. However, clearly exp(log(p/(1-p))) = p/(1-p), which is an odds. As far as I understand it, an odds ratio is the odds of one event occurring (e.g., p/(1-p) for event A) over the odds of another event occurring (e.g., p/(1-p) for event B).
What am I missing here? Is seems like this common interpretation of exponentiated logistic regression coefficients is incorrect.
regression logistic logit
Logistic regression models the log odds of an event as some set of predictors. That is, log(p/(1-p)) where p is the probability of some outcome. Thus, the interpretation of the raw logistic regression coefficients for some variable (x) has to be on the log odds scale. That is, if the coefficient for x = 5 then we know that a 1 unit change in x correspondents to 5 unit change on the log odds scale that an outcome will occur.
However, I often see people interpret exponentiated logistic regression coefficients as odds ratios. However, clearly exp(log(p/(1-p))) = p/(1-p), which is an odds. As far as I understand it, an odds ratio is the odds of one event occurring (e.g., p/(1-p) for event A) over the odds of another event occurring (e.g., p/(1-p) for event B).
What am I missing here? Is seems like this common interpretation of exponentiated logistic regression coefficients is incorrect.
regression logistic logit
asked Aug 9 at 18:02
jack
482
482
add a comment |Â
add a comment |Â
2 Answers
2
active
oldest
votes
up vote
8
down vote
accepted
@Laconic's answer is great and complete, in my opinion. Something I wanted to add is that the original coefficients describe a difference in the log odds for two units who differ by 1 in the predictor. E.g., for a coefficient on $X$ of 5, we can say that the difference in log odds between two units who differ on $X$ by 1 is 5. Mathematically,
$$beta = log(textodds(p|X=x_0+1))-log(textodds(p|X=x_0)) $$
When you exponentiate $beta$, you get
$$exp(beta) = exp(log(textodds(p|X=x_0+1))-log(textodds(p|X=x_0))) \
= fracX=x_0+1)))X=x_0))) \
= fractextodds(ptextodds(p$$
which is a ratio of odds, an odds ratio.
1
This is extremely clear to me. My question is resolved.
â jack
Aug 9 at 18:51
add a comment |Â
up vote
9
down vote
Consider two set of conditions, the first described by the vector of independent variables $X$, and the second described by the vector $X'$, which differs only in the ith variable $x_i$, and by one unit. Let $beta$ be the vector of model parameters as usual.
According to the logistic regression model, the probability of the event occurring in the first case is $p_1 = frac11 + exp(-X beta)$, so that the odds of the event occurring is $fracp_11-p_1 = exp(X beta)$.
The probability of the event occurring in the second case is $p_2 = frac11 + exp(-X' beta)$, so that the odds of the event occurring is $fracp_21-p_2 = exp(X' beta) = exp(X beta + beta_i)$.
The ratio of the odds in the second case to the odds in the first case is therefore $exp(beta_i)$. Hence the interpretation of the exponential of the parameter as an odds ratio.
add a comment |Â
2 Answers
2
active
oldest
votes
2 Answers
2
active
oldest
votes
active
oldest
votes
active
oldest
votes
up vote
8
down vote
accepted
@Laconic's answer is great and complete, in my opinion. Something I wanted to add is that the original coefficients describe a difference in the log odds for two units who differ by 1 in the predictor. E.g., for a coefficient on $X$ of 5, we can say that the difference in log odds between two units who differ on $X$ by 1 is 5. Mathematically,
$$beta = log(textodds(p|X=x_0+1))-log(textodds(p|X=x_0)) $$
When you exponentiate $beta$, you get
$$exp(beta) = exp(log(textodds(p|X=x_0+1))-log(textodds(p|X=x_0))) \
= fracX=x_0+1)))X=x_0))) \
= fractextodds(ptextodds(p$$
which is a ratio of odds, an odds ratio.
1
This is extremely clear to me. My question is resolved.
â jack
Aug 9 at 18:51
add a comment |Â
up vote
8
down vote
accepted
@Laconic's answer is great and complete, in my opinion. Something I wanted to add is that the original coefficients describe a difference in the log odds for two units who differ by 1 in the predictor. E.g., for a coefficient on $X$ of 5, we can say that the difference in log odds between two units who differ on $X$ by 1 is 5. Mathematically,
$$beta = log(textodds(p|X=x_0+1))-log(textodds(p|X=x_0)) $$
When you exponentiate $beta$, you get
$$exp(beta) = exp(log(textodds(p|X=x_0+1))-log(textodds(p|X=x_0))) \
= fracX=x_0+1)))X=x_0))) \
= fractextodds(ptextodds(p$$
which is a ratio of odds, an odds ratio.
1
This is extremely clear to me. My question is resolved.
â jack
Aug 9 at 18:51
add a comment |Â
up vote
8
down vote
accepted
up vote
8
down vote
accepted
@Laconic's answer is great and complete, in my opinion. Something I wanted to add is that the original coefficients describe a difference in the log odds for two units who differ by 1 in the predictor. E.g., for a coefficient on $X$ of 5, we can say that the difference in log odds between two units who differ on $X$ by 1 is 5. Mathematically,
$$beta = log(textodds(p|X=x_0+1))-log(textodds(p|X=x_0)) $$
When you exponentiate $beta$, you get
$$exp(beta) = exp(log(textodds(p|X=x_0+1))-log(textodds(p|X=x_0))) \
= fracX=x_0+1)))X=x_0))) \
= fractextodds(ptextodds(p$$
which is a ratio of odds, an odds ratio.
@Laconic's answer is great and complete, in my opinion. Something I wanted to add is that the original coefficients describe a difference in the log odds for two units who differ by 1 in the predictor. E.g., for a coefficient on $X$ of 5, we can say that the difference in log odds between two units who differ on $X$ by 1 is 5. Mathematically,
$$beta = log(textodds(p|X=x_0+1))-log(textodds(p|X=x_0)) $$
When you exponentiate $beta$, you get
$$exp(beta) = exp(log(textodds(p|X=x_0+1))-log(textodds(p|X=x_0))) \
= fracX=x_0+1)))X=x_0))) \
= fractextodds(ptextodds(p$$
which is a ratio of odds, an odds ratio.
edited Aug 10 at 14:14
answered Aug 9 at 18:48
Noah
2,3611314
2,3611314
1
This is extremely clear to me. My question is resolved.
â jack
Aug 9 at 18:51
add a comment |Â
1
This is extremely clear to me. My question is resolved.
â jack
Aug 9 at 18:51
1
1
This is extremely clear to me. My question is resolved.
â jack
Aug 9 at 18:51
This is extremely clear to me. My question is resolved.
â jack
Aug 9 at 18:51
add a comment |Â
up vote
9
down vote
Consider two set of conditions, the first described by the vector of independent variables $X$, and the second described by the vector $X'$, which differs only in the ith variable $x_i$, and by one unit. Let $beta$ be the vector of model parameters as usual.
According to the logistic regression model, the probability of the event occurring in the first case is $p_1 = frac11 + exp(-X beta)$, so that the odds of the event occurring is $fracp_11-p_1 = exp(X beta)$.
The probability of the event occurring in the second case is $p_2 = frac11 + exp(-X' beta)$, so that the odds of the event occurring is $fracp_21-p_2 = exp(X' beta) = exp(X beta + beta_i)$.
The ratio of the odds in the second case to the odds in the first case is therefore $exp(beta_i)$. Hence the interpretation of the exponential of the parameter as an odds ratio.
add a comment |Â
up vote
9
down vote
Consider two set of conditions, the first described by the vector of independent variables $X$, and the second described by the vector $X'$, which differs only in the ith variable $x_i$, and by one unit. Let $beta$ be the vector of model parameters as usual.
According to the logistic regression model, the probability of the event occurring in the first case is $p_1 = frac11 + exp(-X beta)$, so that the odds of the event occurring is $fracp_11-p_1 = exp(X beta)$.
The probability of the event occurring in the second case is $p_2 = frac11 + exp(-X' beta)$, so that the odds of the event occurring is $fracp_21-p_2 = exp(X' beta) = exp(X beta + beta_i)$.
The ratio of the odds in the second case to the odds in the first case is therefore $exp(beta_i)$. Hence the interpretation of the exponential of the parameter as an odds ratio.
add a comment |Â
up vote
9
down vote
up vote
9
down vote
Consider two set of conditions, the first described by the vector of independent variables $X$, and the second described by the vector $X'$, which differs only in the ith variable $x_i$, and by one unit. Let $beta$ be the vector of model parameters as usual.
According to the logistic regression model, the probability of the event occurring in the first case is $p_1 = frac11 + exp(-X beta)$, so that the odds of the event occurring is $fracp_11-p_1 = exp(X beta)$.
The probability of the event occurring in the second case is $p_2 = frac11 + exp(-X' beta)$, so that the odds of the event occurring is $fracp_21-p_2 = exp(X' beta) = exp(X beta + beta_i)$.
The ratio of the odds in the second case to the odds in the first case is therefore $exp(beta_i)$. Hence the interpretation of the exponential of the parameter as an odds ratio.
Consider two set of conditions, the first described by the vector of independent variables $X$, and the second described by the vector $X'$, which differs only in the ith variable $x_i$, and by one unit. Let $beta$ be the vector of model parameters as usual.
According to the logistic regression model, the probability of the event occurring in the first case is $p_1 = frac11 + exp(-X beta)$, so that the odds of the event occurring is $fracp_11-p_1 = exp(X beta)$.
The probability of the event occurring in the second case is $p_2 = frac11 + exp(-X' beta)$, so that the odds of the event occurring is $fracp_21-p_2 = exp(X' beta) = exp(X beta + beta_i)$.
The ratio of the odds in the second case to the odds in the first case is therefore $exp(beta_i)$. Hence the interpretation of the exponential of the parameter as an odds ratio.
answered Aug 9 at 18:25
The Laconic
711414
711414
add a comment |Â
add a comment |Â
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstats.stackexchange.com%2fquestions%2f361529%2fwhy-are-exponentiated-logistic-regression-coefficients-considered-odds-ratios%23new-answer', 'question_page');
);
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password