Odds, log-odds, and odds-ratio interpretation
The interpretive core of logistic-regression: the coefficient is a log odds-ratio, not a slope on probability. The prof flagged the odds ↔ probability conversion as the kind of question he will ask, calculator-friendly, format-flexible, and a classic interaction trap.
Definition (prof’s framing)
“Odds is used in betting or horse races… 5-to-1 chance that something happens. Then there’s a 5-over-6 chance that this thing happens because it happens in 5 out of 6 cases.” - L07-classif-1
Given a probability of an event:
Odds live in ; probability lives in . The two encode the same information; odds are the natural scale for the multiplicative logistic model.
Notation & setup
- : probability of the “success” / class-1 event given covariates.
- : ratio of the class-1 probability to the class-0 probability.
- : the log-odds, the linear predictor in logistic regression.
Formula(s) to know cold
Odds ↔ probability conversions:
The logistic model on the odds scale (multiplicative):
Odds-ratio interpretation of , increase by 1 unit, all other covariates fixed:
Equivalently: for a one-unit increase in .
Insights & mental models
- The unit of matters. For
incomemeasured in dollars, might look tiny but gives a meaningful per-$10k change. The prof walked through this on theDefaultdataset,balanceandincomehad very different per-unit interpretations because of scale. - L07-classif-1 - Reference-class flip flips the sign. If you encode 1 = default vs 1 = non-default, flips sign. Always state your encoding when interpreting output.
- Direction of effect by sign of . → odds (and hence probability) of class 1 increase with ; → decrease. Magnitude tells the multiplicative factor.
- Logit on probability has no straight-line interpretation. A one-unit change in produces a constant change in the log-odds, but a non-constant change in , the -change depends on where on the sigmoid you are.
Interaction trap (prof flagged for the exam)
When the model includes , the odds ratio for a one-unit increase in is not , it’s , which depends on . From L27-summary (Q7 walkthrough on default data with sex × pay_0 interaction):
“How does the feature pay-zero influence the odds to default? … We need to be able to do it for the men and the women.” - L27-summary
For each group (each level of the interacting categorical), compute the multiplicative effect separately. The 2025 exam question multiplied odds by for males and for females. Answer per group, not as a single number.
Exam signals
“What are the odds, the log odds? How do you compute them? What do they mean?” - L27-summary
“This is the kind of question I would ask. It’s simple. You calculate it. It’s why you need a calculator.” - L27-summary
“By increasing the covariate by one unit, we change the odds for y to be in class 1 by a factor of the exponent of beta.” - L07-classif-1
The 2025 exam Q3c was exactly the Exercise 4.3 conversion: given odds 0.37 → , given → odds.
Pitfalls
- Coefficient probability change. Reporting ” means age increases default probability by 0.05” is wrong, it’s the change in log-odds.
- Forgetting the reference class. Default-class encoding flip → sign flip on every coefficient.
- Reporting odds-ratio for an interaction without specifying the level of the interacting variable. See the trap above.
- Computing odds-ratio for a non-unit change. For a 100-unit change in , the odds multiply by , not .
Scope vs ISLP
- In scope: Odds definition, log-odds = logit, odds-ratio interpretation of , the multiplicative form of the logistic model, the interaction-trap caveat.
- Look up in ISLP: §4.3.1 (logistic model + odds), pp. 134–135. Equation (4.4) is the canonical odds-ratio statement.
- Skip in ISLP: Nothing relevant excluded, odds is a small, self-contained piece.
Exercise instances
- Exercise4.3a: odds = 0.37 of defaulting → fraction of people who default = .
- Exercise4.3b: → odds = .
- CE1 problem 3b: interpret in the tennis logistic regression: “one extra ace for player 1 multiplies the odds of player 1 winning by .”
The 2025 exam (Q3c) and 2024 exam both asked the same odds-conversion calculation, this is the most reliably-recurring exam question in module 4.
How it might appear on the exam
- Calculator MCQ: “Given odds = 0.5, what is ?” (answer 1/3) or “Given , what are the odds?” (answer 4).
- Coefficient interpretation: Given a logistic-regression output table, “for a one-unit increase in , the odds of multiply by ___” → .
- Interaction trap T/F: “The odds ratio for when an interaction is in the model is ” → false, depends on .
- Encoding T/F: “Flipping the reference class flips the sign of ” → true.
Related
- logistic-regression: the parent model where the odds-ratio interpretation lives.
- sensitivity-specificity: read off a confusion matrix at a fixed cutoff; the cutoff sits on the probability scale, not the odds scale.