A quant team fits a deep neural network on intraday Bitcoin tick data. The fund uses the model to place trades the following morning; nobody on the team can describe what the network has learned, and they are content with that as long as the next-day forecast is accurate and well-calibrated. How is this task best classified along the prediction/inference axis?
- A Inference, because the team relies on the model to make decisions about the future.
- B Inference, because deep neural networks have many parameters whose meaning could be interpreted post hoc.
- C Inference, because next-day forecasts are downstream decisions, and reasoning under uncertainty about future outcomes is the textbook definition of statistical inference.
- D Prediction, because they care about $\hat Y$ being accurate but not about the form of $f$.
Show answer
The defining tell is "nobody can describe what the network has learned, and they are content with that as long as the forecast is accurate." That is the prof's canonical prediction setup: "you don't really care if the model is right. That's secondary. Being right is not as important as being able to predict well… with knowing your uncertainty."
A confuses "make decisions" with "infer structure" — quants make decisions on a black-box forecast every day. B inverts the rule: parameter count is irrelevant to the prediction-vs-inference axis; what matters is whether the analyst cares about the form of $f$. C abuses the everyday word "inference" — in this course "inference" specifically means caring about the form of $f$ (which knobs matter, by how much), not "reasoning about the future." Forecasting tomorrow's price under uncertainty is the canonical prediction task, regardless of how downstream decisions feel.
Atoms: prediction-vs-inference, supervised-vs-unsupervised. Lecture: L01-intro.