By Chapter 15, statistics realizes something important.
Until now, almost every model assumed:
one response variable.
Examples:
- Sales
- Churn
- Survival Time
- Purchase Probability
But real business problems rarely behave that way.
Usually outcomes move together.
Examples:
- Sales and Margin
- Deployment and Turnover
- Revenue and Conversion
- Customer Growth and Retention
This chapter introduces:
Multivariate GLMs and Joint Modeling
The idea:
Model multiple outcomes together.
Why Separate Models Can Fail
Suppose we model:
Sales:
$$
Y_1
$$
and separately:
Margin:
$$
Y_2
$$
We obtain:
- Model 1 → Sales
- Model 2 → Margin
But what if high sales usually reduce margin?
The outcomes are correlated.
Separate models miss that relationship.
The Big Idea
Instead of modeling:
$$
Y
$$
we model:
$$
(Y_1,Y_2,\ldots,Y_k)
$$
simultaneously.
Now relationships between outcomes become visible.
Example: Customer Performance
Suppose for each customer we observe:
| Customer | Sales | Deployment |
|---|---|---|
| A | 150,000 | 400,000 |
| B | 90,000 | 250,000 |
| C | 220,000 | 600,000 |
Question:
Do deployment and sales evolve together?
Joint models answer this.
Covariance Between Outcomes
Responses now have:
$$
{Cov}(Y_1,Y_2)
$$
Interpretation:
How outcomes move together.
| Covariance Pattern | Interpretation |
|---|---|
| Positive | Outcomes rise together |
| Negative | Trade-off relationship |
| Near Zero | Approximately independent |
Example: Sales and Margin
Suppose:
$$
{Corr}(Y_{\text{Sales}},Y_{\text{Margin}})=0.8
$$
Meaning:
Higher sales are usually associated with higher margins.
Multivariate Regression
Single Outcome
$$
Y=X\beta+\varepsilon
$$
Multiple Outcomes
$$
Y=XB+E
$$
where:
| Symbol | Meaning |
|---|---|
| $$Y$$ | Matrix of outcomes |
| $$X$$ | Predictor matrix |
| $$B$$ | Multiple coefficient sets |
| $$E$$ | Error matrix |
Now several responses are modeled simultaneously.
Joint GLMs
Suppose:
Outcome 1
Sales Count
Poisson distribution
Outcome 2
Revenue
Gamma distribution
We can model:
Sales:
$$
\log(\mu_1)=X\beta
$$
Revenue:
$$
\log(\mu_2)=X\gamma
$$
Both are estimated jointly.
Why This Helps
Suppose customer activity increases.
Usually:
- Sales increase
- Margin changes
- Conversion rates change
Joint models learn these dependencies automatically.
Shared Random Effects
One elegant solution is to introduce a shared latent factor.
Model:
$$
Y_1 \mid u
$$
and
$$
Y_2 \mid u
$$
Both depend on:
$$
u
$$
where:
$$
u
$$
represents a hidden characteristic.
Example: Store Strength
Suppose:
$$
u=\text{Store Strength}
$$
This hidden factor influences:
- Sales
- Conversion
- Inventory Turnover
One random effect explains multiple outcomes simultaneously.
Multivariate Longitudinal Data
Suppose we track:
| Month | Sales | Margin |
|---|---|---|
| Jan | 10,000 | 3,000 |
| Feb | 12,000 | 3,400 |
| Mar | 15,000 | 4,100 |
Now correlations exist across:
- Outcomes
- Time
This creates a much richer modeling problem.
Joint Survival Models
Suppose we measure:
- Customer spending
- Time to churn
Joint modeling allows us to estimate both together.
Example:
Higher spending may predict longer retention.
This is very common in customer analytics.
Inventory Example
Suppose for each SKU we model:
Outcome 1
Sales Count
Outcome 2
Turn
Outcome 3
Aged Inventory
A joint model learns how:
- Fast sellers behave
- Inventory accumulates
- Turnover evolves
simultaneously.
Customer Lifecycle Example
A common joint-modeling application:
Predict:
- Purchase Probability
- Purchase Amount
- Churn Timing
Together.
This provides a complete customer view.
Why Separate Models Miss Things
Separate models often assume:
$$
{Cov}(Y_i,Y_j)=0
$$
Joint models estimate:
$$
{Cov}(Y_i,Y_j)
$$
directly.
That additional information can change decisions dramatically.
Conditional Independence
Many joint models assume:
Given latent variables:
$$
u
$$
the outcomes become independent.
Mathematically:
$$
Y_1 \perp Y_2 \mid u
$$
Example:
Given retailer quality,
sales and margin may become independent.
Estimation Becomes Hard
Joint models often require:
- Maximum Likelihood Estimation (MLE)
- Expectation-Maximization (EM)
- Bayesian MCMC
Computation becomes substantially heavier.
Dimension Explosion
As the number of outcomes increases:
- Parameters increase
- Covariance terms increase
- Computation increases
Solutions often include:
- Shrinkage
- Regularization
- Latent Factors
Practical Applications
| Outcome Set |
|---|
| Sales + Margin |
| Conversion + Revenue |
| Inventory + Turn |
| Demand + Returns |
| Churn + Spend |
| Risk + Profit |
Chapter 15’s Big Lesson
Reality rarely produces one outcome at a time.
Business systems interact.
Good models learn outcomes together rather than separately.
Final Thought
Before Chapter 15, you ask:
“What predicts this outcome?”
After Chapter 15, you ask:
“How do outcomes influence each other?”
That shift moves statistics from:
single-target prediction
to
understanding systems.

Leave a Reply