Skip to content

Hierarchical¶

Types¶

Non-additive: Price of Make, model, trim
Additive
- Cross-sectional
  - Revenue of Country, State, City
- Temporal
  - Revenue of Year, Quarter, Month
- Cross-temporal
  - Mix of both

Approaches¶

Bottom-up
Top-Down
Middle-out

Hierarchical non-additive¶

If there are multiple independent levels of features, then run a model for each levels - simple model for each level is better than one complex model for all group - especially useful for imbalanced hierarchies

Advantages: 1. Better loss definition: Predict accurately on average → Predict each subset accurately on average 2. Simpler and easier to debug 3. Faster (than requiring a single model to produce splits)

Complexity of atomic model for each hierarchy should be based on the amount of data available for that hierarchy - create a meta-estimator to conditionally apply a model

Hierarchical Additive¶

Required for coherent forecasts

		\(\hat y_\text{top}\)	\(\hat y_\text{bot}\)	\(\hat p_l\)
Bottom-up		\(\sum\limits_l \hat y_{\text{bot}, l}\)	\(\hat f\)
Top-down		\(\hat f\)	\(\sum\limits_l \hat p_l \cdot \hat y_\text{top}\)
	Historical proportion			\(\dfrac{y_{\text{bot}, l}}{y_\text{top}}\)
	Predicted proportion			\(\dfrac{\hat y_{\text{bot}, l}}{\hat y_\text{top}}\)
Middle-out
Optimal Min-Trace

2026-03-13

Comments