Skip to main content

Between Estimator

The Between Estimator regresses on the unit means of the variables — exploiting between-unit variation rather than the within (time) variation used by Fixed Effects. It answers "do units with a higher average XX have a higher average YY".

When to use

Use Between when you care about long-run differences across units (e.g. comparing countries/firms by their averages). RE is essentially a weighted average of Between and Within.


Model specification

Yˉi=β0+Xˉiβ+εˉi,Yˉi=1TtYit\bar{Y}_i = \beta_0 + \bar{X}_i \beta + \bar{\varepsilon}_i, \qquad \bar{Y}_i = \frac{1}{T}\sum_t Y_{it}

OLS on the unit-averaged data.


Running in EcoLab

  1. Modeling module → Linear panel data family → Between.
  2. Declare entity/time, YY, XX.
  3. Run; contrast with FE (within) to analyze the source of variation; export the replication code.

Replication code

* ---- Between Estimator ----
use "panel_data.dta", clear
xtset id time

* Between estimator (regression on unit means)
xtreg y x1 x2, be

Limitations

  • Ignores time variation; does not control for individual effects like FE.
  • Loses dynamic information.

Video tutorial

Video Tutorial: Running the Between estimator in EcoLab

See also