Skip to main content

Pooled OLS — Pooled panel regression

Pooled OLS pools all observations of panel data (N units × T periods) into a single sample and runs OLS as if cross-sectional, ignoring the panel structure. It is the baseline for comparison with FE/RE.

Strong assumption

Pooled OLS assumes no individual effects (αi\alpha_i identical across units). If unobserved unit characteristics are correlated with XX, Pooled OLS is biased ⇒ use FE. Errors within a unit are usually correlated ⇒ use clustered standard errors.


Model specification

Yit=β0+Xitβ+εitY_{it} = \beta_0 + X_{it}\beta + \varepsilon_{it}

Like OLS but using all N×TN \times T observations. Use clustered SE by unit.


Running in EcoLab

  1. Modeling module → Linear panel data family → Pooled OLS.
  2. Declare entity/time, YY, XX; choose clustered SE.
  3. Run; compare with FE/RE via tests; export the replication code.

Replication code

* ---- Pooled OLS with clustered SE ----
use "panel_data.dta", clear

* Pooled OLS with clustered standard errors by entity
reg y x1 x2, vce(cluster id)

Video tutorial

Video Tutorial: Running Pooled OLS in EcoLab

See also