Skip to main content

ARDL Model (Autoregressive Distributed Lag)

ARDL is a time-series regression model that combines lags of the dependent variable and lags of the independent variables in a single equation. ARDL is appropriate when variables have a mixed order of integration — some stationary at level I(0), some at first difference I(1) — and it allows testing for cointegration through the bounds test of Pesaran, Shin and Smith (2001).

In EcoLab, ARDL belongs to the Time Series group and generates reproducible code for Stata, R and Python. See also Estimation & Modeling and EcoLab Overview.


When should you use ARDL?

Use ARDL when all of the following hold:

  • The data is time series (a single country/unit over time), typically a small-to-moderate sample.
  • Variables are integrated of order no higher than I(1) (no variable is I(2)).
  • The order of integration is mixed I(0)/I(1) — a case where Engle–Granger or Johansen is hard to apply.
  • You need to estimate both the long-run relationship and the short-run dynamics via an error correction model (ECM).

Do not use ARDL when any variable is I(2), or when the data is multi-unit panel data (consider FEM/REM or panel ARDL/PMG).


Model specification

General form of ARDL(pp, qq) with one independent variable:

Yt=c+i=1pφiYti+j=0qβjXtj+εtY_t = c + \sum_{i=1}^{p} \varphi_i \, Y_{t-i} + \sum_{j=0}^{q} \beta_j \, X_{t-j} + \varepsilon_t

where pp is the lag order of the dependent variable YY and qq is the lag order of the independent variable XX. The error correction form (ARDL-ECM) separates the long-run relationship from short-run adjustment:

ΔYt=c+α(Yt1θXt1)+iγiΔYti+jδjΔXtj+εt\Delta Y_t = c + \alpha \, (Y_{t-1} - \theta \, X_{t-1}) + \sum_i \gamma_i \, \Delta Y_{t-i} + \sum_j \delta_j \, \Delta X_{t-j} + \varepsilon_t

θ\theta is the long-run coefficient; α\alpha (the error correction term) must be negative and statistically significant to confirm the existence of a cointegrating relationship.


Assumptions and testing steps

  1. Order of integration: unit-root tests (ADF, Phillips–Perron, KPSS) to ensure no variable is I(2).
  2. Optimal lag selection: based on information criteria (AIC/BIC).
  3. Bounds test: the F-statistic compared against the upper/lower critical bounds to conclude on cointegration.
  4. Diagnostics: autocorrelation (Breusch–Godfrey), heteroskedasticity (Breusch–Pagan/White), normality of residuals (Jarque–Bera).
  5. Stability: CUSUM and CUSUMSQ tests.

Running ARDL in EcoLab

  1. In the Data Collection module, connect EcoData or public sources (World Bank, IMF, FRED) to obtain the time series of the dependent and independent variables.
  2. In the Modeling module, click Add model → choose the Time Series group → ARDL.
  3. Select the dependent variable YY, the independent variables XX, and let the system pick lags by AIC/BIC (or set them manually).
  4. Click Run model. Review the Estimation tab (short-run/long-run coefficients), the Diagnostics tab (bounds test, CUSUM), and the Replication Code tab for the Stata/R/Python script.

Input / output example

Input (illustrative): annual series 1990–2023 with growth (GDP growth %), fdi (FDI/GDP %), open (trade openness %), inf (inflation %).

Output (format, illustrative figures — not real results):

ComponentCoefficientStd. errorp-value
Long run: fdi0.420.150.012
Long run: open0.080.040.061
ECM (α\alpha)−0.550.130.001
Bounds F-stat6.10(above upper I(1) bound)cointegrated

Interpretation: a negative and significant ECM coefficient confirms cointegration; the speed of adjustment toward long-run equilibrium is about 55% per year.


Replication code

* --- ARDL Model ---
tsset year

* Estimate ARDL with automatic lag selection
* lags(. 2 2 2): auto for Y, max 2 for each X
ardl growth fdi open inf, lags(. 2 2 2)

* Bounds test for cointegration (Pesaran et al. 2001)
estat btest

* Error Correction Model (ECM) form
ardl growth fdi open inf, lags(. 2 2 2) ec

* Post-estimation diagnostics
estat bgodfrey // autocorrelation
estat hettest // heteroskedasticity

Limitations and notes

  • ARDL cannot handle I(2) variables; check beforehand with unit-root tests.
  • Sensitive to lag selection; compare AIC and BIC.
  • Too small a sample makes the bounds test unreliable.
  • ARDL is a single-equation model; if you suspect multi-directional simultaneity, consider VAR/VECM.

Video tutorial

Video Tutorial: Running ARDL in EcoLab

See also