Skip to main content

Probit — Normal binary regression

Probit models the probability of a binary outcome through the standard normal cumulative distribution function Φ()\Phi(\cdot). Empirically, Probit and Logit usually lead to similar conclusions; they differ in the assumed error distribution (normal vs logistic).

Logit or Probit?

Results are usually very close. Logit is convenient for its odds ratios; Probit is preferred when a normal error assumption is more reasonable or within extended models (Heckman, biprobit). Always compare through marginal effects.


Model specification

P(Yi=1Xi)=Φ(β0+β1X1i++βkXki)P(Y_i = 1 \mid X_i) = \Phi(\beta_0 + \beta_1 X_{1i} + \dots + \beta_k X_{ki})

where Φ\Phi is the standard normal CDF. Estimated by MLE.


Interpretation

  • Coefficients βj\beta_j are not read directly; use marginal effects (AME/MEM).
  • Model fit: Pseudo-R2R^2, classification, AUC.

Running in EcoLab

  1. Modeling module → Limited dependent variable family → Probit.
  2. Select the binary YY and the XX variables.
  3. Run; read marginal effects; compare with Logit; export the replication code.

Replication code

* ===== Probit — Normal binary regression =====
* Estimate the probit model
probit y x1 x2 x3

* Average marginal effects (AME)
margins, dydx(*)

* Predicted probabilities
predict phat, pr

* Classification table
estat classification

* Pseudo-R² is shown in the estimation output

Limitations

  • No odds interpretation like Logit.
  • Same underlying assumptions (exogeneity, correct specification) as other binary-choice models.

Video tutorial

Video Tutorial: Guide to running Probit in EcoLab

See also