The specification of the propensity score in multilevel observational studies

Number: 6
Year: 2008
Author(s): Bruno Arpino, Fabrizia Mealli
Propensity Score Matching (PSM) has become a popular approach to estimation of causal effects. It relies on the assumption that selection into a treatment can be explained purely in terms of observable characteristics (the unconfoundedness assumption) and on the property that balancing on the propensity score is equivalent to balancing on the observed covariates. Several applications in social sciences are characterized by a hierarchical structure of data: units at the first level (e.g., individuals) clustered into groups (e.g., provinces). In this paper we explore the use of multilevel models for the estimation of the propensity score for such hierarchical data when one or more relevant cluster-level variables is unobserved. We compare this approach with alternative ones, like a single level model with cluster dummies. By using Monte Carlo evidence we show that multilevel specifications usually achieve reasonably good balancing in cluster level unobserved covariates and consequently reduce the omitted variable bias. This is also the case for the dummy model.

Bruno Arpino

Universita Bocconi, Dondena Centre for Research on Social Dynamics

 

Fabrizia Mealli

University of Florence, Department of Statistics

 

Keywords: propensity score, multilevel studies, unconfoundedness, causal inferences

 

Download: The paper may be downloaded here.

 

A published version of this paper appears on Computational Statistics and Data Analysis

Arpino B. and Mealli F. (2011). The specification of the propensity score in multilevel studies, Computational Statistics and Data Analysis, 55, pp. 1770-1780 (doi:10.1016/j.csda.2010.11.008).

Keywords: propensity score,multilevel studies,unconfoundedness,causal inference