On statistical and causal models associated with acyclic directed mixed graphs

Abstract

Causal models in statistics are often described by acyclic directed mixed graphs (ADMGs), which contain directed and bidirected edges and no directed cycles. This article surveys various interpretations of ADMGs, discusses their relations in different sub-classes of ADMGs, and argues that one of them—nonparametric equation system (the E model below)—should be used as the default interpretation. The E model is closely related to but different from the interpretation of ADMGs as directed acyclic graphs (DAGs) with latent vairables that is commonly found in the literature. Our endorsement of the E model is based on two observations. First, in a subclass of ADMGs called unconfounded graphs (which retain most of the good properties of directed acyclic graphs and bidirected graphs), the E model is equivalent to many other interpretations including the global Markov and nested Markov models. Second, the E model for an arbitrary ADMG is exactly the union of that for all unconfounded expansions of that graph. This property is referred to as completeness, as it shows that the model does not commit to any specific latent variable explanation. In proving that the E model is nested Markov, we also develop an ADMG-based theory for causality that may be of independent interest.