128
79

Causal Inference by Identification of Vector Autoregressive Processes with Hidden Components

Abstract

A widely applied approach to causal inference from a time series XX, often referred to as "(linear) Granger causal analysis", is to simply regress present on past and interpret the regression matrix B^\hat{B} causally. However, if there is an unmeasured time series ZZ that influences XX, then this approach can lead to wrong causal conclusions, i.e., distinct from those one would draw if one had additional information such as ZZ. In this paper we take a different approach: We assume that XX together with some unmeasured ZZ forms a vector autoregressive (VAR) process with transition matrix AA, and argue why it is more valid to interpret AA causally instead of B^\hat{B}. Then we examine under which conditions the most important parts of AA are identifiable or almost identifiable from only XX. Essentially, sufficient conditions are (1) non-Gaussian, independent noise or (2) no influence from XX to ZZ. We present two estimation algorithms that are tailored towards conditions (1) and (2), respectively, and evaluate them on synthetic and real-world data. We discuss how to check certain model assumptions using XX.

View on arXiv
Comments on this paper