Causal Inference by Identification of Vector Autoregressive Processes with Hidden Components
- CMLLLMSV

A widely applied approach to causal inference from a non-experimental time series , often referred to as "(linear) Granger causal analysis", is to regress present on past and interpret the regression matrix causally. However, if there is an unmeasured time series that influences , then this approach can lead to wrong causal conclusions, i.e., distinct from those one would draw if one had additional information such as . In this paper we take a different approach: We assume that together with some hidden forms a first order vector autoregressive (VAR) process with transition matrix , and argue why it is more valid to interpret causally instead of . Then we examine under which conditions the most important parts of are identifiable or almost identifiable from only . Essentially, sufficient conditions are (1) non-Gaussian, independent noise or (2) no influence from to . We present two estimation algorithms that are tailored towards conditions (1) and (2), respectively, and evaluate them on synthetic and real-world data. We discuss how to check the model using .
View on arXiv