Discovering causal relations is fundamental to reasoning and intelligence. In particular, observational causal discovery algorithms estimate the cause-effect relation between two random entities and , given samples from . In this paper, we develop a framework to estimate the cause-effect relation between two static entities and : for instance, an art masterpiece and its fraudulent copy . To this end, we introduce the notion of proxy variables, which allow the construction of a pair of random entities from the pair of static entities . Then, estimating the cause-effect relation between and using an observational causal discovery algorithm leads to an estimation of the cause-effect relation between and . For example, our framework detects the causal relation between unprocessed photographs and their modifications, and orders in time a set of shuffled frames from a video. As our main case study, we introduce a human-elicited dataset of 10,000 pairs of casually-linked pairs of words from natural language. Our methods discover 75% of these causal relations. Finally, we discuss the role of proxy variables in machine learning, as a general tool to incorporate static knowledge into prediction tasks.
View on arXiv