A simple algorithm for efficiently mining observational databases for causal relationships

Cooper GF. A simple algorithm for efficiently mining observational databases for causal relationships. Journal of Data Mining and Knowledge Discovery 1 (1997) 203-224.

This paper presents a simple, efficient computer-based method for discovering causal relationships from databases that contain observational data. Observational data is passively observed, as contrasted with experimental data. Most of the databases available for data mining are observational. There is great potential for mining such databases to discover causal relationships. We illustrate how observational data can constrain the causal relationships among measured variables, sometimes to the point that we can conclude that one variable is causing another variable. The presentation here is based on a constraint-based approach to causal discovery. A primary purpose of this paper is to present the constraint-based causal discovery method in the simplest possible fashion in order to (1) readily convey the basic ideas that underlie more complex constraint-based causal discovery techniques, and (2) permit interested readers to rapidly program and apply the method to their own databases, as a start toward using more elaborate causal discovery algorithms.
Keywords: causal discovery, data mining, observational data.

Publication Year: 
1997
Faculty Author: 
Publication Credits: 
Cooper GF.
AttachmentSize
PDF icon Cooper.pdf119.37 KB
^