Pranay, SY and Nagaraj, Nithin (2021) Causal discovery using compression-complexity measures. Journal of Biomedical Informatics, 117 (103724).
Text
2021-JMBI-NNagaraj.pdf - Published Version Restricted to Repository staff only Download (5MB) |
Contribution | Name |
---|
Abstract: | Causal inference is one of the most fundamental problems across all domains of science. We address the problem of inferring a causal direction from two observed discrete symbolic sequences X and Y. We present a framework which relies on lossless compressors for inferring context-free grammars (CFGs) from sequence pairs and quantifies the extent to which the grammar inferred from one sequence compresses the other sequence. We infer X causes Y if the grammar inferred from X better compresses Y than in the other direction. To put this notion to practice, we propose three models that use the Compression-Complexity Measures (CCMs) – Lempel–Ziv (LZ) complexity and Effort-To-Compress (ETC) to infer CFGs and discover causal directions without demanding temporal structures. We evaluate these models on synthetic and real-world benchmarks and empirically observe performances competitive with current state-of-the-art methods. Lastly, we present two unique applications of the proposed models for causal inference directly from pairs of genome sequences belonging to the SARS-CoV-2 virus. Using numerous sequences, we show that our models capture causal information exchanged between genome sequence pairs, presenting novel opportunities for addressing key issues in sequence analysis to investigate the evolution of virulence and pathogenicity in future applications. |
---|---|
Item Type: | Journal Paper |
Additional Information: | Copyright belongs to the Publisher |
Keywords: | Compression, Causality, SARS-CoV-2, Genome, Information, Effort- to-Compress MSC: 68P30, 68Q30, 62D20, 94A17 |
Subjects: | School of Humanities > Consciousness Studies Programmes > Consciousness Studies Programme > Measures of Causality |
Date Deposited: | 31 Mar 2021 09:48 |
Last Modified: | 31 Mar 2021 09:48 |
Official URL: | https://www.sciencedirect.com/science/article/abs/... |
Related URLs: | |
Funders: | CSRI-DST, SATYAM-DST, Tata Trusts |
Projects: | ‘Cognitive Science Research Initiative’ (CSRI-DST) Grant No. DST/CSRI/ 2017/54(G), ‘Science and Technology of Yoga and Meditation’ (SATYAM-DST) Grant No. DST/SATYAM/2017/45(G) and Tata Trusts |
DOI: | https://doi.org/10.1016/j.jbi.2021.103724 |
URI: | http://eprints.nias.res.in/id/eprint/2077 |
Actions (login required)
View Item |