Causal Effect Identification from Multiple Incomplete Data Sources : A General Search-Based Approach
dc.contributor.author | Tikka, Santtu | |
dc.contributor.author | Hyttinen, Antti | |
dc.contributor.author | Karvanen, Juha | |
dc.date.accessioned | 2021-10-05T09:18:45Z | |
dc.date.available | 2021-10-05T09:18:45Z | |
dc.date.issued | 2021 | |
dc.identifier.citation | Tikka, S., Hyttinen, A., & Karvanen, J. (2021). Causal Effect Identification from Multiple Incomplete Data Sources : A General Search-Based Approach. <i>Journal of Statistical Software</i>, <i>99</i>, Article 5. <a href="https://doi.org/10.18637/jss.v099.i05" target="_blank">https://doi.org/10.18637/jss.v099.i05</a> | |
dc.identifier.other | CONVID_101358555 | |
dc.identifier.uri | https://jyx.jyu.fi/handle/123456789/78024 | |
dc.description.abstract | Causal effect identification considers whether an interventional probability distribution can be uniquely determined without parametric assumptions from measured source distributions and structural knowledge on the generating system. While complete graphical criteria and procedures exist for many identification problems, there are still challenging but important extensions that have not been considered in the literature such as combined transportability and selection bias, or multiple sources of selection bias. To tackle these new settings, we present a search algorithm directly over the rules of do-calculus. Due to the generality of do-calculus, the search is capable of taking more advanced datagenerating mechanisms into account along with an arbitrary type of both observational and experimental source distributions. The search is enhanced via a heuristic and search space reduction techniques. The approach, called do-search, is provably sound, and it is complete with respect to identifiability problems that have been shown to be completely characterized by do-calculus. When extended with additional rules, the search is capable of handling missing data problems as well. With the versatile search, we are able to approach new problems for which no other algorithmic solutions exist. We perform a systematic analysis of bivariate missing data problems and study causal inference under case-control design. We also present the R package dosearch that provides an interface for a C++ implementation of the search. | en |
dc.format.mimetype | application/pdf | |
dc.language.iso | eng | |
dc.publisher | Foundation for Open Access Statistic | |
dc.relation.ispartofseries | Journal of Statistical Software | |
dc.rights | CC BY 4.0 | |
dc.subject.other | causality | |
dc.subject.other | do-calculus | |
dc.subject.other | selection bias | |
dc.subject.other | transportability | |
dc.subject.other | missing data | |
dc.subject.other | case-control design | |
dc.subject.other | meta-analysis | |
dc.title | Causal Effect Identification from Multiple Incomplete Data Sources : A General Search-Based Approach | |
dc.type | research article | |
dc.identifier.urn | URN:NBN:fi:jyu-202110055076 | |
dc.contributor.laitos | Matematiikan ja tilastotieteen laitos | fi |
dc.contributor.laitos | Department of Mathematics and Statistics | en |
dc.contributor.oppiaine | Tilastotiede | fi |
dc.contributor.oppiaine | Statistics | en |
dc.type.uri | http://purl.org/eprint/type/JournalArticle | |
dc.type.coar | http://purl.org/coar/resource_type/c_2df8fbb1 | |
dc.description.reviewstatus | peerReviewed | |
dc.relation.issn | 1548-7660 | |
dc.relation.volume | 99 | |
dc.type.version | publishedVersion | |
dc.rights.copyright | © Authors, 2021 | |
dc.rights.accesslevel | openAccess | fi |
dc.type.publication | article | |
dc.relation.grantnumber | 311877 | |
dc.subject.yso | kausaliteetti | |
dc.subject.yso | R-kieli | |
dc.subject.yso | meta-analyysi | |
dc.subject.yso | hakualgoritmit | |
dc.subject.yso | päättely | |
dc.format.content | fulltext | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p333 | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p24355 | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p27697 | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p37865 | |
jyx.subject.uri | http://www.yso.fi/onto/yso/p5902 | |
dc.rights.url | https://creativecommons.org/licenses/by/4.0/ | |
dc.relation.doi | 10.18637/jss.v099.i05 | |
dc.relation.funder | Research Council of Finland | en |
dc.relation.funder | Suomen Akatemia | fi |
jyx.fundingprogram | Research profiles, AoF | en |
jyx.fundingprogram | Profilointi, SA | fi |
jyx.fundinginformation | This work belongs to the thematic research area “Decision analytics utilizing causal models and multiobjective optimization” (DEMO) supported by Academy of Finland (grant number 311877). AH was supported by Academy of Finland through grant 295673. | |
dc.type.okm | A1 |