Evaluation Methods and Measures for Causal Learning Algorithms
- URL: http://arxiv.org/abs/2202.02896v1
- Date: Mon, 7 Feb 2022 00:24:34 GMT
- Title: Evaluation Methods and Measures for Causal Learning Algorithms
- Authors: Lu Cheng, Ruocheng Guo, Raha Moraffah, Paras Sheth, K. Selcuk Candan,
Huan Liu
- Abstract summary: We focus on the two fundamental causal-inference tasks and causality-aware machine learning tasks.
The survey seeks to bring to the forefront the urgency of developing publicly available benchmarks and consensus-building standards for causal learning evaluation with observational data.
- Score: 33.07234268724662
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The convenient access to copious multi-faceted data has encouraged machine
learning researchers to reconsider correlation-based learning and embrace the
opportunity of causality-based learning, i.e., causal machine learning (causal
learning). Recent years have therefore witnessed great effort in developing
causal learning algorithms aiming to help AI achieve human-level intelligence.
Due to the lack-of ground-truth data, one of the biggest challenges in current
causal learning research is algorithm evaluations. This largely impedes the
cross-pollination of AI and causal inference, and hinders the two fields to
benefit from the advances of the other. To bridge from conventional causal
inference (i.e., based on statistical methods) to causal learning with big data
(i.e., the intersection of causal inference and machine learning), in this
survey, we review commonly-used datasets, evaluation methods, and measures for
causal learning using an evaluation pipeline similar to conventional machine
learning. We focus on the two fundamental causal-inference tasks and
causality-aware machine learning tasks. Limitations of current evaluation
procedures are also discussed. We then examine popular causal inference
tools/packages and conclude with primary challenges and opportunities for
benchmarking causal learning algorithms in the era of big data. The survey
seeks to bring to the forefront the urgency of developing publicly available
benchmarks and consensus-building standards for causal learning evaluation with
observational data. In doing so, we hope to broaden the discussions and
facilitate collaboration to advance the innovation and application of causal
learning.
Related papers
- Introducing CausalBench: A Flexible Benchmark Framework for Causal Analysis and Machine Learning [10.686245134005047]
Causal learning aims to go far beyond conventional machine learning, yet several major challenges remain.
We introduce em CausalBench, a transparent, fair, and easy-to-use evaluation platform.
arXiv Detail & Related papers (2024-09-12T22:45:10Z) - Multi-modal Causal Structure Learning and Root Cause Analysis [67.67578590390907]
We propose Mulan, a unified multi-modal causal structure learning method for root cause localization.
We leverage a log-tailored language model to facilitate log representation learning, converting log sequences into time-series data.
We also introduce a novel key performance indicator-aware attention mechanism for assessing modality reliability and co-learning a final causal graph.
arXiv Detail & Related papers (2024-02-04T05:50:38Z) - A Survey on Causal Discovery Methods for I.I.D. and Time Series Data [4.57769506869942]
Causal Discovery (CD) algorithms can identify the cause-effect relationships among the variables of a system from related observational data.
We present an extensive discussion on the methods designed to perform causal discovery from both independent and identically distributed (I.I.D.) data and time series data.
arXiv Detail & Related papers (2023-03-27T09:21:41Z) - A Human-Centered Review of Algorithms in Decision-Making in Higher
Education [16.578096382702597]
We reviewed an extensive corpus of papers proposing algorithms for decision-making in higher education.
We found that the models are trending towards deep learning, and increased use of student personal data and protected attributes.
Despite the associated decrease in interpretability and explainability, current development predominantly fails to incorporate human-centered lenses.
arXiv Detail & Related papers (2023-02-12T02:30:50Z) - Statistical Foundation Behind Machine Learning and Its Impact on
Computer Vision [8.974457198386414]
This paper revisits the principle of uniform convergence in statistical learning, discusses how it acts as the foundation behind machine learning, and attempts to gain a better understanding of the essential problem that current deep learning algorithms are solving.
Using computer vision as an example domain in machine learning, the discussion shows that recent research trends in leveraging increasingly large-scale data to perform pre-training for representation learning are largely to reduce the discrepancy between a practically tractable empirical loss and its ultimately desired but intractable expected loss.
arXiv Detail & Related papers (2022-09-06T17:59:04Z) - Research Trends and Applications of Data Augmentation Algorithms [77.34726150561087]
We identify the main areas of application of data augmentation algorithms, the types of algorithms used, significant research trends, their progression over time and research gaps in data augmentation literature.
We expect readers to understand the potential of data augmentation, as well as identify future research directions and open questions within data augmentation research.
arXiv Detail & Related papers (2022-07-18T11:38:32Z) - What Makes Good Contrastive Learning on Small-Scale Wearable-based
Tasks? [59.51457877578138]
We study contrastive learning on the wearable-based activity recognition task.
This paper presents an open-source PyTorch library textttCL-HAR, which can serve as a practical tool for researchers.
arXiv Detail & Related papers (2022-02-12T06:10:15Z) - Can Active Learning Preemptively Mitigate Fairness Issues? [66.84854430781097]
dataset bias is one of the prevailing causes of unfairness in machine learning.
We study whether models trained with uncertainty-based ALs are fairer in their decisions with respect to a protected class.
We also explore the interaction of algorithmic fairness methods such as gradient reversal (GRAD) and BALD.
arXiv Detail & Related papers (2021-04-14T14:20:22Z) - Nonparametric Estimation of Heterogeneous Treatment Effects: From Theory
to Learning Algorithms [91.3755431537592]
We analyze four broad meta-learning strategies which rely on plug-in estimation and pseudo-outcome regression.
We highlight how this theoretical reasoning can be used to guide principled algorithm design and translate our analyses into practice.
arXiv Detail & Related papers (2021-01-26T17:11:40Z) - A Survey on Extraction of Causal Relations from Natural Language Text [9.317718453037667]
Cause-effect relations appear frequently in text, and curating cause-effect relations from text helps in building causal networks for predictive tasks.
Existing causality extraction techniques include knowledge-based, statistical machine learning(ML)-based, and deep learning-based approaches.
arXiv Detail & Related papers (2021-01-16T10:49:39Z) - A Survey on Causal Inference [64.45536158710014]
Causal inference is a critical research topic across many domains, such as statistics, computer science, education, public policy and economics.
Various causal effect estimation methods for observational data have sprung up.
arXiv Detail & Related papers (2020-02-05T21:35:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.