Salesforce CausalAI Library: A Fast and Scalable Framework for Causal
  Analysis of Time Series and Tabular Data
        - URL: http://arxiv.org/abs/2301.10859v2
- Date: Sat, 23 Sep 2023 00:30:22 GMT
- Title: Salesforce CausalAI Library: A Fast and Scalable Framework for Causal
  Analysis of Time Series and Tabular Data
- Authors: Devansh Arpit, Matthew Fernandez, Itai Feigenbaum, Weiran Yao,
  Chenghao Liu, Wenzhuo Yang, Paul Josel, Shelby Heinecke, Eric Hu, Huan Wang,
  Stephen Hoi, Caiming Xiong, Kun Zhang, Juan Carlos Niebles
- Abstract summary: We introduce the Salesforce CausalAI Library, an open-source library for causal analysis using observational data.
The goal of this library is to provide a fast and flexible solution for a variety of problems in the domain of causality.
- Score: 76.85310770921876
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract:   We introduce the Salesforce CausalAI Library, an open-source library for
causal analysis using observational data. It supports causal discovery and
causal inference for tabular and time series data, of discrete, continuous and
heterogeneous types. This library includes algorithms that handle linear and
non-linear causal relationships between variables, and uses multi-processing
for speed-up. We also include a data generator capable of generating synthetic
data with specified structural equation model for the aforementioned data
formats and types, that helps users control the ground-truth causal process
while investigating various algorithms. Finally, we provide a user interface
(UI) that allows users to perform causal analysis on data without coding. The
goal of this library is to provide a fast and flexible solution for a variety
of problems in the domain of causality. This technical report describes the
Salesforce CausalAI API along with its capabilities, the implementations of the
supported algorithms, and experiments demonstrating their performance and
speed. Our library is available at
\url{https://github.com/salesforce/causalai}.
 
      
        Related papers
        - Efficient Conformance Checking of Rich Data-Aware Declare Specifications   (Extended) [49.46686813437884]
 We show that it is possible to compute data-aware optimal alignments in a rich setting with general data types and data conditions.<n>This is achieved by carefully combining the two best-known approaches to deal with control flow and data dependencies.
 arXiv  Detail & Related papers  (2025-06-30T10:16:21Z)
- RV-Syn: Rational and Verifiable Mathematical Reasoning Data Synthesis   based on Structured Function Library [58.404895570822184]
 RV-Syn is a novel mathematical Synthesis approach.
It generates graphs as solutions by combining Python-formatted functions from this library.
Based on the constructed graph, we achieve solution-guided logic-aware problem generation.
 arXiv  Detail & Related papers  (2025-04-29T04:42:02Z)
- Counterfactual Causal Inference in Natural Language with Large Language   Models [9.153187514369849]
 We propose an end-to-end causal structure discovery and causal inference method from natural language.
We first use an LLM to extract the instantiated causal variables from text data and build a causal graph.
We then conduct counterfactual inference on the estimated graph.
 arXiv  Detail & Related papers  (2024-10-08T21:53:07Z)
- Testing Causal Models with Hidden Variables in Polynomial Delay via   Conditional Independencies [49.99600569996907]
 Testing a hypothesized causal model against observational data is a key prerequisite for many causal inference tasks.
While a model can assume exponentially many conditional independence relations (CIs), testing all of them is both impractical and unnecessary.
We introduce c-LMP for causal graphs with hidden variables and develop a delay algorithm to list these CIs in poly-time intervals.
 arXiv  Detail & Related papers  (2024-09-22T21:05:56Z)
- Optimizing VarLiNGAM for Scalable and Efficient Time Series Causal   Discovery [5.430532390358285]
 Causal discovery is designed to identify causal relationships in data.
Time series causal discovery is particularly challenging due to the need to account for temporal dependencies and potential time lag effects.
This study significantly improves the feasibility of processing large datasets.
 arXiv  Detail & Related papers  (2024-09-09T10:52:58Z)
- Large Language Models for Constrained-Based Causal Discovery [4.858756226945995]
 Causality is essential for understanding complex systems, such as the economy, the brain, and the climate.
This work explores the capabilities of Large Language Models (LLMs) as an alternative to domain experts for causal graph generation.
 arXiv  Detail & Related papers  (2024-06-11T15:45:24Z)
- AcceleratedLiNGAM: Learning Causal DAGs at the speed of GPUs [57.12929098407975]
 We show that by efficiently parallelizing existing causal discovery methods, we can scale them to thousands of dimensions.
Specifically, we focus on the causal ordering subprocedure in DirectLiNGAM and implement GPU kernels to accelerate it.
This allows us to apply DirectLiNGAM to causal inference on large-scale gene expression data with genetic interventions yielding competitive results.
 arXiv  Detail & Related papers  (2024-03-06T15:06:11Z)
- Federated Causal Discovery from Heterogeneous Data [70.31070224690399]
 We propose a novel FCD method attempting to accommodate arbitrary causal models and heterogeneous data.
These approaches involve constructing summary statistics as a proxy of the raw data to protect data privacy.
We conduct extensive experiments on synthetic and real datasets to show the efficacy of our method.
 arXiv  Detail & Related papers  (2024-02-20T18:53:53Z)
- PyRCA: A Library for Metric-based Root Cause Analysis [66.72542200701807]
 PyRCA is an open-source machine learning library of Root Cause Analysis (RCA) for Artificial Intelligence for IT Operations (AIOps)
It provides a holistic framework to uncover the complicated metric causal dependencies and automatically locate root causes of incidents.
 arXiv  Detail & Related papers  (2023-06-20T09:55:10Z)
- $\texttt{causalAssembly}$: Generating Realistic Production Data for
  Benchmarking Causal Discovery [1.3048920509133808]
 We build a system for generation of semisynthetic manufacturing data that supports benchmarking of causal discovery methods.
We employ distributional random forests to flexibly estimate and represent conditional distributions.
Using the library, we showcase how to benchmark several well-known causal discovery algorithms.
 arXiv  Detail & Related papers  (2023-06-19T10:05:54Z)
- Amortized Causal Discovery: Learning to Infer Causal Graphs from
  Time-Series Data [63.15776078733762]
 We propose Amortized Causal Discovery, a novel framework to learn to infer causal relations from time-series data.
We demonstrate experimentally that this approach, implemented as a variational model, leads to significant improvements in causal discovery performance.
 arXiv  Detail & Related papers  (2020-06-18T19:59:12Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.