A Review and Roadmap of Deep Causal Model from Different Causal
Structures and Representations
- URL: http://arxiv.org/abs/2311.00923v1
- Date: Thu, 2 Nov 2023 01:31:42 GMT
- Title: A Review and Roadmap of Deep Causal Model from Different Causal
Structures and Representations
- Authors: Hang Chen and Keqing Du and Chenguang Li and Xinyu Yang
- Abstract summary: We redefinition causal data into three categories: definite data, semi-definite data, and indefinite data.
Definite data pertains to statistical data used in conventional causal scenarios, while semi-definite data refers to a spectrum of data formats germane to deep learning.
Indefinite data is an emergent research sphere inferred from the progression of data forms by us.
- Score: 23.87336875544181
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The fusion of causal models with deep learning introducing increasingly
intricate data sets, such as the causal associations within images or between
textual components, has surfaced as a focal research area. Nonetheless, the
broadening of original causal concepts and theories to such complex,
non-statistical data has been met with serious challenges. In response, our
study proposes redefinitions of causal data into three distinct categories from
the standpoint of causal structure and representation: definite data,
semi-definite data, and indefinite data. Definite data chiefly pertains to
statistical data used in conventional causal scenarios, while semi-definite
data refers to a spectrum of data formats germane to deep learning, including
time-series, images, text, and others. Indefinite data is an emergent research
sphere inferred from the progression of data forms by us. To comprehensively
present these three data paradigms, we elaborate on their formal definitions,
differences manifested in datasets, resolution pathways, and development of
research. We summarize key tasks and achievements pertaining to definite and
semi-definite data from myriad research undertakings, present a roadmap for
indefinite data, beginning with its current research conundrums. Lastly, we
classify and scrutinize the key datasets presently utilized within these three
paradigms.
Related papers
- A Survey on Diffusion Models for Time Series and Spatio-Temporal Data [92.1255811066468]
We review the use of diffusion models in time series and S-temporal data, categorizing them by model, task type, data modality, and practical application domain.
We categorize diffusion models into unconditioned and conditioned types discuss time series and S-temporal data separately.
Our survey covers their application extensively in various fields including healthcare, recommendation, climate, energy, audio, and transportation.
arXiv Detail & Related papers (2024-04-29T17:19:40Z) - Towards Causal Relationship in Indefinite Data: Baseline Model and New
Datasets [23.035761299444953]
"Indefinite Data" is characterized by multi-structure data and multi-value representations.
We release two high-quality datasets - Causalogue and Causaction.
We propose a probabilistic framework as a baseline, incorporating three designed highlights for this gap.
arXiv Detail & Related papers (2024-01-16T09:15:43Z) - Large Models for Time Series and Spatio-Temporal Data: A Survey and
Outlook [95.32949323258251]
Temporal data, notably time series andtemporal-temporal data, are prevalent in real-world applications.
Recent advances in large language and other foundational models have spurred increased use in time series andtemporal data mining.
arXiv Detail & Related papers (2023-10-16T09:06:00Z) - A Comprehensive Survey on Generative Diffusion Models for Structured
Data [0.0]
generative diffusion models have achieved a rapid paradigm shift in deep generative models.
Structured data has been received comparatively limited attention from the deep learning research community.
This review serves as a catalyst for the research community, promoting developments in generative diffusion models for structured data.
arXiv Detail & Related papers (2023-06-07T04:26:41Z) - Towards Causal Representation Learning and Deconfounding from Indefinite
Data [17.793702165499298]
Non-statistical data (e.g., images, text, etc.) encounters significant conflicts in terms of properties and methods with traditional causal data.
We redefine causal data from two novel perspectives and then propose three data paradigms.
We implement the above designs as a dynamic variational inference model, tailored to learn causal representation from indefinite data.
arXiv Detail & Related papers (2023-05-04T08:20:37Z) - Causal Discovery from Temporal Data: An Overview and New Perspectives [6.251443497694126]
Analyzing temporal data is extremely valuable for various applications.
causal discovery, learning the causal relations from temporal data, is considered an interesting yet critical task.
In this paper, we specify the correlation between the two categories and provide a systematical overview of existing solutions.
arXiv Detail & Related papers (2023-03-17T16:45:01Z) - Data-SUITE: Data-centric identification of in-distribution incongruous
examples [81.21462458089142]
Data-SUITE is a data-centric framework to identify incongruous regions of in-distribution (ID) data.
We empirically validate Data-SUITE's performance and coverage guarantees.
arXiv Detail & Related papers (2022-02-17T18:58:31Z) - Learning Neural Causal Models with Active Interventions [83.44636110899742]
We introduce an active intervention-targeting mechanism which enables a quick identification of the underlying causal structure of the data-generating process.
Our method significantly reduces the required number of interactions compared with random intervention targeting.
We demonstrate superior performance on multiple benchmarks from simulated to real-world data.
arXiv Detail & Related papers (2021-09-06T13:10:37Z) - On the Efficacy of Adversarial Data Collection for Question Answering:
Results from a Large-Scale Randomized Study [65.17429512679695]
In adversarial data collection (ADC), a human workforce interacts with a model in real time, attempting to produce examples that elicit incorrect predictions.
Despite ADC's intuitive appeal, it remains unclear when training on adversarial datasets produces more robust models.
arXiv Detail & Related papers (2021-06-02T00:48:33Z) - OR-Net: Pointwise Relational Inference for Data Completion under Partial
Observation [51.083573770706636]
This work uses relational inference to fill in the incomplete data.
We propose Omni-Relational Network (OR-Net) to model the pointwise relativity in two aspects.
arXiv Detail & Related papers (2021-05-02T06:05:54Z) - Are We Hungry for 3D LiDAR Data for Semantic Segmentation? A Survey and
Experimental Study [5.6780397318769245]
3D semantic segmentation is a fundamental task for robotic and autonomous driving applications.
Recent works have been focused on using deep learning techniques, whereas developing fine-annotated 3D LiDAR datasets is extremely labor intensive.
The performance limitation caused by insufficient datasets is called data hunger problem.
arXiv Detail & Related papers (2020-06-08T01:20:59Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.