AutoCAD: Automatically Generating Counterfactuals for Mitigating
Shortcut Learning
- URL: http://arxiv.org/abs/2211.16202v1
- Date: Tue, 29 Nov 2022 13:39:53 GMT
- Title: AutoCAD: Automatically Generating Counterfactuals for Mitigating
Shortcut Learning
- Authors: Jiaxin Wen, Yeshuang Zhu, Jinchao Zhang, Jie Zhou and Minlie Huang
- Abstract summary: We present AutoCAD, a fully automatic and task-agnostic CAD generation framework.
In this paper, we present AutoCAD, a fully automatic and task-agnostic CAD generation framework.
- Score: 70.70393006697383
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Recent studies have shown the impressive efficacy of counterfactually
augmented data (CAD) for reducing NLU models' reliance on spurious features and
improving their generalizability. However, current methods still heavily rely
on human efforts or task-specific designs to generate counterfactuals, thereby
impeding CAD's applicability to a broad range of NLU tasks. In this paper, we
present AutoCAD, a fully automatic and task-agnostic CAD generation framework.
AutoCAD first leverages a classifier to unsupervisedly identify rationales as
spans to be intervened, which disentangles spurious and causal features. Then,
AutoCAD performs controllable generation enhanced by unlikelihood training to
produce diverse counterfactuals. Extensive evaluations on multiple
out-of-domain and challenge benchmarks demonstrate that AutoCAD consistently
and significantly boosts the out-of-distribution performance of powerful
pre-trained models across different NLU tasks, which is comparable or even
better than previous state-of-the-art human-in-the-loop or task-specific CAD
methods. The code is publicly available at https://github.com/thu-coai/AutoCAD.
Related papers
- CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM [39.113795259823476]
We introduce the CAD-MLLM, the first system capable of generating parametric CAD models conditioned on the multimodal input.
We use advanced large language models (LLMs) to align the feature space across diverse multi-modalities data and CAD models' vectorized representations.
Our resulting dataset, named Omni-CAD, is the first multimodal CAD dataset that contains textual description, multi-view images, points, and command sequence for each CAD model.
arXiv Detail & Related papers (2024-11-07T18:31:08Z) - Self-supervised Graph Neural Network for Mechanical CAD Retrieval [29.321027284348272]
GC-CAD is a self-supervised contrastive graph neural network-based method for mechanical CAD retrieval.
The proposed method achieves significant accuracy improvements and up to 100 times efficiency improvement over the baseline methods.
arXiv Detail & Related papers (2024-06-13T06:56:49Z) - PairCFR: Enhancing Model Training on Paired Counterfactually Augmented Data through Contrastive Learning [49.60634126342945]
Counterfactually Augmented Data (CAD) involves creating new data samples by applying minimal yet sufficient modifications to flip the label of existing data samples to other classes.
Recent research reveals that training with CAD may lead models to overly focus on modified features while ignoring other important contextual information.
We employ contrastive learning to promote global feature alignment in addition to learning counterfactual clues.
arXiv Detail & Related papers (2024-06-09T07:29:55Z) - PS-CAD: Local Geometry Guidance via Prompting and Selection for CAD Reconstruction [86.726941702182]
We introduce geometric guidance into the reconstruction network PS-CAD.
We provide the geometry of surfaces where the current reconstruction differs from the complete model as a point cloud.
Second, we use geometric analysis to extract a set of planar prompts, that correspond to candidate surfaces.
arXiv Detail & Related papers (2024-05-24T03:43:55Z) - ContrastCAD: Contrastive Learning-based Representation Learning for Computer-Aided Design Models [0.7373617024876725]
We propose a contrastive learning-based approach to learning CAD models, named ContrastCAD.
ContrastCAD effectively captures semantic information within the construction sequences of the CAD model.
We also propose a new CAD data augmentation method, called a Random Replace and Extrude (RRE) method, to enhance the learning performance of the model.
arXiv Detail & Related papers (2024-04-02T05:30:39Z) - Geometric Deep Learning for Computer-Aided Design: A Survey [85.79012726689511]
This survey offers a comprehensive overview of learning-based methods in computer-aided design.
It includes similarity analysis and retrieval, 2D and 3D CAD model synthesis, and CAD generation from point clouds.
It provides a complete list of benchmark datasets and their characteristics, along with open-source codes that have propelled research in this domain.
arXiv Detail & Related papers (2024-02-27T17:11:35Z) - People Make Better Edits: Measuring the Efficacy of LLM-Generated
Counterfactually Augmented Data for Harmful Language Detection [35.89913036572029]
It is imperative that NLP models are robust to spurious features.
Past work has attempted to tackle such spurious features using training data augmentation.
We assess if this task can be automated using generative NLP models.
arXiv Detail & Related papers (2023-11-02T14:31:25Z) - Design Automation for Fast, Lightweight, and Effective Deep Learning
Models: A Survey [53.258091735278875]
This survey covers studies of design automation techniques for deep learning models targeting edge computing.
It offers an overview and comparison of key metrics that are used commonly to quantify the proficiency of models in terms of effectiveness, lightness, and computational costs.
The survey proceeds to cover three categories of the state-of-the-art of deep model design automation techniques.
arXiv Detail & Related papers (2022-08-22T12:12:43Z) - Continual Object Detection via Prototypical Task Correlation Guided
Gating Mechanism [120.1998866178014]
We present a flexible framework for continual object detection via pRotOtypical taSk corrElaTion guided gaTingAnism (ROSETTA)
Concretely, a unified framework is shared by all tasks while task-aware gates are introduced to automatically select sub-models for specific tasks.
Experiments on COCO-VOC, KITTI-Kitchen, class-incremental detection on VOC and sequential learning of four tasks show that ROSETTA yields state-of-the-art performance.
arXiv Detail & Related papers (2022-05-06T07:31:28Z) - How Does Counterfactually Augmented Data Impact Models for Social
Computing Constructs? [35.29235215101502]
We investigate the benefits of counterfactually augmented data (CAD) for social NLP models by focusing on three social computing constructs -- sentiment, sexism, and hate speech.
We find that while models trained on CAD show lower in-domain performance, they generalize better out-of-domain.
arXiv Detail & Related papers (2021-09-14T23:46:39Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.