Related papers: Causal Effects of Linguistic Properties

Causal Effects of Linguistic Properties

URL: http://arxiv.org/abs/2010.12919v5
Date: Mon, 14 Jun 2021 14:10:05 GMT
Title: Causal Effects of Linguistic Properties
Authors: Reid Pryzant, Dallas Card, Dan Jurafsky, Victor Veitch, Dhanya Sridhar
Abstract summary: We consider the problem of using observational data to estimate the causal effects of linguistic properties. We introduce TextCause, an algorithm for estimating causal effects of linguistic properties. We show that the proposed method outperforms related approaches when estimating the effect of Amazon review sentiment.
Score: 41.65859219291606
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We consider the problem of using observational data to estimate the causal effects of linguistic properties. For example, does writing a complaint politely lead to a faster response time? How much will a positive product review increase sales? This paper addresses two technical challenges related to the problem before developing a practical method. First, we formalize the causal quantity of interest as the effect of a writer's intent, and establish the assumptions necessary to identify this from observational data. Second, in practice, we only have access to noisy proxies for the linguistic properties of interest -- e.g., predictions from classifiers and lexicons. We propose an estimator for this setting and prove that its bias is bounded when we perform an adjustment for the text. Based on these results, we introduce TextCause, an algorithm for estimating causal effects of linguistic properties. The method leverages (1) distant supervision to improve the quality of noisy proxies, and (2) a pre-trained language model (BERT) to adjust for the text. We show that the proposed method outperforms related approaches when estimating the effect of Amazon review sentiment on semi-simulated sales figures. Finally, we present an applied case study investigating the effects of complaint politeness on bureaucratic response times.

Related papers

Examining False Positives under Inference Scaling for Mathematical Reasoning [59.19191774050967]
This paper systematically examines the prevalence of false positive solutions in mathematical problem solving for language models. We explore how false positives influence the inference time scaling behavior of language models.
arXiv Detail & Related papers (2025-02-10T07:49:35Z)
Were You Helpful -- Predicting Helpful Votes from Amazon Reviews [0.0]
This project investigates factors that influence the perceived helpfulness of Amazon product reviews through machine learning techniques. We identify key metadata characteristics that serve as strong predictors of review helpfulness. This insight suggests that contextual and user-behavioral factors may be more indicative of review helpfulness than the linguistic content itself.
arXiv Detail & Related papers (2024-12-03T22:38:58Z)
Likelihood as a Performance Gauge for Retrieval-Augmented Generation [78.28197013467157]
We show that likelihoods serve as an effective gauge for language model performance. We propose two methods that use question likelihood as a gauge for selecting and constructing prompts that lead to better performance.
arXiv Detail & Related papers (2024-11-12T13:14:09Z)
Improving Sampling Methods for Fine-tuning SentenceBERT in Text Streams [49.3179290313959]
This study explores the efficacy of seven text sampling methods designed to selectively fine-tune language models. We precisely assess the impact of these methods on fine-tuning the SBERT model using four different loss functions. Our findings indicate that Softmax loss and Batch All Triplets loss are particularly effective for text stream classification.
arXiv Detail & Related papers (2024-03-18T23:41:52Z)
Text-Transport: Toward Learning Causal Effects of Natural Language [46.75318356800048]
We introduce Text-Transport, a method for estimation of causal effects from natural language under any text distribution. We use Text-Transport to study a realistic setting--hate speech on social media--in which causal effects do shift significantly between text domains.
arXiv Detail & Related papers (2023-10-31T17:56:51Z)
Explaining Hate Speech Classification with Model Agnostic Methods [0.9990687944474738]
The research goal of this paper is to bridge the gap between hate speech prediction and the explanations generated by the system to support its decision. This has been achieved by first predicting the classification of a text and then providing a posthoc, model agnostic and surrogate interpretability approach.
arXiv Detail & Related papers (2023-05-30T19:52:56Z)
Fairness-guided Few-shot Prompting for Large Language Models [93.05624064699965]
In-context learning can suffer from high instability due to variations in training examples, example order, and prompt formats. We introduce a metric to evaluate the predictive bias of a fixed prompt against labels or a given attributes. We propose a novel search strategy based on the greedy search to identify the near-optimal prompt for improving the performance of in-context learning.
arXiv Detail & Related papers (2023-03-23T12:28:25Z)
Causal Estimation for Text Data with (Apparent) Overlap Violations [16.94058221134916]
We show how to handle causal identification and obtain robust causal estimation in the presence of apparent overlap violations. The idea is to use supervised representation learning to produce a data representation that preserves confounding information.
arXiv Detail & Related papers (2022-09-30T20:33:17Z)
Naturalistic Causal Probing for Morpho-Syntax [76.83735391276547]
We suggest a naturalistic strategy for input-level intervention on real world data in Spanish. Using our approach, we isolate morpho-syntactic features from counfounders in sentences. We apply this methodology to analyze causal effects of gender and number on contextualized representations extracted from pre-trained models.
arXiv Detail & Related papers (2022-05-14T11:47:58Z)
Probing as Quantifying the Inductive Bias of Pre-trained Representations [99.93552997506438]
We present a novel framework for probing where the goal is to evaluate the inductive bias of representations for a particular task. We apply our framework to a series of token-, arc-, and sentence-level tasks.
arXiv Detail & Related papers (2021-10-15T22:01:16Z)

This list is automatically generated from the titles and abstracts of the papers in this site.