Interpretation modeling: Social grounding of sentences by reasoning over
their implicit moral judgments
- URL: http://arxiv.org/abs/2312.03726v1
- Date: Mon, 27 Nov 2023 07:50:55 GMT
- Title: Interpretation modeling: Social grounding of sentences by reasoning over
their implicit moral judgments
- Authors: Liesbeth Allein, Maria Mihaela Tru\c{s}c\v{a}, Marie-Francine Moens
- Abstract summary: Single gold-standard interpretations rarely exist, challenging conventional assumptions in natural language processing.
This work introduces the interpretation modeling (IM) task which involves modeling several interpretations of a sentence's underlying semantics.
A first-of-its-kind IM dataset is curated to support experiments and analyses.
- Score: 24.133419857271505
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The social and implicit nature of human communication ramifies readers'
understandings of written sentences. Single gold-standard interpretations
rarely exist, challenging conventional assumptions in natural language
processing. This work introduces the interpretation modeling (IM) task which
involves modeling several interpretations of a sentence's underlying semantics
to unearth layers of implicit meaning. To obtain these, IM is guided by
multiple annotations of social relation and common ground - in this work
approximated by reader attitudes towards the author and their understanding of
moral judgments subtly embedded in the sentence. We propose a number of
modeling strategies that rely on one-to-one and one-to-many generation methods
that take inspiration from the philosophical study of interpretation. A
first-of-its-kind IM dataset is curated to support experiments and analyses.
The modeling results, coupled with scrutiny of the dataset, underline the
challenges of IM as conflicting and complex interpretations are socially
plausible. This interplay of diverse readings is affirmed by automated and
human evaluations on the generated interpretations. Finally, toxicity analyses
in the generated interpretations demonstrate the importance of IM for refining
filters of content and assisting content moderators in safeguarding the safety
in online discourse.
Related papers
- Interpreting Pretrained Language Models via Concept Bottlenecks [55.47515772358389]
Pretrained language models (PLMs) have made significant strides in various natural language processing tasks.
The lack of interpretability due to their black-box'' nature poses challenges for responsible implementation.
We propose a novel approach to interpreting PLMs by employing high-level, meaningful concepts that are easily understandable for humans.
arXiv Detail & Related papers (2023-11-08T20:41:18Z) - DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning [89.92601337474954]
Pragmatic reasoning plays a pivotal role in deciphering implicit meanings that frequently arise in real-life conversations.
We introduce a novel challenge, DiPlomat, aiming at benchmarking machines' capabilities on pragmatic reasoning and situated conversational understanding.
arXiv Detail & Related papers (2023-06-15T10:41:23Z) - Natural Language Decompositions of Implicit Content Enable Better Text
Representations [56.85319224208865]
We introduce a method for the analysis of text that takes implicitly communicated content explicitly into account.
We use a large language model to produce sets of propositions that are inferentially related to the text that has been observed.
Our results suggest that modeling the meanings behind observed language, rather than the literal text alone, is a valuable direction for NLP.
arXiv Detail & Related papers (2023-05-23T23:45:20Z) - On the Faithfulness Measurements for Model Interpretations [100.2730234575114]
Post-hoc interpretations aim to uncover how natural language processing (NLP) models make predictions.
To tackle these issues, we start with three criteria: the removal-based criterion, the sensitivity of interpretations, and the stability of interpretations.
Motivated by the desideratum of these faithfulness notions, we introduce a new class of interpretation methods that adopt techniques from the adversarial domain.
arXiv Detail & Related papers (2021-04-18T09:19:44Z) - Did they answer? Subjective acts and intents in conversational discourse [48.63528550837949]
We present the first discourse dataset with multiple and subjective interpretations of English conversation.
We show disagreements are nuanced and require a deeper understanding of the different contextual factors.
arXiv Detail & Related papers (2021-04-09T16:34:19Z) - Interpretable Deep Learning: Interpretations, Interpretability,
Trustworthiness, and Beyond [49.93153180169685]
We introduce and clarify two basic concepts-interpretations and interpretability-that people usually get confused.
We elaborate the design of several recent interpretation algorithms, from different perspectives, through proposing a new taxonomy.
We summarize the existing work in evaluating models' interpretability using "trustworthy" interpretation algorithms.
arXiv Detail & Related papers (2021-03-19T08:40:30Z) - An Investigation of Language Model Interpretability via Sentence Editing [5.492504126672887]
We re-purpose a sentence editing dataset as a testbed for interpretability of pre-trained language models (PLMs)
This enables us to conduct a systematic investigation on an array of questions regarding PLMs' interpretability.
The investigation generates new insights, for example, contrary to the common understanding, we find that attention weights correlate well with human rationales.
arXiv Detail & Related papers (2020-11-28T00:46:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.