Parole de présidents (1958-2022)
- URL: http://arxiv.org/abs/2411.18468v1
- Date: Wed, 27 Nov 2024 16:01:51 GMT
- Title: Parole de présidents (1958-2022)
- Authors: Dominique Labbé, Jacques Savoy,
- Abstract summary: En plus de soixante ans, huit pr'esidents se sont succ'ed'e a la tete de la Ve R'epublique francaise (de Gaulle, Pompidou, Giscard d'Estaing, Mitterrand, Chirac, Sarkozy, Hollande, Macron).
- Score: 1.519321208145928
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: En plus de soixante ans, huit pr\'esidents se sont succ\'ed\'e \`a la t\^ete de la Ve R\'epublique fran\c{c}aise (de Gaulle, Pompidou, Giscard d'Estaing, Mitterrand, Chirac, Sarkozy, Hollande, Macron). Apr\`es avoir pr\'esent\'e le corpus de leurs discours -- soit 9202 textes et plus de 20 millions de mots \'etiquet\'es -- le style de chacun des pr\'esidents sera caract\'eris\'e \`a l'aide de leurs vocabulaire (vocables et cat\'egories grammaticales). Une analyse plus approfondie r\'ev\`ele les s\'equences typiques de chaque locataire de l'\'Elys\'ee. Bas\'ee sur les distances entre l'ensemble des allocutions, une figure illustre les similitudes et diff\'erences entre les diff\'erents pr\'esidents. Over the past sixty-six years, eight presidents successively headed the Fifth French Republic (de Gaulle, Pompidou, Giscard d'Estaing, Mitterrand, Chirac, Sarkozy, Holland, Macron). After presenting the corpus of their speeches -- 9,202 texts and more than 20 million labelled words -- the style of each of them will be characterized by their vocabulary (lemmas and part-of-speech). A deeper analysis reveals the typical sequences of each tenant of the Elys\'ee. Based on an intertextual distance between all presidential speeches, a synthesis can be drawn reflecting the similarities and differences between presidents.
Related papers
- Multilingual and Explainable Text Detoxification with Parallel Corpora [58.83211571400692]
We extend parallel text detoxification corpus to new languages.
We conduct the first of its kind an automated, explainable analysis of the descriptive features of both toxic and non-toxic sentences.
We then experiment with a novel text detoxification method inspired by the Chain-of-Thoughts reasoning approach.
arXiv Detail & Related papers (2024-12-16T12:08:59Z) - Quantifying the Uniqueness of Donald Trump in Presidential Discourse [51.76056700705539]
This paper introduces a novel metric of uniqueness based on large language models.
We find considerable evidence that Trump's speech patterns diverge from those of all major party nominees for the presidency in recent history.
arXiv Detail & Related papers (2024-01-02T19:00:17Z) - Revisiting Conversation Discourse for Dialogue Disentanglement [88.3386821205896]
We propose enhancing dialogue disentanglement by taking full advantage of the dialogue discourse characteristics.
We develop a structure-aware framework to integrate the rich structural features for better modeling the conversational semantic context.
Our work has great potential to facilitate broader multi-party multi-thread dialogue applications.
arXiv Detail & Related papers (2023-06-06T19:17:47Z) - PropSegmEnt: A Large-Scale Corpus for Proposition-Level Segmentation and
Entailment Recognition [63.51569687229681]
We argue for the need to recognize the textual entailment relation of each proposition in a sentence individually.
We propose PropSegmEnt, a corpus of over 45K propositions annotated by expert human raters.
Our dataset structure resembles the tasks of (1) segmenting sentences within a document to the set of propositions, and (2) classifying the entailment relation of each proposition with respect to a different yet topically-aligned document.
arXiv Detail & Related papers (2022-12-21T04:03:33Z) - Cross-Lingual Speaker Identification Using Distant Supervision [84.51121411280134]
We propose a speaker identification framework that addresses issues such as lack of contextual reasoning and poor cross-lingual generalization.
We show that the resulting model outperforms previous state-of-the-art methods on two English speaker identification benchmarks by up to 9% in accuracy and 5% with only distant supervision.
arXiv Detail & Related papers (2022-10-11T20:49:44Z) - Simple, Interpretable and Stable Method for Detecting Words with Usage
Change across Corpora [54.757845511368814]
The problem of comparing two bodies of text and searching for words that differ in their usage arises often in digital humanities and computational social science.
This is commonly approached by training word embeddings on each corpus, aligning the vector spaces, and looking for words whose cosine distance in the aligned space is large.
We propose an alternative approach that does not use vector space alignment, and instead considers the neighbors of each word.
arXiv Detail & Related papers (2021-12-28T23:46:00Z) - Persian Rhetorical Structure Theory [2.610470075814367]
We present a discourse-annotated corpus for the Persian language built in the framework of Rhetorical Theory.
Our corpus consists of 150 journalistic texts, each text having an average of around 400 words.
Our text-level discourse is trained using gold segmentation and is built upon the DPLP discoursebank.
arXiv Detail & Related papers (2021-06-25T18:15:47Z) - Stylistic Analysis of the French Presidential Speeches: Is Macron really
different? [4.5687771576879594]
This study shows that de Gaulle's rhetoric is not mainly dedicated to his own person, or that the two terms of J. Chirac are not fully similar.
According to several overall stylistic indicators, Macron's style does not appear as complex compared to his predecessors.
Compared to the recent US presidents, the French ones present some similarities (e.g., similar mean sentence length) and dissimilarities (more I-words, less we-words)
arXiv Detail & Related papers (2021-05-06T17:35:31Z) - The interconnectedness of the economic content in the speeches of the US
Presidents [1.160208922584163]
We examine the economic content of 951 speeches stated by 45 US Presidents from George Washington (April 1789) to Donald Trump (February 2017.
The goal of our study is to examine the structure of significant interconnections within a network obtained from the economic content of presidential speeches.
arXiv Detail & Related papers (2020-02-18T21:10:55Z) - Heaps' law and Heaps functions in tagged texts: Evidences of their
linguistic relevance [0.0]
We study the relationship between vocabulary size and text length in a corpus of $75$ literary works in English.
We analyze the progressive appearance of new words of each tag along each individual text.
arXiv Detail & Related papers (2020-01-07T17:05:16Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.