Fugu-MT 論文翻訳(概要): The Overlooked Repetitive Lengthening Form in Sentiment Analysis

論文の概要: The Overlooked Repetitive Lengthening Form in Sentiment Analysis

arxiv url: http://arxiv.org/abs/2604.01268v1
Date: Wed, 01 Apr 2026 16:55:17 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-03 14:21:09.661813
Title: The Overlooked Repetitive Lengthening Form in Sentiment Analysis
Title（参考訳）: 感性分析における見過ごされた反復伸長形態
Authors: Lei Wang, Eduard Dragut,
Abstract要約: RLF (Repetitive Lengthening Form) は長年見過ごされてきた。 RLF文は表現表現であり,文書レベルの感情のシグネチャとして機能することを示す。 RLFはオンラインコンテンツ分析の潜在的な価値を持っている。
参考スコア（独自算出の注目度）: 2.6091312475521136
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Individuals engaging in online communication frequently express personal opinions with informal styles (e.g., memes and emojis). While Language Models (LMs) with informal communications have been widely discussed, a unique and emphatic style, the Repetitive Lengthening Form (RLF), has been overlooked for years. In this paper, we explore answers to two research questions: 1) Is RLF important for sentiment analysis (SA)? 2) Can LMs understand RLF? Inspired by previous linguistic research, we curate \textbf{Lengthening}, the first multi-domain dataset with 850k samples focused on RLF for SA. Moreover, we introduce \textbf{Exp}lainable \textbf{Instruct}ion Tuning (\textbf{ExpInstruct}), a two-stage instruction tuning framework aimed to improve both performance and explainability of LLMs for RLF. We further propose a novel unified approach to quantify LMs' understanding of informal expressions. We show that RLF sentences are expressive expressions and can serve as signatures of document-level sentiment. Additionally, RLF has potential value for online content analysis. Our results show that fine-tuned Pre-trained Language Models (PLMs) can surpass zero-shot GPT-4 in performance but not in explanation for RLF. Finally, we show ExpInstruct can improve the open-sourced LLMs to match zero-shot GPT-4 in performance and explainability for RLF with limited samples. Code and sample data are available at https://github.com/Tom-Owl/OverlookedRLF
Abstract（参考訳）: オンラインコミュニケーションに従事する個人は、非公式なスタイル(例えば、ミームや絵文字)で個人的な意見を表現することが多い。非公式なコミュニケーションを持つ言語モデル (LM) は広く議論されているが、ユニークな、強調的なスタイルであるRLF(Repetitive Lengthening Form)は長年見過ごされてきた。本稿では,2つの研究課題に対する回答について考察する。 1)RLFは感情分析(SA)に重要であるか? 2) LM は RLF を理解できるか? これまでの言語研究に触発されて,SA のための RLF に着目した 850k サンプルを用いた最初のマルチドメインデータセットである \textbf{Lengthening} をキュレートした。さらに, RLF における LLM の性能と説明性の向上を目的とした2段階の命令チューニングフレームワークである \textbf{Exp}lainable \textbf{Instruct}ion Tuning (\textbf{ExpInstruct}) を導入する。さらに,従来の表現に対するLMの理解を定量化する統一的手法を提案する。 RLF文は表現表現であり,文書レベルの感情のシグネチャとして機能することを示す。さらに、RLFはオンラインコンテンツ分析の潜在的な価値を持っている。以上の結果から,微調整事前学習言語モデル (PLM) はゼロショット GPT-4 を上回る性能を示すが, RLF を説明できないことが示唆された。最後に,ExpInstruct はオープンソース LLM を改良し,0ショット GPT-4 の性能と,限られたサンプルを用いた RLF の説明可能性を示す。コードとサンプルデータはhttps://github.com/Tom-Owl/OverlookedRLFで公開されている。

論文の概要: The Overlooked Repetitive Lengthening Form in Sentiment Analysis

関連論文リスト