Fugu-MT 論文翻訳(概要): Are You Sure You're Positive? Consolidating Chain-of-Thought Agents with Uncertainty Quantification for Aspect-Category Sentiment Analysis

論文の概要: Are You Sure You're Positive? Consolidating Chain-of-Thought Agents with Uncertainty Quantification for Aspect-Category Sentiment Analysis

arxiv url: http://arxiv.org/abs/2508.17258v1
Date: Sun, 24 Aug 2025 08:51:16 GMT
ステータス: 翻訳完了
システム内更新日: 2025-08-26 18:43:45.430176
Title: Are You Sure You're Positive? Consolidating Chain-of-Thought Agents with Uncertainty Quantification for Aspect-Category Sentiment Analysis
Title（参考訳）: 正当性は確かか? アスペクト・カテゴリ・センシティメント分析のための不確かさ定量化剤の統合
Authors: Filippos Ventirozos, Peter Appleby, Matthew Shardlow,
Abstract要約: データセットのアノテーションに必要な時間とリソースが限られている場合、ゼロショット設定で大きな言語モデルを活用することは有益である、と我々は主張する。本稿では,大規模言語モデルのトークンレベルの不確実性スコアを活用することで,複数のチェーンオブ思考エージェントを組み合わせる新しい手法を提案する。
参考スコア（独自算出の注目度）: 4.14197005718384
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Aspect-category sentiment analysis provides granular insights by identifying specific themes within product reviews that are associated with particular opinions. Supervised learning approaches dominate the field. However, data is scarce and expensive to annotate for new domains. We argue that leveraging large language models in a zero-shot setting is beneficial where the time and resources required for dataset annotation are limited. Furthermore, annotation bias may lead to strong results using supervised methods but transfer poorly to new domains in contexts that lack annotations and demand reproducibility. In our work, we propose novel techniques that combine multiple chain-of-thought agents by leveraging large language models' token-level uncertainty scores. We experiment with the 3B and 70B+ parameter size variants of Llama and Qwen models, demonstrating how these approaches can fulfil practical needs and opening a discussion on how to gauge accuracy in label-scarce conditions.
Abstract（参考訳）: アスペクトカテゴリの感情分析は、特定の意見に関連する製品レビュー内の特定のテーマを特定することで、詳細な洞察を提供する。教師付き学習アプローチがこの分野を支配している。しかし、新しいドメインにアノテートするデータは少なく、高価である。データセットのアノテーションに必要な時間とリソースが限られている場合、ゼロショット設定で大きな言語モデルを活用することは有益である、と我々は主張する。さらに、アノテーションバイアスは、教師付きメソッドを使用して強い結果をもたらすかもしれないが、アノテーションや要求再現性に欠けるコンテキストにおいて、新しいドメインに貧弱に転送する。本研究では,大規模言語モデルのトークンレベルの不確実性スコアを活用することで,複数のチェーンオブ思考エージェントを組み合わせる新しい手法を提案する。 Llama と Qwen モデルの 3B および 70B+ のパラメータサイズ変種を実験し、これらの手法が実際的なニーズを満たす方法を示し、ラベルスカース条件の精度を評価する方法について議論する。

論文の概要: Are You Sure You're Positive? Consolidating Chain-of-Thought Agents with Uncertainty Quantification for Aspect-Category Sentiment Analysis

関連論文リスト