Fugu-MT 論文翻訳(概要): AssoCiAm: A Benchmark for Evaluating Association Thinking while Circumventing Ambiguity

論文の概要: AssoCiAm: A Benchmark for Evaluating Association Thinking while Circumventing Ambiguity

arxiv url: http://arxiv.org/abs/2509.14171v1
Date: Wed, 17 Sep 2025 16:56:27 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-18 18:41:50.930529
Title: AssoCiAm: A Benchmark for Evaluating Association Thinking while Circumventing Ambiguity
Title（参考訳）: AssoCiAm: 曖昧さを回避しつつ, 関連思考を評価するためのベンチマーク
Authors: Yifan Liu, Wenkuan Zhao, Shanshan Zhong, Jinghui Qin, Mingfu Liang, Zhongzhan Huang, Wushao Wen,
Abstract要約: マルチモーダル大言語モデル(MLLM)は、人工知能(AGI)への有望な経路を提供するなど、大きな注目を集めている。 AGIに必要な重要な能力のうち、創造性はMLLMにとって重要な特性として現れ、その基盤として協会が機能している。 AssoCiAmは、ハイブリッド計算手法により曖昧さを回避しつつ、連想能力を評価するために設計されたベンチマークである。
参考スコア（独自算出の注目度）: 40.69669704668314
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Recent advancements in multimodal large language models (MLLMs) have garnered significant attention, offering a promising pathway toward artificial general intelligence (AGI). Among the essential capabilities required for AGI, creativity has emerged as a critical trait for MLLMs, with association serving as its foundation. Association reflects a model' s ability to think creatively, making it vital to evaluate and understand. While several frameworks have been proposed to assess associative ability, they often overlook the inherent ambiguity in association tasks, which arises from the divergent nature of associations and undermines the reliability of evaluations. To address this issue, we decompose ambiguity into two types-internal ambiguity and external ambiguity-and introduce AssoCiAm, a benchmark designed to evaluate associative ability while circumventing the ambiguity through a hybrid computational method. We then conduct extensive experiments on MLLMs, revealing a strong positive correlation between cognition and association. Additionally, we observe that the presence of ambiguity in the evaluation process causes MLLMs' behavior to become more random-like. Finally, we validate the effectiveness of our method in ensuring more accurate and reliable evaluations. See Project Page for the data and codes.
Abstract（参考訳）: MLLM(Multimodal large language model)の最近の進歩は、人工知能(AGI)への道筋として大きな注目を集めている。 AGIに必要な重要な能力のうち、創造性はMLLMにとって重要な特性として現れ、その基盤として協会が機能している。アソシエーションはモデルが創造的に考える能力を反映し、評価と理解が不可欠である。連想能力を評価するためにいくつかのフレームワークが提案されているが、それらはしばしば、関連性の異なる性質から生じ、評価の信頼性を損なう、関連性タスクの固有の曖昧さを見落としている。この問題に対処するため, あいまいさを2種類の内部曖昧性と外部曖昧性に分解し, ハイブリッド計算手法を用いてあいまいさを回避しつつ, 連想能力を評価するためのベンチマークAssoCiAmを導入する。次に,MLLMに関する広範な実験を行い,認知と相関の強い正の相関を明らかにする。さらに,評価プロセスにおけるあいまいさの存在は,MLLMの行動がよりランダムなものになることを観察する。最後に,提案手法の有効性を検証し,より正確で信頼性の高い評価を行う。データとコードについてはProject Pageを参照してください。

論文の概要: AssoCiAm: A Benchmark for Evaluating Association Thinking while Circumventing Ambiguity

関連論文リスト