Fugu-MT 論文翻訳(概要): Transparent and Controllable Recommendation Filtering via Multimodal Multi-Agent Collaboration

論文の概要: Transparent and Controllable Recommendation Filtering via Multimodal Multi-Agent Collaboration

arxiv url: http://arxiv.org/abs/2604.17459v1
Date: Sun, 19 Apr 2026 14:19:28 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-21 21:52:52.537797
Title: Transparent and Controllable Recommendation Filtering via Multimodal Multi-Agent Collaboration
Title（参考訳）: マルチモーダル多エージェント協調による透明かつ制御可能なレコメンデーションフィルタ
Authors: Chi Zhang, Zhipeng Xu, Jiahao Liu, Dongsheng Li, Hansu Gu, Peng Zhang, Ning Gu, Tun Lu,
Abstract要約: 我々は、エンドツーエンドのコラボレーション、マルチモーダル認識、マルチエージェントオーケストレーションを統合する新しいフレームワークを紹介します。本システムでは,帰納的幻覚を除去するために,ファクトグラウンドの偏見パイプラインを用いている。ダイナミックな2階層の嗜好グラフを構築し、明示的なループ内の人為的な修正を可能にする。
参考スコア（独自算出の注目度）: 48.028080782524064
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: While personalized recommender systems excel at content discovery, they frequently expose users to undesirable or discomforting information, highlighting the critical need for user-centric filtering tools. Current methods leveraging Large Language Models (LLMs) struggle with two major bottlenecks: they lack multimodal awareness to identify visually inappropriate content, and they are highly prone to "over-association" -- incorrectly generalizing a user's specific dislike (e.g., anxiety-inducing marketing) to block benign, educational materials. These unconstrained hallucinations lead to a high volume of false positives, ultimately undermining user agency. To overcome these challenges, we introduce a novel framework that integrates end-to-cloud collaboration, multimodal perception, and multi-agent orchestration. Our system employs a fact-grounded adjudication pipeline to eliminate inferential hallucinations. Furthermore, it constructs a dynamic, two-tier preference graph that allows for explicit, human-in-the-loop modifications (via Delta-adjustments), explicitly preventing the algorithm from catastrophically forgetting fine-grained user intents. Evaluated on an adversarial dataset comprising 473 highly confusing samples, the proposed architecture effectively curbed over-association, decreasing the false positive rate by 74.3% and achieving nearly twice the F1-Score of traditional text-only baselines. Additionally, a 7-day longitudinal field study with 19 participants demonstrated robust intent alignment and enhanced governance efficiency. User feedback confirmed that the framework drastically improves algorithmic transparency, rebuilds user control, and alleviates the fear of missing out (FOMO), paving the way for transparent human-AI co-governance in personalized feeds.
Abstract（参考訳）: パーソナライズされたレコメンデータシステムはコンテンツ発見に長けているが、ユーザーを好ましくない、あるいは不快な情報にさらし、ユーザー中心のフィルタリングツールに対する重要なニーズを強調している。視覚的に不適切なコンテンツを特定するためのマルチモーダルな認識が欠如しており、ユーザの特定の嫌悪(不安を引き起こすマーケティングなど)を誤って一般化して、良質な教育材料をブロックする「過剰な連想(over-asociation)」の傾向が高い。これらの制約のない幻覚は、大量の偽陽性を引き起こし、最終的にはユーザーエージェンシーを損なう。これらの課題を克服するために、エンドツーエンドのコラボレーション、マルチモーダル認識、マルチエージェントオーケストレーションを統合する新しいフレームワークを導入します。本システムでは,帰納的幻覚を除去するために,ファクトグラウンドの偏見パイプラインを用いている。さらに、デルタ調整(Delta-adjustments)を通じて)明示的なループ修正を可能にするダイナミックな2階層の選好グラフを構築し、そのアルゴリズムが破滅的にきめ細かなユーザ意図を忘れることを防ぐ。 473個の非常に混乱したサンプルからなる敵対的データセットで評価し、提案アーキテクチャは、過剰結合を効果的に抑制し、偽陽性率を74.3%減少させ、従来のテキストのみのベースラインのF1スコアの2倍近くを達成する。さらに、19人の参加者による7日間の縦断調査では、強い意図の整合性を示し、ガバナンス効率を向上した。ユーザからのフィードバックは、このフレームワークがアルゴリズムの透明性を大幅に改善し、ユーザコントロールを再構築し、欠席(FOMO)の恐れを軽減し、パーソナライズされたフィードにおける透明な人間とAIの共同統治の道を開くことを確認した。

論文の概要: Transparent and Controllable Recommendation Filtering via Multimodal Multi-Agent Collaboration

関連論文リスト