Fugu-MT 論文翻訳(概要): MOSAIC: Modular Opinion Summarization using Aspect Identification and Clustering

論文の概要: MOSAIC: Modular Opinion Summarization using Aspect Identification and Clustering

arxiv url: http://arxiv.org/abs/2603.19277v1
Date: Sun, 01 Mar 2026 15:32:34 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-06 02:36:12.825507
Title: MOSAIC: Modular Opinion Summarization using Aspect Identification and Clustering
Title（参考訳）: MOSAIC:アスペクト識別とクラスタリングを用いたモジュールオピニオン要約
Authors: Piyush Kumar Singh, Jayesh Choudhari,
Abstract要約: 本稿では,要約を解釈可能なコンポーネントに分解する,産業展開用に設計されたスケーラブルでモジュール化されたフレームワークを提案する。オンラインA/Bテストによるアプローチの実践的影響を実運用の製品ページで検証し,中間出力を上向きにすることで,顧客エクスペリエンスが向上することを示す。システムレベルのコンポーネントとして意見クラスタリングを導入し,特にユーザレビューに典型的なノイズや冗長な条件下での信頼度を著しく向上することを示す。
参考スコア（独自算出の注目度）: 0.7519872646378835
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Reviews are central to how travelers evaluate products on online marketplaces, yet existing summarization research often emphasizes end-to-end quality while overlooking benchmark reliability and the practical utility of granular insights. To address this, we propose MOSAIC, a scalable, modular framework designed for industrial deployment that decomposes summarization into interpretable components, including theme discovery, structured opinion extraction, and grounded summary generation. We validate the practical impact of our approach through online A/B tests on live product pages, showing that surfacing intermediate outputs improves customer experience and delivers measurable value even prior to full summarization deployment. We further conduct extensive offline experiments to demonstrate that MOSAIC achieves superior aspect coverage and faithfulness compared to strong baselines for summarization. Crucially, we introduce opinion clustering as a system-level component and show that it significantly enhances faithfulness, particularly under the noisy and redundant conditions typical of user reviews. Finally, we identify reliability limitations in the standard SPACE dataset and release a new open-source tour experience dataset (TRECS) to enable more robust evaluation.
Abstract（参考訳）: レビューは、旅行者がオンラインマーケットプレースで製品を評価する方法の中心であるが、既存の要約研究は、ベンチマークの信頼性と詳細な洞察の実用性を見越しながら、エンドツーエンドの品質を強調していることが多い。そこで我々はMOSAICを提案する。MOSAICは産業展開用に設計されたスケーラブルでモジュール化されたフレームワークで,要約をテーマ発見,構造化意見抽出,グラウンドド・サマリ生成などの解釈可能なコンポーネントに分解する。我々は,オンラインのA/Bテストを通じて実稼働の製品ページ上でのアプローチの実践的影響を検証し,中間的なアウトプットによって顧客エクスペリエンスが向上し,総括デプロイに先立って測定可能な価値がもたらされることを示す。我々はさらに大規模なオフライン実験を行い、MOSAICが要約の強いベースラインよりも優れたアスペクトカバレッジと忠実性を達成することを示す。重要な点として,システムレベルのコンポーネントとして意見クラスタリングを導入し,特にユーザレビューに典型的なノイズや冗長な条件下での信頼感を著しく向上させることを示す。最後に、標準SPACEデータセットの信頼性制限を特定し、より堅牢な評価を可能にするために、新しいオープンソースツアー体験データセット(TRECS)をリリースする。

論文の概要: MOSAIC: Modular Opinion Summarization using Aspect Identification and Clustering

関連論文リスト