Fugu-MT 論文翻訳(概要): Reasoning-enhanced Query Understanding through Decomposition and Interpretation

論文の概要: Reasoning-enhanced Query Understanding through Decomposition and Interpretation

arxiv url: http://arxiv.org/abs/2509.06544v1
Date: Mon, 08 Sep 2025 10:58:42 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-09 14:07:04.076302
Title: Reasoning-enhanced Query Understanding through Decomposition and Interpretation
Title（参考訳）: 分解と解釈による推論強化クエリ理解
Authors: Yunfei Zhong, Jun Yang, Yixing Fan, Jiafeng Guo, Lixin Su, Maarten de Rijke, Ruqing Zhang, Dawei Yin, Xueqi Cheng,
Abstract要約: ReDIは、分解と解釈によるクエリ理解のための推論強化アプローチである。我々は,大規模検索エンジンから実世界の複雑なクエリの大規模データセットをコンパイルした。 BRIGHT と BEIR の実験により、ReDI はスパースと密度の高い検索パラダイムの両方において、強いベースラインを一貫して超えることを示した。
参考スコア（独自算出の注目度）: 130.19204432111277
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Accurate inference of user intent is crucial for enhancing document retrieval in modern search engines. While large language models (LLMs) have made significant strides in this area, their effectiveness has predominantly been assessed with short, keyword-based queries. As AI-driven search evolves, long-form queries with intricate intents are becoming more prevalent, yet they remain underexplored in the context of LLM-based query understanding (QU). To bridge this gap, we introduce ReDI: a Reasoning-enhanced approach for query understanding through Decomposition and Interpretation. ReDI leverages the reasoning and comprehension capabilities of LLMs in a three-stage pipeline: (i) it breaks down complex queries into targeted sub-queries to accurately capture user intent; (ii) it enriches each sub-query with detailed semantic interpretations to improve the query-document matching; and (iii) it independently retrieves documents for each sub-query and employs a fusion strategy to aggregate the results for the final ranking. We compiled a large-scale dataset of real-world complex queries from a major search engine and distilled the query understanding capabilities of teacher models into smaller models for practical application. Experiments on BRIGHT and BEIR demonstrate that ReDI consistently surpasses strong baselines in both sparse and dense retrieval paradigms, affirming its effectiveness.
Abstract（参考訳）: ユーザ意図の正確な推測は、現代の検索エンジンにおける文書検索の強化に不可欠である。大規模言語モデル(LLM)はこの分野で大きな進歩を遂げてきたが、その効果は主に短いキーワードベースのクエリで評価されてきた。 AI駆動の検索が進化するにつれて、複雑な意図を持つ長文クエリがより普及しているが、LLMベースのクエリ理解(QU)の文脈では未探索のままである。このギャップを埋めるために、ReDI:分解と解釈によるクエリ理解のための推論強化アプローチを紹介します。 ReDIは、3段階パイプラインにおけるLLMの推論と理解能力を活用する。 i) 複雑なクエリをターゲットとするサブクエリに分解して、ユーザの意図を正確にキャプチャする。 (ii)クエリ文書マッチングを改善するために,各サブクエリに詳細なセマンティック解釈を付加する。三サブクエリごとに独立して文書を検索し、最終ランク付け結果の集計に融合戦略を用いる。我々は,大規模検索エンジンから実世界の複雑なクエリの大規模データセットをコンパイルし,教師モデルの問合せ理解能力をより小さなモデルに抽出し,実用的に適用した。 BRIGHTとBEIRの実験では、ReDIはスパースと密度の高い検索パラダイムの両方において強いベースラインを一貫して上回り、その有効性を確認している。

論文の概要: Reasoning-enhanced Query Understanding through Decomposition and Interpretation

関連論文リスト