Fugu-MT 論文翻訳(概要): Enhancing Healthcare Search Intent Recognition with Query Representation Learning and Session Context

論文の概要: Enhancing Healthcare Search Intent Recognition with Query Representation Learning and Session Context

arxiv url: http://arxiv.org/abs/2605.10021v1
Date: Mon, 11 May 2026 05:44:54 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-12 23:28:50.55486
Title: Enhancing Healthcare Search Intent Recognition with Query Representation Learning and Session Context
Title（参考訳）: 質問表現学習とセッションコンテキストを用いた医療検索インテント認識の強化
Authors: Harshita Jagdish Sahijwani, Madhav Sigdel, Song Aslan, Priya Gopi Achuthan, Monica D. Skidmore, Eugene Agichtein, Chen Lin,
Abstract要約: 本稿では,健康検索クエリの多面的特徴を捉えるために,新たな損失関数を提案する。我々は,健康クエリのあいまいさと,グローバル検索意図と個別セッションから識別された内容との相違を定量化する。
参考スコア（独自算出の注目度）: 7.918995314783813
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Classifying the intent behind healthcare search queries is crucial for improving the delivery of online healthcare information. The intricate nature of medical search queries, coupled with the limited availability of high-quality labeled data, presents substantial challenges for developing efficient classification models. Previous studies have exploited user interaction data, such as user clicks from search logs and employed pairwise loss functions to model co-click behavior for query representation learning. However, many health queries could have multiple intents, resulting in ambiguous or divergent click behavior. Furthermore, learning the single most popular intent of queries as inferred from global statistics based on the aggregate behavior of different users could potentially lead to disparity and performance drop when classifying the query intent within specific search sessions. To address these limitations, our work improves the query representation learning by aggregating similar queries via clustering, and introducing a novel loss function designed to capture the multifaceted nature of health search queries, resulting in a more scalable and accurate learning procedure. Furthermore, we quantify the ambiguity of health queries and the misalignment between global search intents and those discerned from individual sessions, by introducing the concordance rate (CR) score, and demonstrate a simple and effective method for incorporating our learned query representation into contextual, session-based search intent classification. Our extensive experimental results and analysis on two real-world search log datasets, i.e., a Health Search (HS) dataset and the publicly available TripClick dataset, demonstrate that our approach not only improves the intrinsic clustering metrics for query representation learning but also enhances accuracy for subsequent search intent classification tasks.
Abstract（参考訳）: 医療検索クエリの背後にある意図を分類することは、オンライン医療情報の配信を改善するために不可欠である。医用検索クエリの複雑な性質は、高品質なラベル付きデータの可用性の制限と相まって、効率的な分類モデルを開発する上で大きな課題となっている。従来の研究では、検索ログからのユーザクリックやペアワイズ損失関数といったユーザインタラクションデータを利用して、クエリ表現学習のためのコクリック動作をモデル化していた。しかし、多くのヘルスクエリには複数の意図があり、あいまいなクリック動作やバラバラなクリック動作をもたらす可能性がある。さらに、グローバル統計から推測されるクエリの最も一般的なインテントを学習することで、特定の検索セッション内でクエリインテントを分類する際に、異なるユーザの集合的振る舞いに基づいてクエリの最も一般的なインテントを学習することで、相違やパフォーマンス低下につながる可能性がある。これらの制約に対処するため、クラスタリングにより類似したクエリを集約することでクエリ表現学習を改善し、よりスケーラブルで正確な学習手順を実現するために、ヘルス検索クエリの多面的な性質を捉えるように設計された新しい損失関数を導入する。さらに,本研究は,学習した問合せ表現を文脈的,セッションベースの検索意図分類に組み込むための簡易かつ効果的な手法を実証し,健康クエリのあいまいさと,グローバル検索意図と個々のセッションから識別されたものとの相違を定量化する。実世界の2つの検索ログデータセット、すなわちHealth Search(HS)データセットとTripClickデータセットに関する広範な実験結果と分析により、我々のアプローチは、クエリ表現学習における固有のクラスタリングメトリクスの改善だけでなく、その後の検索意図分類タスクの精度の向上も示している。

論文の概要: Enhancing Healthcare Search Intent Recognition with Query Representation Learning and Session Context

関連論文リスト