Fugu-MT 論文翻訳(概要): Unified Multi-Task Relevance Modeling for E-Commerce: Comparing Task Routing Architectures Across LLMs and Cross-Encoders

論文の概要: Unified Multi-Task Relevance Modeling for E-Commerce: Comparing Task Routing Architectures Across LLMs and Cross-Encoders

arxiv url: http://arxiv.org/abs/2606.23919v1
Date: Mon, 22 Jun 2026 20:32:47 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-24 22:16:48.684323
Title: Unified Multi-Task Relevance Modeling for E-Commerce: Comparing Task Routing Architectures Across LLMs and Cross-Encoders
Title（参考訳）: 電子商取引のための統一マルチタスク関連モデリング:LCMとクロスエンコーダ間のタスクルーティングアーキテクチャの比較
Authors: Md Omar Faruk Rokon, Jhalak Nilesh Acharya, Shasvat Desai, Hong Yao, Kuang-chih Lee,
Abstract要約: 現在の業界プラクティスは、各タスクごとに別々のモデルに依存しており、知識の伝達を防ぎ、一貫性のない関連信号を生成する。エンコーダベースとデコーダのみモデルで、さまざまなメカニズムを通じてタスクのアイデンティティをエンコードします。共有3点関係尺度の下で6つのエンティティペアタスクを共同で訓練する統合マルチタスクフレームワークを提案する。
参考スコア（独自算出の注目度）: 5.423726451729119
License: http://creativecommons.org/licenses/by/4.0/
Abstract: How can we build a single relevance model that handles six different entity pair relationship types in e commerce from query product matching to product type similarity when each task has different data volumes, different semantic requirements, and potentially conflicting learning signals? This question is important because current industry practice relies on separate models for each task, preventing knowledge transfer and producing inconsistent relevance signals. Our work is driven by the following insight: encoder based and decoder only models encode task identity through different mechanisms, so the choice of task routing architecture how task identity is communicated to the shared model affects these two families in asymmetric ways. As our key novelty, we combine three ideas: (a) a unified multi task framework that jointly trains on six entity pair tasks under a shared three point relevance scale, (b) a systematic comparison of three task routing architectures (text prefix routing, multi head classification, and multihead with private transformer layers) across both LoRA adapted LLMs and fully finetuned cross encoders, and (c) a majority vote ensemble that exploits the diversity induced by private layer routing. First, we show that the MHP Ensemble (multi head with private layers) achieves 89.96% accuracy on 453K test examples the highest across all configurations . Second, we show that removing text prefixes without private layers causes severe degradation for decoder only LLMs while cross encoders remain robust , suggesting an encoder decoder asymmetry in task identity encoding. Third, we show that multi task training yields up to 14% improvement on low resource tasks over single task baselines.
Abstract（参考訳）: クエリ製品マッチングから、各タスクが異なるデータボリューム、異なるセマンティック要件、潜在的に矛盾する学習信号を持つ場合の製品タイプの類似性まで、eコマースにおいて6つの異なるエンティティペア関係タイプを処理する単一の関連モデルを構築するにはどうすればよいのか? この問題は、現在の業界慣行が各タスクごとに別々のモデルに依存し、知識伝達を防止し、一貫性のない関連信号を生成するため重要である。エンコーダベースとデコーダのみモデルが異なるメカニズムを通してタスクアイデンティティをエンコードするので、タスクルーティングアーキテクチャの選択は、タスクアイデンティティを共有モデルと通信する方法が、これらの2つのファミリーに非対称な方法で影響を及ぼす。重要なノベルティとして、私たちは3つのアイデアを組み合わせています。 (a)共有三点関係尺度の下で、6つのエンティティペアタスクを共同で訓練する統合マルチタスクフレームワーク。 b) LoRA 適応 LLM と完全微調整クロスエンコーダの両層にまたがる3つのタスクルーティングアーキテクチャ(テキストプレフィックスルーティング,マルチヘッド分類,マルチヘッド層とプライベートトランスフォーマー層)の体系的比較 (c)プライベートレイヤルーティングによって誘導される多様性を活用する多数決アンサンブル。まず、MHP Ensemble(プライベートレイヤを持つマルチヘッド)は、453Kテスト例で89.96%の精度を達成していることを示す。第2に, テキストプレフィックスをプライベートレイヤなしで削除すると, LLMのみのデコーダが大幅に劣化し, クロスエンコーダはロバストであり, タスクID符号化におけるエンコーダの非対称性が示唆される。第3に、マルチタスクトレーニングは、単一のタスクベースラインよりも低リソースタスクを最大14%改善することを示す。

論文の概要: Unified Multi-Task Relevance Modeling for E-Commerce: Comparing Task Routing Architectures Across LLMs and Cross-Encoders

関連論文リスト