Fugu-MT 論文翻訳(概要): Temporal Preference Optimization for Unsupervised Retrieval

論文の概要: Temporal Preference Optimization for Unsupervised Retrieval

arxiv url: http://arxiv.org/abs/2606.17664v1
Date: Tue, 16 Jun 2026 08:27:22 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-17 17:15:32.352001
Title: Temporal Preference Optimization for Unsupervised Retrieval
Title（参考訳）: 教師なし検索のための時間優先最適化
Authors: HyunJin Kim, Jaejun Shim, Young Jin Kim, JinYeong Bak,
Abstract要約: 新たなトレーニング手法であるTRPO(Temporal Retrieval Preference Optimization)を用いたTPOUR(Temporal Preference Optimization for Unsupervised Retriever)を提案する。 TPOURは学習時間埋め込みを通じて目に見えない時間に一般化し、連続的な時間的アライメントを可能にする。 Qwen-Embedding-8Bと比較すると、約72.7倍小さいにもかかわらず、TPOUR Contrieverは暗黙的なクエリで平均nDCG@5を+4.04 (+12.15%)、暗黙的なクエリで+4.98 (+15.21%)改善している。
参考スコア（独自算出の注目度）: 11.583736130556611
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Unsupervised dense retrievers offer scalability by learning semantic similarity from unlabeled documents via contrastive learning, but they struggle to capture the temporal relevance, retrieving semantically related but temporally misaligned documents-an important aspect when a document collection spans multiple time periods (e.g., retrieving documents from 2018-2025 for "Who is the president in 2019?" introduces temporal ambiguity). Existing methods rely on supervised training with explicit timestamps, which are not always feasible. We propose TPOUR (Temporal Preference Optimization for Unsupervised Retriever), which uses our novel training method Temporal Retrieval Preference Optimization (TRPO). TRPO reinterprets preference learning in the temporal dimension, guiding the retriever to favor temporally aligned documents. TPOUR further generalizes to unseen time periods via interpolation in a learned time embedding, enabling continuous temporal alignment. Experiments on temporal information retrieval (T-IR), TPOUR outperforms both unsupervised and supervised baselines. Compared to Qwen-Embedding-8B, despite being about 72.7x smaller, TPOUR Contriever improves average nDCG@5 by +4.04 (+12.15%) on explicit and +4.98 (+15.21%) on implicit queries. We provide our code at https://github.com/agwaBom/TPOUR.
Abstract（参考訳）: 教師なしの厳密なレトリバーは、対照的な学習を通じてラベルのないドキュメントからセマンティックな類似性を学ぶことでスケーラビリティを提供するが、それらは時間的関連性を捉え、セマンティックな関連性を取り出すのに苦労している。既存の方法は明示的なタイムスタンプによる教師付きトレーニングに依存しており、必ずしも実現可能であるとは限らない。本稿では,新しいトレーニング手法であるTRPOを用いたTPOUR(Temporal Preference Optimization for Unsupervised Retriever)を提案する。 TRPOは、時間次元における嗜好学習を再解釈し、検索者に時間的に整列された文書を優先させる。 TPOURはさらに、学習時間埋め込みにおける補間を通して、目に見えない時間に一般化し、連続的な時間的アライメントを可能にする。時間情報検索(T-IR)の実験では、TPOURは教師なしベースラインと教師なしベースラインの両方を上回っている。 Qwen-Embedding-8Bと比較すると、約72.7倍小さいにもかかわらず、TPOUR Contrieverは暗黙的なクエリで平均nDCG@5を+4.04 (+12.15%)、暗黙的なクエリで+4.98 (+15.21%)改善している。私たちはhttps://github.com/agwaBom/TPOUR.comでコードを公開しています。

論文の概要: Temporal Preference Optimization for Unsupervised Retrieval

関連論文リスト