Fugu-MT 論文翻訳(概要): An Embarrassingly Simple Graph Heuristic Reveals Shortcut-Solvable Benchmarks for Sequential Recommendation

論文の概要: An Embarrassingly Simple Graph Heuristic Reveals Shortcut-Solvable Benchmarks for Sequential Recommendation

arxiv url: http://arxiv.org/abs/2605.07125v1
Date: Fri, 08 May 2026 02:00:11 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-11 19:43:38.735929
Title: An Embarrassingly Simple Graph Heuristic Reveals Shortcut-Solvable Benchmarks for Sequential Recommendation
Title（参考訳）: シークエンシャルレコメンデーションのためのショートカット・ソルバブルベンチマークの非常に単純なグラフヒューリスティック
Authors: Haoyu Han, Li Ma, Hanbing Wang, Bingheng Li, Daochen Zha, Chun How Tan, Huiji Gao, Xin Liu, Stephanie Moyerman, Sanjeev Katariya, Hui Liu, Jiliang Tang,
Abstract要約: Sequentialsolvは、シーケンシャルなパターンとセマンティックなアイテム情報を組み合わせたジェネレーティブなレコメンデーターへと移行している。これらの手法は、しばしば、広く使われている少数のベンチマークで評価され、重要な疑問を提起する: これらのベンチマークは、現代のジェネレーティブレコメンデーターが提供しようとしている高度なモデリング機能を必要としているか? 我々は、意図的な単純なグラフでベンチマーク監査を行い、最後の1つか2つの項目から、数ホップの項目遷移グラフから候補を検索し、項目間類似度でランク付けする。
参考スコア（独自算出の注目度）: 50.09718257952108
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Sequential recommendation has increasingly shifted toward generative recommenders that combine sequential patterns with semantic item information. Yet these methods are often evaluated on a small set of widely used benchmarks, raising a key question: do these benchmarks actually require the advanced modeling capabilities that modern generative recommenders claim to provide? We conduct a benchmark audit with an intentionally simple graph heuristic. Starting from only the last one or two interacted items, it retrieves candidates from a few-hop item-transition graph and ranks them by item-feature similarity. Despite using no sequence encoder, generative objective, or training, this heuristic matches or outperforms many modern baselines, with relative NDCG@10 improvements of 38.10% and 44.18% over the best competing baseline on Amazon Review Sports and CDs. We show that this behavior reflects shortcut solvability rather than an artifact of one heuristic. We identify three shortcut structures that can make next-item prediction easier than expected: low-branching local transitions, feature-smooth transitions, and limited dependence on long user histories. These shortcuts need not appear together; even one or two strong signals can make simple local retrieval highly competitive, while weakening them makes the benefits of more sophisticated models clearer. Across 14 datasets, model rankings vary substantially with dataset properties, yet the heuristic remains competitive on 10 of them. Our findings suggest that strong performance on standard benchmarks does not always demonstrate advanced sequential, semantic, or generative modeling ability. We call for more careful dataset selection and dataset-level diagnostic analysis when using benchmarks to support claims about new recommendation models.
Abstract（参考訳）: シーケンシャル・レコメンデーションは、シーケンシャル・パターンとセマンティック・アイテム情報を組み合わせたジェネレーティブ・レコメンデーションへと移りつつある。しかし、これらの手法はしばしば、広く使われている少数のベンチマークで評価され、重要な疑問を提起する: これらのベンチマークは、現代のジェネレーティブレコメンデーターが提供しようとしている高度なモデリング機能を必要としているのだろうか? 我々は、意図的に単純なグラフヒューリスティックでベンチマーク監査を行う。最後の1つか2つのインタラクションアイテムから始めて、いくつかのホップアイテムの遷移グラフから候補を検索し、アイテムとフィーチャーの類似度でランク付けする。シーケンシャルエンコーダ、生成目的、トレーニングは使用していないが、このヒューリスティック・マッチは、Amazon Review SportsとCDのベスト・ベースラインよりも38.10%と44.18%の相対的なNDCG@10の改善により、多くのモダン・ベースラインに匹敵する。この挙動は, 1つのヒューリスティックな人工物ではなく, ショートカットの可解性を反映していることを示す。低分岐局所遷移、機能スムース遷移、長期ユーザ履歴への限定的依存の3つのショートカット構造を同定する。これらのショートカットは同時に現れる必要はなく、1つまたは2つの強い信号でさえ、単純な局所的な検索を非常に競争力のあるものにすることができる一方で、それらを弱めれば、より洗練されたモデルの利点がより明確になる。 14のデータセットで、モデルランキングはデータセットの特性と大きく異なるが、その内10のヒューリスティックは依然として競争力がある。この結果から,標準ベンチマークの性能は必ずしも高度なシーケンシャル,セマンティック,ジェネレーティブなモデリング能力を示すとは限らないことが示唆された。ベンチマークを使用して新しいレコメンデーションモデルに関するクレームをサポートする場合、より慎重なデータセット選択とデータセットレベルの診断分析が求められます。

論文の概要: An Embarrassingly Simple Graph Heuristic Reveals Shortcut-Solvable Benchmarks for Sequential Recommendation

関連論文リスト