Fugu-MT 論文翻訳(概要): Learning to Rotate: Temporal and Semantic Rotary Encoding for Sequential Modeling

論文の概要: Learning to Rotate: Temporal and Semantic Rotary Encoding for Sequential Modeling

arxiv url: http://arxiv.org/abs/2604.24717v1
Date: Mon, 27 Apr 2026 17:26:15 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-28 17:12:08.269164
Title: Learning to Rotate: Temporal and Semantic Rotary Encoding for Sequential Modeling
Title（参考訳）: 回転の学習:時系列モデリングのための時間的・意味的な回転符号化
Authors: Hailing Cheng, Daqi Sun, Xinyu Lu,
Abstract要約: SIREN-RoPEを導入し、不均一な信号で回転次元をポップアップさせる。評価モデルとしてジェネレーティブ・レコメンデータを用いた生産規模のニュースフィードデータセットの評価を行った。
参考スコア（独自算出の注目度）: 6.277197086409669
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Every Transformer architecture dedicates enormous capacity to learning rich representations in semantic embedding space -- yet the rotation manifold acted upon by Rotary Positional Embeddings (RoPE) has been treated as a fixed, hand-crafted structure, populated only by discrete ordinal indices. We argue that this rotation space is a largely overlooked second dimension of expressivity in the attention mechanism, one whose systematic exploration may open a new door for attention-based architectures. The analogy to complex numbers is instructive: just as introducing the imaginary axis -- orthogonal to and independent of the real line -- unlocked new algebraic structure once believed impossible, treating the rotation manifold as a learnable, signal-conditioned space opens an orthogonal degree of freedom in attention. In this framing, the token embedding encodes the semantic (real) component of a representation -- what a token means -- while the rotation encodes its dynamic (imaginary) component -- how it relates to every other token across time, position, and context. We introduce SIREN-RoPE, a concrete instantiation of this idea, which populates the rotation dimension with heterogeneous signals -- continuous timestamps, cyclical temporal patterns, and categorical metadata -- via a dual-branch Sinusoidal Representation Network (SIREN). As a proof of concept, we evaluate on a production-scale news feed dataset from a major social network using a generative recommender as the ranking model, demonstrating that activating this hidden dimension yields consistent improvements across calibration and ranking objectives with negligible computational overhead. We invite the community to view the rotation space not as a solved positional-encoding detail, but as an untapped axis whose rich structure may prove as consequential for attention as the imaginary unit proved for algebra.
Abstract（参考訳）: 全てのトランスフォーマーアーキテクチャは、セマンティックな埋め込み空間においてリッチな表現を学習するために膨大な能力を持つが、ロータリー位置埋め込み(Rotary Positional Embeddings, RoPE)によって作用する回転多様体は固定された手作り構造として扱われており、これは個別の順序指標によってのみ占有されている。我々は、この回転空間は注意機構における2次元の表現性のほとんど見落とされ、体系的な探索が注意に基づくアーキテクチャの新しい扉を開くかもしれないと主張する。虚数軸(実数直線の直交と独立)を導入するのと同様に、回転多様体を学習可能で信号条件付き空間として扱うことは、注意の直交自由度を開放する。このフレーミングでは、トークン埋め込みは表現の意味(実際の)コンポーネント -- トークンの意味 -- をエンコードし、ローテーションは動的(想像的な)コンポーネント -- をエンコードし、時間、位置、コンテキストを通じて他のすべてのトークンに関連付ける。本稿では,このアイデアの具体的インスタンス化であるSIREN-RoPEを紹介する。SIREN-RoPEは,2分岐の正弦波表現ネットワーク(SIREN)を介して,不均一信号(連続タイムスタンプ,巡回時間パターン,カテゴリーメタデータ)で回転次元をポップアップさせる。概念実証として,生成レコメンデータをランキングモデルとして用いて,大手ソーシャルネットワークからの大規模ニュースフィードデータセットを評価し,この隠れ次元を活性化することにより,キャリブレーションやランキングの目的に対して,計算オーバーヘッドを無視して一貫した改善が達成できることを実証した。我々はコミュニティに対して、回転空間を解決された位置エンコードの詳細ではなく、リッチな構造が代数の虚数単位として証明されたような注意を払わせるような、未解決の軸として見るように求めている。

論文の概要: Learning to Rotate: Temporal and Semantic Rotary Encoding for Sequential Modeling

関連論文リスト