Fugu-MT 論文翻訳(概要): Ruyi2.5 Technical Report

論文の概要: Ruyi2.5 Technical Report

arxiv url: http://arxiv.org/abs/2603.17311v1
Date: Wed, 18 Mar 2026 03:13:06 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-19 18:32:57.493307
Title: Ruyi2.5 Technical Report
Title（参考訳）: Ruyi2.5技術報告
Authors: Huan Song, Shuyu Tian, Qingfei Zhao, Wenhao Hong, Jiang Liu, Ting Long, Jiawei Shao, Xuelong Li,
Abstract要約: Ruyi2.5はAI Flowフレームワーク上に構築されたマルチモーダル家族モデルである。 Ruyi2.5-Cameraモデルは、プライバシ保護カメラサービスシステムとして開発されている。 BPPOはバイナリ応答の選択によってサンプルの冗長性を低減し、応答プレフィックスの勾配更新に集中する。
参考スコア（独自算出の注目度）: 46.52711895674739
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We present Ruyi2.5, a multimodal familial model built on the AI Flow framework. Extending Ruyi2's "Train Once, Deploy Many" paradigm to the multimodal domain, Ruyi2.5 constructs a shared-backbone architecture that co-trains models of varying scales within a single unified pipeline, ensuring semantic consistency across all deployment tiers. Built upon Ruyi2.5, Ruyi2.5-Camera model is developed as a privacy-preserving camera service system, which instantiates Ruyi2.5-Camera into a two-stage recognition pipeline: an edge model applies information-bottleneck-guided irreversible feature mapping to de-identify raw frames at the source, while a cloud model performs deep behavior reasoning. To accelerate reinforcement learning fine-tuning, we further propose Binary Prefix Policy Optimization (BPPO), which reduces sample redundancy via binary response selection and focuses gradient updates on response prefixes, achieving a 2 to 3 times training speedup over GRPO. Experiments show Ruyi2.5 matches Qwen3-VL on the general multimodal benchmarks, while Ruyi2.5-Camera substantially outperforms Qwen3-VL on privacy-constrained surveillance tasks.
Abstract（参考訳）: 本稿では,AI Flowフレームワーク上に構築されたマルチモーダル家族モデルであるRuyi2.5を紹介する。 Ruyi2.5は、Ruyi2の"Train Once, Deploy Many"パラダイムをマルチモーダルドメインに拡張し、単一の統一パイプライン内でさまざまなスケールのモデルをコトレーニングする共有バックボーンアーキテクチャを構築し、すべてのデプロイメント層にわたってセマンティック一貫性を確保する。 Ruyi2.5をベースとしたRuyi2.5-Cameraモデルは、プライバシ保護カメラサービスシステムとして開発されており、Ruyi2.5-Cameraを2段階の認識パイプラインにインスタンス化する。さらに,強化学習の微調整を高速化するため,バイナリ応答選択によるサンプルの冗長性を低減し,応答プレフィックスの勾配更新に焦点を合わせ,GRPOよりも2～3倍のトレーニング高速化を実現する2次修正ポリシー最適化(BPPO)を提案する。実験によると、Ruyi2.5は一般的なマルチモーダルベンチマークでQwen3-VLと一致し、Ruyi2.5-Cameraはプライバシーに制約された監視タスクでQwen3-VLを大幅に上回っている。

論文の概要: Ruyi2.5 Technical Report

関連論文リスト