Fugu-MT 論文翻訳(概要): PointRFT: Explicit Reinforcement Fine-tuning for Point Cloud Few-shot Learning

論文の概要: PointRFT: Explicit Reinforcement Fine-tuning for Point Cloud Few-shot Learning

arxiv url: http://arxiv.org/abs/2603.23957v1
Date: Wed, 25 Mar 2026 05:36:32 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-26 21:06:11.146952
Title: PointRFT: Explicit Reinforcement Fine-tuning for Point Cloud Few-shot Learning
Title（参考訳）: PointRFT:Point Cloud Few-shot Learningのための明示的な強化微調整
Authors: Yankai Wang, Yiding Sun, Qirui Wang, Pengbo Li, Chaoyi Lu, Dongxu Zhang,
Abstract要約: 我々は,ポイントクラウド表現学習に特化した,最初の強化微調整パラダイムであるPointRFTを提案する。 PointRFTは様々なベンチマークでバニラ教師付き微調整(SFT)を一貫して上回っている。ハイブリッドなPretraining-SFT-RFTパラダイムに統合されると、ポイントクラウド基盤モデルの表現能力は実質的に解放される。
参考スコア（独自算出の注目度）: 12.771216755928117
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Understanding spatial dynamics and semantics in point cloud is fundamental for comprehensive 3D comprehension. While reinforcement learning algorithms such as Group Relative Policy Optimization (GRPO) have recently achieved remarkable breakthroughs in large language models by incentivizing reasoning capabilities through strategic reward design, their potential remains largely unexplored in the 3D perception domain. This naturally raises a pivotal question: Can RL-based methods effectively empower 3D point cloud fine-tuning? In this paper, we propose PointRFT, the first reinforcement fine-tuning paradigm tailored specifically for point cloud representation learning. We select three prevalent 3D foundation models and devise specialized accuracy reward and dispersion reward functions to stabilize training and mitigate distribution shifts. Through comprehensive few-shot classification experiments comparing distinct training paradigms, we demonstrate that PointRFT consistently outperforms vanilla supervised fine-tuning (SFT) across diverse benchmarks. Furthermore, when organically integrated into a hybrid Pretraining-SFT-RFT paradigm, the representational capacity of point cloud foundation models is substantially unleashed, achieving state-of-the-art performance particularly under data-scarce scenarios.
Abstract（参考訳）: 点クラウドにおける空間力学と意味論の理解は、包括的3次元理解に不可欠である。 Group Relative Policy Optimization (GRPO) のような強化学習アルゴリズムは、近年、戦略的な報酬設計を通じて推論能力を高めることで、大きな言語モデルにおいて顕著なブレークスルーを達成したが、そのポテンシャルは3D知覚領域においてほとんど解明されていない。 RLベースの手法は、3Dポイントクラウドの微調整を効果的に強化できますか? 本稿では,ポイントクラウド表現学習に特化した,最初の強化微調整パラダイムであるPointRFTを提案する。 3つの一般的な3次元基礎モデルを選択し、トレーニングを安定させ、分散シフトを軽減するために、特殊精度報酬関数と分散報酬関数を考案する。異なるトレーニングパラダイムを比較した総合的な数ショット分類実験を通じて、PointRFTは様々なベンチマークでバニラ教師付き微調整(SFT)を一貫して上回っていることを実証する。さらに,ハイブリッドなPretraining-SFT-RFTパラダイムに有機的に統合されると,ポイントクラウド基盤モデルの表現能力は著しく低下し,特にデータスカースシナリオ下での最先端のパフォーマンスが達成される。

論文の概要: PointRFT: Explicit Reinforcement Fine-tuning for Point Cloud Few-shot Learning

関連論文リスト