Fugu-MT 論文翻訳(概要): SurgicalMamba: Dual-Path SSD with State Regramming for Online Surgical Phase Recognition

論文の概要: SurgicalMamba: Dual-Path SSD with State Regramming for Online Surgical Phase Recognition

arxiv url: http://arxiv.org/abs/2605.14889v2
Date: Mon, 18 May 2026 02:24:40 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-19 17:57:46.007404
Title: SurgicalMamba: Dual-Path SSD with State Regramming for Online Surgical Phase Recognition
Title（参考訳）: 手術用Mamba:オンライン手術相認識のための状態リグラム付きデュアルパスSSD
Authors: Sukju Oh, Sukkyu Sun,
Abstract要約: オンライン外科的位相認識(SPR)は、コンテキスト対応の手術室システムを支える。我々は,Mamba2の構造的状態空間双対性(SSD)に基づいて構築された因果SPRモデルであるO(d)について述べる。 7つの公開SPRベンチマークで、OssageMambaは、厳格なオンライン評価の下で最先端の精度とフェーズレベルのJaccardに達した。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Online surgical phase recognition (SPR) underpins context-aware operating-room systems and requires committing to a prediction at every frame from past context alone. Surgical video poses three demands that natural-video recognizers do not jointly address: procedures span tens of thousands of frames, time flows non-uniformly as long routine stretches are punctuated by brief phase-defining transitions, and the visual domain is narrow so backbone features are strongly correlated across channels. Existing recognizers either let per-frame cost grow with elapsed length, or hold cost bounded but advance state at a uniform rate with channel-independent dynamics, leaving the latter two demands unaddressed. We present SurgicalMamba, a causal SPR model built on Mamba2's structured state-space duality (SSD) that holds per-frame cost at O(d). It introduces three SSD-compatible components, each targeting one demand: a dual-path SSD block that separates long- and short-term regimes at the level of recurrent state; intensity-modulated stepping, a continuous-time time-warp that adapts the slow path's effective rate to phase-relevant information; and state regramming, a per-chunk Cayley rotation that opens cross-channel mixing in the otherwise axis-aligned SSM recurrence. The learned rotation planes inherit a phase-aligned structure without any direct supervision, offering an interpretable internal signature of surgical workflow. Across seven public SPR benchmarks, SurgicalMamba reaches state-of-the-art accuracy and phase-level Jaccard under strict online evaluation: 94.6%/82.7% on Cholec80 (+0.7 pp/+2.2 pp over the strongest prior) and 89.5%/68.9% on AutoLaparo (+1.7 pp/+2.0 pp), at 238.74 fps on a single GPU. Ablations isolate the contribution of each component. The code is publicly available at https://github.com/sukjuoh/Surgical-Mamba.
Abstract（参考訳）: オンライン外科的位相認識(SPR)は、コンテキスト対応の手術室システムを支えるものであり、過去のコンテキストのみからの全てのフレームでの予測にコミットする必要がある。プロシージャは数万のフレームにまたがり、長いルーチンストレッチが短い位相定義遷移によって句読されるため、時間の流れは不均一であり、視覚領域は狭く、バックボーンの特徴はチャネル間で強く相関している。既存の認識器は、フレーム単位のコストを経過した長さで増加させるか、あるいは、チャンネル非依存のダイナミックスと均一な速度でバウンダリを保ち、後者の2つの要求は未適応のままである。我々は,Mamba2の構造的状態空間双対性(SSD)に基づいて構築された因果SPRモデルであるO(d)について述べる。 SSD互換コンポーネントは3つあり、それぞれ1つの要求をターゲットにしている: 長期状態と短期状態の分離を行うデュアルパスSSDブロック、強度変調されたステップ、スローパスの有効レートを位相関連情報に適応する連続時間のタイムワープ、チャンク毎のケイリー回転で軸方向のSSMリカレンスを開封するステートリグラム。学習された回転面は、直接の監督なしに位相整列構造を継承し、外科的ワークフローの解釈可能な内部シグネチャを提供する。 7つの公開SPRベンチマークで、オペレーショナルマンバは厳格なオンライン評価のもと、94.6%/82.7%のCholec80 (+0.7 pp/+2.2 pp)、89.5%/68.9%のAutoLaparo (+1.7 pp/+2.0 pp)、そして238.74 fpsの238.74 fpsに達した。アブレーションは各コンポーネントのコントリビューションを分離する。コードはhttps://github.com/sukjuoh/Surgical-Mamba.comで公開されている。

論文の概要: SurgicalMamba: Dual-Path SSD with State Regramming for Online Surgical Phase Recognition

関連論文リスト