Fugu-MT 論文翻訳(概要): Match-Any-Events: Zero-Shot Motion-Robust Feature Matching Across Wide Baselines for Event Cameras

論文の概要: Match-Any-Events: Zero-Shot Motion-Robust Feature Matching Across Wide Baselines for Event Cameras

arxiv url: http://arxiv.org/abs/2604.18744v1
Date: Mon, 20 Apr 2026 18:48:53 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-22 22:41:49.432052
Title: Match-Any-Events: Zero-Shot Motion-Robust Feature Matching Across Wide Baselines for Event Cameras
Title（参考訳）: Match-Any-Events:イベントカメラの広いベースラインにマッチするゼロショットモーション・ロバスト機能
Authors: Ruijun Zhang, Hang Su, Kostas Daniilidis, Ziyun Wang,
Abstract要約: ゼロショット方式でクロスデータセットワイドベースライン対応を実現する最初のイベントマッチングモデルを提案する。本稿では,イベントストリームからマルチタイムな特徴を学習する,モーションロバストかつ計算効率のよいアテンションバックボーンを提案する。我々のフレームワークは、以前の最高のイベント特徴マッチングメソッドよりも37.7%改善されている。
参考スコア（独自算出の注目度）: 40.06828305096689
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: Event cameras have recently shown promising capabilities in instantaneous motion estimation due to their robustness to low light and fast motions. However, computing wide-baseline correspondence between two arbitrary views remains a significant challenge, since event appearance changes substantially with motion, and learning-based approaches are constrained by both scalability and limited wide-baseline supervision. We therefore introduce the first event matching model that achieves cross-dataset wide-baseline correspondence in a zero-shot manner: a single model trained once is deployed on unseen datasets without any target-domain fine-tuning or adaptation. To enable this capability, we introduce a motion-robust and computationally efficient attention backbone that learns multi-timescale features from event streams, augmented with sparsity-aware event token selection, making large-scale training on diverse wide-baseline supervision computationally feasible. To provide the supervision needed for wide-baseline generalization, we develop a robust event motion synthesis framework to generate large-scale event-matching datasets with augmented viewpoints, modalities, and motions. Extensive experiments across multiple benchmarks show that our framework achieves a 37.7% improvement over the previous best event feature matching methods. Code and data are available at: https://github.com/spikelab-jhu/Match-Any-Events.
Abstract（参考訳）: イベントカメラは近年,低照度かつ高速な動きに対する頑健さから,瞬時動作推定の有望な能力を示した。しかし、イベントの出現は動きによって大きく変化し、学習ベースのアプローチはスケーラビリティと制限された広義の監視の両方によって制約されるため、任意の2つのビュー間のワイドベースライン対応の計算は依然として大きな課題である。したがって、ゼロショット方式でクロスデータセットワイドベースライン対応を実現する最初のイベントマッチングモデルを導入する: 一度訓練された1つのモデルは、ターゲットドメインの微調整や適応なしに、未確認データセットにデプロイされる。この機能を実現するために,イベントストリームからマルチタイムな特徴を学習し,空間性を考慮したイベントトークン選択を付加し,多種多様な広義の監視を行う大規模トレーニングを実現する,モーションロバストかつ計算効率のよいアテンションバックボーンを導入する。広義の一般化に必要な監視を行うため,大規模イベントマッチングデータセットを拡張的な視点,モダリティ,動きで生成する,堅牢なイベントモーション合成フレームワークを開発した。複数のベンチマークにわたる大規模な実験により、我々のフレームワークは以前の最高のイベント特徴マッチング方法よりも37.7%改善していることがわかった。コードとデータは、https://github.com/spikelab-jhu/Match-Any-Events.comで入手できる。

論文の概要: Match-Any-Events: Zero-Shot Motion-Robust Feature Matching Across Wide Baselines for Event Cameras

関連論文リスト