Fugu-MT 論文翻訳(概要): BUSSARD: Normalizing Flows for Bijective Universal Scene-Specific Anomalous Relationship Detection

論文の概要: BUSSARD: Normalizing Flows for Bijective Universal Scene-Specific Anomalous Relationship Detection

arxiv url: http://arxiv.org/abs/2603.16645v1
Date: Tue, 17 Mar 2026 15:16:40 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-18 17:42:07.362437
Title: BUSSARD: Normalizing Flows for Bijective Universal Scene-Specific Anomalous Relationship Detection
Title（参考訳）: BUSSARD:Bijective Universal Scene-Specific Anomalous Relationship Detectionのための正規化フロー
Authors: Melissa Schween, Mathis Kruse, Bodo Rosenhahn,
Abstract要約: 本稿では,画像から生成されたシーングラフの異常関係を検出するための正規化フローベースモデルを提案する。我々の研究は、実世界の意味的知識を活用するために、シーングラフからオブジェクトと関係トークンを言語モデルに埋め込んだマルチモーダルなアプローチに従っている。提案手法は,現在の最先端モデルと比較して約10%のAUROC結果が得られると同時に,同時に5倍の高速化を実現している。
参考スコア（独自算出の注目度）: 27.222384424781975
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: We propose Bijective Universal Scene-Specific Anomalous Relationship Detection (BUSSARD), a normalizing flow-based model for detecting anomalous relations in scene graphs, generated from images. Our work follows a multimodal approach, embedding object and relationship tokens from scene graphs with a language model to leverage semantic knowledge from the real world. A normalizing flow model is used to learn bijective transformations that map object-relation-object triplets from scene graphs to a simple base distribution (typically Gaussian), allowing anomaly detection through likelihood estimation. We evaluate our approach on the SARD dataset containing office and dining room scenes. Our method achieves around 10% better AUROC results compared to the current state-of-the-art model, while simultaneously being five times faster. Through ablation studies, we demonstrate superior robustness and universality, particularly regarding the use of synonyms, with our model maintaining stable performance while the baseline shows 17.5% deviation. This work demonstrates the strong potential of learning-based methods for relationship anomaly detection in scene graphs. Our code is available at https://github.com/mschween/BUSSARD .
Abstract（参考訳）: 画像から生成したシーングラフの異常関係を検出するための正規化フローベースモデルであるBUSSARD(Bijective Universal Scene-Specific Anomalous Relationship Detection)を提案する。我々の研究は、実世界の意味的知識を活用するために、シーングラフからオブジェクトと関係トークンを言語モデルに埋め込んだマルチモーダルなアプローチに従っている。正規化フローモデルは、シーングラフから単純なベース分布(典型的にはガウス)にオブジェクト-リレーション-オブジェクト三重項をマッピングするビジェクティブ変換を学習するために使用され、推定による異常検出を可能にする。我々は、オフィスとダイニングルームのシーンを含むSARDデータセットに対するアプローチを評価した。提案手法は,現在の最先端モデルと比較して約10%のAUROC結果が得られると同時に,同時に5倍の高速化を実現している。アブレーション研究を通じて、特に同義語の使用に関して優れた堅牢性と普遍性を示し、ベースラインの偏差が17.5%であるのに対して、我々のモデルは安定した性能を維持している。本研究は,シーングラフにおける関係異常検出のための学習ベース手法の強い可能性を示す。私たちのコードはhttps://github.com/mschween/BUSSARDで利用可能です。

論文の概要: BUSSARD: Normalizing Flows for Bijective Universal Scene-Specific Anomalous Relationship Detection

関連論文リスト