Fugu-MT 論文翻訳(概要): Addressing Image Authenticity When Cameras Use Generative AI

論文の概要: Addressing Image Authenticity When Cameras Use Generative AI

arxiv url: http://arxiv.org/abs/2604.21879v1
Date: Thu, 23 Apr 2026 17:22:51 GMT
ステータス: 翻訳完了
システム内更新日: 2026-04-24 14:40:06.785658
Title: Addressing Image Authenticity When Cameras Use Generative AI
Title（参考訳）: カメラが生成AIを使用する際の画像の正当性に対処する
Authors: Umar Masud, Abhijith Punnappurath, Luxi Zhao, David B. Lindell, Michael S. Brown,
Abstract要約: 強調されたキャプチャタイムの画像内容は、通常、エッジやテクスチャの強化など、良質である。ユーザーは、自分のカメラ画像のコンテンツが本物でないことに気づかないかもしれない。本稿では,カメラ画像の「アンハロシン化」バージョンをユーザが復元できるようにすることで,この問題に対処する。
参考スコア（独自算出の注目度）: 29.76740674812466
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The ability of generative AI (GenAI) methods to photorealistically alter camera images has raised awareness about the authenticity of images shared online. Interestingly, images captured directly by our cameras are considered authentic and faithful. However, with the increasing integration of deep-learning modules into cameras' capture-time hardware -- namely, the image signal processor (ISP) -- there is now a potential for hallucinated content in images directly output by our cameras. Hallucinated capture-time image content is typically benign, such as enhanced edges or texture, but in certain operations, such as AI-based digital zoom or low-light image enhancement, hallucinations can potentially alter the semantics and interpretation of the image content. As a result, users may not realize that the content in their camera images is not authentic. This paper addresses this issue by enabling users to recover the 'unhallucinated' version of the camera image to avoid misinterpretation of the image content. Our approach works by optimizing an image-specific multi-layer perceptron (MLP) decoder together with a modality-specific encoder so that, given the camera image, we can recover the image before hallucinated content was added. The encoder and MLP are self-contained and can be applied post-capture to the image without requiring access to the camera ISP. Moreover, the encoder and MLP decoder require only 180 KB of storage and can be readily saved as metadata within standard image formats such as JPEG and HEIC.
Abstract（参考訳）: 生成AI(GenAI)手法がカメライメージをフォトリアリスティックに修正する能力は、オンラインで共有される画像の真正性に対する認識を高めた。興味深いことに、我々のカメラから直接撮影した画像は本物で忠実だと考えられている。しかし、ディープラーニングモジュールをカメラのキャプチャタイムハードウェア(つまり、画像信号プロセッサ(ISP))に統合することで、カメラから直接出力される画像のコンテンツを幻覚させる可能性がある。しかしAIベースのデジタルズームや低照度画像強調のような特定の操作では、幻覚は画像の内容の意味や解釈を変える可能性がある。その結果、ユーザーは自分のカメラ画像のコンテンツが本物でないことに気づかないかもしれない。本稿では,画像内容の誤解釈を避けるために,ユーザーがカメラ画像の「アンハロシン化」バージョンを復元できるようにすることにより,この問題に対処する。本手法は,画像固有の多層パーセプトロン(MLP)デコーダをモダリティ固有のエンコーダとともに最適化することにより,カメラ画像から幻覚コンテンツを追加する前に画像の復元を行う。エンコーダとMLPは自己完結しており、カメラISPへのアクセスを必要とせず、画像に後処理を施すことができる。さらに、エンコーダとMPPデコーダは180KBのストレージしか必要とせず、JPEGやHEICなどの標準画像フォーマットでメタデータとして簡単に保存できる。

論文の概要: Addressing Image Authenticity When Cameras Use Generative AI

関連論文リスト