Fugu-MT 論文翻訳(概要): Progressive Photorealistic Simplification

論文の概要: Progressive Photorealistic Simplification

arxiv url: http://arxiv.org/abs/2605.10409v1
Date: Mon, 11 May 2026 11:47:44 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-12 23:28:50.782288
Title: Progressive Photorealistic Simplification
Title（参考訳）: プログレッシブ・フォトリアリスティック・シンプル化
Authors: Adi Rosenthal, Dana Berman, Yedid Hoshen, Ariel Shamir,
Abstract要約: 本稿では,要素の除去と着色によってシーンの複雑さを反復的に低減する,プログレッシブ・セマンティック・イメージの単純化について紹介する。本手法は,意味的理解と生成的編集を組み合わせ,視覚言語モデル(VLM)を用いて要素の識別と優先順位付けを行う。効率を向上させるため,このプロセスは単一入力画像からコヒーレントな単純化シーケンスを直接予測する画像対ビデオ生成モデルに蒸留する。
参考スコア（独自算出の注目度）: 35.59806534693362
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Existing image simplification techniques often rely on Non-Photorealistic Rendering (NPR), transforming photographs into stylized sketches, cartoons, or paintings. While effective at reducing visual complexity, such approaches typically sacrifice photographic realism. In this work, we explore a complementary direction: simplifying images while preserving their photorealistic appearance. We introduce progressive semantic image simplification, a framework that iteratively reduces scene complexity by removing and inpainting elements in a controlled manner. At each step, the resulting image remains a plausible natural photograph. Our method combines semantic understanding with generative editing, leveraging Vision-Language Models (VLMs) to identify and prioritize elements for removal, and a learned verifier to ensure photorealism and coherence throughout the process. This is implemented via an iterative Select-Remove-Verify pipeline that produces high-quality simplification trajectories. To improve efficiency, we further distill this process into an image-to-video generation model that directly predicts coherent simplification sequences from a single input image. Beyond generating cleaner and more focused compositions, our approach enables applications such as content-aware decluttering, semantic layer decomposition, and interactive editing. More broadly, our work suggests that simplification through structured content removal can serve as a practical mechanism for guiding visual interpretation within the photorealistic domain, complementing traditional abstraction methods.
Abstract（参考訳）: 既存の画像単純化技術は、しばしば非フォトリアリスティックレンダリング(NPR)に依存し、写真がスタイリングされたスケッチ、漫画、絵画に変換される。視覚的な複雑さを減らすのに効果的であるが、そのようなアプローチは一般的に写真リアリズムを犠牲にする。本研究では,フォトリアリスティックな外観を維持しながら,画像の簡易化という補完的な方向性を探求する。本稿では,シーンの難易度を反復的に低減する,プログレッシブ・セマンティック・イメージの単純化について紹介する。それぞれのステップにおいて、得られた画像は、もっともらしい自然写真のままである。本手法は, 意味的理解と生成的編集, 視覚言語モデル(VLM)を併用して, 除去のための要素を特定し, 優先順位付けし, プロセス全体を通して光リアリズムとコヒーレンスを確保するための学習検証を行う。これは、高品質な単純化軌道を生成する反復Select-Remove-Verifyパイプラインを介して実装される。効率を向上するために、このプロセスを画像からビデオへ変換し、単一の入力画像からコヒーレントな単純化シーケンスを直接予測する。よりクリーンでより焦点を絞ったコンポジションを生成することに加えて、コンテンツ認識のデクラッタリング、セマンティック・レイヤの分解、インタラクティブな編集といった応用が可能になる。より広範に、構造化コンテンツ削除による単純化は、従来の抽象的手法を補完し、フォトリアリスティック領域内で視覚的解釈を導くための実践的なメカニズムとして役立つことを示唆している。

論文の概要: Progressive Photorealistic Simplification

関連論文リスト