Low-Resolution Editing is All You Need for High-Resolution Editing
- URL: http://arxiv.org/abs/2511.19945v1
- Date: Tue, 25 Nov 2025 05:35:32 GMT
- Title: Low-Resolution Editing is All You Need for High-Resolution Editing
- Authors: Junsung Lee, Hyunsoo Lee, Yong Jae Lee, Bohyung Han,
- Abstract summary: We introduce the task of high-resolution image editing and propose a test-time optimization framework to address it.<n>Our method performs patch-wise optimization on high-resolution source images, followed by a fine-grained detail transfer module and a novel synchronization strategy.
- Score: 67.6663530128766
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: High-resolution content creation is rapidly emerging as a central challenge in both the vision and graphics communities. While images serve as the most fundamental modality for visual expression, content generation that aligns with the user intent requires effective, controllable high-resolution image manipulation mechanisms. However, existing approaches remain limited to low-resolution settings, typically supporting only up to 1K resolution. In this work, we introduce the task of high-resolution image editing and propose a test-time optimization framework to address it. Our method performs patch-wise optimization on high-resolution source images, followed by a fine-grained detail transfer module and a novel synchronization strategy to maintain consistency across patches. Extensive experiments show that our method produces high-quality edits, facilitating the way toward high-resolution content creation.
Related papers
- Ultra High-Resolution Image Inpainting with Patch-Based Content Consistency Adapter [47.512192547392026]
We present Patch-Adapter, an effective framework for high-resolution text-guided image inpainting.<n>Our approach achieves 4K+ resolution while maintaining precise content consistency and prompt alignment.<n> Experiments demonstrate that Patch-Adapter not only resolves artifacts common in large-scale inpainting but also achieves state-of-the-art performance.
arXiv Detail & Related papers (2025-10-15T11:18:24Z) - HRSeg: High-Resolution Visual Perception and Enhancement for Reasoning Segmentation [74.1872891313184]
HRSeg is an efficient model with high-resolution fine-grained perception.<n>It features two key innovations: High-Resolution Perception (HRP) and High-Resolution Enhancement (HRE)
arXiv Detail & Related papers (2025-07-17T08:09:31Z) - High-Resolution Be Aware! Improving the Self-Supervised Real-World Super-Resolution [37.546746047196486]
Self-supervised learning is crucial for super-resolution because ground-truth images are usually unavailable for real-world settings.
Existing methods derive self-supervision from low-resolution images by creating pseudo-pairs or by enforcing a low-resolution reconstruction objective.
This paper strengthens awareness of the high-resolution image to improve the self-supervised real-world super-resolution.
arXiv Detail & Related papers (2024-11-25T08:13:32Z) - Task-Oriented Diffusion Inversion for High-Fidelity Text-based Editing [60.730661748555214]
We introduce textbfTask-textbfOriented textbfDiffusion textbfInversion (textbfTODInv), a novel framework that inverts and edits real images tailored to specific editing tasks.
ToDInv seamlessly integrates inversion and editing through reciprocal optimization, ensuring both high fidelity and precise editability.
arXiv Detail & Related papers (2024-08-23T22:16:34Z) - ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance [46.64836025290448]
ResMaster is a training-free method that empowers resolution-limited diffusion models to generate high-quality images beyond resolution restrictions.
It provides structural and fine-grained guidance for crafting high-resolution images on a patch-by-patch basis.
Experiments validate that ResMaster sets a new benchmark for high-resolution image generation and demonstrates promising efficiency.
arXiv Detail & Related papers (2024-06-24T09:28:21Z) - CoSeR: Bridging Image and Language for Cognitive Super-Resolution [74.24752388179992]
We introduce the Cognitive Super-Resolution (CoSeR) framework, empowering SR models with the capacity to comprehend low-resolution images.
We achieve this by marrying image appearance and language understanding to generate a cognitive embedding.
To further improve image fidelity, we propose a novel condition injection scheme called "All-in-Attention"
arXiv Detail & Related papers (2023-11-27T16:33:29Z) - Multi-Modality Image Super-Resolution using Generative Adversarial
Networks [0.0]
We propose a solution to the joint problem of image super-resolution and multi-modality image-to-image translation.
The problem can be stated as the recovery of a high-resolution image in a modality, given a low-resolution observation of the same image in an alternative modality.
arXiv Detail & Related papers (2022-06-18T12:19:31Z) - Semi-Supervised Super-Resolution [30.872605139672086]
Super-Resolution is the technique to improve the quality of a low-resolution photo by boosting its plausible resolution.
Previous Super-Resolution methods require vast amounts of data for training which becomes problematic in domains where very few low-resolution, high-resolution pairs might be available.
This paper introduces a semi-supervised way to perform super-resolution that can generate sharp, high-resolution images with as few as 500 paired examples.
arXiv Detail & Related papers (2022-04-18T07:10:29Z) - A Generative Model for Hallucinating Diverse Versions of Super
Resolution Images [0.3222802562733786]
We are tackling in this work the problem of obtaining different high-resolution versions from the same low-resolution image using Generative Adversarial Models.
Our learning approach makes use of high frequencies available in the training high-resolution images for preserving and exploring in an unsupervised manner.
arXiv Detail & Related papers (2021-02-12T17:11:42Z) - Semantic Layout Manipulation with High-Resolution Sparse Attention [106.59650698907953]
We tackle the problem of semantic image layout manipulation, which aims to manipulate an input image by editing its semantic label map.
A core problem of this task is how to transfer visual details from the input images to the new semantic layout while making the resulting image visually realistic.
We propose a high-resolution sparse attention module that effectively transfers visual details to new layouts at a resolution up to 512x512.
arXiv Detail & Related papers (2020-12-14T06:50:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.