Clarity ChatGPT: An Interactive and Adaptive Processing System for Image
Restoration and Enhancement
- URL: http://arxiv.org/abs/2311.11695v1
- Date: Mon, 20 Nov 2023 11:51:13 GMT
- Title: Clarity ChatGPT: An Interactive and Adaptive Processing System for Image
Restoration and Enhancement
- Authors: Yanyan Wei, Zhao Zhang, Jiahuan Ren, Xiaogang Xu, Richang Hong, Yi
Yang, Shuicheng Yan, Meng Wang
- Abstract summary: We propose a transformative system that combines the conversational intelligence of ChatGPT with multiple IRE methods.
Our case studies demonstrate that Clarity ChatGPT effectively improves the generalization and interaction capabilities in the IRE.
- Score: 97.41630939425731
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The generalization capability of existing image restoration and enhancement
(IRE) methods is constrained by the limited pre-trained datasets, making it
difficult to handle agnostic inputs such as different degradation levels and
scenarios beyond their design scopes. Moreover, they are not equipped with
interactive mechanisms to consider user preferences or feedback, and their
end-to-end settings cannot provide users with more choices. Faced with the
above-mentioned IRE method's limited performance and insufficient
interactivity, we try to solve it from the engineering and system framework
levels. Specifically, we propose Clarity ChatGPT-a transformative system that
combines the conversational intelligence of ChatGPT with multiple IRE methods.
Clarity ChatGPT can automatically detect image degradation types and select
appropriate IRE methods to restore images, or iteratively generate satisfactory
results based on user feedback. Its innovative features include a CLIP-powered
detector for accurate degradation classification, no-reference image quality
evaluation for performance evaluation, region-specific processing for precise
enhancements, and advanced fusion techniques for optimal restoration results.
Clarity ChatGPT marks a significant advancement in integrating language and
vision, enhancing image-text interactions, and providing a robust,
high-performance IRE solution. Our case studies demonstrate that Clarity
ChatGPT effectively improves the generalization and interaction capabilities in
the IRE, and also fills the gap in the low-level domain of the existing
vision-language model.
Related papers
- Beyond Degradation Redundancy: Contrastive Prompt Learning for All-in-One Image Restoration [109.38288333994407]
Contrastive Prompt Learning (CPL) is a novel framework that fundamentally enhances prompt-task alignment.
Our framework establishes new state-of-the-art performance while maintaining parameter efficiency, offering a principled solution for unified image restoration.
arXiv Detail & Related papers (2025-04-14T08:24:57Z) - AVadCLIP: Audio-Visual Collaboration for Robust Video Anomaly Detection [57.649223695021114]
We present a novel weakly supervised framework that leverages audio-visual collaboration for robust video anomaly detection.
Our framework demonstrates superior performance across multiple benchmarks, with audio integration significantly boosting anomaly detection accuracy.
arXiv Detail & Related papers (2025-04-06T13:59:16Z) - ControlFusion: A Controllable Image Fusion Framework with Language-Vision Degradation Prompts [58.99648692413168]
Current image fusion methods struggle to address the composite degradations encountered in real-world imaging scenarios.
We propose ControlFusion, which adaptively neutralizes composite degradations.
In experiments, ControlFusion outperforms SOTA fusion methods in fusion quality and degradation handling.
arXiv Detail & Related papers (2025-03-30T08:18:53Z) - Semantic-Guided Global-Local Collaborative Networks for Lightweight Image Super-Resolution [9.666827340439669]
Single-Image Super-Resolution (SISR) plays a pivotal role in enhancing the accuracy and reliability of measurement systems.
We propose a Semantic-Guided Global-Local Collaborative Network (SGGLC-Net) for lightweight SISR.
arXiv Detail & Related papers (2025-03-20T11:43:55Z) - Adaptive Prompt Tuning: Vision Guided Prompt Tuning with Cross-Attention for Fine-Grained Few-Shot Learning [5.242869847419834]
Few-shot, fine-grained classification in computer vision poses significant challenges due to the need to differentiate subtle class distinctions with limited data.
This paper presents a novel method that enhances the Contrastive Language-Image Pre-Training model through adaptive prompt tuning.
arXiv Detail & Related papers (2024-12-19T08:51:01Z) - Integrating Object Detection Modality into Visual Language Model for Enhanced Autonomous Driving Agent [8.212818176634116]
We extend the Llama-Adapter architecture by incorporating a YOLOS-based detection network alongside the CLIP perception network.
Our approach introduces camera ID-separators to improve multi-view processing, crucial for comprehensive environmental awareness.
arXiv Detail & Related papers (2024-11-08T15:50:30Z) - DemosaicFormer: Coarse-to-Fine Demosaicing Network for HybridEVS Camera [70.28702677370879]
Hybrid Event-Based Vision Sensor (HybridEVS) is a novel sensor integrating traditional frame-based and event-based sensors.
Despite its potential, the lack of Image signal processing (ISP) pipeline specifically designed for HybridEVS poses a significant challenge.
We propose a coarse-to-fine framework named DemosaicFormer which comprises coarse demosaicing and pixel correction.
arXiv Detail & Related papers (2024-06-12T07:20:46Z) - SPIRE: Semantic Prompt-Driven Image Restoration [66.26165625929747]
We develop SPIRE, a Semantic and restoration Prompt-driven Image Restoration framework.
Our approach is the first framework that supports fine-level instruction through language-based quantitative specification of the restoration strength.
Our experiments demonstrate the superior restoration performance of SPIRE compared to the state of the arts.
arXiv Detail & Related papers (2023-12-18T17:02:30Z) - Prompt-based Ingredient-Oriented All-in-One Image Restoration [0.0]
We propose a novel data ingredient-oriented approach to tackle multiple image degradation tasks.
Specifically, we utilize a encoder to capture features and introduce prompts with degradation-specific information to guide the decoder.
Our method performs competitively to the state-of-the-art.
arXiv Detail & Related papers (2023-09-06T15:05:04Z) - Explainable bilevel optimization: an application to the Helsinki deblur
challenge [1.1470070927586016]
We present a bilevel optimization scheme for the solution of a general image deblurring problem.
A parametric variational-like approach is encapsulated within a machine learning scheme to provide a high quality reconstructed image.
arXiv Detail & Related papers (2022-10-18T11:36:37Z) - Enhancement by Your Aesthetic: An Intelligible Unsupervised Personalized
Enhancer for Low-Light Images [67.14410374622699]
We propose an intelligible unsupervised personalized enhancer (iUPEnhancer) for low-light images.
The proposed iUP-Enhancer is trained with the guidance of these correlations and the corresponding unsupervised loss functions.
Experiments demonstrate that the proposed algorithm produces competitive qualitative and quantitative results.
arXiv Detail & Related papers (2022-07-15T07:16:10Z) - Controllable Image Enhancement [66.18525728881711]
We present a semiautomatic image enhancement algorithm that can generate high-quality images with multiple styles by controlling a few parameters.
An encoder-decoder framework encodes the retouching skills into latent codes and decodes them into the parameters of image signal processing functions.
arXiv Detail & Related papers (2022-06-16T23:54:53Z) - Towards Unsupervised Deep Image Enhancement with Generative Adversarial
Network [92.01145655155374]
We present an unsupervised image enhancement generative network (UEGAN)
It learns the corresponding image-to-image mapping from a set of images with desired characteristics in an unsupervised manner.
Results show that the proposed model effectively improves the aesthetic quality of images.
arXiv Detail & Related papers (2020-12-30T03:22:46Z) - Image-Based Benchmarking and Visualization for Large-Scale Global
Optimization [6.5447678518952115]
An image-based visualization framework is proposed that visualizes the solutions to large-scale global optimization problems as images are proposed.
In the proposed framework, the pixels visualize decision variables while the entire image represents the overall solution quality.
The proposed framework is then demonstrated on arbitrary benchmark problems with known optima.
arXiv Detail & Related papers (2020-07-24T03:39:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.