An Ensemble Model for Distorted Images in Real Scenarios
- URL: http://arxiv.org/abs/2309.14998v1
- Date: Tue, 26 Sep 2023 15:12:55 GMT
- Title: An Ensemble Model for Distorted Images in Real Scenarios
- Authors: Boyuan Ji, Jianchang Huang, Wenzhuo Huang, Shuke He
- Abstract summary: In this paper, we apply the object detector YOLOv7 to detect distorted images from the CDCOCO dataset.
Through carefully designed optimizations, our model achieves excellent performance on the CDCOCO test set.
Our denoising detection model can denoise and repair distorted images, making the model useful in a variety of real-world scenarios and environments.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Image acquisition conditions and environments can significantly affect
high-level tasks in computer vision, and the performance of most computer
vision algorithms will be limited when trained on distortion-free datasets.
Even with updates in hardware such as sensors and deep learning methods, it
will still not work in the face of variable conditions in real-world
applications. In this paper, we apply the object detector YOLOv7 to detect
distorted images from the dataset CDCOCO. Through carefully designed
optimizations including data enhancement, detection box ensemble, denoiser
ensemble, super-resolution models, and transfer learning, our model achieves
excellent performance on the CDCOCO test set. Our denoising detection model can
denoise and repair distorted images, making the model useful in a variety of
real-world scenarios and environments.
Related papers
- BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation [57.40024206484446]
We introduce the BEHAVIOR Vision Suite (BVS), a set of tools and assets to generate fully customized synthetic data for systematic evaluation of computer vision models.
BVS supports a large number of adjustable parameters at the scene level.
We showcase three example application scenarios.
arXiv Detail & Related papers (2024-05-15T17:57:56Z) - Deep Domain Adaptation: A Sim2Real Neural Approach for Improving Eye-Tracking Systems [80.62854148838359]
Eye image segmentation is a critical step in eye tracking that has great influence over the final gaze estimate.
We use dimensionality-reduction techniques to measure the overlap between the target eye images and synthetic training data.
Our methods result in robust, improved performance when tackling the discrepancy between simulation and real-world data samples.
arXiv Detail & Related papers (2024-03-23T22:32:06Z) - Real-Time Object Detection in Occluded Environment with Background
Cluttering Effects Using Deep Learning [0.8192907805418583]
We concentrate on deep learning models for real-time detection of cars and tanks in an occluded environment with a cluttered background.
The developed method makes the custom dataset and employs a preprocessing technique to clean the noisy dataset.
The accuracy and frame per second of the SSD-Mobilenet v2 model are higher than YOLO V3 and YOLO V4.
arXiv Detail & Related papers (2024-01-02T01:30:03Z) - CD-COCO: A Versatile Complex Distorted COCO Database for
Scene-Context-Aware Computer Vision [6.48583124646155]
Image acquisition conditions have a major impact on the performance of high-level image processing.
We build a versatile database derived from MS-COCO database.
New local distortions are generated by considering the scene context.
arXiv Detail & Related papers (2023-11-12T22:28:19Z) - Let Segment Anything Help Image Dehaze [12.163299570927302]
We propose a framework to integrate large-model prior into low-level computer vision tasks.
We demonstrate the effectiveness and applicability of large models in guiding low-level visual tasks.
arXiv Detail & Related papers (2023-06-28T02:02:19Z) - Bilevel Fast Scene Adaptation for Low-Light Image Enhancement [50.639332885989255]
Enhancing images in low-light scenes is a challenging but widely concerned task in the computer vision.
Main obstacle lies in the modeling conundrum from distribution discrepancy across different scenes.
We introduce the bilevel paradigm to model the above latent correspondence.
A bilevel learning framework is constructed to endow the scene-irrelevant generality of the encoder towards diverse scenes.
arXiv Detail & Related papers (2023-06-02T08:16:21Z) - Benchmarking performance of object detection under image distortions in
an uncontrolled environment [0.483420384410068]
robustness of object detection algorithms plays a prominent role in real-world applications.
It has been proven that the performance of object detection methods suffers from in-capture distortions.
We present a performance evaluation framework for the state-of-the-art object detection methods.
arXiv Detail & Related papers (2022-10-28T09:06:52Z) - Exploring Resolution and Degradation Clues as Self-supervised Signal for
Low Quality Object Detection [77.3530907443279]
We propose a novel self-supervised framework to detect objects in degraded low resolution images.
Our methods has achieved superior performance compared with existing methods when facing variant degradation situations.
arXiv Detail & Related papers (2022-08-05T09:36:13Z) - Salient Objects in Clutter [130.63976772770368]
This paper identifies and addresses a serious design bias of existing salient object detection (SOD) datasets.
This design bias has led to a saturation in performance for state-of-the-art SOD models when evaluated on existing datasets.
We propose a new high-quality dataset and update the previous saliency benchmark.
arXiv Detail & Related papers (2021-05-07T03:49:26Z) - Object Detection and Recognition of Swap-Bodies using Camera mounted on
a Vehicle [13.702911401489427]
This project aims to jointly perform object detection of a swap-body and to find the type of swap-body by reading an ILU code.
Recent research activities have drastically improved deep learning techniques which proves to enhance the field of computer vision.
arXiv Detail & Related papers (2020-04-17T08:49:54Z) - Two-shot Spatially-varying BRDF and Shape Estimation [89.29020624201708]
We propose a novel deep learning architecture with a stage-wise estimation of shape and SVBRDF.
We create a large-scale synthetic training dataset with domain-randomized geometry and realistic materials.
Experiments on both synthetic and real-world datasets show that our network trained on a synthetic dataset can generalize well to real-world images.
arXiv Detail & Related papers (2020-04-01T12:56:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.