CD-COCO: A Versatile Complex Distorted COCO Database for
Scene-Context-Aware Computer Vision
- URL: http://arxiv.org/abs/2311.06976v1
- Date: Sun, 12 Nov 2023 22:28:19 GMT
- Title: CD-COCO: A Versatile Complex Distorted COCO Database for
Scene-Context-Aware Computer Vision
- Authors: Ayman Beghdadi, Azeddine Beghdadi, Malik Mallem, Lotfi Beji, Faouzi
Alaya Cheikh
- Abstract summary: Image acquisition conditions have a major impact on the performance of high-level image processing.
We build a versatile database derived from MS-COCO database.
New local distortions are generated by considering the scene context.
- Score: 6.48583124646155
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: The recent development of deep learning methods applied to vision has enabled
their increasing integration into real-world applications to perform complex
Computer Vision (CV) tasks. However, image acquisition conditions have a major
impact on the performance of high-level image processing. A possible solution
to overcome these limitations is to artificially augment the training databases
or to design deep learning models that are robust to signal distortions. We opt
here for the first solution by enriching the database with complex and
realistic distortions which were ignored until now in the existing databases.
To this end, we built a new versatile database derived from the well-known
MS-COCO database to which we applied local and global photo-realistic
distortions. These new local distortions are generated by considering the scene
context of the images that guarantees a high level of photo-realism.
Distortions are generated by exploiting the depth information of the objects in
the scene as well as their semantics. This guarantees a high level of
photo-realism and allows to explore real scenarios ignored in conventional
databases dedicated to various CV applications. Our versatile database offers
an efficient solution to improve the robustness of various CV tasks such as
Object Detection (OD), scene segmentation, and distortion-type classification
methods. The image database, scene classification index, and distortion
generation codes are publicly available
\footnote{\url{https://github.com/Aymanbegh/CD-COCO}}
Related papers
- Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities [88.398085358514]
Contrastive Deepfake Embeddings (CoDE) is a novel embedding space specifically designed for deepfake detection.
CoDE is trained via contrastive learning by additionally enforcing global-local similarities.
arXiv Detail & Related papers (2024-07-29T18:00:10Z) - An Ensemble Model for Distorted Images in Real Scenarios [0.0]
In this paper, we apply the object detector YOLOv7 to detect distorted images from the CDCOCO dataset.
Through carefully designed optimizations, our model achieves excellent performance on the CDCOCO test set.
Our denoising detection model can denoise and repair distorted images, making the model useful in a variety of real-world scenarios and environments.
arXiv Detail & Related papers (2023-09-26T15:12:55Z) - Disruptive Autoencoders: Leveraging Low-level features for 3D Medical
Image Pre-training [51.16994853817024]
This work focuses on designing an effective pre-training framework for 3D radiology images.
We introduce Disruptive Autoencoders, a pre-training framework that attempts to reconstruct the original image from disruptions created by a combination of local masking and low-level perturbations.
The proposed pre-training framework is tested across multiple downstream tasks and achieves state-of-the-art performance.
arXiv Detail & Related papers (2023-07-31T17:59:42Z) - SIDAR: Synthetic Image Dataset for Alignment & Restoration [2.9649783577150837]
There is a lack of datasets that provide enough data to train and evaluate end-to-end deep learning models.
Our proposed data augmentation helps to overcome the issue of data scarcity by using 3D rendering.
The resulting dataset can serve as a training and evaluation set for a multitude of tasks involving image alignment and artifact removal.
arXiv Detail & Related papers (2023-05-19T23:32:06Z) - Benchmarking performance of object detection under image distortions in
an uncontrolled environment [0.483420384410068]
robustness of object detection algorithms plays a prominent role in real-world applications.
It has been proven that the performance of object detection methods suffers from in-capture distortions.
We present a performance evaluation framework for the state-of-the-art object detection methods.
arXiv Detail & Related papers (2022-10-28T09:06:52Z) - Single Image Internal Distribution Measurement Using Non-Local
Variational Autoencoder [11.985083962982909]
This paper proposes a novel image-specific solution, namely non-local variational autoencoder (textttNLVAE)
textttNLVAE is introduced as a self-supervised strategy that reconstructs high-resolution images using disentangled information from the non-local neighbourhood.
Experimental results from seven benchmark datasets demonstrate the effectiveness of the textttNLVAE model.
arXiv Detail & Related papers (2022-04-02T18:43:55Z) - Enhancing Low-Light Images in Real World via Cross-Image Disentanglement [58.754943762945864]
We propose a new low-light image enhancement dataset consisting of misaligned training images with real-world corruptions.
Our model achieves state-of-the-art performances on both the newly proposed dataset and other popular low-light datasets.
arXiv Detail & Related papers (2022-01-10T03:12:52Z) - Region-level Active Learning for Cluttered Scenes [60.93811392293329]
We introduce a new strategy that subsumes previous Image-level and Object-level approaches into a generalized, Region-level approach.
We show that this approach significantly decreases labeling effort and improves rare object search on realistic data with inherent class-imbalance and cluttered scenes.
arXiv Detail & Related papers (2021-08-20T14:02:38Z) - Salient Objects in Clutter [130.63976772770368]
This paper identifies and addresses a serious design bias of existing salient object detection (SOD) datasets.
This design bias has led to a saturation in performance for state-of-the-art SOD models when evaluated on existing datasets.
We propose a new high-quality dataset and update the previous saliency benchmark.
arXiv Detail & Related papers (2021-05-07T03:49:26Z) - Deep Traffic Sign Detection and Recognition Without Target Domain Real
Images [52.079665469286496]
We propose a novel database generation method that requires no real image from the target-domain, and (ii) templates of the traffic signs.
The method does not aim at overcoming the training with real data, but to be a compatible alternative when the real data is not available.
On large data sets, training with a fully synthetic data set almost matches the performance of training with a real one.
arXiv Detail & Related papers (2020-07-30T21:06:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.