Image Data Augmentation for Deep Learning: A Survey
- URL: http://arxiv.org/abs/2204.08610v2
- Date: Sun, 5 Nov 2023 11:00:23 GMT
- Title: Image Data Augmentation for Deep Learning: A Survey
- Authors: Suorong Yang, Weikang Xiao, Mengchen Zhang, Suhan Guo, Jian Zhao and
Furao Shen
- Abstract summary: We systematically review different image data augmentation methods.
We propose a taxonomy of reviewed methods and present the strengths and limitations of these methods.
We also conduct extensive experiments with various data augmentation methods on three typical computer vision tasks.
- Score: 8.817690876855728
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Deep learning has achieved remarkable results in many computer vision tasks.
Deep neural networks typically rely on large amounts of training data to avoid
overfitting. However, labeled data for real-world applications may be limited.
By improving the quantity and diversity of training data, data augmentation has
become an inevitable part of deep learning model training with image data.
As an effective way to improve the sufficiency and diversity of training
data, data augmentation has become a necessary part of successful application
of deep learning models on image data. In this paper, we systematically review
different image data augmentation methods. We propose a taxonomy of reviewed
methods and present the strengths and limitations of these methods. We also
conduct extensive experiments with various data augmentation methods on three
typical computer vision tasks, including semantic segmentation, image
classification and object detection. Finally, we discuss current challenges
faced by data augmentation and future research directions to put forward some
useful research guidance.
Related papers
- A Review of Image Retrieval Techniques: Data Augmentation and Adversarial Learning Approaches [0.0]
This review focuses on the roles of data augmentation and adversarial learning techniques in enhancing retrieval performance.
Data augmentation enhances the model's generalization ability and robustness by generating more diverse training samples, simulating real-world variations, and reducing overfitting.
adversarial attacks and defenses introduce perturbations during training to improve the model's robustness against potential attacks.
arXiv Detail & Related papers (2024-09-02T12:55:17Z) - Colorful Cutout: Enhancing Image Data Augmentation with Curriculum Learning [8.406910685074136]
In this study, we adopt curriculum data augmentation for image data augmentation and propose colorful cutout.
Our experimental results highlight the possibility of curriculum data augmentation for image data.
arXiv Detail & Related papers (2024-03-29T06:53:52Z) - Deep Domain Adaptation: A Sim2Real Neural Approach for Improving Eye-Tracking Systems [80.62854148838359]
Eye image segmentation is a critical step in eye tracking that has great influence over the final gaze estimate.
We use dimensionality-reduction techniques to measure the overlap between the target eye images and synthetic training data.
Our methods result in robust, improved performance when tackling the discrepancy between simulation and real-world data samples.
arXiv Detail & Related papers (2024-03-23T22:32:06Z) - Domain Generalization for Mammographic Image Analysis with Contrastive
Learning [62.25104935889111]
The training of an efficacious deep learning model requires large data with diverse styles and qualities.
A novel contrastive learning is developed to equip the deep learning models with better style generalization capability.
The proposed method has been evaluated extensively and rigorously with mammograms from various vendor style domains and several public datasets.
arXiv Detail & Related papers (2023-04-20T11:40:21Z) - Harnessing the Power of Text-image Contrastive Models for Automatic
Detection of Online Misinformation [50.46219766161111]
We develop a self-learning model to explore the constrastive learning in the domain of misinformation identification.
Our model shows the superior performance of non-matched image-text pair detection when the training data is insufficient.
arXiv Detail & Related papers (2023-04-19T02:53:59Z) - Diffusion-based Data Augmentation for Skin Disease Classification:
Impact Across Original Medical Datasets to Fully Synthetic Images [2.5075774184834803]
Deep neural networks still rely on large amounts of training data to avoid overfitting.
Labeled training data for real-world applications such as healthcare is limited and difficult to access.
We build upon the emerging success of text-to-image diffusion probabilistic models in augmenting the training samples of our macroscopic skin disease dataset.
arXiv Detail & Related papers (2023-01-12T04:22:23Z) - Advanced Data Augmentation Approaches: A Comprehensive Survey and Future
directions [57.30984060215482]
We provide a background of data augmentation, a novel and comprehensive taxonomy of reviewed data augmentation techniques, and the strengths and weaknesses (wherever possible) of each technique.
We also provide comprehensive results of the data augmentation effect on three popular computer vision tasks, such as image classification, object detection and semantic segmentation.
arXiv Detail & Related papers (2023-01-07T11:37:32Z) - Image Augmentation for Multitask Few-Shot Learning: Agricultural Domain
Use-Case [0.0]
This paper challenges small and imbalanced datasets based on the example of a plant phenomics domain.
We introduce an image augmentation framework, which enables us to extremely enlarge the number of training samples.
We prove that our augmentation method increases model performance when only a few training samples are available.
arXiv Detail & Related papers (2021-02-24T14:08:34Z) - Data Augmentation for Meta-Learning [58.47185740820304]
meta-learning algorithms sample data, query data, and tasks on each training step.
Data augmentation can be used not only to expand the number of images available per class, but also to generate entirely new classes/tasks.
Our proposed meta-specific data augmentation significantly improves the performance of meta-learners on few-shot classification benchmarks.
arXiv Detail & Related papers (2020-10-14T13:48:22Z) - Deep Traffic Sign Detection and Recognition Without Target Domain Real
Images [52.079665469286496]
We propose a novel database generation method that requires no real image from the target-domain, and (ii) templates of the traffic signs.
The method does not aim at overcoming the training with real data, but to be a compatible alternative when the real data is not available.
On large data sets, training with a fully synthetic data set almost matches the performance of training with a real one.
arXiv Detail & Related papers (2020-07-30T21:06:47Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.