Advanced Data Augmentation Approaches: A Comprehensive Survey and Future
directions
- URL: http://arxiv.org/abs/2301.02830v1
- Date: Sat, 7 Jan 2023 11:37:32 GMT
- Title: Advanced Data Augmentation Approaches: A Comprehensive Survey and Future
directions
- Authors: Teerath Kumar, Muhammad Turab, Kislay Raj, Alessandra Mileo, Rob
Brennan and Malika Bendechache
- Abstract summary: We provide a background of data augmentation, a novel and comprehensive taxonomy of reviewed data augmentation techniques, and the strengths and weaknesses (wherever possible) of each technique.
We also provide comprehensive results of the data augmentation effect on three popular computer vision tasks, such as image classification, object detection and semantic segmentation.
- Score: 57.30984060215482
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Deep learning (DL) algorithms have shown significant performance in various
computer vision tasks. However, having limited labelled data lead to a network
overfitting problem, where network performance is bad on unseen data as
compared to training data. Consequently, it limits performance improvement. To
cope with this problem, various techniques have been proposed such as dropout,
normalization and advanced data augmentation. Among these, data augmentation,
which aims to enlarge the dataset size by including sample diversity, has been
a hot topic in recent times. In this article, we focus on advanced data
augmentation techniques. we provide a background of data augmentation, a novel
and comprehensive taxonomy of reviewed data augmentation techniques, and the
strengths and weaknesses (wherever possible) of each technique. We also provide
comprehensive results of the data augmentation effect on three popular computer
vision tasks, such as image classification, object detection and semantic
segmentation. For results reproducibility, we compiled available codes of all
data augmentation techniques. Finally, we discuss the challenges and
difficulties, and possible future direction for the research community. We
believe, this survey provides several benefits i) readers will understand the
data augmentation working mechanism to fix overfitting problems ii) results
will save the searching time of the researcher for comparison purposes. iii)
Codes of the mentioned data augmentation techniques are available at
https://github.com/kmr2017/Advanced-Data-augmentation-codes iv) Future work
will spark interest in research community.
Related papers
- A Comprehensive Survey on Data Augmentation [55.355273602421384]
Data augmentation is a technique that generates high-quality artificial data by manipulating existing data samples.
Existing literature surveys only focus on a certain type of specific modality data.
We propose a more enlightening taxonomy that encompasses data augmentation techniques for different common data modalities.
arXiv Detail & Related papers (2024-05-15T11:58:08Z) - A survey of synthetic data augmentation methods in computer vision [0.0]
This paper presents an extensive review of synthetic data augmentation techniques.
We focus on the important data generation and augmentation techniques, general scope of application and specific use-cases.
We provide a summary of common synthetic datasets for training computer vision models.
arXiv Detail & Related papers (2024-03-15T07:34:08Z) - Data Augmentation in Human-Centric Vision [54.97327269866757]
This survey presents a comprehensive analysis of data augmentation techniques in human-centric vision tasks.
It delves into a wide range of research areas including person ReID, human parsing, human pose estimation, and pedestrian detection.
Our work categorizes data augmentation methods into two main types: data generation and data perturbation.
arXiv Detail & Related papers (2024-03-13T16:05:18Z) - A Survey on Data Augmentation in Large Model Era [16.05117556207015]
Large models, encompassing large language and diffusion models, have shown exceptional promise in approximating human-level intelligence.
With continuous updates to these models, the existing reservoir of high-quality data may soon be depleted.
This paper offers an exhaustive review of large model-driven data augmentation methods.
arXiv Detail & Related papers (2024-01-27T14:19:33Z) - A Comprehensive Survey of Dataset Distillation [73.15482472726555]
It has become challenging to handle the unlimited growth of data with limited computing power.
Deep learning technology has developed unprecedentedly in the last decade.
This paper provides a holistic understanding of dataset distillation from multiple aspects.
arXiv Detail & Related papers (2023-01-13T15:11:38Z) - Research Trends and Applications of Data Augmentation Algorithms [77.34726150561087]
We identify the main areas of application of data augmentation algorithms, the types of algorithms used, significant research trends, their progression over time and research gaps in data augmentation literature.
We expect readers to understand the potential of data augmentation, as well as identify future research directions and open questions within data augmentation research.
arXiv Detail & Related papers (2022-07-18T11:38:32Z) - Image Data Augmentation for Deep Learning: A Survey [8.817690876855728]
We systematically review different image data augmentation methods.
We propose a taxonomy of reviewed methods and present the strengths and limitations of these methods.
We also conduct extensive experiments with various data augmentation methods on three typical computer vision tasks.
arXiv Detail & Related papers (2022-04-19T02:05:56Z) - Data Augmentation for Deep Graph Learning: A Survey [66.04015540536027]
We first propose a taxonomy for graph data augmentation and then provide a structured review by categorizing the related work based on the augmented information modalities.
Focusing on the two challenging problems in DGL (i.e., optimal graph learning and low-resource graph learning), we also discuss and review the existing learning paradigms which are based on graph data augmentation.
arXiv Detail & Related papers (2022-02-16T18:30:33Z) - Complex Wavelet SSIM based Image Data Augmentation [0.0]
We look at the MNIST handwritten dataset an image dataset used for digit recognition.
We take a detailed look into one of the most popular augmentation techniques used for this data set elastic deformation.
We propose to use a similarity measure called Complex Wavelet Structural Similarity Index Measure (CWSSIM) to selectively filter out the irrelevant data.
arXiv Detail & Related papers (2020-07-11T21:11:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.