Data Augmentation: a Combined Inductive-Deductive Approach featuring
Answer Set Programming
- URL: http://arxiv.org/abs/2310.14413v1
- Date: Sun, 22 Oct 2023 21:02:26 GMT
- Title: Data Augmentation: a Combined Inductive-Deductive Approach featuring
Answer Set Programming
- Authors: Pierangela Bruno, Francesco Calimeri, Cinzia Marte and Simona Perri
- Abstract summary: We propose a framework for synthesizing photo-realistic images via advanced Data Augmentation techniques.
The resulting labeled images undergo a dedicated process based on Deep Learning in charge of creating photo-realistic images that comply with the generated label.
- Score: 1.1060425537315088
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Although the availability of a large amount of data is usually given for
granted, there are relevant scenarios where this is not the case; for instance,
in the biomedical/healthcare domain, some applications require to build huge
datasets of proper images, but the acquisition of such images is often hard for
different reasons (e.g., accessibility, costs, pathology-related variability),
thus causing limited and usually imbalanced datasets. Hence, the need for
synthesizing photo-realistic images via advanced Data Augmentation techniques
is crucial. In this paper we propose a hybrid inductive-deductive approach to
the problem; in particular, starting from a limited set of real labeled images,
the proposed framework makes use of logic programs for declaratively specifying
the structure of new images, that is guaranteed to comply with both a set of
constraints coming from the domain knowledge and some specific desiderata. The
resulting labeled images undergo a dedicated process based on Deep Learning in
charge of creating photo-realistic images that comply with the generated label.
Related papers
- Flow Matching for Medical Image Synthesis: Bridging the Gap Between Speed and Quality [1.0922645515457987]
We propose the use of an optimal transport flow matching approach to accelerate image generation.
By introducing a straighter mapping between the source and target distribution, our method significantly reduces inference time.
Our results demonstrate the efficiency and versatility of this framework, making it a promising advancement for medical imaging applications.
arXiv Detail & Related papers (2025-03-01T00:49:47Z) - MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training [62.843316348659165]
Deep learning-based image matching algorithms have dramatically outperformed humans in rapidly and accurately finding large amounts of correspondences.
We propose a large-scale pre-training framework that utilizes synthetic cross-modal training signals to train models to recognize and match fundamental structures across images.
Our key finding is that the matching model trained with our framework achieves remarkable generalizability across more than eight unseen cross-modality registration tasks.
arXiv Detail & Related papers (2025-01-13T18:37:36Z) - Efficient MedSAMs: Segment Anything in Medical Images on Laptop [69.28565867103542]
We organized the first international competition dedicated to promptable medical image segmentation.
The top teams developed lightweight segmentation foundation models and implemented an efficient inference pipeline.
The best-performing algorithms have been incorporated into the open-source software with a user-friendly interface to facilitate clinical adoption.
arXiv Detail & Related papers (2024-12-20T17:33:35Z) - Generative AI Enables Medical Image Segmentation in Ultra Low-Data Regimes [35.151834585823224]
We introduce a generative deep learning framework, which uniquely generates high-quality paired segmentation masks and medical images.
Unlike traditional generative models that treat data generation and segmentation model training as separate processes, our method employs multi-level optimization for end-to-end data generation.
Our method demonstrated strong generalization performance across 9 diverse medical image segmentation tasks and on 16 datasets, in ultra-low data regimes.
arXiv Detail & Related papers (2024-08-30T17:11:36Z) - Image-GS: Content-Adaptive Image Representation via 2D Gaussians [55.15950594752051]
We propose Image-GS, a content-adaptive image representation.
Using anisotropic 2D Gaussians as the basis, Image-GS shows high memory efficiency, supports fast random access, and offers a natural level of detail stack.
General efficiency and fidelity of Image-GS are validated against several recent neural image representations and industry-standard texture compressors.
We hope this research offers insights for developing new applications that require adaptive quality and resource control, such as machine perception, asset streaming, and content generation.
arXiv Detail & Related papers (2024-07-02T00:45:21Z) - Discriminative Hamiltonian Variational Autoencoder for Accurate Tumor Segmentation in Data-Scarce Regimes [2.8498944632323755]
We propose an end-to-end hybrid architecture for medical image segmentation.
We use Hamiltonian Variational Autoencoders (HVAE) and a discriminative regularization to improve the quality of generated images.
Our architecture operates on a slice-by-slice basis to segment 3D volumes, capitilizing on the richly augmented dataset.
arXiv Detail & Related papers (2024-06-17T15:42:08Z) - An expert-driven data generation pipeline for histological images [7.219732640188684]
We propose a novel pipeline for generating synthetic datasets for cell segmentation.
Given only a handful of annotated images, our method generates a large dataset which can be used to train DL instance segmentation models.
arXiv Detail & Related papers (2024-06-03T15:05:08Z) - Exposure Bracketing Is All You Need For A High-Quality Image [50.822601495422916]
Multi-exposure images are complementary in denoising, deblurring, high dynamic range imaging, and super-resolution.
We propose to utilize exposure bracketing photography to get a high-quality image by combining these tasks in this work.
In particular, a temporally modulated recurrent network (TMRNet) and self-supervised adaptation method are proposed.
arXiv Detail & Related papers (2024-01-01T14:14:35Z) - Learned representation-guided diffusion models for large-image generation [58.192263311786824]
We introduce a novel approach that trains diffusion models conditioned on embeddings from self-supervised learning (SSL)
Our diffusion models successfully project these features back to high-quality histopathology and remote sensing images.
Augmenting real data by generating variations of real images improves downstream accuracy for patch-level and larger, image-scale classification tasks.
arXiv Detail & Related papers (2023-12-12T14:45:45Z) - Evaluating Data Attribution for Text-to-Image Models [62.844382063780365]
We evaluate attribution through "customization" methods, which tune an existing large-scale model toward a given exemplar object or style.
Our key insight is that this allows us to efficiently create synthetic images that are computationally influenced by the exemplar by construction.
By taking into account the inherent uncertainty of the problem, we can assign soft attribution scores over a set of training images.
arXiv Detail & Related papers (2023-06-15T17:59:51Z) - Cones 2: Customizable Image Synthesis with Multiple Subjects [50.54010141032032]
We study how to efficiently represent a particular subject as well as how to appropriately compose different subjects.
By rectifying the activations in the cross-attention map, the layout appoints and separates the location of different subjects in the image.
arXiv Detail & Related papers (2023-05-30T18:00:06Z) - This Intestine Does Not Exist: Multiscale Residual Variational
Autoencoder for Realistic Wireless Capsule Endoscopy Image Generation [7.430724826764835]
A novel Variational Autoencoder architecture is proposed, namely "This Intestine Does not Exist" (TIDE)
The proposed architecture comprises multiscale feature extraction convolutional blocks and residual connections, which enable the generation of high-quality and diverse datasets.
Contrary to the current approaches, which are oriented towards the augmentation of the available datasets, this study demonstrates that using TIDE, real WCE datasets can be fully substituted.
arXiv Detail & Related papers (2023-02-04T11:49:38Z) - Retrieval-based Spatially Adaptive Normalization for Semantic Image
Synthesis [68.1281982092765]
We propose a novel normalization module, termed as REtrieval-based Spatially AdaptIve normaLization (RESAIL)
RESAIL provides pixel level fine-grained guidance to the normalization architecture.
Experiments on several challenging datasets show that our RESAIL performs favorably against state-of-the-arts in terms of quantitative metrics, visual quality, and subjective evaluation.
arXiv Detail & Related papers (2022-04-06T14:21:39Z) - Label Assistant: A Workflow for Assisted Data Annotation in Image
Segmentation Tasks [0.8135412538980286]
We propose a generic workflow to assist the annotation process and discuss methods on an abstract level.
Thereby, we review the possibilities of focusing on promising samples, image pre-processing, pre-labeling, label inspection, or post-processing of annotations.
In addition, we present an implementation of the proposal by means of a developed flexible and extendable software prototype nested in hybrid touchscreen/laptop device.
arXiv Detail & Related papers (2021-11-27T19:08:25Z) - Learning Deformable Image Registration from Optimization: Perspective,
Modules, Bilevel Training and Beyond [62.730497582218284]
We develop a new deep learning based framework to optimize a diffeomorphic model via multi-scale propagation.
We conduct two groups of image registration experiments on 3D volume datasets including image-to-atlas registration on brain MRI data and image-to-image registration on liver CT data.
arXiv Detail & Related papers (2020-04-30T03:23:45Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.