DTGen: Generative Diffusion-Based Few-Shot Data Augmentation for Fine-Grained Dirty Tableware Recognition
- URL: http://arxiv.org/abs/2509.11661v1
- Date: Mon, 15 Sep 2025 07:59:34 GMT
- Title: DTGen: Generative Diffusion-Based Few-Shot Data Augmentation for Fine-Grained Dirty Tableware Recognition
- Authors: Lifei Hao, Yue Cheng, Baoqi Huang, Bing Jia, Xuandong Zhao,
- Abstract summary: We propose DTGen, a few-shot data augmentation scheme for fine-grained dirty tableware recognition.<n>Under extremely limited real few-shot conditions, DTGen can synthesize virtually unlimited high-quality samples.<n> DTGen provides a feasible deployment path for automated tableware cleaning and food safety monitoring.
- Score: 37.947945188644624
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Intelligent tableware cleaning is a critical application in food safety and smart homes, but existing methods are limited by coarse-grained classification and scarcity of few-shot data, making it difficult to meet industrialization requirements. We propose DTGen, a few-shot data augmentation scheme based on generative diffusion models, specifically designed for fine-grained dirty tableware recognition. DTGen achieves efficient domain specialization through LoRA, generates diverse dirty images via structured prompts, and ensures data quality through CLIP-based cross-modal filtering. Under extremely limited real few-shot conditions, DTGen can synthesize virtually unlimited high-quality samples, significantly improving classifier performance and supporting fine-grained dirty tableware recognition. We further elaborate on lightweight deployment strategies, promising to transfer DTGen's benefits to embedded dishwashers and integrate with cleaning programs to intelligently regulate energy consumption and detergent usage. Research results demonstrate that DTGen not only validates the value of generative AI in few-shot industrial vision but also provides a feasible deployment path for automated tableware cleaning and food safety monitoring.
Related papers
- WaterVIB: Learning Minimal Sufficient Watermark Representations via Variational Information Bottleneck [54.83612291453286]
Existing methods fail because they entangle the watermark with high-frequency cover texture, which is susceptible to being rewritten during generative purification.<n>We propose WaterVIB, a framework that reformulates the encoder as an information sieve via the Variational Information Bottleneck.<n>This effectively filters out redundant cover nuances prone to generative shifts, retaining only the essential signal invariant to regeneration.
arXiv Detail & Related papers (2026-02-25T02:38:17Z) - Granular-ball Guided Masking: Structure-aware Data Augmentation [97.18560547134587]
Granular-ball Guided Masking (GBGM) is a structure-aware augmentation strategy guided by Granular-ball Computing (GBC)<n>GBGM adaptively preserves semantically rich, structurally important regions while suppressing redundant areas through a coarse-to-fine hierarchical masking process.<n>Experiments on multiple benchmarks demonstrate consistent improvements in classification accuracy and masked image reconstruction.
arXiv Detail & Related papers (2025-12-24T07:15:33Z) - Explainable AI for Diabetic Retinopathy Detection Using Deep Learning with Attention Mechanisms and Fuzzy Logic-Based Interpretability [0.0]
This paper proposes a hybrid deep learning framework recipe for weed detection.<n>A Generative Adversarial Network (GAN)-based augmentation method was imposed to balance class robustness and better generalize the model.<n> Experimental results yield superior results with 99.33% accuracy, precision, recall, and F1-score on multi-benchmark datasets.
arXiv Detail & Related papers (2025-11-20T12:17:00Z) - A Hybrid CNN-ViT-GNN Framework with GAN-Based Augmentation for Intelligent Weed Detection in Precision Agriculture [0.0]
This paper proposes a hybrid deep learning framework recipe for weed detection.<n>A Generative Adversarial Network (GAN)-based augmentation method was imposed to balance class robustness and better generalize the model.<n> Experimental results yield superior results with 99.33% accuracy, precision, recall, and F1-score on multi-benchmark datasets.
arXiv Detail & Related papers (2025-11-19T15:32:08Z) - Artificial Intelligence in the Food Industry: Food Waste Estimation based on Computer Vision, a Brief Case Study in a University Dining Hall [1.864621482724548]
This study presents a cost-effective computer vision framework that estimates plate-level food waste.<n>Four fully supervised models were trained using a capped dynamic inverse-frequency loss and AdamW metrics.<n>All models achieved satisfying performance, and for each food type, at least one model approached or surpassed 90% DPA.
arXiv Detail & Related papers (2025-07-19T15:21:29Z) - Restoration Score Distillation: From Corrupted Diffusion Pretraining to One-Step High-Quality Generation [82.39763984380625]
We propose textitRestoration Score Distillation (RSD), a principled generalization of Denoising Score Distillation (DSD)<n>RSD accommodates a broader range of corruption types, such as blurred, incomplete, or low-resolution images.<n>It consistently surpasses its teacher model across diverse restoration tasks on both natural and scientific datasets.
arXiv Detail & Related papers (2025-05-19T17:21:03Z) - Bounding Box-Guided Diffusion for Synthesizing Industrial Images and Segmentation Map [50.21082069320818]
We propose a novel diffusion-based pipeline for generating high-fidelity industrial datasets with minimal supervision.<n>Our approach conditions the diffusion model on enriched bounding box representations to produce precise segmentation masks.<n>Results demonstrate that diffusion-based synthesis can bridge the gap between artificial and real-world industrial data.
arXiv Detail & Related papers (2025-05-06T15:21:36Z) - Denoising Score Distillation: From Noisy Diffusion Pretraining to One-Step High-Quality Generation [82.39763984380625]
We introduce denoising score distillation (DSD), a surprisingly effective and novel approach for training high-quality generative models from low-quality data.<n>DSD pretrains a diffusion model exclusively on noisy, corrupted samples and then distills it into a one-step generator capable of producing refined, clean outputs.
arXiv Detail & Related papers (2025-03-10T17:44:46Z) - WasteGAN: Data Augmentation for Robotic Waste Sorting through Generative Adversarial Networks [7.775894876221921]
We introduce a data augmentation method based on a novel GAN architecture called wasteGAN.
The proposed method allows to increase the performance of semantic segmentation models, starting from a very limited bunch of labeled examples.
We then leverage the higher-quality segmentation masks predicted from models trained on the wasteGAN synthetic data to compute semantic-aware grasp poses.
arXiv Detail & Related papers (2024-09-25T15:04:21Z) - Enhanced Droplet Analysis Using Generative Adversarial Networks [0.0]
This work develops an image generator named DropletGAN to generate images of droplets.
It is also used to develop a light droplet detector using the synthetic dataset.
To the best of our knowledge, this work stands as the first to employ a generative model for augmenting droplet detection.
arXiv Detail & Related papers (2024-02-24T21:20:53Z) - Anomaly Detection Based on Selection and Weighting in Latent Space [73.01328671569759]
We propose a novel selection-and-weighting-based anomaly detection framework called SWAD.
Experiments on both benchmark and real-world datasets have shown the effectiveness and superiority of SWAD.
arXiv Detail & Related papers (2021-03-08T10:56:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.