New Benchmark for Household Garbage Image Recognition
- URL: http://arxiv.org/abs/2202.11878v1
- Date: Thu, 24 Feb 2022 03:07:59 GMT
- Title: New Benchmark for Household Garbage Image Recognition
- Authors: Zhize Wu, Huanyi Li, Xiaofeng Wang, Zijun Wu, Le Zou, Lixiang Xu, and
Ming Tan
- Abstract summary: We build a new benchmark dataset for household garbage image classification by simulating different lightings, backgrounds, angles, and shapes.
This dataset is named 30 Classes of Household Garbage Images (HGI-30), which contains 18,000 images of 30 household garbage classes.
- Score: 6.304975225537251
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Household garbage images are usually faced with complex backgrounds, variable
illuminations, diverse angles, and changeable shapes, which bring a great
difficulty in garbage image classification. Due to the ability to discover
problem-specific features, deep learning and especially convolutional neural
networks (CNNs) have been successfully and widely used for image representation
learning. However, available and stable household garbage datasets are
insufficient, which seriously limits the development of research and
application. Besides, the state of the art in the field of garbage image
classification is not entirely clear. To solve this problem, in this study, we
built a new open benchmark dataset for household garbage image classification
by simulating different lightings, backgrounds, angles, and shapes. This
dataset is named 30 Classes of Household Garbage Images (HGI-30), which
contains 18,000 images of 30 household garbage classes. The publicly available
HGI-30 dataset allows researchers to develop accurate and robust methods for
household garbage recognition. We also conducted experiments and performance
analysis of the state-of-the-art deep CNN methods on HGI-30, which serves as
baseline results on this benchmark.
Related papers
- A Large-scale AI-generated Image Inpainting Benchmark [11.216906046169683]
We propose a methodology for creating high-quality inpainting datasets and apply it to create DiQuID.
DiQuID comprises over 95,000 inpainted images generated from 78,000 original images sourced from MS-COCO, RAISE, and OpenImages.
We provide comprehensive benchmarking results using state-of-the-art forgery detection methods, demonstrating the dataset's effectiveness in evaluating and improving detection algorithms.
arXiv Detail & Related papers (2025-02-10T15:56:28Z) - GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion [27.35300492569507]
We present GRIN, an efficient diffusion model designed to ingest sparse unstructured training data.
We show that GRIN establishes a new state of the art in zero-shot metric monocular depth estimation even when trained from scratch.
arXiv Detail & Related papers (2024-09-15T23:32:04Z) - The identification of garbage dumps in the rural areas of Cyprus through
the application of deep learning to satellite imagery [0.0]
The aim of this study was to investigate the degree to which artificial intelligence techniques can be used to identify illegal garbage dumps in the rural areas of Cyprus.
It involved collecting a novel dataset of images that could be categorised as either containing, or not containing, garbage.
An artificial neural network was trained to recognise the presence or absence of garbage in new images.
arXiv Detail & Related papers (2023-07-23T05:24:20Z) - Exploring Incompatible Knowledge Transfer in Few-shot Image Generation [107.81232567861117]
Few-shot image generation learns to generate diverse and high-fidelity images from a target domain using a few reference samples.
Existing F SIG methods select, preserve and transfer prior knowledge from a source generator to learn the target generator.
We propose knowledge truncation, which is a complementary operation to knowledge preservation and is implemented by a lightweight pruning-based method.
arXiv Detail & Related papers (2023-04-15T14:57:15Z) - Deep Image Deblurring: A Survey [165.32391279761006]
Deblurring is a classic problem in low-level computer vision, which aims to recover a sharp image from a blurred input image.
Recent advances in deep learning have led to significant progress in solving this problem.
arXiv Detail & Related papers (2022-01-26T01:31:30Z) - Unsupervised Part Discovery from Contrastive Reconstruction [90.88501867321573]
The goal of self-supervised visual representation learning is to learn strong, transferable image representations.
We propose an unsupervised approach to object part discovery and segmentation.
Our method yields semantic parts consistent across fine-grained but visually distinct categories.
arXiv Detail & Related papers (2021-11-11T17:59:42Z) - Low-Rank Subspaces in GANs [101.48350547067628]
This work introduces low-rank subspaces that enable more precise control of GAN generation.
LowRankGAN is able to find the low-dimensional representation of attribute manifold.
Experiments on state-of-the-art GAN models (including StyleGAN2 and BigGAN) trained on various datasets demonstrate the effectiveness of our LowRankGAN.
arXiv Detail & Related papers (2021-06-08T16:16:32Z) - Salient Objects in Clutter [130.63976772770368]
This paper identifies and addresses a serious design bias of existing salient object detection (SOD) datasets.
This design bias has led to a saturation in performance for state-of-the-art SOD models when evaluated on existing datasets.
We propose a new high-quality dataset and update the previous saliency benchmark.
arXiv Detail & Related papers (2021-05-07T03:49:26Z) - An Efficient Framework for Zero-Shot Sketch-Based Image Retrieval [36.254157442709264]
Zero-shot Sketch-based Image Retrieval (ZS-SBIR) has attracted the attention of the computer vision community due to it's real-world applications.
ZS-SBIR inherits the main challenges of multiple computer vision problems including content-based Image Retrieval (CBIR), zero-shot learning and domain adaptation.
arXiv Detail & Related papers (2021-02-08T06:10:37Z) - An Evolution of CNN Object Classifiers on Low-Resolution Images [0.4129225533930965]
Object classification from low-quality images is difficult for the variance of object colors, aspect ratios, and cluttered backgrounds.
Deep convolutional neural networks (DCNNs) have been demonstrated as very powerful systems for facing the challenge of object classification from high-resolution images.
In this paper, we investigate an optimal architecture that accurately classifies low-quality images using DCNNs architectures.
arXiv Detail & Related papers (2021-01-03T18:44:23Z) - GraspNet: A Large-Scale Clustered and Densely Annotated Dataset for
Object Grasping [49.777649953381676]
We contribute a large-scale grasp pose detection dataset with a unified evaluation system.
Our dataset contains 87,040 RGBD images with over 370 million grasp poses.
arXiv Detail & Related papers (2019-12-31T18:15:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.