Related papers: A novel method for object detection using deep learning and CAD models

A novel method for object detection using deep learning and CAD models

URL: http://arxiv.org/abs/2102.06729v1
Date: Fri, 12 Feb 2021 19:19:45 GMT
Title: A novel method for object detection using deep learning and CAD models
Authors: Igor Garcia Ballhausen Sampaio and Luigy Machaca and Jos\'e Viterbo and Joris Gu\'erin
Abstract summary: Object Detection (OD) is an important computer vision problem for industry, which can be used for quality control in the production lines. Recently, Deep Learning (DL) methods have enabled practitioners to train OD models performing well on complex real world images. In this paper, we introduce a fully automated method that uses a CAD model of an object and returns a fully trained OD model for detecting this object.
Score: 0.4588028371034407
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Object Detection (OD) is an important computer vision problem for industry, which can be used for quality control in the production lines, among other applications. Recently, Deep Learning (DL) methods have enabled practitioners to train OD models performing well on complex real world images. However, the adoption of these models in industry is still limited by the difficulty and the significant cost of collecting high quality training datasets. On the other hand, when applying OD to the context of production lines, CAD models of the objects to be detected are often available. In this paper, we introduce a fully automated method that uses a CAD model of an object and returns a fully trained OD model for detecting this object. To do this, we created a Blender script that generates realistic labeled datasets of images containing the object, which are then used for training the OD model. The method is validated experimentally on two practical examples, showing that this approach can generate OD models performing well on real images, while being trained only on synthetic images. The proposed method has potential to facilitate the adoption of object detection models in industry as it is easy to adapt for new objects and highly flexible. Hence, it can result in significant costs reduction, gains in productivity and improved products quality.

Related papers

Where's the liability in the Generative Era? Recovery-based Black-Box Detection of AI-Generated Content [42.68683643671603]
We introduce a novel black box detection framework that requires only API access.<n>We measure the likelihood that the image was generated by the model itself.<n>For black-box models that do not support masked image inputs, we incorporate a cost efficient surrogate model trained to align with the target model distribution.
arXiv Detail & Related papers (2025-05-02T05:11:35Z)
Fully-Synthetic Training for Visual Quality Inspection in Automotive Production [0.4915744683251149]
We propose a pipeline for generating synthetic images using domain randomization. We evaluate our approach in three real inspection scenarios and demonstrate that an object detection model trained solely on synthetic data can outperform models trained on real images.
arXiv Detail & Related papers (2025-03-12T12:58:30Z)
Zero-Shot Object-Centric Representation Learning [72.43369950684057]
We study current object-centric methods through the lens of zero-shot generalization. We introduce a benchmark comprising eight different synthetic and real-world datasets. We find that training on diverse real-world images improves transferability to unseen scenarios.
arXiv Detail & Related papers (2024-08-17T10:37:07Z)
NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models [34.898217885820614]
We present a pipeline that does not require CAD models and allows training a state-of-the-art pose estimator requiring only a small set of real images as input. Our method is based on a NeuS2 object representation, that we learn through a semi-automated procedure based on Structure-from-Motion (SfM) and object-agnostic segmentation. We evaluate our method on the LINEMOD-Occlusion dataset, extensively studying the impact of its individual components and showing competitive performance with respect to approaches based on CAD models and PBR data.
arXiv Detail & Related papers (2024-07-16T22:48:22Z)
Photogrammetry for Digital Twinning Industry 4.0 (I4) Systems [0.43127334486935653]
Digital Twins (DT) are transformational technology that leverage software systems to replicate physical process behavior. This paper aims to explore the use of photogrammetry and 3D scanning techniques to create accurate visual representation of the 'Physical Process' The results indicate that photogrammetry using consumer-grade devices can be an efficient and cost-efficient approach to creating DTs for smart manufacturing.
arXiv Detail & Related papers (2024-07-12T04:51:19Z)
SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients [0.8873228457453465]
Small object detection in aerial imagery presents significant challenges in computer vision. Traditional methods using transformer-based models often face limitations stemming from the lack of specialized databases. This paper introduces two innovative approaches that significantly enhance detection and segmentation capabilities for small aerial objects.
arXiv Detail & Related papers (2024-05-02T19:47:08Z)
DIO: Dataset of 3D Mesh Models of Indoor Objects for Robotics and Computer Vision Applications [17.637438333501628]
The creation of accurate virtual models of real-world objects is imperative to robotic simulations and applications such as computer vision. This paper documents the different methods employed for generating a database of mesh models of real-world objects.
arXiv Detail & Related papers (2024-02-19T04:58:40Z)
MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare [84.80956484848505]
MegaPose is a method to estimate the 6D pose of novel objects, that is, objects unseen during training. We present a 6D pose refiner based on a render&compare strategy which can be applied to novel objects. Second, we introduce a novel approach for coarse pose estimation which leverages a network trained to classify whether the pose error between a synthetic rendering and an observed image of the same object can be corrected by the refiner.
arXiv Detail & Related papers (2022-12-13T19:30:03Z)
Bridging the Gap to Real-World Object-Centric Learning [66.55867830853803]
We show that reconstructing features from models trained in a self-supervised manner is a sufficient training signal for object-centric representations to arise in a fully unsupervised way. Our approach, DINOSAUR, significantly out-performs existing object-centric learning models on simulated data.
arXiv Detail & Related papers (2022-09-29T15:24:47Z)
Few-Cost Salient Object Detection with Adversarial-Paced Learning [95.0220555274653]
This paper proposes to learn the effective salient object detection model based on the manual annotation on a few training images only. We name this task as the few-cost salient object detection and propose an adversarial-paced learning (APL)-based framework to facilitate the few-cost learning scenario.
arXiv Detail & Related papers (2021-04-05T14:15:49Z)
Supervised Training of Dense Object Nets using Optimal Descriptors for Industrial Robotic Applications [57.87136703404356]
Dense Object Nets (DONs) by Florence, Manuelli and Tedrake introduced dense object descriptors as a novel visual object representation for the robotics community. In this paper we show that given a 3D model of an object, we can generate its descriptor space image, which allows for supervised training of DONs. We compare the training methods on generating 6D grasps for industrial objects and show that our novel supervised training approach improves the pick-and-place performance in industry-relevant tasks.
arXiv Detail & Related papers (2021-02-16T11:40:12Z)
Object Detection and Recognition of Swap-Bodies using Camera mounted on a Vehicle [13.702911401489427]
This project aims to jointly perform object detection of a swap-body and to find the type of swap-body by reading an ILU code. Recent research activities have drastically improved deep learning techniques which proves to enhance the field of computer vision.
arXiv Detail & Related papers (2020-04-17T08:49:54Z)
CPS++: Improving Class-level 6D Pose and Shape Estimation From Monocular Images With Self-Supervised Learning [74.53664270194643]
Modern monocular 6D pose estimation methods can only cope with a handful of object instances. We propose a novel method for class-level monocular 6D pose estimation, coupled with metric shape retrieval. We experimentally demonstrate that we can retrieve precise 6D poses and metric shapes from a single RGB image.
arXiv Detail & Related papers (2020-03-12T15:28:13Z)

This list is automatically generated from the titles and abstracts of the papers in this site.