A novel method for object detection using deep learning and CAD models
- URL: http://arxiv.org/abs/2102.06729v1
- Date: Fri, 12 Feb 2021 19:19:45 GMT
- Title: A novel method for object detection using deep learning and CAD models
- Authors: Igor Garcia Ballhausen Sampaio and Luigy Machaca and Jos\'e Viterbo
and Joris Gu\'erin
- Abstract summary: Object Detection (OD) is an important computer vision problem for industry, which can be used for quality control in the production lines.
Recently, Deep Learning (DL) methods have enabled practitioners to train OD models performing well on complex real world images.
In this paper, we introduce a fully automated method that uses a CAD model of an object and returns a fully trained OD model for detecting this object.
- Score: 0.4588028371034407
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Object Detection (OD) is an important computer vision problem for industry,
which can be used for quality control in the production lines, among other
applications. Recently, Deep Learning (DL) methods have enabled practitioners
to train OD models performing well on complex real world images. However, the
adoption of these models in industry is still limited by the difficulty and the
significant cost of collecting high quality training datasets. On the other
hand, when applying OD to the context of production lines, CAD models of the
objects to be detected are often available. In this paper, we introduce a fully
automated method that uses a CAD model of an object and returns a fully trained
OD model for detecting this object. To do this, we created a Blender script
that generates realistic labeled datasets of images containing the object,
which are then used for training the OD model. The method is validated
experimentally on two practical examples, showing that this approach can
generate OD models performing well on real images, while being trained only on
synthetic images. The proposed method has potential to facilitate the adoption
of object detection models in industry as it is easy to adapt for new objects
and highly flexible. Hence, it can result in significant costs reduction, gains
in productivity and improved products quality.
Related papers
- Zero-Shot Object-Centric Representation Learning [72.43369950684057]
We study current object-centric methods through the lens of zero-shot generalization.
We introduce a benchmark comprising eight different synthetic and real-world datasets.
We find that training on diverse real-world images improves transferability to unseen scenarios.
arXiv Detail & Related papers (2024-08-17T10:37:07Z) - NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models [34.898217885820614]
We present a pipeline that does not require CAD models and allows training a state-of-the-art pose estimator requiring only a small set of real images as input.
Our method is based on a NeuS2 object representation, that we learn through a semi-automated procedure based on Structure-from-Motion (SfM) and object-agnostic segmentation.
We evaluate our method on the LINEMOD-Occlusion dataset, extensively studying the impact of its individual components and showing competitive performance with respect to approaches based on CAD models and PBR data.
arXiv Detail & Related papers (2024-07-16T22:48:22Z) - Photogrammetry for Digital Twinning Industry 4.0 (I4) Systems [0.43127334486935653]
Digital Twins (DT) are transformational technology that leverage software systems to replicate physical process behavior.
This paper aims to explore the use of photogrammetry and 3D scanning techniques to create accurate visual representation of the 'Physical Process'
The results indicate that photogrammetry using consumer-grade devices can be an efficient and cost-efficient approach to creating DTs for smart manufacturing.
arXiv Detail & Related papers (2024-07-12T04:51:19Z) - SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients [0.8873228457453465]
Small object detection in aerial imagery presents significant challenges in computer vision.
Traditional methods using transformer-based models often face limitations stemming from the lack of specialized databases.
This paper introduces two innovative approaches that significantly enhance detection and segmentation capabilities for small aerial objects.
arXiv Detail & Related papers (2024-05-02T19:47:08Z) - DIO: Dataset of 3D Mesh Models of Indoor Objects for Robotics and
Computer Vision Applications [17.637438333501628]
The creation of accurate virtual models of real-world objects is imperative to robotic simulations and applications such as computer vision.
This paper documents the different methods employed for generating a database of mesh models of real-world objects.
arXiv Detail & Related papers (2024-02-19T04:58:40Z) - MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare [84.80956484848505]
MegaPose is a method to estimate the 6D pose of novel objects, that is, objects unseen during training.
We present a 6D pose refiner based on a render&compare strategy which can be applied to novel objects.
Second, we introduce a novel approach for coarse pose estimation which leverages a network trained to classify whether the pose error between a synthetic rendering and an observed image of the same object can be corrected by the refiner.
arXiv Detail & Related papers (2022-12-13T19:30:03Z) - Bridging the Gap to Real-World Object-Centric Learning [66.55867830853803]
We show that reconstructing features from models trained in a self-supervised manner is a sufficient training signal for object-centric representations to arise in a fully unsupervised way.
Our approach, DINOSAUR, significantly out-performs existing object-centric learning models on simulated data.
arXiv Detail & Related papers (2022-09-29T15:24:47Z) - Few-Cost Salient Object Detection with Adversarial-Paced Learning [95.0220555274653]
This paper proposes to learn the effective salient object detection model based on the manual annotation on a few training images only.
We name this task as the few-cost salient object detection and propose an adversarial-paced learning (APL)-based framework to facilitate the few-cost learning scenario.
arXiv Detail & Related papers (2021-04-05T14:15:49Z) - Supervised Training of Dense Object Nets using Optimal Descriptors for
Industrial Robotic Applications [57.87136703404356]
Dense Object Nets (DONs) by Florence, Manuelli and Tedrake introduced dense object descriptors as a novel visual object representation for the robotics community.
In this paper we show that given a 3D model of an object, we can generate its descriptor space image, which allows for supervised training of DONs.
We compare the training methods on generating 6D grasps for industrial objects and show that our novel supervised training approach improves the pick-and-place performance in industry-relevant tasks.
arXiv Detail & Related papers (2021-02-16T11:40:12Z) - Object Detection and Recognition of Swap-Bodies using Camera mounted on
a Vehicle [13.702911401489427]
This project aims to jointly perform object detection of a swap-body and to find the type of swap-body by reading an ILU code.
Recent research activities have drastically improved deep learning techniques which proves to enhance the field of computer vision.
arXiv Detail & Related papers (2020-04-17T08:49:54Z) - CPS++: Improving Class-level 6D Pose and Shape Estimation From Monocular
Images With Self-Supervised Learning [74.53664270194643]
Modern monocular 6D pose estimation methods can only cope with a handful of object instances.
We propose a novel method for class-level monocular 6D pose estimation, coupled with metric shape retrieval.
We experimentally demonstrate that we can retrieve precise 6D poses and metric shapes from a single RGB image.
arXiv Detail & Related papers (2020-03-12T15:28:13Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.