Fully-Automated Packaging Structure Recognition in Logistics
Environments
- URL: http://arxiv.org/abs/2008.04620v1
- Date: Tue, 11 Aug 2020 10:57:23 GMT
- Title: Fully-Automated Packaging Structure Recognition in Logistics
Environments
- Authors: Laura D\"orr, Felix Brandt, Martin Pouls, Alexander Naumann
- Abstract summary: We propose a method for complete automation of packaging structure recognition.
Our algorithm is based on deep learning models, more precisely convolutional neural networks for instance segmentation in images.
We show that the solution is capable of correctly recognizing the packaging structure in approximately 85% of our test cases, and even more (91%) when focusing on most common package types.
- Score: 60.56493342808093
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Within a logistics supply chain, a large variety of transported goods need to
be handled, recognized and checked at many different network points. Often,
huge manual effort is involved in recognizing or verifying packet identity or
packaging structure, for instance to check the delivery for completeness. We
propose a method for complete automation of packaging structure recognition:
Based on a single image, one or multiple transport units are localized and, for
each of these transport units, the characteristics, the total number and the
arrangement of its packaging units is recognized. Our algorithm is based on
deep learning models, more precisely convolutional neural networks for instance
segmentation in images, as well as computer vision methods and heuristic
components. We use a custom data set of realistic logistics images for training
and evaluation of our method. We show that the solution is capable of correctly
recognizing the packaging structure in approximately 85% of our test cases, and
even more (91%) when focusing on most common package types.
Related papers
- Towards Packaging Unit Detection for Automated Palletizing Tasks [5.235268087662475]
We propose an approach to this challenging problem that is fully trained on synthetically generated data.
The proposed approach is able to handle sparse and low quality sensor data.
We conduct an extensive evaluation on real-world data with a wide range of different retail products.
arXiv Detail & Related papers (2023-08-11T15:37:38Z) - VISTA: Vision Transformer enhanced by U-Net and Image Colorfulness Frame
Filtration for Automatic Retail Checkout [0.7250756081498245]
We propose to segment and classify individual frames from a video sequence.
The segmentation method consists of a unified single product item- and hand-segmentation followed by entropy masking.
Our best system achieves 3rd place in the AI City Challenge 2022 Track 4 with an F1 score of 0.4545.
arXiv Detail & Related papers (2022-04-23T08:54:28Z) - An Improved Deep Learning Approach For Product Recognition on Racks in
Retail Stores [2.470815298095903]
Automated product recognition in retail stores is an important real-world application in the domain of Computer Vision and Pattern Recognition.
We develop a two-stage object detection and recognition pipeline comprising of a Faster-RCNN-based object localizer and a ResNet-18-based image encoder.
Each of the models is fine-tuned using appropriate data sets for better prediction and data augmentation is performed on each query image to prepare an extensive gallery set for fine-tuning the ResNet-18-based product recognition model.
arXiv Detail & Related papers (2022-02-26T06:51:36Z) - Joint Learning of Neural Transfer and Architecture Adaptation for Image
Recognition [77.95361323613147]
Current state-of-the-art visual recognition systems rely on pretraining a neural network on a large-scale dataset and finetuning the network weights on a smaller dataset.
In this work, we prove that dynamically adapting network architectures tailored for each domain task along with weight finetuning benefits in both efficiency and effectiveness.
Our method can be easily generalized to an unsupervised paradigm by replacing supernet training with self-supervised learning in the source domain tasks and performing linear evaluation in the downstream tasks.
arXiv Detail & Related papers (2021-03-31T08:15:17Z) - Simple multi-dataset detection [83.9604523643406]
We present a simple method for training a unified detector on multiple large-scale datasets.
We show how to automatically integrate dataset-specific outputs into a common semantic taxonomy.
Our approach does not require manual taxonomy reconciliation.
arXiv Detail & Related papers (2021-02-25T18:55:58Z) - Instance Localization for Self-supervised Detection Pretraining [68.24102560821623]
We propose a new self-supervised pretext task, called instance localization.
We show that integration of bounding boxes into pretraining promotes better task alignment and architecture alignment for transfer learning.
Experimental results demonstrate that our approach yields state-of-the-art transfer learning results for object detection.
arXiv Detail & Related papers (2021-02-16T17:58:57Z) - An Image Processing Pipeline for Automated Packaging Structure
Recognition [60.56493342808093]
We propose a cognitive system for the fully automated recognition of packaging structures for standardized logistics shipments based on single RGB images.
Our contribution contains descriptions of a suitable system design and its evaluation on relevant real-world data.
arXiv Detail & Related papers (2020-09-29T07:26:08Z) - Fusion of Global-Local Features for Image Quality Inspection of Shipping
Label [6.458496335718508]
We propose an input image quality verification method combining global and local features.
The experimental results regarding real captured and generated images show that the proposed method achieves better performance than other methods.
arXiv Detail & Related papers (2020-08-26T08:25:34Z) - Instance-Aware Graph Convolutional Network for Multi-Label
Classification [55.131166957803345]
Graph convolutional neural network (GCN) has effectively boosted the multi-label image recognition task.
We propose an instance-aware graph convolutional neural network (IA-GCN) framework for multi-label classification.
arXiv Detail & Related papers (2020-08-19T12:49:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.