Productive Reproducible Workflows for DNNs: A Case Study for Industrial
Defect Detection
- URL: http://arxiv.org/abs/2206.09359v1
- Date: Sun, 19 Jun 2022 09:10:13 GMT
- Title: Productive Reproducible Workflows for DNNs: A Case Study for Industrial
Defect Detection
- Authors: Perry Gibson, Jos\'e Cano
- Abstract summary: This paper presents a case study where we discuss our recent experience producing an end-to-end artificial intelligence application for industrial defect detection.
We detail the high level deep learning libraries, containerized, continuous integration/deployment pipelines, and open source code templates we leveraged to produce a competitive result.
We highlight the value that exploiting such systems can bring, even for research, and present our best results in terms of accuracy and inference time.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: As Deep Neural Networks (DNNs) have become an increasingly ubiquitous
workload, the range of libraries and tooling available to aid in their
development and deployment has grown significantly. Scalable, production
quality tools are freely available under permissive licenses, and are
accessible enough to enable even small teams to be very productive. However
within the research community, awareness and usage of said tools is not
necessarily widespread, and researchers may be missing out on potential
productivity gains from exploiting the latest tools and workflows. This paper
presents a case study where we discuss our recent experience producing an
end-to-end artificial intelligence application for industrial defect detection.
We detail the high level deep learning libraries, containerized workflows,
continuous integration/deployment pipelines, and open source code templates we
leveraged to produce a competitive result, matching the performance of other
ranked solutions to our three target datasets. We highlight the value that
exploiting such systems can bring, even for research, and detail our solution
and present our best results in terms of accuracy and inference time on a
server class GPU, as well as inference times on a server class CPU, and a
Raspberry Pi 4.
Related papers
- DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows [72.40917624485822]
We introduce DataDreamer, an open source Python library that allows researchers to implement powerful large language models.
DataDreamer also helps researchers adhere to best practices that we propose to encourage open science.
arXiv Detail & Related papers (2024-02-16T00:10:26Z) - SERL: A Software Suite for Sample-Efficient Robotic Reinforcement
Learning [85.21378553454672]
We develop a library containing a sample efficient off-policy deep RL method, together with methods for computing rewards and resetting the environment.
We find that our implementation can achieve very efficient learning, acquiring policies for PCB board assembly, cable routing, and object relocation.
These policies achieve perfect or near-perfect success rates, extreme robustness even under perturbations, and exhibit emergent robustness recovery and correction behaviors.
arXiv Detail & Related papers (2024-01-29T10:01:10Z) - A Survey of Serverless Machine Learning Model Inference [0.0]
Generative AI, Computer Vision, and Natural Language Processing have led to an increased integration of AI models into various products.
This survey aims to summarize and categorize the emerging challenges and optimization opportunities for large-scale deep learning serving systems.
arXiv Detail & Related papers (2023-11-22T18:46:05Z) - Multi-Fidelity Active Learning with GFlowNets [65.91555804996203]
We propose a multi-fidelity active learning algorithm with GFlowNets as a sampler, to efficiently discover diverse, high-scoring candidates.
Our evaluation on molecular discovery tasks shows that multi-fidelity active learning with GFlowNets can discover high-scoring candidates at a fraction of the budget of its single-fidelity counterpart.
arXiv Detail & Related papers (2023-06-20T17:43:42Z) - Katakomba: Tools and Benchmarks for Data-Driven NetHack [52.0035089982277]
NetHack is known as the frontier of reinforcement learning research.
We argue that there are three major obstacles for adoption: resource-wise, implementation-wise, and benchmark-wise.
We develop an open-source library that provides workflow fundamentals familiar to the offline reinforcement learning community.
arXiv Detail & Related papers (2023-06-14T22:50:25Z) - The MIT Supercloud Workload Classification Challenge [10.458111248130944]
In this paper, we present a workload classification challenge based on the MIT Supercloud dataset.
The goal of this challenge is to foster algorithmic innovations in the analysis of compute workloads.
arXiv Detail & Related papers (2022-04-12T14:28:04Z) - Toward Fault Detection in Industrial Welding Processes with Deep
Learning and Data Augmentation [0.0]
This paper addresses the challenges on the industrial realization of the AI tools.
We use object detection algorithms from the object detection API and adapt them to our use case using transfer learning.
We find that moderate scaling of the dataset via image augmentation leads to improvements in intersection over union (IoU) and recall.
arXiv Detail & Related papers (2021-06-18T14:52:49Z) - Automated Machine Learning Techniques for Data Streams [91.3755431537592]
This paper surveys the state-of-the-art open-source AutoML tools, applies them to data collected from streams, and measures how their performance changes over time.
The results show that off-the-shelf AutoML tools can provide satisfactory results but in the presence of concept drift, detection or adaptation techniques have to be applied to maintain the predictive accuracy over time.
arXiv Detail & Related papers (2021-06-14T11:42:46Z) - Building Robust Industrial Applicable Object Detection Models Using
Transfer Learning and Single Pass Deep Learning Architectures [1.1816942730023883]
We explore how deep convolutional neural networks dedicated to the task of object detection can improve our industrial-oriented object detection pipelines.
By using a deep learning architecture that integrates region proposals, classification and probability estimation in a single run, we aim at obtaining real-time performance.
We apply these algorithms to two industrially relevant applications, one being the detection of promotion boards in eye tracking data and the other detecting and recognizing packages of warehouse products for augmented advertisements.
arXiv Detail & Related papers (2020-07-09T09:50:45Z) - Deep Learning for Ultra-Reliable and Low-Latency Communications in 6G
Networks [84.2155885234293]
We first summarize how to apply data-driven supervised deep learning and deep reinforcement learning in URLLC.
To address these open problems, we develop a multi-level architecture that enables device intelligence, edge intelligence, and cloud intelligence for URLLC.
arXiv Detail & Related papers (2020-02-22T14:38:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.