Analog and Multi-modal Manufacturing Datasets Acquired on the Future
Factories Platform
- URL: http://arxiv.org/abs/2401.15544v1
- Date: Sun, 28 Jan 2024 02:26:58 GMT
- Title: Analog and Multi-modal Manufacturing Datasets Acquired on the Future
Factories Platform
- Authors: Ramy Harik, Fadi El Kalach, Jad Samaha, Devon Clark, Drew Sander,
Philip Samaha, Liam Burns, Ibrahim Yousif, Victor Gadow, Theodros Tarekegne,
Nitol Saha
- Abstract summary: Two industry-grade datasets are presented in this paper.
They were collected at the Future Factories Lab at the University of South Carolina on December 11th and 12th of 2023.
- Score: 0.0
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: Two industry-grade datasets are presented in this paper that were collected
at the Future Factories Lab at the University of South Carolina on December
11th and 12th of 2023. These datasets are generated by a manufacturing assembly
line that utilizes industrial standards with respect to actuators, control
mechanisms, and transducers. The two datasets were both generated
simultaneously by operating the assembly line for 30 consecutive hours (with
minor filtering) and collecting data from sensors equipped throughout the
system. During operation, defects were also introduced into the assembly
operation by manually removing parts needed for the final assembly. The
datasets generated include a time series analog dataset and the other is a time
series multi-modal dataset which includes images of the system alongside the
analog data. These datasets were generated with the objective of providing
tools to further the research towards enhancing intelligence in manufacturing.
Real manufacturing datasets can be scarce let alone datasets with anomalies or
defects. As such these datasets hope to address this gap and provide
researchers with a foundation to build and train Artificial Intelligence models
applicable for the manufacturing industry. Finally, these datasets are the
first iteration of published data from the future Factories lab and can be
further adjusted to fit more researchers needs moving forward.
Related papers
- Tackling Data Heterogeneity in Federated Time Series Forecasting [61.021413959988216]
Time series forecasting plays a critical role in various real-world applications, including energy consumption prediction, disease transmission monitoring, and weather forecasting.
Most existing methods rely on a centralized training paradigm, where large amounts of data are collected from distributed devices to a central cloud server.
We propose a novel framework, Fed-TREND, to address data heterogeneity by generating informative synthetic data as auxiliary knowledge carriers.
arXiv Detail & Related papers (2024-11-24T04:56:45Z) - A Benchmark Time Series Dataset for Semiconductor Fabrication Manufacturing Constructed using Component-based Discrete-Event Simulation Models [0.0]
This research is based on a benchmark model of an Intel semiconductor fabrication factory.
The time series dataset is constructed using discrete-event time trajectories.
The dataset can also be utilized in the machine learning community for behavioral analysis.
arXiv Detail & Related papers (2024-08-17T23:05:47Z) - ConRebSeg: A Segmentation Dataset for Reinforced Concrete Construction [4.32009010195029]
We provide a dataset of 14,805 RGB images with segmentation labels for reinforced concrete construction.
We investigate the influence of data availability and label inconsistencies on the performance of models.
We advocate for a crowd-sourced dataset to boost the development of autonomous robotic systems in the construction industry.
arXiv Detail & Related papers (2024-07-12T15:53:15Z) - IPAD: Industrial Process Anomaly Detection Dataset [71.39058003212614]
Video anomaly detection (VAD) is a challenging task aiming to recognize anomalies in video frames.
We propose a new dataset, IPAD, specifically designed for VAD in industrial scenarios.
This dataset covers 16 different industrial devices and contains over 6 hours of both synthetic and real-world video footage.
arXiv Detail & Related papers (2024-04-23T13:38:01Z) - A Systematic Review of Available Datasets in Additive Manufacturing [56.684125592242445]
In-situ monitoring incorporating visual and other sensor technologies allows the collection of extensive datasets during the Additive Manufacturing process.
These datasets have potential for determining the quality of the manufactured output and the detection of defects through the use of Machine Learning.
This systematic review investigates the availability of open image-based datasets originating from AM processes that align with a number of pre-defined selection criteria.
arXiv Detail & Related papers (2024-01-27T16:13:32Z) - Open-sourced Data Ecosystem in Autonomous Driving: the Present and Future [130.87142103774752]
This review systematically assesses over seventy open-source autonomous driving datasets.
It offers insights into various aspects, such as the principles underlying the creation of high-quality datasets.
It also delves into the scientific and technical challenges that warrant resolution.
arXiv Detail & Related papers (2023-12-06T10:46:53Z) - Dataset Factory: A Toolchain For Generative Computer Vision Datasets [0.9013233848500058]
We propose a "dataset factory" that separates the storage and processing of samples from metadata.
This enables data-centric operations at scale for machine learning teams and individual researchers.
arXiv Detail & Related papers (2023-09-20T19:43:37Z) - Linking the Dynamic PicoProbe Analytical Electron-Optical Beam Line /
Microscope to Supercomputers [39.52789559084336]
Dynamic PicoProbe at Argonne National Laboratory is undergoing upgrades that will enable it to produce up to 100s of GB of data per day.
While this data is highly important for both fundamental science and industrial applications, there is currently limited on-site infrastructure to handle these high-volume data streams.
We address this problem by providing a software architecture capable of supporting neighboring large-scale data transfers to neighboring supercomputers at the Argonne Leadership Computing Facility.
This infrastructure supports expected workloads and also provides domain scientists the ability to reinterrogate data from past experiments to yield additional scientific value and derive new insights.
arXiv Detail & Related papers (2023-08-25T23:07:58Z) - TSGM: A Flexible Framework for Generative Modeling of Synthetic Time Series [61.436361263605114]
Time series data are often scarce or highly sensitive, which precludes the sharing of data between researchers and industrial organizations.
We introduce Time Series Generative Modeling (TSGM), an open-source framework for the generative modeling of synthetic time series.
arXiv Detail & Related papers (2023-05-19T10:11:21Z) - Challenges of the Creation of a Dataset for Vision Based Human Hand
Action Recognition in Industrial Assembly [0.0]
This dataset consists of 12 classes with 459,180 images in the basic version and 2,295,900 images after spatial augmentation.
It has an above-average duration and meets the technical and legal requirements for industrial assembly lines.
The recorded ground truth assembly classes were selected after extensive observation of real-world use cases.
arXiv Detail & Related papers (2023-03-07T07:57:12Z) - TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual
Environments [84.6017003787244]
This work proposes a synthetic data generation pipeline to address the difficulties and domain-gaps present in simulated datasets.
We show that using annotations and visual cues from existing datasets, we can facilitate automated multi-modal data generation.
arXiv Detail & Related papers (2022-08-16T20:46:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.