PyScrew: A Comprehensive Dataset Collection from Industrial Screw Driving Experiments
- URL: http://arxiv.org/abs/2505.11925v1
- Date: Sat, 17 May 2025 09:20:20 GMT
- Title: PyScrew: A Comprehensive Dataset Collection from Industrial Screw Driving Experiments
- Authors: Nikolai West, Jochen Deuse,
- Abstract summary: The collection comprises six distinct datasets with over 34,000 individual screw driving operations conducted under dual conditions.<n>We detail the standardized experimental setup used across all datasets, including hardware specifications, process phases, and data acquisition methods.<n>The data model preserves the temporal and operational structure of screw driving processes, facilitating both exploratory analysis and the development of machine learning models.
- Score: 0.8287206589886881
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper presents a comprehensive collection of industrial screw driving datasets designed to advance research in manufacturing process monitoring and quality control. The collection comprises six distinct datasets with over 34,000 individual screw driving operations conducted under controlled experimental conditions, capturing the multifaceted nature of screw driving processes in plastic components. Each dataset systematically investigates specific aspects: natural thread degradation patterns through repeated use (s01), variations in surface friction conditions including contamination and surface treatments (s02), diverse assembly faults with up to 27 error types (s03-s04), and fabrication parameter variations in both upper and lower workpieces through modified injection molding settings (s05-s06). We detail the standardized experimental setup used across all datasets, including hardware specifications, process phases, and data acquisition methods. The hierarchical data model preserves the temporal and operational structure of screw driving processes, facilitating both exploratory analysis and the development of machine learning models. To maximize accessibility, we provide dual access pathways: raw data through Zenodo with a persistent DOI, and a purpose-built Python library (PyScrew) that offers consistent interfaces for data loading, preprocessing, and integration with common analysis workflows. These datasets serve diverse research applications including anomaly detection, predictive maintenance, quality control system development, feature extraction methodology evaluation, and classification of specific error conditions. By addressing the scarcity of standardized, comprehensive datasets in industrial manufacturing, this collection enables reproducible research and fair comparison of analytical approaches in an area of growing importance for industrial automation.
Related papers
- Analog and Multi-modal Manufacturing Datasets Acquired on the Future Factories Platform V2 [0.0]
This paper presents two industry-grade datasets captured during an 8-hour continuous operation of a manufacturing assembly line.<n>The datasets adhere to industry standards, covering communication protocols, actuators, control mechanisms, transducers, sensors, and cameras.
arXiv Detail & Related papers (2025-02-07T15:47:27Z) - Unsupervised Multimodal Fusion of In-process Sensor Data for Advanced Manufacturing Process Monitoring [0.0]
This paper presents a novel approach to multimodal sensor data fusion in manufacturing processes.
We leverage contrastive learning techniques to correlate different data modalities without the need for labeled data.
Our approach facilitates downstream tasks such as process control, anomaly detection, and quality assurance.
arXiv Detail & Related papers (2024-10-29T21:52:04Z) - Zero-Shot Learning of Causal Models [17.427722515310606]
We learn a model capable of inferring Structural Causal Models (SCMs) in a zero-shot manner.<n>As a by-product, our approach can perform zero-shot generation of new dataset samples and intervened samples.<n>To the best of our knowledge, this is the first time that SCMs are inferred in a zero-shot manner from observations.
arXiv Detail & Related papers (2024-10-08T15:31:33Z) - BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation [57.40024206484446]
We introduce the BEHAVIOR Vision Suite (BVS), a set of tools and assets to generate fully customized synthetic data for systematic evaluation of computer vision models.
BVS supports a large number of adjustable parameters at the scene level.
We showcase three example application scenarios.
arXiv Detail & Related papers (2024-05-15T17:57:56Z) - IPAD: Industrial Process Anomaly Detection Dataset [71.39058003212614]
Video anomaly detection (VAD) is a challenging task aiming to recognize anomalies in video frames.
We propose a new dataset, IPAD, specifically designed for VAD in industrial scenarios.
This dataset covers 16 different industrial devices and contains over 6 hours of both synthetic and real-world video footage.
arXiv Detail & Related papers (2024-04-23T13:38:01Z) - Analog and Multi-modal Manufacturing Datasets Acquired on the Future
Factories Platform [0.0]
Two industry-grade datasets are presented in this paper.
They were collected at the Future Factories Lab at the University of South Carolina on December 11th and 12th of 2023.
arXiv Detail & Related papers (2024-01-28T02:26:58Z) - Integration of Domain Expert-Centric Ontology Design into the CRISP-DM for Cyber-Physical Production Systems [45.05372822216111]
Methods from Machine Learning (ML) and Data Mining (DM) have proven to be promising in extracting complex and hidden patterns from the data collected.
However, such data-driven projects, usually performed with the Cross-Industry Standard Process for Data Mining (CRISPDM), often fail due to the disproportionate amount of time needed for understanding and preparing the data.
This contribution intends present an integrated approach so that data scientists are able to more quickly and reliably gain insights into the CPPS challenges.
arXiv Detail & Related papers (2023-07-21T15:04:00Z) - Deep Learning based pipeline for anomaly detection and quality
enhancement in industrial binder jetting processes [68.8204255655161]
Anomaly detection describes methods of finding abnormal states, instances or data points that differ from a normal value space.
This paper contributes to a data-centric way of approaching artificial intelligence in industrial production.
arXiv Detail & Related papers (2022-09-21T08:14:34Z) - TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual
Environments [84.6017003787244]
This work proposes a synthetic data generation pipeline to address the difficulties and domain-gaps present in simulated datasets.
We show that using annotations and visual cues from existing datasets, we can facilitate automated multi-modal data generation.
arXiv Detail & Related papers (2022-08-16T20:46:08Z) - Synthesizing Rolling Bearing Fault Samples in New Conditions: A
framework based on a modified CGAN [1.0569625612398386]
Bearing fault diagnosis and condition monitoring is essential for reducing operational costs and downtime in numerous industries.
In this paper, a novel algorithm based on Conditional Generative Adversarial Networks (CGANs) is trained on the normal and fault data on any actual fault conditions.
The proposed method is validated on a real-world bearing dataset, and fault data are generated for different conditions.
arXiv Detail & Related papers (2022-06-24T04:47:08Z) - Detecting Faults during Automatic Screwdriving: A Dataset and Use Case
of Anomaly Detection for Automatic Screwdriving [80.6725125503521]
Data-driven approaches, using Machine Learning (ML) for detecting faults have recently gained increasing interest.
We present a use case of using ML models for detecting faults during automated screwdriving operations.
arXiv Detail & Related papers (2021-07-05T11:46:00Z) - AURSAD: Universal Robot Screwdriving Anomaly Detection Dataset [80.6725125503521]
This report describes a dataset created using a UR3e series robot and OnRobot Screwdriver.
The resulting data contains 2042 samples of normal and anomalous robot operation.
Brief ML benchmarks using this data are also provided, showcasing the data's suitability and potential for further analysis and experimentation.
arXiv Detail & Related papers (2021-02-02T09:59:23Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.