Estimating the electrical power output of industrial devices with
end-to-end time-series classification in the presence of label noise
- URL: http://arxiv.org/abs/2105.00349v1
- Date: Sat, 1 May 2021 21:45:42 GMT
- Title: Estimating the electrical power output of industrial devices with
end-to-end time-series classification in the presence of label noise
- Authors: Andrea Castellani, Sebastian Schmitt, and Barbara Hammer
- Abstract summary: In this work, we focus on estimating the power output of a Combined Heat and Power machine of a medium-sized company facility.
As the facility is fully instrumented and sensor measurements from the CHP are available, we generate the training labels in an automated fashion.
We propose a novel multi-task deep learning approach that jointly trains a classifier and an autoencoder with a shared embedding representation.
- Score: 6.857190736208506
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: In complex industrial settings, it is common practice to monitor the
operation of machines in order to detect undesired states, adjust maintenance
schedules, optimize system performance or collect usage statistics of
individual machines. In this work, we focus on estimating the power output of a
Combined Heat and Power (CHP) machine of a medium-sized company facility by
analyzing the total facility power consumption. We formulate the problem as a
time-series classification problem where the class label represents the CHP
power output. As the facility is fully instrumented and sensor measurements
from the CHP are available, we generate the training labels in an automated
fashion from the CHP sensor readings. However, sensor failures result in
mislabeled training data samples which are hard to detect and remove from the
dataset. Therefore, we propose a novel multi-task deep learning approach that
jointly trains a classifier and an autoencoder with a shared embedding
representation. The proposed approach targets to gradually correct the
mislabelled data samples during training in a self-supervised fashion, without
any prior assumption on the amount of label noise. We benchmark our approach on
several time-series classification datasets and find it to be comparable and
sometimes better than state-of-the-art methods. On the real-world use-case of
predicting the CHP power output, we thoroughly evaluate the architectural
design choices and show that the final architecture considerably increases the
robustness of the learning process and consistently beats other recent
state-of-the-art algorithms in the presence of unstructured as well as
structured label noise.
Related papers
- Improving a Named Entity Recognizer Trained on Noisy Data with a Few
Clean Instances [55.37242480995541]
We propose to denoise noisy NER data with guidance from a small set of clean instances.
Along with the main NER model we train a discriminator model and use its outputs to recalibrate the sample weights.
Results on public crowdsourcing and distant supervision datasets show that the proposed method can consistently improve performance with a small guidance set.
arXiv Detail & Related papers (2023-10-25T17:23:37Z) - Predictive Maintenance Model Based on Anomaly Detection in Induction
Motors: A Machine Learning Approach Using Real-Time IoT Data [0.0]
In this work, we demonstrate a novel anomaly detection system on induction motors used in pumps, compressors, fans, and other industrial machines.
We use a combination of pre-processing techniques and machine learning (ML) models with a low computational cost.
arXiv Detail & Related papers (2023-10-15T18:43:45Z) - Unsupervised clustering of disturbances in power systems via deep
convolutional autoencoders [2.0736732081151366]
Power quality (PQ) events are recorded by PQ meters whenever anomalous events are detected on the power grid.
Many of the waveforms captured during a disturbance in the power system need to be labeled for supervised learning.
This paper presents an autoencoder and K-means clustering-based unsupervised technique that can be used to cluster PQ events.
arXiv Detail & Related papers (2023-06-08T04:41:34Z) - Task-Oriented Over-the-Air Computation for Multi-Device Edge AI [57.50247872182593]
6G networks for supporting edge AI features task-oriented techniques that focus on effective and efficient execution of AI task.
Task-oriented over-the-air computation (AirComp) scheme is proposed in this paper for multi-device split-inference system.
arXiv Detail & Related papers (2022-11-02T16:35:14Z) - Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D
Object Detection [85.11649974840758]
3D object detection networks tend to be biased towards the data they are trained on.
We propose a single-frame approach for source-free, unsupervised domain adaptation of lidar-based 3D object detectors.
arXiv Detail & Related papers (2021-11-30T18:42:42Z) - S3: Supervised Self-supervised Learning under Label Noise [53.02249460567745]
In this paper we address the problem of classification in the presence of label noise.
In the heart of our method is a sample selection mechanism that relies on the consistency between the annotated label of a sample and the distribution of the labels in its neighborhood in the feature space.
Our method significantly surpasses previous methods on both CIFARCIFAR100 with artificial noise and real-world noisy datasets such as WebVision and ANIMAL-10N.
arXiv Detail & Related papers (2021-11-22T15:49:20Z) - Deep Recurrent Semi-Supervised EEG Representation Learning for Emotion
Recognition [14.67085109524245]
EEG-based emotion recognition often requires sufficient labeled training samples to build an effective computational model.
We propose a semi-supervised pipeline to jointly exploit both unlabeled and labeled data for learning EEG representations.
We test our framework on the large-scale SEED EEG dataset and compare our results with several other popular semi-supervised methods.
arXiv Detail & Related papers (2021-07-28T17:21:30Z) - Automated Label Generation for Time Series Classification with
Representation Learning: Reduction of Label Cost for Training [16.287885535569067]
We propose a method to auto-generate labels of un-labelled time-series.
Our method is based on representation learning using Auto Encoded Compact Sequence.
It performs self-labelled in iterations, by learning latent structure, as well as synthetically boosting representative time-series.
arXiv Detail & Related papers (2021-07-12T14:28:40Z) - Autoencoder-based Representation Learning from Heterogeneous
Multivariate Time Series Data of Mechatronic Systems [0.0]
We present a method for unsupervised feature extraction using autoencoder networks that specifically addresses the heterogeneous nature of the database.
Three public datasets of mechatronic systems from different application domains are used to validate the results.
arXiv Detail & Related papers (2021-04-06T21:04:27Z) - Unsupervised Domain Adaptation for Acoustic Scene Classification Using
Band-Wise Statistics Matching [69.24460241328521]
Machine learning algorithms can be negatively affected by mismatches between training (source) and test (target) data distributions.
We propose an unsupervised domain adaptation method that consists of aligning the first- and second-order sample statistics of each frequency band of target-domain acoustic scenes to the ones of the source-domain training dataset.
We show that the proposed method outperforms the state-of-the-art unsupervised methods found in the literature in terms of both source- and target-domain classification accuracy.
arXiv Detail & Related papers (2020-04-30T23:56:05Z) - EHSOD: CAM-Guided End-to-end Hybrid-Supervised Object Detection with
Cascade Refinement [53.69674636044927]
We present EHSOD, an end-to-end hybrid-supervised object detection system.
It can be trained in one shot on both fully and weakly-annotated data.
It achieves comparable results on multiple object detection benchmarks with only 30% fully-annotated data.
arXiv Detail & Related papers (2020-02-18T08:04:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.