Online Active Learning for Soft Sensor Development using Semi-Supervised
Autoencoders
- URL: http://arxiv.org/abs/2212.13067v3
- Date: Sun, 9 Apr 2023 21:09:43 GMT
- Title: Online Active Learning for Soft Sensor Development using Semi-Supervised
Autoencoders
- Authors: Davide Cacciarelli, Murat Kulahci, John Tyssedal
- Abstract summary: Data-driven soft sensors are extensively used in industrial and chemical processes to predict hard-to-measure process variables.
Active learning methods can be highly beneficial as they can suggest the most informative labels to query.
In this work, we adapt some of these approaches to the stream-based scenario and show how they can be used to select the most informative data points.
- Score: 0.7734726150561089
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Data-driven soft sensors are extensively used in industrial and chemical
processes to predict hard-to-measure process variables whose real value is
difficult to track during routine operations. The regression models used by
these sensors often require a large number of labeled examples, yet obtaining
the label information can be very expensive given the high time and cost
required by quality inspections. In this context, active learning methods can
be highly beneficial as they can suggest the most informative labels to query.
However, most of the active learning strategies proposed for regression focus
on the offline setting. In this work, we adapt some of these approaches to the
stream-based scenario and show how they can be used to select the most
informative data points. We also demonstrate how to use a semi-supervised
architecture based on orthogonal autoencoders to learn salient features in a
lower dimensional space. The Tennessee Eastman Process is used to compare the
predictive performance of the proposed approaches.
Related papers
- Learning with Less: Knowledge Distillation from Large Language Models via Unlabeled Data [54.934578742209716]
In real-world NLP applications, Large Language Models (LLMs) offer promising solutions due to their extensive training on vast datasets.
LLKD is an adaptive sample selection method that incorporates signals from both the teacher and student.
Our comprehensive experiments show that LLKD achieves superior performance across various datasets with higher data efficiency.
arXiv Detail & Related papers (2024-11-12T18:57:59Z) - A deep latent variable model for semi-supervised multi-unit soft sensing in industrial processes [0.0]
We introduce a deep latent variable model for semi-supervised multi-unit soft sensing.
This hierarchical, generative model is able to jointly model different units, as well as learning from both labeled and unlabeled data.
We show that by combining semi-supervised and multi-task learning, the proposed model achieves superior results.
arXiv Detail & Related papers (2024-07-18T09:13:22Z) - An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models [55.01592097059969]
Supervised finetuning on instruction datasets has played a crucial role in achieving the remarkable zero-shot generalization capabilities.
Active learning is effective in identifying useful subsets of samples to annotate from an unlabeled pool.
We propose using experimental design to circumvent the computational bottlenecks of active learning.
arXiv Detail & Related papers (2024-01-12T16:56:54Z) - Multi-unit soft sensing permits few-shot learning [0.0]
A performance gain is generally attained when knowledge is transferred among strongly related soft sensor learning tasks.
A particularly relevant case for transferability is when developing soft sensors of the same type for similar, but physically different processes or units.
Applying methods that exploit transferability in this setting leads to what we call multi-unit soft sensing.
arXiv Detail & Related papers (2023-09-27T17:50:05Z) - A Survey on Deep Industrial Transfer Learning in Fault Prognostics [0.0]
This paper aims at establishing best practices for future research in this field.
It is shown that the field is lacking common benchmarks to robustly compare results and facilitate scientific progress.
The data sets utilized in these publications are surveyed as well in order to identify suitable candidates for such benchmark scenarios.
arXiv Detail & Related papers (2023-01-04T17:01:27Z) - Stream-based active learning with linear models [0.7734726150561089]
In production, instead of performing random inspections to obtain product information, labels are collected by evaluating the information content of the unlabeled data.
We propose a new strategy for the stream-based scenario, where instances are sequentially offered to the learner.
The iterative aspect of the decision-making process is tackled by setting a threshold on the informativeness of the unlabeled data points.
arXiv Detail & Related papers (2022-07-20T13:15:23Z) - ALLSH: Active Learning Guided by Local Sensitivity and Hardness [98.61023158378407]
We propose to retrieve unlabeled samples with a local sensitivity and hardness-aware acquisition function.
Our method achieves consistent gains over the commonly used active learning strategies in various classification tasks.
arXiv Detail & Related papers (2022-05-10T15:39:11Z) - Automated Machine Learning Techniques for Data Streams [91.3755431537592]
This paper surveys the state-of-the-art open-source AutoML tools, applies them to data collected from streams, and measures how their performance changes over time.
The results show that off-the-shelf AutoML tools can provide satisfactory results but in the presence of concept drift, detection or adaptation techniques have to be applied to maintain the predictive accuracy over time.
arXiv Detail & Related papers (2021-06-14T11:42:46Z) - Just Label What You Need: Fine-Grained Active Selection for Perception
and Prediction through Partially Labeled Scenes [78.23907801786827]
We introduce generalizations that ensure that our approach is both cost-aware and allows for fine-grained selection of examples through partially labeled scenes.
Our experiments on a real-world, large-scale self-driving dataset suggest that fine-grained selection can improve the performance across perception, prediction, and downstream planning tasks.
arXiv Detail & Related papers (2021-04-08T17:57:41Z) - Active Learning: Problem Settings and Recent Developments [2.1574781022415364]
This paper explains the basic problem settings of active learning and recent research trends.
In particular, research on learning acquisition functions to select samples from the data for labeling, theoretical work on active learning algorithms, and stopping criteria for sequential data acquisition are highlighted.
arXiv Detail & Related papers (2020-12-08T05:24:06Z) - Deep Soft Procrustes for Markerless Volumetric Sensor Alignment [81.13055566952221]
In this work, we improve markerless data-driven correspondence estimation to achieve more robust multi-sensor spatial alignment.
We incorporate geometric constraints in an end-to-end manner into a typical segmentation based model and bridge the intermediate dense classification task with the targeted pose estimation one.
Our model is experimentally shown to achieve similar results with marker-based methods and outperform the markerless ones, while also being robust to the pose variations of the calibration structure.
arXiv Detail & Related papers (2020-03-23T10:51:32Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.