A Hybrid Approach of Transfer Learning and Physics-Informed Modeling:
Improving Dissolved Oxygen Concentration Prediction in an Industrial
Wastewater Treatment Plant
- URL: http://arxiv.org/abs/2401.11217v1
- Date: Sat, 20 Jan 2024 11:53:08 GMT
- Title: A Hybrid Approach of Transfer Learning and Physics-Informed Modeling:
Improving Dissolved Oxygen Concentration Prediction in an Industrial
Wastewater Treatment Plant
- Authors: Ece S. Koksal and Erdal Aydin
- Abstract summary: The objective is to increase the prediction performance of an industrial wastewater treatment plant by transferring the knowledge of (i) an open-source simulation model that captures the underlying physics of the process, albeit with dissimilarities to the target plant, and (ii) another industrial plant characterized by noisy and limited data but located in the same refinery, and (iii) the model in (ii)
The results have shown that test and validation performance are improved up to 27% and 59%, respectively.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Constructing first principles models is a challenging task for nonlinear and
complex systems such as a wastewater treatment unit. In recent years,
data-driven models are widely used to overcome the complexity. However, they
often suffer from issues such as missing, low quality or noisy data. Transfer
learning is a solution for this issue where knowledge from another task is
transferred to target one to increase the prediction performance. In this work,
the objective is increasing the prediction performance of an industrial
wastewater treatment plant by transferring the knowledge of (i) an open-source
simulation model that captures the underlying physics of the process, albeit
with dissimilarities to the target plant, (ii) another industrial plant
characterized by noisy and limited data but located in the same refinery, and
(iii) the model in (ii) and making the objective function of the training
problem physics informed where the physics information derived from the
open-source model in (ii). The results have shown that test and validation
performance are improved up to 27% and 59%, respectively.
Related papers
- Rare Class Prediction Model for Smart Industry in Semiconductor Manufacturing [1.3955252961896323]
This study develops a rare class prediction approach for in situ data collected from a smart semiconductor manufacturing process.
The primary objective is to build a model that addresses issues of noise and class imbalance, enhancing class separation.
The model was evaluated using various performance metrics, with ROC curves showing an AUC of 0.95, a precision of 0.66, and a recall of 0.96
arXiv Detail & Related papers (2024-06-06T22:09:43Z) - Low-rank finetuning for LLMs: A fairness perspective [54.13240282850982]
Low-rank approximation techniques have become the de facto standard for fine-tuning Large Language Models.
This paper investigates the effectiveness of these methods in capturing the shift of fine-tuning datasets from the initial pre-trained data distribution.
We show that low-rank fine-tuning inadvertently preserves undesirable biases and toxic behaviors.
arXiv Detail & Related papers (2024-05-28T20:43:53Z) - PiRD: Physics-informed Residual Diffusion for Flow Field Reconstruction [5.06136344261226]
CNN-based methods for data fidelity enhancement rely on low-fidelity data patterns and distributions during the training phase.
Our proposed model - Physics-informed Residual Diffusion - demonstrates the capability to elevate the quality of data from both standard low-fidelity inputs.
Experimental results have shown that our approach can effectively reconstruct high-quality outcomes for two-dimensional turbulent flows without requiring retraining.
arXiv Detail & Related papers (2024-04-12T11:45:51Z) - Domain adaption and physical constrains transfer learning for shale gas
production [0.26440512250125126]
We propose a novel transfer learning methodology that utilizes domain adaptation and physical constraints.
This methodology effectively employs historical data from the source domain to reduce negative transfer from the data distribution perspective.
By incorporating drilling, completion, and geological data as physical constraints, we develop a hybrid model.
arXiv Detail & Related papers (2023-12-18T04:13:27Z) - Transfer learning for predicting source terms of principal component
transport in chemically reactive flow [0.40964539027092917]
The aim of this study is to evaluate whether the number of requisite training samples can be reduced with the use of various transfer learning models.
Three transfer learning strategies are then applied to the training of the ANN model with a sparse dataset.
The performance of the reduced-order model with a sparse dataset is found to be remarkably enhanced if the training of the ANN model is restricted by a regularization term.
arXiv Detail & Related papers (2023-12-01T05:18:35Z) - Learning Objective-Specific Active Learning Strategies with Attentive
Neural Processes [72.75421975804132]
Learning Active Learning (LAL) suggests to learn the active learning strategy itself, allowing it to adapt to the given setting.
We propose a novel LAL method for classification that exploits symmetry and independence properties of the active learning problem.
Our approach is based on learning from a myopic oracle, which gives our model the ability to adapt to non-standard objectives.
arXiv Detail & Related papers (2023-09-11T14:16:37Z) - Towards Efficient Task-Driven Model Reprogramming with Foundation Models [52.411508216448716]
Vision foundation models exhibit impressive power, benefiting from the extremely large model capacity and broad training data.
However, in practice, downstream scenarios may only support a small model due to the limited computational resources or efficiency considerations.
This brings a critical challenge for the real-world application of foundation models: one has to transfer the knowledge of a foundation model to the downstream task.
arXiv Detail & Related papers (2023-04-05T07:28:33Z) - An Adversarial Active Sampling-based Data Augmentation Framework for
Manufacturable Chip Design [55.62660894625669]
Lithography modeling is a crucial problem in chip design to ensure a chip design mask is manufacturable.
Recent developments in machine learning have provided alternative solutions in replacing the time-consuming lithography simulations with deep neural networks.
We propose a litho-aware data augmentation framework to resolve the dilemma of limited data and improve the machine learning model performance.
arXiv Detail & Related papers (2022-10-27T20:53:39Z) - Physics-informed machine learning with differentiable programming for
heterogeneous underground reservoir pressure management [64.17887333976593]
Avoiding over-pressurization in subsurface reservoirs is critical for applications like CO2 sequestration and wastewater injection.
Managing the pressures by controlling injection/extraction are challenging because of complex heterogeneity in the subsurface.
We use differentiable programming with a full-physics model and machine learning to determine the fluid extraction rates that prevent over-pressurization.
arXiv Detail & Related papers (2022-06-21T20:38:13Z) - Using Data Assimilation to Train a Hybrid Forecast System that Combines
Machine-Learning and Knowledge-Based Components [52.77024349608834]
We consider the problem of data-assisted forecasting of chaotic dynamical systems when the available data is noisy partial measurements.
We show that by using partial measurements of the state of the dynamical system, we can train a machine learning model to improve predictions made by an imperfect knowledge-based model.
arXiv Detail & Related papers (2021-02-15T19:56:48Z) - Complete CVDL Methodology for Investigating Hydrodynamic Instabilities [0.49873153106566565]
In fluid dynamics, one of the most important research fields is hydrodynamic instabilities and their evolution in different flow regimes.
Currently, three main methods are used for understanding such phenomenon - namely analytical models, experiments and simulations.
We claim and demonstrate that a major portion of this research effort could and should be analysed using recent breakthrough advancements in the field of Computer Vision with Deep Learning (CVDL, or Deep Computer-Vision)
Specifically, we focus in this research on one of the most representative instabilities, the Rayleigh-Taylor one, simulate its behaviour and create an open-sourced state-of-the
arXiv Detail & Related papers (2020-04-03T13:52:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.