Developing the Reliable Shallow Supervised Learning for Thermal Comfort
using ASHRAE RP-884 and ASHRAE Global Thermal Comfort Database II
- URL: http://arxiv.org/abs/2303.03873v1
- Date: Fri, 3 Mar 2023 13:47:38 GMT
- Title: Developing the Reliable Shallow Supervised Learning for Thermal Comfort
using ASHRAE RP-884 and ASHRAE Global Thermal Comfort Database II
- Authors: Kanisius Karyono, Badr M. Abdullah, Alison J. Cotgrave, Ana Bras, and
Jeff Cullen
- Abstract summary: This work introduces the reliable data set for training the AI subsystem for thermal comfort.
No training data for thermal comfort is available as reliable as this dataset, but the direct use of this data can lead to overfitting.
This work offers the algorithm for data filtering and semantic data augmentation for the ASHRAE database for the supervised learning process.
- Score: 1.0863226323853896
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: The artificial intelligence (AI) system designer for thermal comfort faces
insufficient data recorded from the current user or overfitting due to
unreliable training data. This work introduces the reliable data set for
training the AI subsystem for thermal comfort. This paper presents the control
algorithm based on shallow supervised learning, which is simple enough to be
implemented in the Internet of Things (IoT) system for residential usage using
ASHRAE RP-884 and ASHRAE Global Thermal Comfort Database II. No training data
for thermal comfort is available as reliable as this dataset, but the direct
use of this data can lead to overfitting. This work offers the algorithm for
data filtering and semantic data augmentation for the ASHRAE database for the
supervised learning process. Overfitting always becomes a problem due to the
psychological aspect involved in the thermal comfort decision. The method to
check the AI system based on the psychrometric chart against overfitting is
presented. This paper also assesses the most important parameters needed to
achieve human thermal comfort. This method can support the development of
reinforced learning for thermal comfort.
Related papers
- Iterative Data Smoothing: Mitigating Reward Overfitting and
Overoptimization in RLHF [79.98542868281471]
Reinforcement Learning from Human Feedback (RLHF) is a technique that aligns language models closely with human-centric values.
It is observed that the performance of the reward model degrades after one epoch of training, and optimizing too much against the learned reward model eventually hinders the true objective.
This paper delves into these issues, leveraging the theoretical insights to design improved reward learning algorithm termed 'Iterative Data Smoothing' (IDS)
arXiv Detail & Related papers (2024-01-29T17:43:42Z) - On Responsible Machine Learning Datasets with Fairness, Privacy, and Regulatory Norms [56.119374302685934]
There have been severe concerns over the trustworthiness of AI technologies.
Machine and deep learning algorithms depend heavily on the data used during their development.
We propose a framework to evaluate the datasets through a responsible rubric.
arXiv Detail & Related papers (2023-10-24T14:01:53Z) - Machine Learning-Based Automated Thermal Comfort Prediction: Integration
of Low-Cost Thermal and Visual Cameras for Higher Accuracy [3.2872586139884623]
Real-time feedback system is needed to provide data about occupants' comfort conditions.
New solutions are required to bring a more holistic view toward non-intrusive thermal scanning.
arXiv Detail & Related papers (2022-04-14T15:30:16Z) - Conditional Synthetic Data Generation for Personal Thermal Comfort
Models [7.505485586268498]
Personal thermal comfort models aim to predict an individual's thermal comfort response, instead of the average response of a large group.
Recently, machine learning algorithms have proven to be having enormous potential as a candidate for personal thermal comfort models.
But, often within the normal settings of a building, personal thermal comfort data obtained via experiments are heavily class-imbalanced.
We propose to implement a state-of-the-art conditional synthetic data generator to generate synthetic data corresponding to the low-frequency classes.
arXiv Detail & Related papers (2022-03-10T08:57:25Z) - Deep Reinforcement Learning Assisted Federated Learning Algorithm for
Data Management of IIoT [82.33080550378068]
The continuous expanded scale of the industrial Internet of Things (IIoT) leads to IIoT equipments generating massive amounts of user data every moment.
How to manage these time series data in an efficient and safe way in the field of IIoT is still an open issue.
This paper studies the FL technology applications to manage IIoT equipment data in wireless network environments.
arXiv Detail & Related papers (2022-02-03T07:12:36Z) - Meta-UDA: Unsupervised Domain Adaptive Thermal Object Detection using
Meta-Learning [64.92447072894055]
Infrared (IR) cameras are robust under adverse illumination and lighting conditions.
We propose an algorithm meta-learning framework to improve existing UDA methods.
We produce a state-of-the-art thermal detector for the KAIST and DSIAC datasets.
arXiv Detail & Related papers (2021-10-07T02:28:18Z) - A Sequential Modelling Approach for Indoor Temperature Prediction and
Heating Control in Smart Buildings [4.759925918369102]
This paper proposes a learning-based framework for sequentially applying the data-driven statistical methods to predict indoor temperature.
Experiments demonstrate the effectiveness of the modelling approach and control algorithm, and reveal the promising potential of the mixed data-driven approach in smart building applications.
arXiv Detail & Related papers (2020-09-21T13:20:27Z) - Provably Efficient Causal Reinforcement Learning with Confounded
Observational Data [135.64775986546505]
We study how to incorporate the dataset (observational data) collected offline, which is often abundantly available in practice, to improve the sample efficiency in the online setting.
We propose the deconfounded optimistic value iteration (DOVI) algorithm, which incorporates the confounded observational data in a provably efficient manner.
arXiv Detail & Related papers (2020-06-22T14:49:33Z) - AWAC: Accelerating Online Reinforcement Learning with Offline Datasets [84.94748183816547]
We show that our method, advantage weighted actor critic (AWAC), enables rapid learning of skills with a combination of prior demonstration data and online experience.
Our results show that incorporating prior data can reduce the time required to learn a range of robotic skills to practical time-scales.
arXiv Detail & Related papers (2020-06-16T17:54:41Z) - Transfer Learning for Thermal Comfort Prediction in Multiple Cities [8.759740337781526]
This research aims to tackle the data-shortage problem and boost the performance of thermal comfort prediction.
We utilise sensor data from multiple cities in the same climate zone to learn thermal comfort patterns.
We present a transfer learning based multilayer perceptron model from the same climate zone for accurate thermal comfort prediction.
arXiv Detail & Related papers (2020-04-29T11:42:02Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.