Related papers: A Meta-Learning Approach for Multi-Objective Reinforcement Learning in Sustainable Home Environments

A Meta-Learning Approach for Multi-Objective Reinforcement Learning in Sustainable Home Environments

URL: http://arxiv.org/abs/2407.11489v1
Date: Tue, 16 Jul 2024 08:23:20 GMT
Title: A Meta-Learning Approach for Multi-Objective Reinforcement Learning in Sustainable Home Environments
Authors: Junlin Lu, Patrick Mannion, Karl Mason,
Abstract summary: We extend state-of-the-art MORL algorithms with the meta-learning paradigm. We employ an auto-encoder (AE)-based unsupervised method to detect environment context changes. This study assesses the application of MORL in residential appliance scheduling and underscores the effectiveness of meta-learning in energy management.
Score: 2.9845592719739127
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Effective residential appliance scheduling is crucial for sustainable living. While multi-objective reinforcement learning (MORL) has proven effective in balancing user preferences in appliance scheduling, traditional MORL struggles with limited data in non-stationary residential settings characterized by renewable generation variations. Significant context shifts that can invalidate previously learned policies. To address these challenges, we extend state-of-the-art MORL algorithms with the meta-learning paradigm, enabling rapid, few-shot adaptation to shifting contexts. Additionally, we employ an auto-encoder (AE)-based unsupervised method to detect environment context changes. We have also developed a residential energy environment to evaluate our method using real-world data from London residential settings. This study not only assesses the application of MORL in residential appliance scheduling but also underscores the effectiveness of meta-learning in energy management. Our top-performing method significantly surpasses the best baseline, while the trained model saves 3.28% on electricity bills, a 2.74% increase in user comfort, and a 5.9% improvement in expected utility. Additionally, it reduces the sparsity of solutions by 62.44%. Remarkably, these gains were accomplished using 96.71% less training data and 61.1% fewer training steps.

Related papers

Explainable AI for building energy retrofitting under data scarcity [40.14307808809578]
This study presents an Artificial Intelligence (AI) and Machine Learning (ML)-based framework to recommend energy efficiency measures for residential buildings. Using Latvia as a case study, the methodology addresses challenges associated with limited datasets, class imbalance and data scarcity. The evaluation of the approach shows that it notably overcomes data limitations, achieving improvements up to 54% in precision, recall and F1 score.
arXiv Detail & Related papers (2025-04-08T14:00:08Z)
Improving the Efficiency of a Deep Reinforcement Learning-Based Power Management System for HPC Clusters Using Curriculum Learning [1.1380162891529537]
Machine learning has shown promise in determining optimal times to switch nodes on or off. In this study, we enhance the performance of a deep reinforcement learning (DRL) agent for HPC power management by integrating curriculum learning (CL) Experimental results confirm that an easy-to-hard curriculum outperforms other training orders in terms of reducing wasted energy usage.
arXiv Detail & Related papers (2025-02-27T18:19:22Z)
SPEQ: Offline Stabilization Phases for Efficient Q-Learning in High Update-To-Data Ratio Reinforcement Learning [51.10866035483686]
High update-to-data (UTD) ratio algorithms in reinforcement learning (RL) improve sample efficiency but incur high computational costs, limiting real-world scalability. We propose Offline Stabilization Phases for Efficient Q-Learning (SPEQ), an RL algorithm that combines low-UTD online training with periodic offline stabilization phases. During these phases, Q-functions are fine-tuned with high UTD ratios on a fixed replay buffer, reducing redundant updates on suboptimal data.
arXiv Detail & Related papers (2025-01-15T09:04:19Z)
Load Forecasting for Households and Energy Communities: Are Deep Learning Models Worth the Effort? [0.0]
Energy communities (ECs) play a key role in enabling local demand shifting and enhancing self-sufficiency.<n>Data-driven forecasting has gained significant attention, but it remains insufficiently explored in many practical contexts.<n>This study evaluates the effectiveness of state-of-the-art deep learning models across various community size, historical data availability, and model complexity.
arXiv Detail & Related papers (2025-01-09T06:29:50Z)
Unlearning with Control: Assessing Real-world Utility for Large Language Model Unlearning [97.2995389188179]
Recent research has begun to approach large language models (LLMs) unlearning via gradient ascent (GA) Despite their simplicity and efficiency, we suggest that GA-based methods face the propensity towards excessive unlearning. We propose several controlling methods that can regulate the extent of excessive unlearning.
arXiv Detail & Related papers (2024-06-13T14:41:00Z)
Green AI in Action: Strategic Model Selection for Ensembles in Production [2.464194460689648]
Ensemble learning, combining predictions from multiple models to form a single prediction, intensifies this problem due to cumulative energy consumption. This paper presents a novel approach to model selection that addresses the challenge of balancing the accuracy of AI models with their energy consumption in a live AI ensemble system.
arXiv Detail & Related papers (2024-05-21T18:57:43Z)
EdgeOL: Efficient in-situ Online Learning on Edge Devices [51.86178757050963]
We propose EdgeOL, an edge online learning framework that optimize inference accuracy, fine-tuning execution time, and energy efficiency.<n> Experimental results show that, on average, EdgeOL reduces overall fine-tuning execution time by 64%, energy consumption by 52%, and improves average inference accuracy by 1.75% over the immediate online learning strategy.
arXiv Detail & Related papers (2024-01-30T02:41:05Z)
Sparse Low-rank Adaptation of Pre-trained Language Models [79.74094517030035]
We introduce sparse low-rank adaptation (SoRA) that enables dynamic adjustments to the intrinsic rank during the adaptation process. Our approach strengthens the representation power of LoRA by initializing it with a higher rank, while efficiently taming a temporarily increased number of parameters. Our experimental results demonstrate that SoRA can outperform other baselines even with 70% retained parameters and 70% training time.
arXiv Detail & Related papers (2023-11-20T11:56:25Z)
Adaptive Resource Allocation for Virtualized Base Stations in O-RAN with Online Learning [60.17407932691429]
Open Radio Access Network systems, with their base stations (vBSs), offer operators the benefits of increased flexibility, reduced costs, vendor diversity, and interoperability. We propose an online learning algorithm that balances the effective throughput and vBS energy consumption, even under unforeseeable and "challenging'' environments. We prove the proposed solutions achieve sub-linear regret, providing zero average optimality gap even in challenging environments.
arXiv Detail & Related papers (2023-09-04T17:30:21Z)
Energy Efficient Deep Multi-Label ON/OFF Classification of Low Frequency Metered Home Appliances [0.16777183511743468]
Non-intrusive load monitoring (NILM) is the process of obtaining appliance-level data from a single metering point. We introduce a novel DL model aimed at enhanced multi-label classification of NILM with improved computation and energy efficiency. Compared to the state-of-the-art, the proposed model has its energy consumption reduced by more than 23%.
arXiv Detail & Related papers (2023-07-18T13:23:23Z)
Continually learning out-of-distribution spatiotemporal data for robust energy forecasting [10.47725405370935]
Building energy usage is essential for promoting sustainability and reducing waste. Forecasting energy usage during anomalous periods is difficult due to changes in occupancy patterns and energy usage behavior. Online learning has emerged as a promising solution to this challenge. We have conducted experiments using data from six buildings to test the efficacy of these approaches.
arXiv Detail & Related papers (2023-06-10T09:12:10Z)
Towards Sustainable Deep Learning for Wireless Fingerprinting Localization [0.541530201129053]
Location based services are becoming part of new wireless infrastructures and emerging business processes. Deep Learning (DL) artificial intelligence methods perform very well in wireless fingerprinting localization based on extensive indoor radio measurement data. With the increasing complexity these methods become computationally very intensive and energy hungry. We present a new DL-based architecture for indoor localization that is more energy efficient compared to related state-of-the-art approaches.
arXiv Detail & Related papers (2022-01-22T15:13:44Z)
Learning to Continuously Optimize Wireless Resource in a Dynamic Environment: A Bilevel Optimization Perspective [52.497514255040514]
This work develops a new approach that enables data-driven methods to continuously learn and optimize resource allocation strategies in a dynamic environment. We propose to build the notion of continual learning into wireless system design, so that the learning model can incrementally adapt to the new episodes. Our design is based on a novel bilevel optimization formulation which ensures certain fairness" across different data samples.
arXiv Detail & Related papers (2021-05-03T07:23:39Z)
Learning to Continuously Optimize Wireless Resource In Episodically Dynamic Environment [55.91291559442884]
This work develops a methodology that enables data-driven methods to continuously learn and optimize in a dynamic environment. We propose to build the notion of continual learning into the modeling process of learning wireless systems. Our design is based on a novel min-max formulation which ensures certain fairness" across different data samples.
arXiv Detail & Related papers (2020-11-16T08:24:34Z)
A Relearning Approach to Reinforcement Learning for Control of Smart Buildings [1.8799681615947088]
This paper demonstrates that continual relearning of control policies using incremental deep reinforcement learning (RL) can improve policy learning for non-stationary processes. We develop an incremental RL technique that simultaneously reduces building energy consumption without sacrificing overall comfort.
arXiv Detail & Related papers (2020-08-04T23:31:05Z)
How to Train Your Energy-Based Model for Regression [107.54411649704194]
Energy-based models (EBMs) have become increasingly popular within computer vision in recent years. Recent work has applied EBMs also for regression tasks, achieving state-of-the-art performance on object detection and visual tracking. How EBMs should be trained for best possible regression performance is not a well-studied problem.
arXiv Detail & Related papers (2020-05-04T17:55:01Z)

This list is automatically generated from the titles and abstracts of the papers in this site.