Kernel-Based Distributed Q-Learning: A Scalable Reinforcement Learning
Approach for Dynamic Treatment Regimes
- URL: http://arxiv.org/abs/2302.10434v1
- Date: Tue, 21 Feb 2023 04:15:34 GMT
- Title: Kernel-Based Distributed Q-Learning: A Scalable Reinforcement Learning
Approach for Dynamic Treatment Regimes
- Authors: Di Wang, Yao Wang, Shaojie Tang, Shao-Bo Lin
- Abstract summary: We present a novel approach, a distributed Q-learning algorithm, for generating dynamic treatment regimes.
We show that our algorithm outperforms both traditional linear Q-learning and commonly used deep Q-learning in terms of both prediction accuracy and computation cost.
- Score: 37.06048335758881
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In recent years, large amounts of electronic health records (EHRs) concerning
chronic diseases, such as cancer, diabetes, and mental disease, have been
collected to facilitate medical diagnosis. Modeling the dynamic properties of
EHRs related to chronic diseases can be efficiently done using dynamic
treatment regimes (DTRs), which are a set of sequential decision rules. While
Reinforcement learning (RL) is a widely used method for creating DTRs, there is
ongoing research in developing RL algorithms that can effectively handle large
amounts of data. In this paper, we present a novel approach, a distributed
Q-learning algorithm, for generating DTRs. The novelties of our research are as
follows: 1) From a methodological perspective, we present a novel and scalable
approach for generating DTRs by combining distributed learning with Q-learning.
The proposed approach is specifically designed to handle large amounts of data
and effectively generate DTRs. 2) From a theoretical standpoint, we provide
generalization error bounds for the proposed distributed Q-learning algorithm,
which are derived within the framework of statistical learning theory. These
bounds quantify the relationships between sample size, prediction accuracy, and
computational burden, providing insights into the performance of the algorithm.
3) From an applied perspective, we demonstrate the effectiveness of our
proposed distributed Q-learning algorithm for DTRs by applying it to clinical
cancer treatments. The results show that our algorithm outperforms both
traditional linear Q-learning and commonly used deep Q-learning in terms of
both prediction accuracy and computation cost.
Related papers
- DTR-Bench: An in silico Environment and Benchmark Platform for Reinforcement Learning Based Dynamic Treatment Regime [18.443316087890324]
Reinforcement learning (RL) has garnered increasing recognition for its potential to optimise dynamic treatment regimes (DTRs) in personalised medicine.
We introduce textitDTR-Bench, a benchmarking platform for simulating diverse healthcare scenarios.
We evaluate various state-of-the-art RL algorithms across these settings, particularly highlighting their performance amidst real-world challenges.
arXiv Detail & Related papers (2024-05-28T21:40:00Z) - Evolutionary Optimization of 1D-CNN for Non-contact Respiration Pattern Classification [0.19999259391104385]
We present a deep learning-based approach for time-series respiration data classification.
We employed a 1D convolutional neural network (1D-CNN) for classification purposes.
Genetic algorithm was employed to optimize the 1D-CNN architecture to maximize classification accuracy.
arXiv Detail & Related papers (2023-12-20T13:59:43Z) - Learning Optimal Dynamic Treatment Regimes Using Causal Tree Methods in
Medicine [20.401805132360654]
We develop two novel methods for learning optimal dynamic treatment regimes (DTRs)
Our methods are based on a data-driven estimation of heterogeneous treatment effects using causal tree methods.
We evaluate our proposed methods using synthetic data and then apply them to real-world data from intensive care units.
arXiv Detail & Related papers (2022-04-14T17:27:08Z) - Lung Cancer Lesion Detection in Histopathology Images Using Graph-Based
Sparse PCA Network [93.22587316229954]
We propose a graph-based sparse principal component analysis (GS-PCA) network, for automated detection of cancerous lesions on histological lung slides stained by hematoxylin and eosin (H&E)
We evaluate the performance of the proposed algorithm on H&E slides obtained from an SVM K-rasG12D lung cancer mouse model using precision/recall rates, F-score, Tanimoto coefficient, and area under the curve (AUC) of the receiver operator characteristic (ROC)
arXiv Detail & Related papers (2021-10-27T19:28:36Z) - Efficient training of lightweight neural networks using Online
Self-Acquired Knowledge Distillation [51.66271681532262]
Online Self-Acquired Knowledge Distillation (OSAKD) is proposed, aiming to improve the performance of any deep neural model in an online manner.
We utilize k-nn non-parametric density estimation technique for estimating the unknown probability distributions of the data samples in the output feature space.
arXiv Detail & Related papers (2021-08-26T14:01:04Z) - A Meta-embedding-based Ensemble Approach for ICD Coding Prediction [64.42386426730695]
International Classification of Diseases (ICD) are the de facto codes used globally for clinical coding.
These codes enable healthcare providers to claim reimbursement and facilitate efficient storage and retrieval of diagnostic information.
Our proposed approach enhances the performance of neural models by effectively training word vectors using routine medical data as well as external knowledge from scientific articles.
arXiv Detail & Related papers (2021-02-26T17:49:58Z) - Sample-Efficient Reinforcement Learning via Counterfactual-Based Data
Augmentation [15.451690870640295]
In some scenarios such as healthcare, usually only few records are available for each patient, impeding the application of currentReinforcement learning algorithms.
We propose a data-efficient RL algorithm that exploits structural causal models (SCMs) to model the state dynamics.
We show that counterfactual outcomes are identifiable under mild conditions and that Q- learning on the counterfactual-based augmented data set converges to the optimal value function.
arXiv Detail & Related papers (2020-12-16T17:21:13Z) - A Systematic Approach to Featurization for Cancer Drug Sensitivity
Predictions with Deep Learning [49.86828302591469]
We train >35,000 neural network models, sweeping over common featurization techniques.
We found the RNA-seq to be highly redundant and informative even with subsets larger than 128 features.
arXiv Detail & Related papers (2020-04-30T20:42:17Z) - Deep Learning Algorithms for Rotating Machinery Intelligent Diagnosis:
An Open Source Benchmark Study [0.8497188292342053]
This paper provides a benchmark study of deep learning algorithms for rotating machinery intelligent diagnosis.
We integrate the whole evaluation codes into a code library and release this code library to the public for better development of this field.
By these works, we release a unified code framework for comparing and testing models fairly and quickly, emphasize the importance of open source codes, provide the baseline accuracy (a lower bound) to avoid useless improvement, and discuss potential future directions in this field.
arXiv Detail & Related papers (2020-03-06T17:24:43Z) - Opportunities and Challenges of Deep Learning Methods for
Electrocardiogram Data: A Systematic Review [62.490310870300746]
The electrocardiogram (ECG) is one of the most commonly used diagnostic tools in medicine and healthcare.
Deep learning methods have achieved promising results on predictive healthcare tasks using ECG signals.
This paper presents a systematic review of deep learning methods for ECG data from both modeling and application perspectives.
arXiv Detail & Related papers (2019-12-28T02:44:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.