Related papers: NurseSchedRL: Attention-Guided Reinforcement Learning for Nurse-Patient Assignment

NurseSchedRL: Attention-Guided Reinforcement Learning for Nurse-Patient Assignment

URL: http://arxiv.org/abs/2509.18125v1
Date: Wed, 10 Sep 2025 21:41:42 GMT
Title: NurseSchedRL: Attention-Guided Reinforcement Learning for Nurse-Patient Assignment
Authors: Harsha Koduri,
Abstract summary: NurseSchedRL is a reinforcement learning framework for nurse-patient assignment.<n>It integrates structured state encoding, constrained action masking, and attention-based representations of skills, fatigue, and geographical context.
Score: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Healthcare systems face increasing pressure to allocate limited nursing resources efficiently while accounting for skill heterogeneity, patient acuity, staff fatigue, and continuity of care. Traditional optimization and heuristic scheduling methods struggle to capture these dynamic, multi-constraint environments. I propose NurseSchedRL, a reinforcement learning framework for nurse-patient assignment that integrates structured state encoding, constrained action masking, and attention-based representations of skills, fatigue, and geographical context. NurseSchedRL uses Proximal Policy Optimization (PPO) with feasibility masks to ensure assignments respect real-world constraints, while dynamically adapting to patient arrivals and varying nurse availability. In simulation with realistic nurse and patient data, NurseSchedRL achieves improved scheduling efficiency, better alignment of skills to patient needs, and reduced fatigue compared to baseline heuristic and unconstrained RL approaches. These results highlight the potential of reinforcement learning for decision support in complex, high-stakes healthcare workforce management.

Related papers

Automated Procedural Analysis via Video-Language Models for AI-assisted Nursing Skills Assessment [5.851959409921155]
We introduce a video-language model (VLM) based framework to develop the AI capability of automated procedural assessment and feedback for nursing skills training.<n>The framework follows a curriculum-inspired progression, advancing from high-level action recognition to procedural reasoning.<n>This work advances AI applications in nursing education, contributing to stronger workforce development and ultimately safer patient care.
arXiv Detail & Related papers (2025-09-20T21:11:33Z)
From Coordination to Personalization: A Trust-Aware Simulation Framework for Emergency Department Decision Support [0.0]
This study proposes a simulation-based framework for modeling doctors and nurses as intelligent agents guided by computational trust mechanisms.<n>The proposed framework demonstrates the potential of computational trust for evidence-based decision support in emergency medicine.
arXiv Detail & Related papers (2025-09-09T18:00:44Z)
A Multi-Objective Genetic Algorithm for Healthcare Workforce Scheduling [0.764671395172401]
We propose a Multi-objective Genetic Algorithm (MOO-GA) that models the hospital unit workforce scheduling problem as a multi-objective optimization task.<n>By defining objective functions for cost, patient care coverage, and staff satisfaction, the GA navigates the vast search space to identify a set of high-quality, non-dominated solutions.
arXiv Detail & Related papers (2025-08-28T16:16:10Z)
MORE-CLEAR: Multimodal Offline Reinforcement learning for Clinical notes Leveraged Enhanced State Representation [2.3363060352988283]
We propose a Multimodal Offline REinforcement learning for Clinical notes Leveraged Enhanced stAte Representation framework for sepsis control in intensive care units.<n>More-CLEAR employs pre-trained large-scale language models (LLMs) to facilitate the extraction of rich semantic representations from clinical notes.<n>To our knowledge, this is the first to leverage LLM capabilities within a multimodal offline RL for better state representation in medical applications.
arXiv Detail & Related papers (2025-08-11T06:58:33Z)
A Hybrid Data-Driven Approach For Analyzing And Predicting Inpatient Length Of Stay In Health Centre [0.0]
The study proposes an all-encompassing framework for the optimization of patient flow.<n>Using a comprehensive dataset of 2.3 million de-identified patient records, we analyzed demographics, diagnoses, treatments, services, costs, and charges.<n>Our model predicts patient length of stay (LoS) upon admission using supervised learning algorithms.
arXiv Detail & Related papers (2025-01-30T18:01:48Z)
Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis [68.06621490069428]
Geographic health disparities pose a pressing global challenge, particularly in underserved regions of low- and middle-income nations.<n>We propose a novel cross-silo federated learning framework, named FedHelp, aimed at alleviating geographic health disparities and fortifying the diagnostic capabilities of underserved regions.
arXiv Detail & Related papers (2024-12-27T13:59:58Z)
Enhancing Nursing and Elderly Care with Large Language Models: An AI-Driven Framework [9.201523682061753]
This paper explores the application of large language models (LLMs) in nursing and elderly care, focusing on AI-driven patient monitoring and interaction.<n>We introduce a novel Chinese nursing dataset and implement incremental pre-training (IPT) and supervised fine-tuning (SFT) techniques to enhance LLM performance in specialized tasks.
arXiv Detail & Related papers (2024-12-13T08:10:56Z)
Evaluating the Fairness of the MIMIC-IV Dataset and a Baseline Algorithm: Application to the ICU Length of Stay Prediction [65.268245109828]
This paper uses the MIMIC-IV dataset to examine the fairness and bias in an XGBoost binary classification model predicting the ICU length of stay. The research reveals class imbalances in the dataset across demographic attributes and employs data preprocessing and feature extraction. The paper concludes with recommendations for fairness-aware machine learning techniques for mitigating biases and the need for collaborative efforts among healthcare professionals and data scientists.
arXiv Detail & Related papers (2023-12-31T16:01:48Z)
Safe and Interpretable Estimation of Optimal Treatment Regimes [54.257304443780434]
We operationalize a safe and interpretable framework to identify optimal treatment regimes. Our findings support personalized treatment strategies based on a patient's medical history and pharmacological features.
arXiv Detail & Related papers (2023-10-23T19:59:10Z)
Deep Reinforcement Learning for Efficient and Fair Allocation of Health Care Resources [47.57108369791273]
Scarcity of health care resources could result in the unavoidable consequence of rationing. There is no universally accepted standard for health care resource allocation protocols. We propose a transformer-based deep Q-network to integrate the disease progression of individual patients and the interaction effects among patients.
arXiv Detail & Related papers (2023-09-15T17:28:06Z)
Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical Care [46.2482873419289]
We introduce a deep Q-learning approach to obtain more reliable critical care policies. We evaluate our method in off-policy and offline settings using simulated environments and real health records from intensive care units.
arXiv Detail & Related papers (2023-06-13T18:02:57Z)
U-PASS: an Uncertainty-guided deep learning Pipeline for Automated Sleep Staging [61.6346401960268]
We propose a machine learning pipeline called U-PASS tailored for clinical applications that incorporates uncertainty estimation at every stage of the process. We apply our uncertainty-guided deep learning pipeline to the challenging problem of sleep staging and demonstrate that it systematically improves performance at every stage.
arXiv Detail & Related papers (2023-06-07T08:27:36Z)
Optimal discharge of patients from intensive care via a data-driven policy learning framework [58.720142291102135]
It is important that the patient discharge task addresses the nuanced trade-off between decreasing a patient's length of stay and the risk of readmission or even death following the discharge decision. This work introduces an end-to-end general framework for capturing this trade-off to recommend optimal discharge timing decisions. A data-driven approach is used to derive a parsimonious, discrete state space representation that captures a patient's physiological condition.
arXiv Detail & Related papers (2021-12-17T04:39:33Z)

This list is automatically generated from the titles and abstracts of the papers in this site.