Fugu-MT 論文翻訳(概要): PovNet+: A Deep Learning Architecture for Socially Assistive Robots to Learn and Assist with Multiple Activities of Daily Living

論文の概要: PovNet+: A Deep Learning Architecture for Socially Assistive Robots to Learn and Assist with Multiple Activities of Daily Living

arxiv url: http://arxiv.org/abs/2602.00131v1
Date: Wed, 28 Jan 2026 04:39:56 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-03 19:28:32.966052
Title: PovNet+: A Deep Learning Architecture for Socially Assistive Robots to Learn and Assist with Multiple Activities of Daily Living
Title（参考訳）: PovNet+: 社会支援型ロボットによる多人数生活支援のための深層学習アーキテクチャ
Authors: Fraser Robinson, Souren Pashangpour, Matthew Lisondra, Goldie Nejat,
Abstract要約: 自律型社会支援ロボットの長期展開における大きな障壁は、日常生活における複数の活動の知覚と支援の両方ができないことである。社会支援ロボットのためのマルチアクティブ認識のための最初のマルチモーダルディープラーニングアーキテクチャPOVNet+を提案する。
参考スコア（独自算出の注目度）: 4.96981595868944
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: A significant barrier to the long-term deployment of autonomous socially assistive robots is their inability to both perceive and assist with multiple activities of daily living (ADLs). In this paper, we present the first multimodal deep learning architecture, POVNet+, for multi-activity recognition for socially assistive robots to proactively initiate assistive behaviors. Our novel architecture introduces the use of both ADL and motion embedding spaces to uniquely distinguish between a known ADL being performed, a new unseen ADL, or a known ADL being performed atypically in order to assist people in real scenarios. Furthermore, we apply a novel user state estimation method to the motion embedding space to recognize new ADLs while monitoring user performance. This ADL perception information is used to proactively initiate robot assistive interactions. Comparison experiments with state-of-the-art human activity recognition methods show our POVNet+ method has higher ADL classification accuracy. Human-robot interaction experiments in a cluttered living environment with multiple users and the socially assistive robot Leia using POVNet+ demonstrate the ability of our multi-modal ADL architecture in successfully identifying different seen and unseen ADLs, and ADLs being performed atypically, while initiating appropriate assistive human-robot interactions.
Abstract（参考訳）: 自律型社会支援ロボットの長期展開における大きな障壁は、日常生活における複数の活動(ADL)を知覚し、支援することができないことである。本稿では,社会支援ロボットのためのマルチモーダル深層学習アーキテクチャPOVNet+を提案する。我々の新しいアーキテクチャでは、ADLが実行されている既知のADL、新しい未知のADL、または既知のADLを非典型的に区別するために、ADLとモーション埋め込み空間の両方を導入している。さらに、ユーザパフォーマンスを監視しながら、新しいADLを認識するために、新しいユーザ状態推定手法をモーション埋め込み空間に適用する。このADL知覚情報は、ロボットのアシストインタラクションを積極的に開始するために使用される。最先端の人間行動認識法との比較実験により,POVNet+法はADL分類精度が高いことが示された。 POVNet+ を用いたマルチモーダル型 ADL アーキテクチャを用いて,複数ユーザによる混在した生活環境における人間とロボットのインタラクション実験を行った。

論文の概要: PovNet+: A Deep Learning Architecture for Socially Assistive Robots to Learn and Assist with Multiple Activities of Daily Living

関連論文リスト