Fugu-MT 論文翻訳(概要): From 'What' to 'How' and 'Why': Sharing LLM-Generated Retrospective Summaries of Older Adults' Passive Tracking Data with Remote Family Members

論文の概要: From 'What' to 'How' and 'Why': Sharing LLM-Generated Retrospective Summaries of Older Adults' Passive Tracking Data with Remote Family Members

arxiv url: http://arxiv.org/abs/2606.03876v1
Date: Tue, 02 Jun 2026 16:46:00 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-03 22:00:05.172482
Title: From 'What' to 'How' and 'Why': Sharing LLM-Generated Retrospective Summaries of Older Adults' Passive Tracking Data with Remote Family Members
Title（参考訳）: 何」から「どのように」・「なぜ」へ--遠隔家族の受動追跡データを用いたLCMによる高齢者の振り返りサプリメントの共有
Authors: Jiachen Li, Reina Szeyi Chan, Akshat Choube, Xiang Zhi Tan, Elizabeth Mynatt, Varun Mishra,
Abstract要約: 高齢者のRAMに対するマルチモーダル追跡データから振り返りサマリーを生成する方法について述べる。既存のシステムであるVital Insightを活用して、初期要約を生成しました。次に、システムをマルチレイヤ、マルチエージェント、インサイト駆動の要約アプローチに再設計しました。
参考スコア（独自算出の注目度）: 6.520797431253829
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: With the growing prevalence of modern ubiquitous computing technologies, multi-modal tracking systems hold promise for providing timely awareness and reassurance to stakeholders such as remote family members (RFMs) of older adults, who play a central role in care coordination. However, combining heterogeneous data streams into high-level, meaningful content - such as retrospective summaries - remains challenging. While recent work has demonstrated the promise of large language models (LLMs) for interpreting multi-modal tracking data, less attention has been given to generating narrative accounts for stakeholders like RFMs, who possess rich personal knowledge of older adults and strong emotional responsibility, yet have limited visibility into their daily lives and limited capacity for caregiving. In this work, we explore how LLMs can be used to generate retrospective summaries from multi-modal tracking data for RFMs of older adults. We leveraged and customized an existing system, Vital Insight, to generate initial summaries on different dates and data availability scenarios as technology probes, and conducted interviews with 11 RFMs to gather feedback. Based on these insights, we redesigned the system into a multi-layer, multi-agent, insight-driven summary approach that builds from objective statistics and descriptions to enriched, context-aware narratives. We then compared the redesigned summaries with the initial versions through a survey with the same 11 RFMs and found significant improvements in satisfaction, perceived helpfulness, trust, and willingness to receive the summaries. We conclude by presenting design implications for AI-generated summaries for RFMs and broader contexts, emphasizing the need to support RFMs' sensemaking shift from simply presenting ''What'' data were collected, to explaining ''How'' is my loved one doing and ''Why''.
Abstract（参考訳）: 現代のユビキタスコンピューティング技術の普及に伴い、マルチモーダルトラッキングシステムは、介護調整において中心的な役割を果たす高齢者の遠隔家族メンバー(RFM)のような利害関係者にタイムリーな認識と安心を提供するという約束を持っている。しかし、ヘテロジニアスなデータストリームを、レトロスペクティブの要約のような、高レベルで意味のあるコンテンツに組み合わせることはまだ難しいです。近年の研究では、マルチモーダル追跡データを解釈するための大規模言語モデル(LLMs)の可能性を実証しているが、高齢者の豊富な個人的知識と強い感情的責任を持つRAMsのような利害関係者の物語アカウントを作成することにはあまり注意が向けられていない。本研究では,高齢者のRAMのマルチモーダル追跡データから,LLMを用いて振り返りサマリーを生成する方法について検討する。既存のシステムであるVital Insightを活用して、さまざまな日付とデータ可用性シナリオを技術調査として生成し、11のRAMとのインタビューを行い、フィードバックを収集しました。これらの知見に基づいて、我々はシステムを多層・多エージェント・インサイト駆動の要約アプローチに再設計し、客観的な統計と記述からリッチでコンテキスト対応な物語へと構築した。次に、再設計した要約と、同じ11のRAMによる調査を通じての初期バージョンを比較し、満足度、有用性、信頼、そして要約を受け取る意志の大幅な改善を見出した。我々は、RAMとより広い文脈におけるAI生成サマリーの設計意味を提示し、単に'What'のデータが収集されたことを提示することから、'How'は私の愛するものであり、'Why'は'Why'なのかを説明するまで、RAMのセンスメイキングをサポートする必要性を強調した。

関連論文リスト

LIMSSR: LLM-Driven Sequence-to-Score Reasoning under Training-Time Incomplete Multimodal Observations [53.20772659095155]
本稿では、トレーニング時不完全観察において、より困難なIMLの設定に取り組む。本稿では,この課題を条件付きシーケンス推論タスクとして再構成したLIMSSR(LLM-Driven Incomplete Multimodal Sequence-to-Score Reasoning)を提案する。
論文参考訳（メタデータ） (2026-05-01T06:11:42Z)
Towards Multimodal Lifelong Understanding: A Dataset and Agentic Baseline [58.585692088008905]
MM-Lifelongはマルチモーダルライフロング理解のために設計されたデータセットである。撮影時間は181.1時間で、日、週、月の各スケールにまたがって構成され、様々な時間密度を捉えている。
論文参考訳（メタデータ） (2026-03-05T18:52:12Z)
CARPAS: Towards Content-Aware Refinement of Provided Aspects for Summarization in Large Language Models [16.41705871316774]
本稿では,CARPAS(Content-Aware Refinement of Productd aspects for Summarization)を紹介する。本稿では,関連する側面の数を予測するための予備的なサブタスクを提案し,その予測数が効果的なガイダンスとして機能できることを実証する。実験の結果,提案手法は全データセットのパフォーマンスを著しく向上させることがわかった。
論文参考訳（メタデータ） (2025-10-08T16:16:46Z)
MemInsight: Autonomous Memory Augmentation for LLM Agents [12.620141762922168]
セマンティックなデータ表現と検索機構を強化するために,自動メモリ拡張手法であるMemInsightを提案する。提案手法の有効性を,会話推薦,質問応答,イベント要約の3つのシナリオで実証的に検証した。
論文参考訳（メタデータ） (2025-03-27T17:57:28Z)
A Survey of Multimodal Retrieval-Augmented Generation [3.9616308910160445]
MRAG(Multimodal Retrieval-Augmented Generation)は、マルチモーダルデータ(テキスト、画像、ビデオ)を検索および生成プロセスに統合することにより、大規模言語モデル(LLM)を強化する。近年の研究では、MRAGは視覚とテキストの両方の理解を必要とするシナリオにおいて、従来の検索・拡張生成(RAG)よりも優れていることが示されている。
論文参考訳（メタデータ） (2025-03-26T02:43:09Z)
Composed Multi-modal Retrieval: A Survey of Approaches and Applications [81.54640206021757]
複合マルチモーダル検索(CMR)は次世代技術として誕生する。 CMRは、参照視覚入力とテキスト修正を統合することで、画像やビデオをクエリすることを可能にする。本稿では,CMRの基礎的課題,技術的進歩,応用について概説する。
論文参考訳（メタデータ） (2025-03-03T09:18:43Z)
RA-BLIP: Multimodal Adaptive Retrieval-Augmented Bootstrapping Language-Image Pre-training [55.54020926284334]
近年,MLLM (Multimodal Large Language Models) が注目されている。検索拡張技術はLLMとMLLMの両方に有効なプラグインであることが証明されている。本研究では,MLLMの新しい検索支援フレームワークであるRA-BLIP(Retrieval-Augmented Bootstrapping Language-Image Pre-training)を提案する。
論文参考訳（メタデータ） (2024-10-18T03:45:19Z)
From Linguistic Giants to Sensory Maestros: A Survey on Cross-Modal Reasoning with Large Language Models [56.9134620424985]
クロスモーダル推論(CMR)は、より高度な人工知能システムへの進化における重要な能力として、ますます認識されている。 CMRタスクに取り組むためにLLM(Large Language Models)をデプロイする最近のトレンドは、その有効性を高めるためのアプローチの新たな主流となっている。本調査では,LLMを用いてCMRで適用された現在の方法論を,詳細な3階層分類に分類する。
論文参考訳（メタデータ） (2024-09-19T02:51:54Z)
IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues [10.280113107290067]
IM-RAGアプローチは、多ラウンドRAGをサポートするために、情報検索システムとLarge Language Models (LLM)を統合する。 IMプロセス全体が強化学習(Reinforcement Learning, RL)によって最適化され、プログレストラッカーが組み込まれ、中間段階の報酬が提供される。提案手法は, 赤外線モジュールの統合において高い柔軟性を提供しながら, 最先端(SOTA)性能を実現する。
論文参考訳（メタデータ） (2024-05-15T12:41:20Z)
Recent Advances in Hate Speech Moderation: Multimodality and the Role of Large Models [52.24001776263608]
この包括的調査は、HSモデレーションの最近の歩みを掘り下げている。大型言語モデル(LLM)と大規模マルチモーダルモデル(LMM)の急成長する役割を強調した。研究における既存のギャップを、特に表現不足言語や文化の文脈で特定する。
論文参考訳（メタデータ） (2024-01-30T03:51:44Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。