Fugu-MT 論文翻訳(概要): VERA-MH: Reliability and Validity of an Open-Source AI Safety Evaluation in Mental Health

論文の概要: VERA-MH: Reliability and Validity of an Open-Source AI Safety Evaluation in Mental Health

arxiv url: http://arxiv.org/abs/2602.05088v1
Date: Wed, 04 Feb 2026 22:17:04 GMT
ステータス: 翻訳完了
システム内更新日: 2026-02-06 18:49:08.644339
Title: VERA-MH: Reliability and Validity of an Open-Source AI Safety Evaluation in Mental Health
Title（参考訳）: VERA-MH:メンタルヘルスにおけるオープンソースのAI安全性評価の信頼性と妥当性
Authors: Kate H. Bentley, Luca Belli, Adam M. Chekroud, Emily J. Ward, Emily R. Dworkin, Emily Van Ark, Kelly M. Johnston, Will Alexander, Millard Brown, Matt Hawrilenko,
Abstract要約: メンタルヘルス(VERA-MH)評価における倫理的かつ責任のあるAIの検証は、証拠に基づく自動安全ベンチマークの緊急の必要性を満たすために最近提案された。本研究は,自殺リスク検出および応答におけるAI安全性に対するVERA-MH評価の臨床的妥当性と信頼性を検討することを目的とした。 Findingsは、メンタルヘルスのためのオープンソースで完全に自動化されたAI安全評価であるVERA-MHの臨床的妥当性と信頼性をサポートする。
参考スコア（独自算出の注目度）: 0.0
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Millions now use leading generative AI chatbots for psychological support. Despite the promise related to availability and scale, the single most pressing question in AI for mental health is whether these tools are safe. The Validation of Ethical and Responsible AI in Mental Health (VERA-MH) evaluation was recently proposed to meet the urgent need for an evidence-based automated safety benchmark. This study aimed to examine the clinical validity and reliability of the VERA-MH evaluation for AI safety in suicide risk detection and response. We first simulated a large set of conversations between large language model (LLM)-based users (user-agents) and general-purpose AI chatbots. Licensed mental health clinicians used a rubric (scoring guide) to independently rate the simulated conversations for safe and unsafe chatbot behaviors, as well as user-agent realism. An LLM-based judge used the same scoring rubric to evaluate the same set of simulated conversations. We then compared rating alignment across (a) individual clinicians and (b) clinician consensus and the LLM judge, and (c) examined clinicians' ratings of user-agent realism. Individual clinicians were generally consistent with one another in their safety ratings (chance-corrected inter-rater reliability [IRR]: 0.77), thus establishing a gold-standard clinical reference. The LLM judge was strongly aligned with this clinical consensus (IRR: 0.81) overall and within key conditions. Clinician raters generally perceived the user-agents to be realistic. For the potential mental health benefits of AI chatbots to be realized, attention to safety is paramount. Findings from this human evaluation study support the clinical validity and reliability of VERA-MH: an open-source, fully automated AI safety evaluation for mental health. Further research will address VERA-MH generalizability and robustness.
Abstract（参考訳）: 何百万人もの人たちが、心理的サポートのために主要なAIチャットボットを使っている。可用性とスケールに関する約束にもかかわらず、AIにおけるメンタルヘルスに関する最も強い疑問は、これらのツールが安全かどうかである。メンタルヘルス(VERA-MH)評価における倫理的かつ責任のあるAIの検証は、証拠に基づく自動安全ベンチマークの緊急の必要性を満たすために最近提案された。本研究は,自殺リスク検出および応答におけるAI安全性に対するVERA-MH評価の臨床的妥当性と信頼性を検討することを目的とした。まず,大規模言語モデル(LLM)ベースのユーザ(ユーザエージェント)と汎用AIチャットボットとの対話をシミュレーションした。免許を受けた精神保健医は、シミュレーションされた会話を安全で安全でないチャットボットの行動とユーザエージェントリアリズムと独立して評価するために、ルーリック(スコーリングガイド)を使用した。 LLMベースの裁判官は、同じスコアリングルーブリックを使用して、シミュレーションされた会話のセットを評価した。その後、レーティングアライメントを比較した。 (a)個別臨床医 b)臨床医のコンセンサスとLCM審査員 (c) 臨床医のユーザ・エージェント・リアリズムの評価について検討した。個々の臨床医は、安全評価において概して一致し(整合性整合性インターレータ信頼性(IRR: 0.77))、金標準臨床基準を確立した。 LLM審査員は、この臨床コンセンサス(IRR: 0.81)と鍵条件の範囲内で強く一致した。臨床検査官は一般的にユーザエージェントが現実的であると認識した。 AIチャットボットの潜在的なメンタルヘルス上のメリットを実現するためには、安全への注意が最重要である。この人的評価研究から得られた知見は、メンタルヘルスのためのオープンソースで完全に自動化されたAI安全評価であるVERA-MHの臨床的妥当性と信頼性を支持する。さらなる研究は、VERA-MHの一般化性と堅牢性に対処する。

論文の概要: VERA-MH: Reliability and Validity of an Open-Source AI Safety Evaluation in Mental Health

関連論文リスト