Fugu-MT 論文翻訳(概要): Do Large Language Models Show Decision Heuristics Similar to Humans? A Case Study Using GPT-3.5

論文の概要: Do Large Language Models Show Decision Heuristics Similar to Humans? A Case Study Using GPT-3.5

arxiv url: http://arxiv.org/abs/2305.04400v1
Date: Mon, 8 May 2023 01:02:52 GMT
ステータス: 翻訳完了
システム内更新日: 2023-05-09 16:03:47.786653
Title: Do Large Language Models Show Decision Heuristics Similar to Humans? A Case Study Using GPT-3.5
Title（参考訳）: 大きな言語モデルは、人間に似た決定ヒューリスティックを示すか? GPT-3.5を用いた一症例
Authors: Gaurav Suri, Lily R. Slater, Ali Ziaee, Morgan Nguyen
Abstract要約: GPT-3.5は、ChatGPTと呼ばれる会話エージェントをサポートするLLMの例である。本研究では,ChatGPTがバイアスを示すか,その他の決定効果を示すかを決定するために,一連の新しいプロンプトを用いた。また、同じプロンプトをヒトでもテストしました。
参考スコア（独自算出の注目度）: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: A Large Language Model (LLM) is an artificial intelligence system that has been trained on vast amounts of natural language data, enabling it to generate human-like responses to written or spoken language input. GPT-3.5 is an example of an LLM that supports a conversational agent called ChatGPT. In this work, we used a series of novel prompts to determine whether ChatGPT shows heuristics, biases, and other decision effects. We also tested the same prompts on human participants. Across four studies, we found that ChatGPT was influenced by random anchors in making estimates (Anchoring Heuristic, Study 1); it judged the likelihood of two events occurring together to be higher than the likelihood of either event occurring alone, and it was erroneously influenced by salient anecdotal information (Representativeness and Availability Heuristic, Study 2); it found an item to be more efficacious when its features were presented positively rather than negatively - even though both presentations contained identical information (Framing Effect, Study 3); and it valued an owned item more than a newly found item even though the two items were identical (Endowment Effect, Study 4). In each study, human participants showed similar effects. Heuristics and related decision effects in humans are thought to be driven by cognitive and affective processes such as loss aversion and effort reduction. The fact that an LLM - which lacks these processes - also shows such effects invites consideration of the possibility that language may play a role in generating these effects in humans.
Abstract（参考訳）: 大規模言語モデル(英: large language model、llm)は、膨大な量の自然言語データに基づいて訓練された人工知能システムである。 GPT-3.5は、ChatGPTと呼ばれる会話エージェントをサポートするLLMの例である。本研究では,ChatGPTがヒューリスティックス,バイアス,その他の決定効果を示すかどうかを判定するために,一連の新しいプロンプトを用いた。また、同じプロンプトをヒトでもテストしました。 Across four studies, we found that ChatGPT was influenced by random anchors in making estimates (Anchoring Heuristic, Study 1); it judged the likelihood of two events occurring together to be higher than the likelihood of either event occurring alone, and it was erroneously influenced by salient anecdotal information (Representativeness and Availability Heuristic, Study 2); it found an item to be more efficacious when its features were presented positively rather than negatively - even though both presentations contained identical information (Framing Effect, Study 3); and it valued an owned item more than a newly found item even though the two items were identical (Endowment Effect, Study 4). それぞれの研究で、人間の被験者も同様の効果を示した。ヒトのヒューリスティックと関連する意思決定効果は、損失回避や労力削減といった認知的および感情的なプロセスによって引き起こされると考えられている。これらのプロセスが欠如しているLLMは、そのような効果も示しているという事実は、言語がこれらの効果を人体で生成する役割を担っている可能性を考慮させる。

関連論文リスト

Humanity in AI: Detecting the Personality of Large Language Models [0.0]
アンケートは大規模言語モデル(LLM)の個性を検出する一般的な方法である本稿では,テキストマイニングとアンケート手法の組み合わせを提案する。 LLMのパーソナリティは、事前訓練されたデータから導かれる。
論文参考訳（メタデータ） (2024-10-11T05:53:11Z)
Cross-lingual Speech Emotion Recognition: Humans vs. Self-Supervised Models [16.0617753653454]
本研究では,人間のパフォーマンスとSSLモデルの比較分析を行った。また、モデルと人間のSER能力を発話レベルとセグメントレベルの両方で比較する。その結果,適切な知識伝達を行うモデルでは,対象言語に適応し,ネイティブ話者に匹敵する性能が得られることがわかった。
論文参考訳（メタデータ） (2024-09-25T13:27:17Z)
Rel-A.I.: An Interaction-Centered Approach To Measuring Human-LM Reliance [73.19687314438133]
インタラクションの文脈的特徴が依存に与える影響について検討する。文脈特性が人間の信頼行動に大きく影響していることが判明した。これらの結果から,キャリブレーションと言語品質だけでは人間とLMの相互作用のリスクを評価するには不十分であることが示唆された。
論文参考訳（メタデータ） (2024-07-10T18:00:05Z)
Modulating Language Model Experiences through Frictions [56.17593192325438]
言語モデルの過度な消費は、短期において未確認エラーを伝播し、長期的な批判的思考のために人間の能力を損なうリスクを出力する。行動科学の介入にインスパイアされた言語モデル体験のための選択的摩擦を提案し,誤用を抑える。
論文参考訳（メタデータ） (2024-06-24T16:31:11Z)
Towards a Psychology of Machines: Large Language Models Predict Human Memory [0.0]
大規模言語モデル(LLM)は自然言語処理において顕著な能力を示している。本研究では,LLMが庭道文や文脈情報を含むタスクにおいて,人間の記憶性能を予測できるかどうかを検討する。
論文参考訳（メタデータ） (2024-03-08T08:41:14Z)
Divergences between Language Models and Human Brains [59.100552839650774]
我々は,人間と機械語処理の相違点を体系的に探求する。我々は、LMがうまく捉えられない2つの領域、社会的/感情的知性と身体的常識を識別する。以上の結果から,これらの領域における微調整LMは,ヒト脳反応との整合性を向上させることが示唆された。
論文参考訳（メタデータ） (2023-11-15T19:02:40Z)
Sensitivity, Performance, Robustness: Deconstructing the Effect of Sociodemographic Prompting [64.80538055623842]
社会デマトグラフィープロンプトは、特定の社会デマトグラフィープロファイルを持つ人間が与える答えに向けて、プロンプトベースのモデルの出力を操縦する技術である。ソシオデマトグラフィー情報はモデル予測に影響を及ぼし、主観的NLPタスクにおけるゼロショット学習を改善するのに有用であることを示す。
論文参考訳（メタデータ） (2023-09-13T15:42:06Z)
Cognitive Effects in Large Language Models [14.808777775761753]
大規模言語モデル(LLM)は、過去1年で大きな注目を集め、現在、数億人の人々が毎日利用しています。我々はこれらのモデルのうちの1つ(GPT-3)を、人間の認知タスクで通常見られる系統的なパターンである認知効果についてテストした。具体的には, プライミング, 距離, SNARC, サイズ共役効果をGPT-3で示し, アンカー効果は欠如していた。
論文参考訳（メタデータ） (2023-08-28T06:30:33Z)
Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias [57.42417061979399]
近年の研究では、インストラクションチューニング(IT)と人間フィードバック(RLHF)による強化学習によって、大規模言語モデル(LM)の能力が劇的に向上していることが示されている。本研究では,ITとRLHFがLMの意思決定と推論に与える影響について検討する。以上の結果から,GPT-3,Mistral,T5ファミリーの各種モデルにおけるこれらのバイアスの存在が示唆された。
論文参考訳（メタデータ） (2023-08-01T01:39:25Z)
Susceptibility to Influence of Large Language Models [5.931099001882958]
2つの研究は、大きな言語モデル(LLM)が、影響力のある入力への暴露後の心理的変化をモデル化できるという仮説を検証した。最初の研究では、Illusory Truth Effect(ITE)という一般的な影響のモードがテストされた。第2の研究では、その説得力と政治的動員力を高めるために、ニュースの大衆的なフレーミングという、特定の影響の態勢について論じている。
論文参考訳（メタデータ） (2023-03-10T16:53:30Z)
Is the Language Familiarity Effect gradual? A computational modelling approach [14.83230292969134]
本研究では、言語親和性効果のモデルを用いて、その効果の段階的な測定値が得られることを示す。この効果は幅広い言語にまたがって再現され、その普遍性のさらなる証拠となる。また,LFEの段階的尺度に基づいて,同じ家系に属する言語がLFEに与える影響を裏付ける結果を得た。
論文参考訳（メタデータ） (2022-06-27T16:08:42Z)
Naturalistic Causal Probing for Morpho-Syntax [76.83735391276547]
スペインにおける実世界のデータに対する入力レベルの介入に対する自然主義的戦略を提案する。提案手法を用いて,共同設立者から文章中の形態・症状の特徴を抽出する。本研究では,事前学習したモデルから抽出した文脈化表現に対する性別と数字の因果効果を解析するために,本手法を適用した。
論文参考訳（メタデータ） (2022-05-14T11:47:58Z)
Model-based analysis of brain activity reveals the hierarchy of language in 305 subjects [82.81964713263483]
言語の神経基盤を分解する一般的なアプローチは、個人間で異なる刺激に対する脳の反応を関連付けている。そこで本研究では,自然刺激に曝露された被験者に対して,モデルに基づくアプローチが等価な結果が得られることを示す。
論文参考訳（メタデータ） (2021-10-12T15:30:21Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。