Fugu-MT 論文翻訳(概要): Jobs' AI Exposure Should Be Measured from Evidence, Not Model Priors

論文の概要: Jobs' AI Exposure Should Be Measured from Evidence, Not Model Priors

arxiv url: http://arxiv.org/abs/2605.15474v1
Date: Thu, 14 May 2026 23:29:42 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-18 17:44:16.284183
Title: Jobs' AI Exposure Should Be Measured from Evidence, Not Model Priors
Title（参考訳）: ジョブズのAIエクスポージャーは、モデル優先ではなく証拠から測定されるべき
Authors: Luca Mouchel, Pierre Bouquet, Yossi Sheffi,
Abstract要約: AIへの仕事の露出は、根拠に基づく根拠に基づく方法で測定されるべきである。現在の理論的露出測定では、ゼロショットプロンプトを使用してタスクレベルのAI露出を分類している。我々はO*NET 30.2における18,796の職業-タスク対にAI露出ラベルを割り当てる検索拡張フレームワークを提案する。
参考スコア（独自算出の注目度）: 2.4724375110574304
License: http://creativecommons.org/licenses/by/4.0/
Abstract: This position paper argues that job exposure to AI should be measured with grounded, evidence-based methods, not inferred from LLM priors alone. Current theoretical exposure measures use zero-shot prompting to classify task-level AI exposure, generating labels with no explicit evidence, no transparent chain of reasoning, and no external validation. The stakes of these measurements are too high to rely on such methods, as they influence policy making, where public and private funds are directed, and how workers understand their future prospects. We therefore argue that AI capability claims should meet three standards: reproducibility, external grounding, and inspectability. We propose a retrieval-augmented framework that assigns AI exposure labels to all 18,796 occupation--task pairs in O*NET 30.2, using open-weight reasoning and instruct models with retrieved news articles and academic paper abstracts as evidence of current AI capabilities. Relative to a zero-shot baseline, the grounded condition is preferred in over 72\% of disagreement cases under both automatic and human evaluation, and yields scores that align more closely with observed real-world AI usage. Taken together, these findings suggest that evidence-grounded measurement better captures what current AI systems can plausibly do in practice, rather than what a model asserts without external evidence. Because AI capabilities continue to change, the measurements used to inform policy must evolve with them: theoretical AI exposure scores should be periodically reassessed, not inherited as immutable ground truth.
Abstract（参考訳）: このポジションペーパーは、AIに対する仕事の露出は、LLMの先行性からのみ推測されるのではなく、根拠に基づく手法で測定されるべきである、と論じている。現在の理論的露出測定では、ゼロショットプロンプトを使用してタスクレベルのAI露出を分類し、明確な証拠がなく、推論の透明な連鎖がなく、外部の検証もないラベルを生成する。これらの測定の利害関係は、政策立案、公的資金と民間資金の方向性、そして労働者が将来の展望をどのように理解するかに影響を及ぼすなど、そのような方法に頼るには高すぎる。したがって、AI能力クレームは再現性、外部接地、検査可能性の3つの基準を満たすべきである、と我々は主張する。我々は,O*NET 30.2の18,796対の職業-タスクペアにAI露出ラベルを割り当てる検索拡張フレームワークを提案する。ゼロショットベースラインとは対照的に、グラウンドド条件は自動評価と人的評価の両方で72%以上の不一致ケースで好まれ、実世界のAI利用とより密に一致したスコアが得られます。これらの結果は、モデルが外部の証拠なしで主張するよりも、現在のAIシステムが実際にできることをよりよく捉えていることを示唆している。理論的AI露光スコアは定期的に再評価されるべきであり、不変の基底真理として受け継がれるものではない。

論文の概要: Jobs' AI Exposure Should Be Measured from Evidence, Not Model Priors

関連論文リスト