Fugu-MT 論文翻訳(概要): Do LLMs Favor Their Providers? Measuring Vertical Integration Bias in Code Generation

論文の概要: Do LLMs Favor Their Providers? Measuring Vertical Integration Bias in Code Generation

arxiv url: http://arxiv.org/abs/2605.28515v1
Date: Wed, 27 May 2026 14:17:06 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-28 17:38:56.10289
Title: Do LLMs Favor Their Providers? Measuring Vertical Integration Bias in Code Generation
Title（参考訳）: LLMはプロバイダを好んでいるか? コード生成における垂直統合バイアスの測定
Authors: Melih Catal, Alex Wolf, Tiago Ferreiro Matos, Pooja Rani, Harald Gall,
Abstract要約: 多くの大きな言語モデル(LLM)は特定のプロバイダに関連付けられている。これにより、生成されたコードが、同等の選択肢よりもプロバイダ自身のエコシステムを好むかどうかという疑問が持ち上がる。コード生成におけるVertical Integration Bias (VIB) の測定ベンチマークである textscVIBench を導入する。
参考スコア（独自算出の注目度）: 2.5322673002308362
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large Language Models (LLMs) have become an integral part of software development, especially with the advent of agentic capabilities. Yet, many frontier LLMs are affiliated with specific providers. This raises the question of whether generated code favors the provider's own ecosystem over comparable alternatives, potentially constraining developers' choices and increasing dependence on a single provider. We define this behavior as Vertical Integration Bias (VIB) and introduce \textsc{VIBench}, a benchmark for measuring VIB in direct and agentic code generation across $20$ provider-selectable software-integration scenarios. Evaluating $10$ frontier provider-affiliated models against $3$ non-affiliated controls, we find positive VIB in direct generation, with six of ten affiliated models showing statistically significant effects up to $+18.8$ percentage points (pp). Agentic workflows further amplify VIB, reaching $+39.2$ pp. Moreover, early affiliated-ecosystem choices in agentic workflows can persist into conceptually decoupled downstream files, with persistence as high as $90.3\%$. These findings underscore the need to measure and account for VIB in code generation, especially as agentic capabilities become more prevalent.
Abstract（参考訳）: 大規模言語モデル(LLM)は、特にエージェント能力の出現によって、ソフトウェア開発において不可欠な部分となっている。しかし、多くのフロンティアLSMは特定のプロバイダと提携している。これにより、生成されたコードは、同等の選択肢よりもプロバイダ自身のエコシステムを好んでおり、開発者の選択を制限し、単一のプロバイダへの依存を増大させる可能性がある、という疑問が持ち上がる。我々は、この振る舞いをVertical Integration Bias (VIB) と定義し、20ドルのプロバイダ選択可能なソフトウェア統合シナリオにまたがる直接的およびエージェント的コード生成におけるVIBを測定するベンチマークである \textsc{VIBench} を導入する。 10ドル(約10万円)のフロンティア・プロバイダ関連モデルと3ドル(約3,800円)の非関連コントロールを比べると、直接的にVIBが陽性となり、その内6つが統計的に有意な効果を+18.8ドルのパーセンテージポイント(pp)まで示す。エージェントワークフローはさらにVIBを増幅し、$+39.2$ pp。さらに、エージェントワークフローにおける初期の関連するエコシステムの選択は、概念的に分離された下流ファイルに持続し、永続性は90.3\%$である。これらの発見は、特にエージェント機能がより普及するにつれて、コード生成におけるVIBの測定と説明の必要性を浮き彫りにしている。

論文の概要: Do LLMs Favor Their Providers? Measuring Vertical Integration Bias in Code Generation

関連論文リスト