Fugu-MT 論文翻訳(概要): Tokenomics: Quantifying Where Tokens Are Used in Agentic Software Engineering

論文の概要: Tokenomics: Quantifying Where Tokens Are Used in Agentic Software Engineering

arxiv url: http://arxiv.org/abs/2601.14470v1
Date: Tue, 20 Jan 2026 20:52:14 GMT
ステータス: 翻訳完了
システム内更新日: 2026-01-22 21:27:50.146894
Title: Tokenomics: Quantifying Where Tokens Are Used in Agentic Software Engineering
Title（参考訳）: Tokenomics: エージェントソフトウェアエンジニアリングにおけるトークンの使い方の定量化
Authors: Mohamad Salim, Jasmine Latendresse, SayedHassan Khatoonabadi, Emad Shihab,
Abstract要約: SDLC(Software Development Life Cycle)におけるLCM-MAシステムにおけるトークン消費パターンの分析を行う。 GPT-5推論モデルを用いて、ChatDevフレームワークによって実行される30のソフトウェア開発タスクの実行トレースを分析する。予備的な結果は、反復コードレビューの段階が平均59.4%のトークン消費の大多数を占めていることを示している。
参考スコア（独自算出の注目度）: 4.812321790984494
License: http://creativecommons.org/licenses/by/4.0/
Abstract: LLM-based Multi-Agent (LLM-MA) systems are increasingly applied to automate complex software engineering tasks such as requirements engineering, code generation, and testing. However, their operational efficiency and resource consumption remain poorly understood, hindering practical adoption due to unpredictable costs and environmental impact. To address this, we conduct an analysis of token consumption patterns in an LLM-MA system within the Software Development Life Cycle (SDLC), aiming to understand where tokens are consumed across distinct software engineering activities. We analyze execution traces from 30 software development tasks performed by the ChatDev framework using a GPT-5 reasoning model, mapping its internal phases to distinct development stages (Design, Coding, Code Completion, Code Review, Testing, and Documentation) to create a standardized evaluation framework. We then quantify and compare token distribution (input, output, reasoning) across these stages. Our preliminary findings show that the iterative Code Review stage accounts for the majority of token consumption for an average of 59.4% of tokens. Furthermore, we observe that input tokens consistently constitute the largest share of consumption for an average of 53.9%, providing empirical evidence for potentially significant inefficiencies in agentic collaboration. Our results suggest that the primary cost of agentic software engineering lies not in initial code generation but in automated refinement and verification. Our novel methodology can help practitioners predict expenses and optimize workflows, and it directs future research toward developing more token-efficient agent collaboration protocols.
Abstract（参考訳）: LLMベースのMulti-Agent (LLM-MA) システムは、要求工学、コード生成、テストといった複雑なソフトウェアエンジニアリングタスクの自動化にますます応用されている。しかし、その運用効率と資源消費はよく理解されていないため、予測不可能なコストと環境への影響により実践的な採用を妨げている。そこで我々は,SDLC(Software Development Life Cycle)におけるLCM-MAシステムにおけるトークン消費パターンの分析を行い,異なるソフトウェアエンジニアリング活動においてトークンがどこに消費されているかを理解することを目的とした。 GPT-5推論モデルを用いて、ChatDevフレームワークによって実行される30のソフトウェア開発タスクの実行トレースを分析し、内部フェーズを異なる開発ステージ(設計、コーディング、コード補完、コードレビュー、テスト、ドキュメント)にマッピングし、標準化された評価フレームワークを作成します。次に、これらのステージ間でトークンの分布(入力、出力、推論)を定量化し比較します。予備的な結果は、反復コードレビューの段階が平均59.4%のトークン消費の大多数を占めていることを示している。さらに,入力トークンは平均53.9%の消費率で一貫して最大のシェアを占めており,エージェント協調において潜在的に有意な非効率性を示す実証的な証拠を提供する。この結果から,エージェントソフトウェア工学の主なコストは,初期コード生成ではなく,自動修正と検証にあることが示唆された。提案手法は,費用予測やワークフローの最適化に有効であり,トークン効率の高いエージェント協調プロトコルの開発に向けた今後の研究を導くものである。

論文の概要: Tokenomics: Quantifying Where Tokens Are Used in Agentic Software Engineering

関連論文リスト