Fugu-MT 論文翻訳(概要): MEC$^3$O: Multi-Expert Consensus for Code Time Complexity Prediction

論文の概要: MEC$^3$O: Multi-Expert Consensus for Code Time Complexity Prediction

arxiv url: http://arxiv.org/abs/2510.09049v1
Date: Fri, 10 Oct 2025 06:34:49 GMT
ステータス: 翻訳完了
システム内更新日: 2025-10-14 00:38:48.29005
Title: MEC$^3$O: Multi-Expert Consensus for Code Time Complexity Prediction
Title（参考訳）: MEC$^3$O: コード時間複雑度予測のためのマルチエキスパート合意
Authors: Joonghyuk Hahn, Soohan Lim, Yo-Sub Han,
Abstract要約: コードの複雑さを予測するマルチエキスパートコンセンサスシステムMEC$3$Oを提案する。 CodeComplexの実験では、MEC$3$Oは少なくとも10%高い精度とマクロF1スコアを達成する。これは、最終予測を生成するための多専門家の議論と重み付け戦略の有効性を示す。
参考スコア（独自算出の注目度）: 6.644994424048165
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: Predicting the complexity of source code is essential for software development and algorithm analysis. Recently, Baik et al. (2025) introduced CodeComplex for code time complexity prediction. The paper shows that LLMs without fine-tuning struggle with certain complexity classes. This suggests that no single LLM excels at every class, but rather each model shows advantages in certain classes. We propose MEC$^3$O, a multi-expert consensus system, which extends the multi-agent debate frameworks. MEC$^3$O assigns LLMs to complexity classes based on their performance and provides them with class-specialized instructions, turning them into experts. These experts engage in structured debates, and their predictions are integrated through a weighted consensus mechanism. Our expertise assignments to LLMs effectively handle Degeneration-of-Thought, reducing reliance on a separate judge model, and preventing convergence to incorrect majority opinions. Experiments on CodeComplex show that MEC$^3$O outperforms the open-source baselines, achieving at least 10% higher accuracy and macro-F1 scores. It also surpasses GPT-4o-mini in macro-F1 scores on average and demonstrates competitive on-par F1 scores to GPT-4o and GPT-o4-mini on average. This demonstrates the effectiveness of multi-expert debates and weight consensus strategy to generate the final predictions. Our code and data is available at https://github.com/suhanmen/MECO.
Abstract（参考訳）: ソースコードの複雑さを予測することは、ソフトウェア開発とアルゴリズム分析に不可欠である。最近Baik氏(2025年)は、コード時間複雑性の予測にCodeComplexを導入した。本稿は,LLMが特定の複雑性クラスと微調整の苦労を伴わないことを示す。これは、全てのクラスにおいて単一のLLMが排他的ではなく、むしろ各モデルが特定のクラスで利点を示すことを示唆している。我々は,マルチエージェントの議論フレームワークを拡張するマルチエキスパートコンセンサスシステムMEC$^3$Oを提案する。 MEC$^3$O は LLM をそのパフォーマンスに基づいた複雑性クラスに割り当て、クラス特化命令を提供して専門家にする。これらの専門家は構造化された議論に従事し、それらの予測は重み付けされたコンセンサス機構を通じて統合される。 LLMの専門的な課題は、Degeneration-of-Thoughtを効果的に処理し、独立した判断モデルへの依存を減らし、不正な多数意見への収束を防ぎます。 CodeComplexの実験によると、MEC$^3$Oはオープンソースベースラインよりも優れており、少なくとも10%高い精度とマクロF1スコアを達成している。また、マクロF1ではGPT-4o-miniを平均で上回り、GPT-4oやGPT-o4-miniと平均で競合する。これは、最終予測を生成するための多専門家の議論と重み付け戦略の有効性を示す。私たちのコードとデータはhttps://github.com/suhanmen/MECO.comで公開されています。

論文の概要: MEC$^3$O: Multi-Expert Consensus for Code Time Complexity Prediction

関連論文リスト