Fugu-MT 論文翻訳(概要): Beyond the Surface: A Solution-Aware Retrieval Model for Competition-level Code Generation

論文の概要: Beyond the Surface: A Solution-Aware Retrieval Model for Competition-level Code Generation

arxiv url: http://arxiv.org/abs/2509.01129v1
Date: Mon, 01 Sep 2025 04:47:51 GMT
ステータス: 翻訳完了
システム内更新日: 2025-09-04 15:17:03.547609
Title: Beyond the Surface: A Solution-Aware Retrieval Model for Competition-level Code Generation
Title（参考訳）: サーフェスを超えて:競争レベルのコード生成のためのソリューション対応検索モデル
Authors: Shiwen Zhang, Lingxiang Wang, Hainan Zhang, Ziwei Wang, Sijia Wen, Zhiming Zheng,
Abstract要約: 競合プログラミングタスクのための合成データによって強化された解認識ランキングモデルであるSolveRankを提案する。具体的には、DeepSeek-R1モデルを用いて、解の整合性についてGPT-4oで検証した、等価だが異なる言い換えられた新しい問題を生成する。推論中、SolveRankは、ダウンストリームコードジェネレータを支援するために、コーパスから関連する問題と対応するコードを取得する。
参考スコア（独自算出の注目度）: 12.805684772054684
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: In competitive programming task, problem statements are often embedded within elaborate narrative backgrounds, requiring deep understanding of the underlying solutions to successfully complete the tasks. Current code generation models primarily focus on token-level semantic modeling, highly susceptible to distractions from irrelevant narrative statements. Inspired by RAG, retrieving reference code with similar solutions may help enhance model performance on difficult problems. However, existing retrieval models also emphasize surface-level semantic similarity, neglecting the deeper solution-level logical similarities that are critical in competitive programming. Therefore, designing ranking models capable of accurately identifying and retrieving problems and corresponding codes remains an urgent research problem in competitive code generation. In this paper, we propose SolveRank, a solution-aware ranking model empowered by synthetic data for competitive programming tasks. Specifically, we leverage the DeepSeek-R1 model to generate logically equivalent but differently phrased new problems, verified by GPT-4o for solution consistency. Then, we train SolveRank with these as positive samples and BM25/random-retrieved problems as negatives. During inference, SolveRank retrieves relevant problems and corresponding code from the corpus to assist a downstream code generator. Experiments on the xCodeEval dataset demonstrate that SolveRank outperforms SOTA ranking methods in precision and recall metrics, and boosts code generation performance for difficult problems.
Abstract（参考訳）: 競合プログラミングタスクでは、問題ステートメントは複雑な物語の背景に埋め込まれることが多く、タスクを成功させるために基礎となるソリューションを深く理解する必要がある。現在のコード生成モデルは、主にトークンレベルのセマンティックモデリングに焦点を当てている。 RAGにインスパイアされた、同様のソリューションで参照コードを取得することは、難しい問題におけるモデルパフォーマンスを向上させるのに役立つかもしれない。しかし、既存の検索モデルは、競合プログラミングにおいて重要な深い解レベルの論理的類似性を無視し、表面レベルのセマンティックな類似性も強調している。したがって、問題の特定と検索を正確に行うことができるランク付けモデルを設計することは、競争コード生成において急激な研究課題である。本稿では、競合プログラミングタスクのための合成データによって強化された解認識ランキングモデルであるSolveRankを提案する。具体的には,DeepSeek-R1モデルを用いて論理的に等価だが,解の整合性について GPT-4o で検証した新しい問題を生成する。次に、SolveRankを正のサンプルとして、BM25/ランダム検索問題を負としてトレーニングする。推論中、SolveRankは、ダウンストリームコードジェネレータを支援するために、コーパスから関連する問題と対応するコードを取得する。 xCodeEvalデータセットの実験では、SolveRankはSOTAランキングメソッドよりも精度とリコールの指標が優れており、難しい問題に対してコード生成のパフォーマンスが向上している。

論文の概要: Beyond the Surface: A Solution-Aware Retrieval Model for Competition-level Code Generation

関連論文リスト