Fugu-MT 論文翻訳(概要): CN-Buzz2Portfolio: A Chinese-Market Dataset and Benchmark for LLM-Based Macro and Sector Asset Allocation from Daily Trending Financial News

論文の概要: CN-Buzz2Portfolio: A Chinese-Market Dataset and Benchmark for LLM-Based Macro and Sector Asset Allocation from Daily Trending Financial News

arxiv url: http://arxiv.org/abs/2603.22305v1
Date: Wed, 18 Mar 2026 02:31:28 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-25 19:53:37.041142
Title: CN-Buzz2Portfolio: A Chinese-Market Dataset and Benchmark for LLM-Based Macro and Sector Asset Allocation from Daily Trending Financial News
Title（参考訳）: CN-Buzz2Portfolio:日次トレンドファイナンシャルニュースからLLMベースのマクロおよびセクターアセットの中国市場データセットとベンチマーク
Authors: Liyuan Chen, Shilong Li, Jiangpeng Yan, Shuoling Liu, Qiang Yang, Xiu Li,
Abstract要約: 大規模言語モデル(LLM)は、静的自然言語処理(NLP)タスクから動的意思決定エージェントへと急速に移行している。直接のライブトレーディングは非生産的であり、スキルに幸運を抱くことによって結果バイアスを起こす傾向にあるが、既存の静的ベンチマークはエンティティレベルの株式選択に限定され、より広範な市場注目を無視することが多い。この研究は、一般的な推論と金融決定の整合性に関する新たな洞察を与え、持続可能な金融エージェントの研究を促進するために、すべてのデータ、コード、実験がリリースされる。
参考スコア（独自算出の注目度）: 21.498084305999992
License: http://creativecommons.org/licenses/by-nc-sa/4.0/
Abstract: Large Language Models (LLMs) are rapidly transitioning from static Natural Language Processing (NLP) tasks including sentiment analysis and event extraction to acting as dynamic decision-making agents in complex financial environments. However, the evolution of LLMs into autonomous financial agents faces a significant dilemma in evaluation paradigms. Direct live trading is irreproducible and prone to outcome bias by confounding luck with skill, whereas existing static benchmarks are often confined to entity-level stock picking and ignore broader market attention. To facilitate the rigorous analysis of these challenges, we introduce CN-Buzz2Portfolio, a reproducible benchmark grounded in the Chinese market that maps daily trending news to macro and sector asset allocation. Spanning a rolling horizon from 2024 to mid-2025, our dataset simulates a realistic public attention stream, requiring agents to distill investment logic from high-exposure narratives instead of pre-filtered entity news. We propose a Tri-Stage CPA Agent Workflow involving Compression, Perception, and Allocation to evaluate LLMs on broad asset classes such as Exchange Traded Funds (ETFs) rather than individual stocks, thereby reducing idiosyncratic volatility. Extensive experiments on nine LLMs reveal significant disparities in how models translate macro-level narratives into portfolio weights. This work provides new insights into the alignment between general reasoning and financial decision-making, and all data, codes, and experiments are released to promote sustainable financial agent research.
Abstract（参考訳）: 大規模言語モデル(LLM)は、感情分析やイベント抽出を含む静的自然言語処理(NLP)タスクから、複雑な金融環境における動的意思決定エージェントとしての機能へと急速に移行している。しかし、LLMの自律的金融エージェントへの進化は、評価パラダイムにおいて大きなジレンマに直面している。直接のライブトレーディングは非生産的であり、スキルに幸運を抱くことによって結果バイアスを起こす傾向にあるが、既存の静的ベンチマークはエンティティレベルの株式選択に限定され、より広範な市場注目を無視することが多い。これらの課題の厳密な分析を容易にするため,中国市場において,毎日のトレンドニュースをマクロやセクターのアセットアロケーションにマッピングする再現可能なベンチマークであるCN-Buzz2Portfolioを導入する。 2024年から2025年半ばにかけて、我々のデータセットは現実的な大衆の注目の流れをシミュレートし、エージェントは事前にフィルターされたエンティティニュースではなく、高露出の物語から投資ロジックを抽出する必要がある。本稿では,圧縮・知覚・割当を含む3段階のCPAエージェントワークフローを提案し,個々の株式よりもETF(Exchange Traded Funds)などの幅広い資産クラスでLCMを評価することにより,慣用的ボラティリティの低減を図る。 9つのLLMに関する大規模な実験は、モデルがマクロレベルの物語をポートフォリオの重みにどのように変換するかにおいて、大きな違いを示している。この研究は、一般的な推論と金融決定の整合性に関する新たな洞察を与え、持続可能な金融エージェントの研究を促進するために、すべてのデータ、コード、実験がリリースされる。

論文の概要: CN-Buzz2Portfolio: A Chinese-Market Dataset and Benchmark for LLM-Based Macro and Sector Asset Allocation from Daily Trending Financial News

関連論文リスト