Fugu-MT 論文翻訳(概要): LLM Sparsity Prior for Robust Feature Selection

論文の概要: LLM Sparsity Prior for Robust Feature Selection

arxiv url: http://arxiv.org/abs/2605.23102v1
Date: Thu, 21 May 2026 23:34:04 GMT
ステータス: 翻訳完了
システム内更新日: 2026-05-25 17:29:20.134995
Title: LLM Sparsity Prior for Robust Feature Selection
Title（参考訳）: ロバストな特徴選択に先立つLLMスパシティ
Authors: Caleb Skinner, Yihan Guo, Meng Li,
Abstract要約: LLM生成重量の定量化のためのフレームワークを提案する。次に,LLM生成重みを先行包含確率に統合するLLMスペーシャリティ・プライオリティ(LSP)を提案する。 LSPは予測精度を改善し、ベースラインに欠落した臨床的に関連性のある特徴を特定する。
参考スコア（独自算出の注目度）: 3.6910249175649983
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Large language models (LLMs) offer a scalable mechanism to elicit domain-informed prior information for high-dimensional variable selection. However, existing methods such as LLM-Lasso are sensitive to weight quality, with performance degrading substantially when LLM-generated weights are inaccurate. To address this challenge, we first introduce a framework for quantifying the quality of LLM-generated weights, enabling rigorous evaluation of LLM-informed methods across varying weight regimes. We then propose the LLM Sparsity Prior (LSP), which integrates LLM-generated weights into the prior inclusion probabilities of Spike-and-Slab and Spike-and-Slab Lasso models via two interpretable hyperparameters governing global sparsity and weight concentration. Hierarchical hyperpriors on these parameters allow the model to dynamically discount uninformative or misleading weights, improving robustness without sacrificing gains when weights are accurate. Finally, we develop principled prompt engineering strategies and validate the method on a private medical dataset studying Acute Kidney Injury. LSP improves prediction accuracy and identifies clinically relevant features missed by the baselines, with robustness to prompt variation and particular effectiveness in low-data regimes.
Abstract（参考訳）: 大規模言語モデル(LLM)は、高次元変数選択のためにドメインインフォームドされた事前情報を抽出するためのスケーラブルなメカニズムを提供する。しかし、LLM-Lassoのような既存の手法は重量に敏感であり、LCM生成重量が不正確な場合に性能が著しく低下する。この課題に対処するために、まず、LLM生成量の品質を定量化するためのフレームワークを導入し、LLMインフォームド手法の厳密な評価を可能にした。次に,LLMの重みをSpyke-and-SlabモデルとSpike-and-Slab Lassoモデルの先行包含確率に統合するLLMスペーシリティ優先法(LSP)を提案する。これらのパラメータの階層的ハイパープライアーにより、モデルは非形式的あるいは誤解を招く重みを動的に割引することができ、重みが正確であるときに利得を犠牲にすることなく堅牢性を向上させることができる。最後に, 急性腎臓損傷を研究対象とする個人医療データセット上で, プロンプト・エンジニアリング・ストラテジーを原則として開発し, 本手法の有効性を検証した。 LSPは予測精度を改善し、ベースラインに欠落した臨床的に関連性のある特徴を識別し、低データ体制における変動と特定の効果の促進に頑健である。

論文の概要: LLM Sparsity Prior for Robust Feature Selection

関連論文リスト