Fugu-MT 論文翻訳(概要): LAUD: Integrating Large Language Models with Active Learning for Unlabeled Data

論文の概要: LAUD: Integrating Large Language Models with Active Learning for Unlabeled Data

arxiv url: http://arxiv.org/abs/2511.14738v1
Date: Tue, 18 Nov 2025 18:31:00 GMT
ステータス: 翻訳完了
システム内更新日: 2025-11-19 16:23:53.262532
Title: LAUD: Integrating Large Language Models with Active Learning for Unlabeled Data
Title（参考訳）: LAUD: ラベルなしデータのためのアクティブラーニングによる大規模言語モデルの統合
Authors: Tzu-Hsuan Chou, Chun-Nan Chou,
Abstract要約: 実世界のシナリオでは、ラベル付きデータがないため、実践者が良好なパフォーマンスのモデルを得ることができないことが多い。ラベルなしデータセット(LAUD)のための大規模言語モデルとアクティブラーニングを統合した学習フレームワークを提案する。実験により、LAUDから派生したLLMは、商品名分類タスクにおいてゼロショットまたは少数ショットの学習でLLMよりも優れていた。
参考スコア（独自算出の注目度）: 2.0052993723676895
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Large language models (LLMs) have shown a remarkable ability to generalize beyond their pre-training data, and fine-tuning LLMs can elevate performance to human-level and beyond. However, in real-world scenarios, lacking labeled data often prevents practitioners from obtaining well-performing models, thereby forcing practitioners to highly rely on prompt-based approaches that are often tedious, inefficient, and driven by trial and error. To alleviate this issue of lacking labeled data, we present a learning framework integrating LLMs with active learning for unlabeled dataset (LAUD). LAUD mitigates the cold-start problem by constructing an initial label set with zero-shot learning. Experimental results show that LLMs derived from LAUD outperform LLMs with zero-shot or few-shot learning on commodity name classification tasks, demonstrating the effectiveness of LAUD.
Abstract（参考訳）: 大規模言語モデル(LLM)は、事前学習データを超えて一般化する驚くべき能力を示しており、微調整 LLM は、人間レベル以上のパフォーマンスを向上することができる。しかし、実世界のシナリオでは、ラベル付きデータが欠如しているため、実践者が良いパフォーマンスのモデルを得ることができないため、実践者は、しばしば退屈で非効率で、試行錯誤によって駆動されるプロンプトベースのアプローチに強く依存せざるを得ない。ラベル付きデータの欠如を緩和するため,ラベルなしデータセット(LAUD)のアクティブラーニングにLLMを統合する学習フレームワークを提案する。 LAUDはゼロショット学習を用いた初期ラベルセットを構築することにより、コールドスタート問題を緩和する。実験の結果,LAUD から派生した LLM は,商品名分類タスクにおいてゼロショットや少数ショットの学習で LLM よりも優れており,LAUD の有効性が示された。

論文の概要: LAUD: Integrating Large Language Models with Active Learning for Unlabeled Data

関連論文リスト