Fugu-MT 論文翻訳(概要): Poodle: Seamlessly Scaling Down Large Language Models with Just-in-Time Model Replacement

論文の概要: Poodle: Seamlessly Scaling Down Large Language Models with Just-in-Time Model Replacement

arxiv url: http://arxiv.org/abs/2512.05525v1
Date: Fri, 05 Dec 2025 08:36:39 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-13 22:40:56.959868
Title: Poodle: Seamlessly Scaling Down Large Language Models with Just-in-Time Model Replacement
Title（参考訳）: Poodle: ジャスト・イン・タイムのモデル置き換えによる大規模言語モデルのスケールアップ
Authors: Nils Strassenburg, Boris Glavic, Tilmann Rabl,
Abstract要約: ビジネスは、単純な反復的なタスクを自動化するために、大規模言語モデル(LLM)にますます依存している。 LLMはトレーニング例をほとんど必要とせず、モデル開発に関する専門知識のないユーザによって利用することができる。我々は、JITR(Just-in-time Model replacement)のビジョンを示し、LLM呼び出しの繰り返しタスクを特定すると、そのモデルが透過的により安価な代替品に置き換えられる。
参考スコア（独自算出の注目度）: 6.193292107042754
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Businesses increasingly rely on large language models (LLMs) to automate simple repetitive tasks instead of developing custom machine learning models. LLMs require few, if any, training examples and can be utilized by users without expertise in model development. However, this comes at the cost of substantially higher resource and energy consumption compared to smaller models, which often achieve similar predictive performance for simple tasks. In this paper, we present our vision for just-in-time model replacement (JITR), where, upon identifying a recurring task in calls to an LLM, the model is replaced transparently with a cheaper alternative that performs well for this specific task. JITR retains the ease of use and low development effort of LLMs, while saving significant cost and energy. We discuss the main challenges in realizing our vision regarding the identification of recurring tasks and the creation of a custom model. Specifically, we argue that model search and transfer learning will play a crucial role in JITR to efficiently identify and fine-tune models for a recurring task. Using our JITR prototype Poodle, we achieve significant savings for exemplary tasks.
Abstract（参考訳）: 企業は、カスタム機械学習モデルを開発する代わりに、単純な反復タスクを自動化するために、大規模言語モデル(LLM)をますます頼りにしている。 LLMはトレーニング例をほとんど必要とせず、モデル開発に関する専門知識のないユーザによって利用することができる。しかし、これは、単純なタスクで同様の予測性能が得られる小さなモデルに比べて、資源とエネルギー消費が著しく高いコストが伴う。本稿では, JITR(Just-in-time Model replacement)のビジョンを述べるとともに, LLM呼び出しの繰り返しタスクを識別する際, より安価な代替案を透過的に置き換える。 JITRはLLMの使いやすさと開発労力の低さを保ちながら、大幅なコストとエネルギーを節約している。本稿では、繰り返しタスクの識別とカスタムモデルの作成に関するビジョンを実現する上での課題について論じる。具体的には, モデル探索と伝達学習がJITRにおいて重要な役割を担い, 繰り返しタスクに対するモデル同定と微調整を効率的に行うことを論じる。 JITRのプロトタイプであるPoodleを使って、模範的なタスクの大幅な削減を実現しています。

論文の概要: Poodle: Seamlessly Scaling Down Large Language Models with Just-in-Time Model Replacement

関連論文リスト