Fugu-MT 論文翻訳(概要): CoPE: A Small Language Model for Steerable and Scalable Content Labeling

論文の概要: CoPE: A Small Language Model for Steerable and Scalable Content Labeling

arxiv url: http://arxiv.org/abs/2512.18027v1
Date: Fri, 19 Dec 2025 19:47:33 GMT
ステータス: 翻訳完了
システム内更新日: 2025-12-23 18:54:32.155114
Title: CoPE: A Small Language Model for Steerable and Scalable Content Labeling
Title（参考訳）: CoPE: 安定的でスケーラブルなコンテンツラベリングのための小さな言語モデル
Authors: Samidh Chakrabarti, David Willner, Kevin Klyman, Tiffany Saade, Emily Capstick, Sabina Nong,
Abstract要約: CoPEはポリシーステアブルな小型言語モデルであり、高速かつ正確なコンテンツラベリングを可能にする。本稿では,モデルが単なるポリシー記憶ではなく,ポリシー解釈を学習することのできる,新たなトレーニングカリキュラムを提案する。我々は、このモデルの90億のパラメータバージョンをオープンにリリースし、単一のコンシューマグレードのGPUで実行できる。
参考スコア（独自算出の注目度）: 1.299304266312528
License: http://creativecommons.org/licenses/by-sa/4.0/
Abstract: This paper details the methodology behind CoPE, a policy-steerable small language model capable of fast and accurate content labeling. We present a novel training curricula called Contradictory Example Training that enables the model to learn policy interpretation rather than mere policy memorization. We also present a novel method for generating content policies, called Binocular Labeling, which enables rapid construction of unambiguous training datasets. When evaluated across seven different harm areas, CoPE exhibits equal or superior accuracy to frontier models at only 1% of their size. We openly release a 9 billion parameter version of the model that can be run on a single consumer-grade GPU. Models like CoPE represent a paradigm shift for classifier systems. By turning an ML task into a policy writing task, CoPE opens up new design possibilities for the governance of online platforms.
Abstract（参考訳）: 本稿では,ポリシーステアブルな小言語モデルであるCoPEの背景となる方法論を詳述する。本稿では,モデルが単なる政策記憶ではなく,政策解釈を学習できるようにするContradictory Example Trainingという新しい学習カリキュラムを提案する。また、不明瞭なトレーニングデータセットの迅速な構築を可能にする、両眼ラベリングと呼ばれるコンテンツポリシーを生成する新しい手法を提案する。 7つの異なる調和領域で評価すると、CoPEはフロンティアモデルと同等または優れた精度を示し、そのサイズはわずか1%である。我々は、このモデルの90億のパラメータバージョンをオープンにリリースし、単一のコンシューマグレードのGPUで実行できる。 CoPEのようなモデルは、分類器システムのパラダイムシフトを表す。 MLタスクをポリシー記述タスクにすることで、CoPEはオンラインプラットフォームのガバナンスのための新しい設計可能性を開く。

論文の概要: CoPE: A Small Language Model for Steerable and Scalable Content Labeling

関連論文リスト