Fugu-MT 論文翻訳(概要): Redact or Keep? A Fully Local AI Cascade for Educational Dialogue De-Identification

論文の概要: Redact or Keep? A Fully Local AI Cascade for Educational Dialogue De-Identification

arxiv url: http://arxiv.org/abs/2606.18372v1
Date: Tue, 16 Jun 2026 18:18:58 GMT
ステータス: 翻訳完了
システム内更新日: 2026-06-18 17:16:50.837775
Title: Redact or Keep? A Fully Local AI Cascade for Educational Dialogue De-Identification
Title（参考訳）: 再現か維持か? 教育用対話認識のための完全にローカルなAIカスケード
Authors: Haocheng Zhang, Zhuqian Zhou, Kirk Vanacore, Bakhtawar Ahtisham, René F. Kizilcec,
Abstract要約: 既存のアプローチは、ガバナンスと正確さのトレードオフを強要します。オープンエンドのエンティティ認識から制約付きプライバシトリアージへの脱識別を再構築する,完全ローカルなカスケードフレームワークを提案する。
参考スコア（独自算出の注目度）: 3.5643353590707867
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Educational dialogue is a valuable but sensitive resource for research: the same transcripts that capture authentic learning often capture personally identifiable information (PII) entangled with curricular content, where "Riemann" may refer to a real student or to a mathematical concept. Existing approaches force a tradeoff between governance and accuracy. Commercial Large Language Models (LLMs) can handle this ambiguity but require sending student data to third parties, while local named entity recognition (NER) systems preserve governance but over-redact curricular terms. We propose a fully local cascade framework that reframes de-identification from open-ended entity recognition to constrained privacy triage. A recall-first union proposer combines two lightweight encoders with deterministic rules to over-generate candidate spans; a context-aware reviewer then makes a binary Redact/Keep decision for each candidate using surrounding dialogue and speaker role. We evaluate three reviewer configurations against same-family LLM-only baselines and a commercial API on math tutoring transcripts from two large platforms. The strongest local configuration reaches 0.958 macro F1, compared with 0.767 for a same-family LLM-only baseline and 0.706 for the commercial API, while running entirely on a single laptop. On a targeted challenge set of curricular-personal name ambiguity, the same configuration degrades by only 0.03 F1 versus 0.19 to 0.25 for smaller reviewers. These results suggest that for educational de-identification, problem formulation matters more than model scale.
Abstract（参考訳）: 教育対話は研究の貴重な資料であり、真正な学習を捉えるのと同じ文字起こしは、実の学生や数学的概念を「リーマン」と呼ぶ場合、個人識別可能な情報(PII)を収集することが多い。既存のアプローチは、ガバナンスと正確さのトレードオフを強要します。 LLM(Commercial Large Language Models)はこの曖昧さに対処できるが、学生データを第三者に送信する必要がある。オープンエンドのエンティティ認識から制約付きプライバシトリアージへの脱識別を再構築する,完全ローカルなカスケードフレームワークを提案する。リコールファースト・ユニオン・プロジェクタは、2つの軽量エンコーダと決定論的ルールを組み合わせて候補スパンを過剰に生成する。筆者らは,同族 LLM のみのベースラインに対する3つのレビュア構成と,2つの大きなプラットフォームから書き起こしを学習するための商用APIの評価を行った。最強のローカル構成は0.958マクロF1に到達し、同じファミリーのLCMのみのベースラインは0.767、商用APIは0.706だった。対象とする個人名あいまいさの挑戦セットでは、より小さなレビュアーの場合、0.03 F1に対して0.19から0.25に低下する。これらの結果から, 問題定式化はモデルスケール以上の意味があることが示唆された。

関連論文リスト

Towards Human-Like Interactive Speech Recognition With Agentic Correction and Semantic Evaluation [53.844308305341166]
単一パスのASRフロントエンドと意味的訂正,意図のルーティング,推論に基づく編集を組み合わせた閉ループフレームワークである textbfAgentic ASR を提案する。複数言語、名前付き集中型、コードスイッチングベンチマークの実験は、反復的相互作用が意味的誤りを一貫して減少させることを示している。
論文参考訳（メタデータ） (2026-05-28T06:23:31Z)
BD at BEA 2025 Shared Task: MPNet Ensembles for Pedagogical Mistake Identification and Localization in AI Tutor Responses [0.7475784495279183]
本稿では,AIを活用したチュータの教育能力評価に関するBEA 2025共有タスクについて紹介する。我々のシステムは、BERTとXLNetの事前学習の利点を組み合わせたトランスフォーマーベースの言語モデルMPNet上に構築されている。提案手法は両トラックにおいて, 一致マクロF1スコアが約0.7110, ミステイク同定が約0.5543, 公式テストセットが0.5543であった。
論文参考訳（メタデータ） (2025-06-02T15:57:49Z)
Text-Video Retrieval with Global-Local Semantic Consistent Learning [122.15339128463715]
我々は,シンプルで効果的なグローバル局所意味的一貫性学習(GLSCL)を提案する。 GLSCLは、テキストビデオ検索のためのモダリティをまたいだ潜在共有セマンティクスを活用する。本手法はSOTAと同等の性能を実現し,計算コストの約220倍の高速化を実現している。
論文参考訳（メタデータ） (2024-05-21T11:59:36Z)
Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment [53.2701026843921]
大規模事前訓練型視覚言語モデル(VLM)はゼロショット分類に有効であることが証明されている。本稿では,アノテーションではなく,より広い語彙を前提とした,より難易度の高いゼロショット分類(Realistic Zero-Shot Classification)を提案する。本稿では,ラベルのないデータから構造意味情報を抽出し,同時に自己学習を行う自己構造意味アライメント(S3A)フレームワークを提案する。
論文参考訳（メタデータ） (2023-08-24T17:56:46Z)
Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science [27.727207443432278]
本稿では,ChatGPTとOpenAssistantの2つの公開言語モデルのゼロショット性能を評価する。その結果,異なるプロンプト戦略が分類精度に大きく影響し,F1スコアが10%を超えることが判明した。
論文参考訳（メタデータ） (2023-05-23T17:48:21Z)
Seeking the Shape of Sound: An Adaptive Framework for Learning Voice-Face Association [94.7030305679589]
上記の課題を共同で解決するための新しい枠組みを提案します。我々はモダリティアライメントプロセスにグローバル損失を導入する。提案メソッドは、複数の設定で以前の方法よりも優れています。
論文参考訳（メタデータ） (2021-03-12T14:10:48Z)
A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning [22.757971831442426]
信念トラッカーのトレーニングには、ユーザーの発話ごとに高価なターンレベルのアノテーションが必要となることが多い。本稿では,確率的対話モデルであるLAtent BElief State (LABES)モデルを提案する。 LABES-S2Sは、LABESのSeq2Seqモデルインスタンス化のコピーである。
論文参考訳（メタデータ） (2020-09-17T07:26:37Z)

関連論文リストは本サイト内にある論文のタイトル・アブストラクトから自動的に作成しています。

指定された論文の情報です。
本サイトの運営者は本サイト（すべての情報・翻訳含む）の品質を保証せず、本サイト（すべての情報・翻訳含む）を使用して発生したあらゆる結果について一切の責任を負いません。