Fugu-MT 論文翻訳(概要): When AI Meets Early Childhood Education: Large Language Models as Assessment Teammates in Chinese Preschools

論文の概要: When AI Meets Early Childhood Education: Large Language Models as Assessment Teammates in Chinese Preschools

arxiv url: http://arxiv.org/abs/2603.24389v1
Date: Wed, 25 Mar 2026 15:05:34 GMT
ステータス: 翻訳完了
システム内更新日: 2026-03-26 21:06:11.350758
Title: When AI Meets Early Childhood Education: Large Language Models as Assessment Teammates in Chinese Preschools
Title（参考訳）: AIが幼児教育と出会う時--中国の幼児期におけるチームメイトの評価としての大規模言語モデル
Authors: Xingming Li, Runke Huang, Yanan Bao, Yuye Jin, Yuru Jiao, Qingyong Hu,
Abstract要約: 高品質な教師子交流(TCI)は、幼児期の発達に欠かせないものであるが、従来の専門家による評価は、重要なスケーラビリティの課題に直面している。中国のような大規模システムでは、25万人以上の幼稚園で3600万人の子どもが利用されており、手作業による観察のコストと時間要件は、継続的な品質監視を不可能にしている。本稿では,AIが構造化された品質指標を抽出し,人間の専門家による判断との整合性を検証することによって,スケーラブルな評価チームメイトとして機能するかどうかを検討する。
参考スコア（独自算出の注目度）: 13.924636663725776
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
Abstract: High-quality teacher-child interaction (TCI) is fundamental to early childhood development, yet traditional expert-based assessment faces a critical scalability challenge. In large systems like China's-serving 36 million children across 250,000+ kindergartens-the cost and time requirements of manual observation make continuous quality monitoring infeasible, relegating assessment to infrequent episodic audits that limit timely intervention and improvement tracking. In this paper, we investigate whether AI can serve as a scalable assessment teammate by extracting structured quality indicators and validating their alignment with human expert judgments. Our contributions include: (1) TEPE-TCI-370h (Tracing Effective Preschool Education), the first large-scale dataset of naturalistic teacher-child interactions in Chinese preschools (370 hours, 105 classrooms) with standardized ECQRS-EC and SSTEW annotations; (2) We develop Interaction2Eval, a specialized LLM-based framework addressing domain-specific challenges-child speech recognition, Mandarin homophone disambiguation, and rubric-based reasoning-achieving up to 88% agreement; (3) Deployment validation across 43 classrooms demonstrating an 18x efficiency gain in the assessment workflow, highlighting its potential for shifting from annual expert audits to monthly AI-assisted monitoring with targeted human oversight. This work not only demonstrates the technical feasibility of scalable, AI-augmented quality assessment but also lays the foundation for a new paradigm in early childhood education-one where continuous, inclusive, AI-assisted evaluation becomes the engine of systemic improvement and equitable growth.
Abstract（参考訳）: 高品質な教師子交流(TCI)は、幼児期の発達に欠かせないものであるが、従来の専門家による評価は、重要なスケーラビリティの課題に直面している。中国では、25万人以上の幼稚園で3600万人の子どもたちが、手作業による観察のコストと時間要件によって、継続的な品質監視が不可能になり、時間的介入と改善の追跡を制限した頻繁な監査に対する評価が低下している。本稿では,AIが構造化された品質指標を抽出し,人間の専門家による判断との整合性を検証することによって,スケーラブルな評価チームメイトとして機能するかどうかを検討する。 1) TEPE-TCI-370h (Tracing Effective Preschool Education) は,中国初等教育における自然主義的な教師と児童の交流に関する最初の大規模データセット(370時間105教室)で,標準化されたECQRS-ECとSSTEWアノテーションを併用する。この研究は、スケーラブルでAIが強化された品質評価の技術的実現可能性を示すだけでなく、継続的かつ包括的でAIが支援する評価が、体系的な改善と平等な成長のエンジンとなる、幼児教育における新しいパラダイムの基盤となる。

論文の概要: When AI Meets Early Childhood Education: Large Language Models as Assessment Teammates in Chinese Preschools

関連論文リスト