PedaCo-Gen: Scaffolding Pedagogical Agency in Human-AI Collaborative Video Authoring
- URL: http://arxiv.org/abs/2602.19623v1
- Date: Mon, 23 Feb 2026 09:12:13 GMT
- Title: PedaCo-Gen: Scaffolding Pedagogical Agency in Human-AI Collaborative Video Authoring
- Authors: Injun Baek, Yearim Kim, Nojun Kwak,
- Abstract summary: This study introduces PedaCo-Gen, a collaborative video generating system for authoring instructional videos based on Mayer's Cognitive Theory of Multimedia Learning (CTML)<n>Moving away from traditional "one-shot" generation, PedaCo-Gen introduces an Intermediate Representation phase, enabling educators to interactively review and refine video blueprints-comprising scripts and visual descriptions-with an AI reviewer.<n>Our study with 23 education experts demonstrates that PedaCo-Gen significantly enhances video quality across various topics and CTML principles compared to baselines.
- Score: 28.634225905526677
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: While advancements in Text-to-Video (T2V) generative AI offer a promising path toward democratizing content creation, current models are often optimized for visual fidelity rather than instructional efficacy. This study introduces PedaCo-Gen, a pedagogically-informed human-AI collaborative video generating system for authoring instructional videos based on Mayer's Cognitive Theory of Multimedia Learning (CTML). Moving away from traditional "one-shot" generation, PedaCo-Gen introduces an Intermediate Representation (IR) phase, enabling educators to interactively review and refine video blueprints-comprising scripts and visual descriptions-with an AI reviewer. Our study with 23 education experts demonstrates that PedaCo-Gen significantly enhances video quality across various topics and CTML principles compared to baselines. Participants perceived the AI-driven guidance not merely as a set of instructions but as a metacognitive scaffold that augmented their instructional design expertise, reporting high production efficiency (M=4.26) and guide validity (M=4.04). These findings highlight the importance of reclaiming pedagogical agency through principled co-creation, providing a foundation for future AI authoring tools that harmonize generative power with human professional expertise.
Related papers
- Synergizing Understanding and Generation with Interleaved Analyzing-Drafting Thinking [154.2388970262703]
Unified Vision-Language Models (UVLMs) aim to advance multimodal learning by supporting both understanding and generation within a single framework.<n>We introduce the interleaved Analyzing-Drafting problem-solving loop (AD-Loop), a new think paradigm that alternates between analytic and drafting operations.<n>By interleaving textual thoughts with visual thoughts, AD-Loop enables models to iteratively refine both comprehension and outputs, fostering genuine synergy.
arXiv Detail & Related papers (2026-02-24T23:26:09Z) - Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning [66.51617619673587]
We present Skyra, a specialized large language model (MLLM) that identifies human-perceivable visual artifacts in AI-generated videos.<n>To support this objective, we construct ViF-CoT-4K for Supervised Fine-Tuning (SFT), which represents the first large-scale AI-generated video dataset with fine-grained human annotations.<n>We then develop a two-stage training strategy that systematically enhances our model's artifact's-temporal perception, explanation capability, and detection accuracy.
arXiv Detail & Related papers (2025-12-17T18:48:26Z) - Generative Teaching via Code [43.45801837806965]
TeachMaster orchestrates a collaborative team of agents--spanning planning, design, and rendering--to automate the production of interpretable, editable, and curriculum-ready educational videos.<n>Experiments validate that TeachMaster significantly boosts production efficiency without compromising structural coherence or visual fidelity.
arXiv Detail & Related papers (2025-12-07T12:52:24Z) - Transforming Higher Education with AI-Powered Video Lectures [0.2538209532048866]
The integration of artificial intelligence (AI) into video lecture production has the potential to transform higher education.<n>This paper investigates a semi automated workflow that combines Google Gemini for script generation, Amazon Polly for voice synthesis, and Microsoft PowerPoint for video assembly.
arXiv Detail & Related papers (2025-10-30T23:33:10Z) - Beyond Automation: Socratic AI, Epistemic Agency, and the Implications of the Emergence of Orchestrated Multi-Agent Learning Architectures [0.0]
Generative AI is no longer a peripheral tool in higher education.<n>This paper presents findings from a controlled experiment evaluating a Socratic AI Tutor.<n>Students using the Tutor reported significantly greater support for critical, independent, and reflective thinking.
arXiv Detail & Related papers (2025-08-07T07:49:03Z) - CogGen: A Learner-Centered Generative AI Architecture for Intelligent Tutoring with Programming Video [1.6961276655027102]
CogGen is a learner-centered AI architecture that transforms programming videos into interactive, adaptive learning experiences.<n>This work advances AI-powered tutoring by bridging structured student modeling with interactive AI conversations.
arXiv Detail & Related papers (2025-06-25T16:39:05Z) - How Can Video Generative AI Transform K-12 Education? Examining Teachers' Perspectives through TPACK and TAM [0.7785405821914395]
Video generative AI (Video GenAI) has opened new possibilities for K-12 education by enabling the creation of dynamic, customized, and high-quality visual content.<n>This study explores the perspectives of leading K-12 teachers on the educational applications of Video GenAI.
arXiv Detail & Related papers (2025-03-11T03:08:07Z) - Reflection on Data Storytelling Tools in the Generative AI Era from the Human-AI Collaboration Perspective [54.152639172274]
Large-scale generative AI techniques have the potential to enhance data storytelling with their power in visual and narration generation.<n>We compare the collaboration patterns of the latest tools with those of earlier ones using a dedicated framework for understanding human-AI collaboration in data storytelling.<n>The benefits of these AI techniques and implications to human-AI collaboration are also revealed.
arXiv Detail & Related papers (2025-03-04T13:56:18Z) - GRADEO: Towards Human-Like Evaluation for Text-to-Video Generation via Multi-Step Reasoning [62.775721264492994]
GRADEO is one of the first specifically designed video evaluation models.<n>It grades AI-generated videos for explainable scores and assessments through multi-step reasoning.<n> Experiments show that our method aligns better with human evaluations than existing methods.
arXiv Detail & Related papers (2025-03-04T07:04:55Z) - Exploration with Principles for Diverse AI Supervision [88.61687950039662]
Training large transformers using next-token prediction has given rise to groundbreaking advancements in AI.
While this generative AI approach has produced impressive results, it heavily leans on human supervision.
This strong reliance on human oversight poses a significant hurdle to the advancement of AI innovation.
We propose a novel paradigm termed Exploratory AI (EAI) aimed at autonomously generating high-quality training data.
arXiv Detail & Related papers (2023-10-13T07:03:39Z) - Explainable Active Learning (XAL): An Empirical Study of How Local
Explanations Impact Annotator Experience [76.9910678786031]
We propose a novel paradigm of explainable active learning (XAL), by introducing techniques from the recently surging field of explainable AI (XAI) into an Active Learning setting.
Our study shows benefits of AI explanation as interfaces for machine teaching--supporting trust calibration and enabling rich forms of teaching feedback, and potential drawbacks--anchoring effect with the model judgment and cognitive workload.
arXiv Detail & Related papers (2020-01-24T22:52:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.