The TalkMoves Dataset: K-12 Mathematics Lesson Transcripts Annotated for
Teacher and Student Discursive Moves
- URL: http://arxiv.org/abs/2204.09652v1
- Date: Wed, 6 Apr 2022 18:12:30 GMT
- Title: The TalkMoves Dataset: K-12 Mathematics Lesson Transcripts Annotated for
Teacher and Student Discursive Moves
- Authors: Abhijit Suresh, Jennifer Jacobs, Charis Harty, Margaret Perkoff, James
H. Martin, Tamara Sumner
- Abstract summary: This paper describes the TalkMoves dataset, composed of 567 human-annotated K-12 mathematics lesson transcripts.
The dataset can be used by educators, policymakers, and researchers to understand the nature of teacher and student discourse in K-12 math classrooms.
- Score: 8.090330715662962
- License: http://creativecommons.org/licenses/by-nc-sa/4.0/
- Abstract: Transcripts of teaching episodes can be effective tools to understand
discourse patterns in classroom instruction. According to most educational
experts, sustained classroom discourse is a critical component of equitable,
engaging, and rich learning environments for students. This paper describes the
TalkMoves dataset, composed of 567 human-annotated K-12 mathematics lesson
transcripts (including entire lessons or portions of lessons) derived from
video recordings. The set of transcripts primarily includes in-person lessons
with whole-class discussions and/or small group work, as well as some online
lessons. All of the transcripts are human-transcribed, segmented by the speaker
(teacher or student), and annotated at the sentence level for ten discursive
moves based on accountable talk theory. In addition, the transcripts include
utterance-level information in the form of dialogue act labels based on the
Switchboard Dialog Act Corpus. The dataset can be used by educators,
policymakers, and researchers to understand the nature of teacher and student
discourse in K-12 math classrooms. Portions of this dataset have been used to
develop the TalkMoves application, which provides teachers with automated,
immediate, and actionable feedback about their mathematics instruction.
Related papers
- Multimodal Classification of Teaching Activities from University Lecture
Recordings [0.9790236766474201]
We present a multimodal classification algorithm that identifies the type of activity that is being carried out at any time of the lesson.
Some academic activities are more easily identifiable with the audio signal while resorting to the text transcription is needed to identify others.
arXiv Detail & Related papers (2023-12-24T08:33:30Z) - Measuring Five Accountable Talk Moves to Improve Instruction at Scale [1.4549461207028445]
We fine-tune models to identify five instructional talk moves inspired by accountable talk theory.
We correlate the instructors' use of each talk move with indicators of student engagement and satisfaction.
These results corroborate previous research on the effectiveness of accountable talk moves.
arXiv Detail & Related papers (2023-11-02T03:04:50Z) - MathDial: A Dialogue Tutoring Dataset with Rich Pedagogical Properties
Grounded in Math Reasoning Problems [74.73881579517055]
We propose a framework to generate such dialogues by pairing human teachers with a Large Language Model prompted to represent common student errors.
We describe how we use this framework to collect MathDial, a dataset of 3k one-to-one teacher-student tutoring dialogues.
arXiv Detail & Related papers (2023-05-23T21:44:56Z) - User Adaptive Language Learning Chatbots with a Curriculum [55.63893493019025]
We adapt lexically constrained decoding to a dialog system, which urges the dialog system to include curriculum-aligned words and phrases in its generated utterances.
The evaluation result demonstrates that the dialog system with curriculum infusion improves students' understanding of target words and increases their interest in practicing English.
arXiv Detail & Related papers (2023-04-11T20:41:41Z) - Learning Analytics from Spoken Discussion Dialogs in Flipped Classroom [36.53657088550011]
This study aims to collect and analyze the discussion dialogs in flipped classroom in order to get to know group learning processes and outcomes.
We have recently transformed a course using the flipped classroom strategy, where students watched video-recorded lectures at home prior to group-based problem-solving discussions in class.
Then, machine learning algorithms are applied to the indicators in order to predict the group learning outcome as High, Mid or Low.
arXiv Detail & Related papers (2023-01-29T09:36:41Z) - The NCTE Transcripts: A Dataset of Elementary Math Classroom Transcripts [4.931378519409227]
We introduce the largest dataset of mathematics classroom transcripts available to researchers.
The dataset consists of 1,660 minute long 4th and 5th grade elementary mathematics observations.
The anonymized transcripts represent data from 317 teachers across 4 school districts that serve largely marginalized students.
arXiv Detail & Related papers (2022-11-21T19:00:01Z) - Multimodal Lecture Presentations Dataset: Understanding Multimodality in
Educational Slides [57.86931911522967]
We test the capabilities of machine learning models in multimodal understanding of educational content.
Our dataset contains aligned slides and spoken language, for 180+ hours of video and 9000+ slides, with 10 lecturers from various subjects.
We introduce PolyViLT, a multimodal transformer trained with a multi-instance learning loss that is more effective than current approaches.
arXiv Detail & Related papers (2022-08-17T05:30:18Z) - Teacher Perception of Automatically Extracted Grammar Concepts for L2
Language Learning [91.49622922938681]
We present an automatic framework that automatically discovers and visualizing descriptions of different aspects of grammar.
Specifically, we extract descriptions from a natural text corpus that answer questions about morphosyntax and semantics.
We apply this method for teaching the Indian languages, Kannada and Marathi, which, unlike English, do not have well-developed pedagogical resources.
arXiv Detail & Related papers (2022-06-10T14:52:22Z) - Using Transformers to Provide Teachers with Personalized Feedback on
their Classroom Discourse: The TalkMoves Application [14.851607363136978]
We describe the TalkMoves application's cloud-based infrastructure for managing and processing classroom recordings.
We discuss several technical challenges that need to be addressed when working with real-world speech and language data from noisy K-12 classrooms.
arXiv Detail & Related papers (2021-04-29T20:45:02Z) - Watch and Learn: Mapping Language and Noisy Real-world Videos with
Self-supervision [54.73758942064708]
We teach machines to understand visuals and natural language by learning the mapping between sentences and noisy video snippets without explicit annotations.
For training and evaluation, we contribute a new dataset ApartmenTour' that contains a large number of online videos and subtitles.
arXiv Detail & Related papers (2020-11-19T03:43:56Z) - Neural Multi-Task Learning for Teacher Question Detection in Online
Classrooms [50.19997675066203]
We build an end-to-end neural framework that automatically detects questions from teachers' audio recordings.
By incorporating multi-task learning techniques, we are able to strengthen the understanding of semantic relations among different types of questions.
arXiv Detail & Related papers (2020-05-16T02:17:04Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.