Know Thy Student: Interactive Learning with Gaussian Processes
- URL: http://arxiv.org/abs/2204.12072v1
- Date: Tue, 26 Apr 2022 04:43:57 GMT
- Title: Know Thy Student: Interactive Learning with Gaussian Processes
- Authors: Rose E. Wang, Mike Wu, Noah Goodman
- Abstract summary: Our work proposes a simple diagnosis algorithm which uses Gaussian processes for inferring student-related information, before constructing a teaching dataset.
We study this in the offline reinforcement learning setting where the teacher must provide demonstrations to the student and avoid sending redundant trajectories.
Our experiments highlight the importance of diagosing before teaching and demonstrate how students can learn more efficiently with the help of an interactive teacher.
- Score: 11.641731210416102
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Learning often involves interaction between multiple agents. Human
teacher-student settings best illustrate how interactions result in efficient
knowledge passing where the teacher constructs a curriculum based on their
students' abilities. Prior work in machine teaching studies how the teacher
should construct optimal teaching datasets assuming the teacher knows
everything about the student. However, in the real world, the teacher doesn't
have complete information about the student. The teacher must interact and
diagnose the student, before teaching. Our work proposes a simple diagnosis
algorithm which uses Gaussian processes for inferring student-related
information, before constructing a teaching dataset. We apply this to two
settings. One is where the student learns from scratch and the teacher must
figure out the student's learning algorithm parameters, eg. the regularization
parameters in ridge regression or support vector machines. Two is where the
student has partially explored the environment and the teacher must figure out
the important areas the student has not explored; we study this in the offline
reinforcement learning setting where the teacher must provide demonstrations to
the student and avoid sending redundant trajectories. Our experiments highlight
the importance of diagosing before teaching and demonstrate how students can
learn more efficiently with the help of an interactive teacher. We conclude by
outlining where diagnosing combined with teaching would be more desirable than
passive learning.
Related papers
- Representational Alignment Supports Effective Machine Teaching [81.19197059407121]
We integrate insights from machine teaching and pragmatic communication with the literature on representational alignment.
We design a supervised learning environment that disentangles representational alignment from teacher accuracy.
arXiv Detail & Related papers (2024-06-06T17:48:24Z) - Learn to Teach: Improve Sample Efficiency in Teacher-student Learning
for Sim-to-Real Transfer [5.731477362725785]
We propose a sample efficient learning framework termed Learn to Teach (L2T) that recycles experience collected by the teacher agent.
We show that a single-loop algorithm can train both the teacher and student agents under both Reinforcement Learning and Inverse Reinforcement Learning contexts.
arXiv Detail & Related papers (2024-02-09T21:16:43Z) - YODA: Teacher-Student Progressive Learning for Language Models [82.0172215948963]
This paper introduces YODA, a teacher-student progressive learning framework.
It emulates the teacher-student education process to improve the efficacy of model fine-tuning.
Experiments show that training LLaMA2 with data from YODA improves SFT with significant performance gain.
arXiv Detail & Related papers (2024-01-28T14:32:15Z) - Can Language Models Teach Weaker Agents? Teacher Explanations Improve
Students via Personalization [84.86241161706911]
We show that teacher LLMs can indeed intervene on student reasoning to improve their performance.
We also demonstrate that in multi-turn interactions, teacher explanations generalize and learn from explained data.
We verify that misaligned teachers can lower student performance to random chance by intentionally misleading them.
arXiv Detail & Related papers (2023-06-15T17:27:20Z) - Iterative Teacher-Aware Learning [136.05341445369265]
In human pedagogy, teachers and students can interact adaptively to maximize communication efficiency.
We propose a gradient optimization based teacher-aware learner who can incorporate teacher's cooperative intention into the likelihood function.
arXiv Detail & Related papers (2021-10-01T00:27:47Z) - Interactive Knowledge Distillation [79.12866404907506]
We propose an InterActive Knowledge Distillation scheme to leverage the interactive teaching strategy for efficient knowledge distillation.
In the distillation process, the interaction between teacher and student networks is implemented by a swapping-in operation.
Experiments with typical settings of teacher-student networks demonstrate that the student networks trained by our IAKD achieve better performance than those trained by conventional knowledge distillation methods.
arXiv Detail & Related papers (2020-07-03T03:22:04Z) - Neural Multi-Task Learning for Teacher Question Detection in Online
Classrooms [50.19997675066203]
We build an end-to-end neural framework that automatically detects questions from teachers' audio recordings.
By incorporating multi-task learning techniques, we are able to strengthen the understanding of semantic relations among different types of questions.
arXiv Detail & Related papers (2020-05-16T02:17:04Z) - Understanding the Power and Limitations of Teaching with Imperfect
Knowledge [30.588367257209388]
We study the interaction between a teacher and a student/learner where the teacher selects training examples for the learner to learn a specific task.
Inspired by real-world applications of machine teaching in education, we consider the setting where teacher's knowledge is limited and noisy.
We show connections to how imperfect knowledge affects the teacher's solution of the corresponding machine teaching problem when constructing optimal teaching sets.
arXiv Detail & Related papers (2020-03-21T17:53:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.