Using Interpretation Methods for Model Enhancement
- URL: http://arxiv.org/abs/2404.02068v1
- Date: Tue, 2 Apr 2024 16:10:29 GMT
- Title: Using Interpretation Methods for Model Enhancement
- Authors: Zhuo Chen, Chengyue Jiang, Kewei Tu,
- Abstract summary: We propose a framework of utilizing interpretation methods and gold rationales to enhance models.
Our framework is very general in the sense that it can incorporate various interpretation methods.
Experimental results show that our framework is effective especially in low-resource settings.
- Score: 44.29399911722625
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In the age of neural natural language processing, there are plenty of works trying to derive interpretations of neural models. Intuitively, when gold rationales exist during training, one can additionally train the model to match its interpretation with the rationales. However, this intuitive idea has not been fully explored. In this paper, we propose a framework of utilizing interpretation methods and gold rationales to enhance models. Our framework is very general in the sense that it can incorporate various interpretation methods. Previously proposed gradient-based methods can be shown as an instance of our framework. We also propose two novel instances utilizing two other types of interpretation methods, erasure/replace-based and extractor-based methods, for model enhancement. We conduct comprehensive experiments on a variety of tasks. Experimental results show that our framework is effective especially in low-resource settings in enhancing models with various interpretation methods, and our two newly-proposed methods outperform gradient-based methods in most settings. Code is available at https://github.com/Chord-Chen-30/UIMER.
Related papers
Err
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.