LMPriors: Pre-Trained Language Models as Task-Specific Priors
- URL: http://arxiv.org/abs/2210.12530v1
- Date: Sat, 22 Oct 2022 19:09:18 GMT
- Title: LMPriors: Pre-Trained Language Models as Task-Specific Priors
- Authors: Kristy Choi, Chris Cundy, Sanjari Srivastava, Stefano Ermon
- Abstract summary: We develop principled techniques for augmenting our models with suitable priors.
This is to encourage them to learn in ways that are compatible with our understanding of the world.
We draw inspiration from the recent successes of large-scale language models (LMs) to construct task-specific priors distilled from the rich knowledge of LMs.
- Score: 78.97143833642971
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Particularly in low-data regimes, an outstanding challenge in machine
learning is developing principled techniques for augmenting our models with
suitable priors. This is to encourage them to learn in ways that are compatible
with our understanding of the world. But in contrast to generic priors such as
shrinkage or sparsity, we draw inspiration from the recent successes of
large-scale language models (LMs) to construct task-specific priors distilled
from the rich knowledge of LMs. Our method, Language Model Priors (LMPriors),
incorporates auxiliary natural language metadata about the task -- such as
variable names and descriptions -- to encourage downstream model outputs to be
consistent with the LM's common-sense reasoning based on the metadata.
Empirically, we demonstrate that LMPriors improve model performance in settings
where such natural language descriptions are available, and perform well on
several tasks that benefit from such prior knowledge, such as feature
selection, causal inference, and safe reinforcement learning.
Related papers
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning [70.21358720599821]
Large language models (LLMs) hold the promise of solving diverse tasks when provided with appropriate natural language prompts.
We propose SELF-GUIDE, a multi-stage mechanism in which we synthesize task-specific input-output pairs from the student LLM.
We report an absolute improvement of approximately 15% for classification tasks and 18% for generation tasks in the benchmark's metrics.
arXiv Detail & Related papers (2024-07-16T04:41:58Z) - Aligning Large Language Models for Controllable Recommendations [31.255594408462322]
We introduce a collection of supervised learning tasks, augmented with labels derived from a conventional recommender model.
We then develop a reinforcement learning-based alignment procedure to strengthen LLMs' aptitude in responding to users' intentions.
Our method markedly advances the capability of LLMs to comply with instructions within recommender systems, while sustaining a high level of accuracy performance.
arXiv Detail & Related papers (2024-03-08T05:23:27Z) - Supervised Knowledge Makes Large Language Models Better In-context Learners [94.89301696512776]
Large Language Models (LLMs) exhibit emerging in-context learning abilities through prompt engineering.
The challenge of improving the generalizability and factuality of LLMs in natural language understanding and question answering remains under-explored.
We propose a framework that enhances the reliability of LLMs as it: 1) generalizes out-of-distribution data, 2) elucidates how LLMs benefit from discriminative models, and 3) minimizes hallucinations in generative tasks.
arXiv Detail & Related papers (2023-12-26T07:24:46Z) - Unsupervised Improvement of Factual Knowledge in Language Models [4.5788796239850225]
Masked language modeling plays a key role in pretraining large language models.
We propose an approach for influencing pretraining in a way that can improve language model performance on a variety of knowledge-intensive tasks.
arXiv Detail & Related papers (2023-04-04T07:37:06Z) - Augmented Language Models: a Survey [55.965967655575454]
This survey reviews works in which language models (LMs) are augmented with reasoning skills and the ability to use tools.
We refer to them as Augmented Language Models (ALMs)
The missing token objective allows ALMs to learn to reason, use tools, and even act, while still performing standard natural language tasks.
arXiv Detail & Related papers (2023-02-15T18:25:52Z) - Pre-Trained Language Models for Interactive Decision-Making [72.77825666035203]
We describe a framework for imitation learning in which goals and observations are represented as a sequence of embeddings.
We demonstrate that this framework enables effective generalization across different environments.
For test tasks involving novel goals or novel scenes, initializing policies with language models improves task completion rates by 43.6%.
arXiv Detail & Related papers (2022-02-03T18:55:52Z) - MergeDistill: Merging Pre-trained Language Models using Distillation [5.396915402673246]
We propose MergeDistill, a framework to merge pre-trained LMs in a way that can best leverage their assets with minimal dependencies.
We demonstrate the applicability of our framework in a practical setting by leveraging pre-existing teacher LMs and training student LMs that perform competitively with or even outperform teacher LMs trained on several orders of magnitude more data and with a fixed model capacity.
arXiv Detail & Related papers (2021-06-05T08:22:05Z) - Masked Language Modeling and the Distributional Hypothesis: Order Word
Matters Pre-training for Little [74.49773960145681]
A possible explanation for the impressive performance of masked language model (MLM)-training is that such models have learned to represent the syntactic structures prevalent in NLP pipelines.
In this paper, we propose a different explanation: pre-trains succeed on downstream tasks almost entirely due to their ability to model higher-order word co-occurrence statistics.
Our results show that purely distributional information largely explains the success of pre-training, and underscore the importance of curating challenging evaluation datasets that require deeper linguistic knowledge.
arXiv Detail & Related papers (2021-04-14T06:30:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.