Learning to Generate Explainable Stock Predictions using Self-Reflective
Large Language Models
- URL: http://arxiv.org/abs/2402.03659v3
- Date: Thu, 29 Feb 2024 12:10:37 GMT
- Title: Learning to Generate Explainable Stock Predictions using Self-Reflective
Large Language Models
- Authors: Kelvin J.L. Koa, Yunshan Ma, Ritchie Ng, Tat-Seng Chua
- Abstract summary: We propose a framework to teach Large Language Models (LLMs) to generate explainable stock predictions.
A reflective agent learns how to explain past stock movements through self-reasoning, while the PPO trainer trains the model to generate the most likely explanations.
Our framework can outperform both traditional deep-learning and LLM methods in prediction accuracy and Matthews correlation coefficient.
- Score: 54.21695754082441
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Explaining stock predictions is generally a difficult task for traditional
non-generative deep learning models, where explanations are limited to
visualizing the attention weights on important texts. Today, Large Language
Models (LLMs) present a solution to this problem, given their known
capabilities to generate human-readable explanations for their decision-making
process. However, the task of stock prediction remains challenging for LLMs, as
it requires the ability to weigh the varying impacts of chaotic social texts on
stock prices. The problem gets progressively harder with the introduction of
the explanation component, which requires LLMs to explain verbally why certain
factors are more important than the others. On the other hand, to fine-tune
LLMs for such a task, one would need expert-annotated samples of explanation
for every stock movement in the training set, which is expensive and
impractical to scale. To tackle these issues, we propose our
Summarize-Explain-Predict (SEP) framework, which utilizes a self-reflective
agent and Proximal Policy Optimization (PPO) to let a LLM teach itself how to
generate explainable stock predictions in a fully autonomous manner. The
reflective agent learns how to explain past stock movements through
self-reasoning, while the PPO trainer trains the model to generate the most
likely explanations from input texts. The training samples for the PPO trainer
are also the responses generated during the reflective process, which
eliminates the need for human annotators. Using our SEP framework, we fine-tune
a LLM that can outperform both traditional deep-learning and LLM methods in
prediction accuracy and Matthews correlation coefficient for the stock
classification task. To justify the generalization capability of our framework,
we further test it on the portfolio construction task, and demonstrate its
effectiveness through various portfolio metrics.
Related papers
- From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems [59.40480894948944]
Large language model (LLM) empowered agents are able to solve decision-making problems in the physical world.
Under this model, the LLM Planner navigates a partially observable Markov decision process (POMDP) by iteratively generating language-based subgoals via prompting.
We prove that the pretrained LLM Planner effectively performs Bayesian aggregated imitation learning (BAIL) through in-context learning.
arXiv Detail & Related papers (2024-05-30T09:42:54Z) - LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language [35.84181171987974]
Our goal is to build a regression model that can process numerical data and make probabilistic predictions at arbitrary locations.
We start by exploring strategies for eliciting explicit, coherent numerical predictive distributions from Large Language Models.
We demonstrate the ability to usefully incorporate text into numerical predictions, improving predictive performance and giving quantitative structure that reflects qualitative descriptions.
arXiv Detail & Related papers (2024-05-21T15:13:12Z) - From Understanding to Utilization: A Survey on Explainability for Large
Language Models [27.295767173801426]
This survey underscores the imperative for increased explainability in Large Language Models (LLMs)
Our focus is primarily on pre-trained Transformer-based LLMs, which pose distinctive interpretability challenges due to their scale and complexity.
When considering the utilization of explainability, we explore several compelling methods that concentrate on model editing, control generation, and model enhancement.
arXiv Detail & Related papers (2024-01-23T16:09:53Z) - Explanation-aware Soft Ensemble Empowers Large Language Model In-context
Learning [50.00090601424348]
Large language models (LLMs) have shown remarkable capabilities in various natural language understanding tasks.
We propose EASE, an Explanation-Aware Soft Ensemble framework to empower in-context learning with LLMs.
arXiv Detail & Related papers (2023-11-13T06:13:38Z) - In-Context Explainers: Harnessing LLMs for Explaining Black Box Models [28.396104334980492]
Large Language Models (LLMs) have demonstrated exceptional capabilities in complex tasks like machine translation, commonsense reasoning, and language understanding.
One of the primary reasons for the adaptability of LLMs in such diverse tasks is their in-context learning (ICL) capability, which allows them to perform well on new tasks by simply using a few task samples in the prompt.
We propose a novel framework, In-Context Explainers, comprising of three novel approaches that exploit the ICL capabilities of LLMs to explain the predictions made by other predictive models.
arXiv Detail & Related papers (2023-10-09T15:31:03Z) - Are Large Language Models Really Robust to Word-Level Perturbations? [68.60618778027694]
We propose a novel rational evaluation approach that leverages pre-trained reward models as diagnostic tools.
Longer conversations manifest the comprehensive grasp of language models in terms of their proficiency in understanding questions.
Our results demonstrate that LLMs frequently exhibit vulnerability to word-level perturbations that are commonplace in daily language usage.
arXiv Detail & Related papers (2023-09-20T09:23:46Z) - Evaluating and Explaining Large Language Models for Code Using Syntactic
Structures [74.93762031957883]
This paper introduces ASTxplainer, an explainability method specific to Large Language Models for code.
At its core, ASTxplainer provides an automated method for aligning token predictions with AST nodes.
We perform an empirical evaluation on 12 popular LLMs for code using a curated dataset of the most popular GitHub projects.
arXiv Detail & Related papers (2023-08-07T18:50:57Z) - Masked Language Modeling and the Distributional Hypothesis: Order Word
Matters Pre-training for Little [74.49773960145681]
A possible explanation for the impressive performance of masked language model (MLM)-training is that such models have learned to represent the syntactic structures prevalent in NLP pipelines.
In this paper, we propose a different explanation: pre-trains succeed on downstream tasks almost entirely due to their ability to model higher-order word co-occurrence statistics.
Our results show that purely distributional information largely explains the success of pre-training, and underscore the importance of curating challenging evaluation datasets that require deeper linguistic knowledge.
arXiv Detail & Related papers (2021-04-14T06:30:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.