Stock Embeddings: Learning Distributed Representations for Financial
Assets
- URL: http://arxiv.org/abs/2202.08968v1
- Date: Mon, 14 Feb 2022 15:39:06 GMT
- Title: Stock Embeddings: Learning Distributed Representations for Financial
Assets
- Authors: Rian Dolphin, Barry Smyth, Ruihai Dong
- Abstract summary: We propose a neural model for training stock embeddings, which harnesses the dynamics of historical returns data.
We describe our approach in detail and discuss a number of ways that it can be used in the financial domain.
- Score: 11.67728795230542
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Identifying meaningful relationships between the price movements of financial
assets is a challenging but important problem in a variety of financial
applications. However with recent research, particularly those using machine
learning and deep learning techniques, focused mostly on price forecasting, the
literature investigating the modelling of asset correlations has lagged
somewhat. To address this, inspired by recent successes in natural language
processing, we propose a neural model for training stock embeddings, which
harnesses the dynamics of historical returns data in order to learn the nuanced
relationships that exist between financial assets. We describe our approach in
detail and discuss a number of ways that it can be used in the financial
domain. Furthermore, we present the evaluation results to demonstrate the
utility of this approach, compared to several important benchmarks, in two
real-world financial analytics tasks.
Related papers
- Large Language Models for Financial Aid in Financial Time-series Forecasting [0.4218593777811082]
Time series forecasting in financial aid is difficult due to limited historical datasets and high dimensional financial information.
We use state-of-the-art time series models including pre-trained LLMs (GPT-2 as the backbone), transformers, and linear models to demonstrate their ability to outperform traditional approaches.
arXiv Detail & Related papers (2024-10-24T12:41:47Z) - Contrastive Learning of Asset Embeddings from Financial Time Series [8.595725772518332]
We propose a novel contrastive learning framework to generate asset embeddings from financial time series data.
Our approach leverages the similarity of asset returns over many subwindows to generate informative positive and negative samples.
Experiments on real-world datasets demonstrate the effectiveness of the learned asset embeddings on benchmark industry classification and portfolio optimization tasks.
arXiv Detail & Related papers (2024-07-26T10:26:44Z) - A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges [60.546677053091685]
Large language models (LLMs) have unlocked novel opportunities for machine learning applications in the financial domain.
We explore the application of LLMs on various financial tasks, focusing on their potential to transform traditional practices and drive innovation.
We highlight this survey for categorizing the existing literature into key application areas, including linguistic tasks, sentiment analysis, financial time series, financial reasoning, agent-based modeling, and other applications.
arXiv Detail & Related papers (2024-06-15T16:11:35Z) - AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework [48.3060010653088]
We release AlphaFin datasets, combining traditional research datasets, real-time financial data, and handwritten chain-of-thought (CoT) data.
We then use AlphaFin datasets to benchmark a state-of-the-art method, called Stock-Chain, for effectively tackling the financial analysis task.
arXiv Detail & Related papers (2024-03-19T09:45:33Z) - PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark
for Finance [63.51545277822702]
PIXIU is a comprehensive framework including the first financial large language model (LLMs) based on fine-tuning LLaMA with instruction data.
We propose FinMA by fine-tuning LLaMA with the constructed dataset to be able to follow instructions for various financial tasks.
We conduct a detailed analysis of FinMA and several existing LLMs, uncovering their strengths and weaknesses in handling critical financial tasks.
arXiv Detail & Related papers (2023-06-08T14:20:29Z) - Bayesian Bilinear Neural Network for Predicting the Mid-price Dynamics
in Limit-Order Book Markets [84.90242084523565]
Traditional time-series econometric methods often appear incapable of capturing the true complexity of the multi-level interactions driving the price dynamics.
By adopting a state-of-the-art second-order optimization algorithm, we train a Bayesian bilinear neural network with temporal attention.
By addressing the use of predictive distributions to analyze errors and uncertainties associated with the estimated parameters and model forecasts, we thoroughly compare our Bayesian model with traditional ML alternatives.
arXiv Detail & Related papers (2022-03-07T18:59:54Z) - FinQA: A Dataset of Numerical Reasoning over Financial Data [52.7249610894623]
We focus on answering deep questions over financial data, aiming to automate the analysis of a large corpus of financial documents.
We propose a new large-scale dataset, FinQA, with Question-Answering pairs over Financial reports, written by financial experts.
The results demonstrate that popular, large, pre-trained models fall far short of expert humans in acquiring finance knowledge.
arXiv Detail & Related papers (2021-09-01T00:08:14Z) - Algorithms for Learning Graphs in Financial Markets [5.735035463793008]
We investigate the fundamental problem of learning undirected graphical models under Laplacian structural constraints.
We present natural justifications, supported by empirical evidence, for the usage of the Laplacian matrix as a model for the precision matrix of financial assets.
We design numerical algorithms based on the alternating direction method of multipliers to learn undirected, weighted graphs.
arXiv Detail & Related papers (2020-12-31T02:48:35Z) - Navigating the Dynamics of Financial Embeddings over Time [0.0]
We propose the application of Graph Representation Learning in a scalable dynamic setting.
We perform a rigorous qualitative analysis of the latent trajectories to extract real world insights.
arXiv Detail & Related papers (2020-07-01T16:27:31Z) - Gaussian process imputation of multiple financial series [71.08576457371433]
Multiple time series such as financial indicators, stock prices and exchange rates are strongly coupled due to their dependence on the latent state of the market.
We focus on learning the relationships among financial time series by modelling them through a multi-output Gaussian process.
arXiv Detail & Related papers (2020-02-11T19:18:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.