Signature-Informed Transformer for Asset Allocation
- URL: http://arxiv.org/abs/2510.03129v1
- Date: Fri, 03 Oct 2025 15:58:21 GMT
- Title: Signature-Informed Transformer for Asset Allocation
- Authors: Yoontae Hwang, Stefan Zohren,
- Abstract summary: Signature-Informed Transformer is a framework that learns end-to-end allocation policies by directly optimizing a risk-aware financial objective.<n> evaluated on daily S&P 100 equity data, SIT decisively outperforms traditional and deep-learning baselines.<n>Results indicate that portfolio-aware objectives and geometry-aware inductive biases are essential for risk-aware capital allocation in machine-learning systems.
- Score: 9.290367832033063
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Robust asset allocation is a key challenge in quantitative finance, where deep-learning forecasters often fail due to objective mismatch and error amplification. We introduce the Signature-Informed Transformer (SIT), a novel framework that learns end-to-end allocation policies by directly optimizing a risk-aware financial objective. SIT's core innovations include path signatures for a rich geometric representation of asset dynamics and a signature-augmented attention mechanism embedding financial inductive biases, like lead-lag effects, into the model. Evaluated on daily S\&P 100 equity data, SIT decisively outperforms traditional and deep-learning baselines, especially when compared to predict-then-optimize models. These results indicate that portfolio-aware objectives and geometry-aware inductive biases are essential for risk-aware capital allocation in machine-learning systems. The code is available at: https://github.com/Yoontae6719/Signature-Informed-Transformer-For-Asset-Allocation
Related papers
- MAESTRO: Meta-learning Adaptive Estimation of Scalarization Trade-offs for Reward Optimization [56.074760766965085]
Group-Relative Policy Optimization has emerged as an efficient paradigm for aligning Large Language Models (LLMs)<n>We propose MAESTRO, which treats reward scalarization as a dynamic latent policy, leveraging the model's terminal hidden states as a semantic bottleneck.<n>We formulate this as a contextual bandit problem within a bi-level optimization framework, where a lightweight Conductor network co-evolves with the policy by utilizing group-relative advantages as a meta-reward signal.
arXiv Detail & Related papers (2026-01-12T05:02:48Z) - Adaptive Financial Sentiment Analysis for NIFTY 50 via Instruction-Tuned LLMs , RAG and Reinforcement Learning Approaches [1.9116784879310027]
Existing works in financial sentiment analysis have not considered the impact of stock prices or market feedback on sentiment analysis.<n>We propose an adaptive framework that integrates large language models (LLMs) with real-world stock market feedback to improve sentiment classification.
arXiv Detail & Related papers (2025-12-23T06:27:12Z) - ASTIF: Adaptive Semantic-Temporal Integration for Cryptocurrency Price Forecasting [6.12055122337183]
ASTIF is a hybrid intelligent system that adapts its forecasting strategy in real time through confidence-based meta-learning.<n>A confidence-aware meta-learner functions as an adaptive inference layer, modulating each predictor's contribution based on its real-time uncertainty.<n>The research contributes a scalable, knowledge-based solution for fusing quantitative and qualitative data in non-stationary environments.
arXiv Detail & Related papers (2025-12-21T09:17:36Z) - Trade in Minutes! Rationality-Driven Agentic System for Quantitative Financial Trading [57.28635022507172]
TiMi is a rationality-driven multi-agent system that architecturally decouples strategy development from minute-level deployment.<n>We propose a two-tier analytical paradigm from macro patterns to micro customization, layered programming design for trading bot implementation, and closed-loop optimization driven by mathematical reflection.
arXiv Detail & Related papers (2025-10-06T13:08:55Z) - Enhancing Credit Risk Prediction: A Meta-Learning Framework Integrating Baseline Models, LASSO, and ECOC for Superior Accuracy [7.254744067646655]
This research proposes a comprehensive meta-learning framework that synthesizes multiple complementary models.<n>We implement Permutation Feature Importance analysis for each prediction class across all constituent models.<n>Results demonstrate that our framework significantly enhances the accuracy of financial entity classification.
arXiv Detail & Related papers (2025-09-26T14:09:04Z) - ResT: Reshaping Token-Level Policy Gradients for Tool-Use Large Language Models [62.82372407840088]
Large language models (LLMs) transcend passive generation and act as goal-directed agents by invoking external tools.<n>textbfReshaped textbfToken-level policy gradients (textbfResT) for tool-use tasks.<n>textbfResT achieves state-of-the-art results, outperforming prior methods by up to $8.76%$.
arXiv Detail & Related papers (2025-09-26T03:38:27Z) - Quantum Reinforcement Learning Trading Agent for Sector Rotation in the Taiwan Stock Market [7.360168388085351]
We propose a hybrid quantum-classical reinforcement learning framework for sector rotation in the Taiwan stock market.<n>Although quantum-enhanced models consistently achieve higher training rewards, they underperform classical models in real-world investment metrics.<n>This discrepancy highlights a core challenge in applying reinforcement learning to financial domains.
arXiv Detail & Related papers (2025-06-26T01:29:19Z) - UniErase: Towards Balanced and Precise Unlearning in Language Models [69.04923022755547]
Large language models (LLMs) require iterative updates to address the outdated information problem.<n>UniErase is a novel unlearning framework that demonstrates precision and balanced performances between knowledge unlearning and ability retaining.
arXiv Detail & Related papers (2025-05-21T15:53:28Z) - The Evolution of Alpha in Finance Harnessing Human Insight and LLM Agents [0.0]
Pursuit of alpha returns that exceed market benchmarks has undergone a profound transformation.<n>This paper introduces a comprehensive five stage taxonomy that traces this progression.<n>The proposed taxonomy offers a unified framework for evaluating maturity, aligning infrastructure, and guiding the responsible development of next generation alpha systems.
arXiv Detail & Related papers (2025-05-20T00:51:43Z) - An Advanced Ensemble Deep Learning Framework for Stock Price Prediction Using VAE, Transformer, and LSTM Model [4.097563258332958]
This research proposes a cutting-edge ensemble deep learning framework for stock price prediction by combining three advanced neural network architectures.<n>The framework uses rich set of technical indicators and it scales its predictors based on the current market situation.<n>It has a very important application in algorithmic trading, risk analysis, and control and decision-making for finance professions and scholars.
arXiv Detail & Related papers (2025-03-28T07:20:40Z) - Gradient Reduction Convolutional Neural Network Policy for Financial Deep Reinforcement Learning [0.0]
This paper introduces two significant enhancements to refine our CNN model's predictive performance and robustness for financial data.
Firstly, we integrate a normalization layer at the input stage to ensure consistent feature scaling.
Secondly, we employ a Gradient Reduction Architecture, where earlier layers are wider and subsequent layers are progressively narrower.
arXiv Detail & Related papers (2024-08-16T11:39:03Z) - Advancing Anomaly Detection: Non-Semantic Financial Data Encoding with LLMs [49.57641083688934]
We introduce a novel approach to anomaly detection in financial data using Large Language Models (LLMs) embeddings.
Our experiments demonstrate that LLMs contribute valuable information to anomaly detection as our models outperform the baselines.
arXiv Detail & Related papers (2024-06-05T20:19:09Z) - Diffusion Variational Autoencoder for Tackling Stochasticity in
Multi-Step Regression Stock Price Prediction [54.21695754082441]
Multi-step stock price prediction over a long-term horizon is crucial for forecasting its volatility.
Current solutions to multi-step stock price prediction are mostly designed for single-step, classification-based predictions.
We combine a deep hierarchical variational-autoencoder (VAE) and diffusion probabilistic techniques to do seq2seq stock prediction.
Our model is shown to outperform state-of-the-art solutions in terms of its prediction accuracy and variance.
arXiv Detail & Related papers (2023-08-18T16:21:15Z) - Supercharging Imbalanced Data Learning With Energy-based Contrastive
Representation Transfer [72.5190560787569]
In computer vision, learning from long tailed datasets is a recurring theme, especially for natural image datasets.
Our proposal posits a meta-distributional scenario, where the data generating mechanism is invariant across the label-conditional feature distributions.
This allows us to leverage a causal data inflation procedure to enlarge the representation of minority classes.
arXiv Detail & Related papers (2020-11-25T00:13:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.