Related papers: Variable selection for minimum-variance portfolios

Variable selection for minimum-variance portfolios

URL: http://arxiv.org/abs/2508.14986v1
Date: Wed, 20 Aug 2025 18:14:39 GMT
Title: Variable selection for minimum-variance portfolios
Authors: Guilherme V. Moura, André P. Santos, Hudson S. Torrent,
Abstract summary: We parameterize minimum-variance portfolio weights as a function of a large pool of firm-level characteristics.<n>We find that the gains from employing ML to select relevant predictors are substantial.<n>Some of the selected predictors that help decreasing portfolio risk also increase returns.
Score: 0.0
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Machine learning (ML) methods have been successfully employed in identifying variables that can predict the equity premium of individual stocks. In this paper, we investigate if ML can also be helpful in selecting variables relevant for optimal portfolio choice. To address this question, we parameterize minimum-variance portfolio weights as a function of a large pool of firm-level characteristics as well as their second-order and cross-product transformations, yielding a total of 4,610 predictors. We find that the gains from employing ML to select relevant predictors are substantial: minimum-variance portfolios achieve lower risk relative to sparse specifications commonly considered in the literature, especially when non-linear terms are added to the predictor space. Moreover, some of the selected predictors that help decreasing portfolio risk also increase returns, leading to minimum-variance portfolios with good performance in terms of Shape ratios in some situations. Our evidence suggests that ad-hoc sparsity can be detrimental to the performance of minimum-variance characteristics-based portfolios.

Related papers

Covariance-Aware Simplex Projection for Cardinality-Constrained Portfolio Optimization [0.0]
Covariance-Aware Simplex Projection (CASP) is a two-stage repair operator that selects a target number of assets using volatility-normalized scores.<n>On S&P 500 data (2020-2024), CASP-Basic delivers materially lower portfolio variance than standard Euclidean repair.
arXiv Detail & Related papers (2025-12-23T02:22:53Z)
Integrated Prediction and Multi-period Portfolio Optimization [29.582959310549594]
Multi-period portfolio optimization accounts for transaction costs, path-dependent risks, and the intertemporal structure of trading decisions.<n>This paper introduces IPMO, a model for multi-period mean-variance portfolio optimization with turnover penalties.<n>For scalability, we introduce a mirror-descent fixed-point (MDFP) differentiation scheme that avoids factorizing the Karush-Kuhn-Tucker (KKT) systems.
arXiv Detail & Related papers (2025-12-12T04:31:22Z)
Bayesian Portfolio Optimization by Predictive Synthesis [5.319802998033766]
Most existing portfolio optimization methods require information on the distribution of returns of the assets that make up the portfolio.<n>Various methods have been proposed to estimate distribution information, but their accuracy greatly depends on the uncertainty of the financial markets.
arXiv Detail & Related papers (2025-10-08T16:18:11Z)
Revisiting Essential and Nonessential Settings of Evidential Deep Learning [70.82728812001807]
Evidential Deep Learning (EDL) is an emerging method for uncertainty estimation. We propose Re-EDL, a simplified yet more effective variant of EDL.
arXiv Detail & Related papers (2024-10-01T04:27:07Z)
Return Prediction for Mean-Variance Portfolio Selection: How Decision-Focused Learning Shapes Forecasting Models [25.72157859795055]
Decision-Focused Learning can integrate prediction and optimization to improve decision-making outcomes.<n>This study investigates how DFL adjusts stock return prediction models to optimize decisions in mean-variance optimization (MVO)<n>Our findings reveal why DFL achieves superior portfolio performance despite higher prediction errors.
arXiv Detail & Related papers (2024-09-15T10:37:11Z)
Deep Reinforcement Learning and Mean-Variance Strategies for Responsible Portfolio Optimization [49.396692286192206]
We study the use of deep reinforcement learning for responsible portfolio optimization by incorporating ESG states and objectives. Our results show that deep reinforcement learning policies can provide competitive performance against mean-variance approaches for responsible portfolio allocation.
arXiv Detail & Related papers (2024-03-25T12:04:03Z)
Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization [59.758009422067]
We consider the problem of quantifying uncertainty over expected cumulative rewards in model-based reinforcement learning. We propose a new uncertainty Bellman equation (UBE) whose solution converges to the true posterior variance over values. We introduce a general-purpose policy optimization algorithm, Q-Uncertainty Soft Actor-Critic (QU-SAC) that can be applied for either risk-seeking or risk-averse policy optimization.
arXiv Detail & Related papers (2023-12-07T15:55:58Z)
Diffusion Variational Autoencoder for Tackling Stochasticity in Multi-Step Regression Stock Price Prediction [54.21695754082441]
Multi-step stock price prediction over a long-term horizon is crucial for forecasting its volatility. Current solutions to multi-step stock price prediction are mostly designed for single-step, classification-based predictions. We combine a deep hierarchical variational-autoencoder (VAE) and diffusion probabilistic techniques to do seq2seq stock prediction. Our model is shown to outperform state-of-the-art solutions in terms of its prediction accuracy and variance.
arXiv Detail & Related papers (2023-08-18T16:21:15Z)
Prediction-Oriented Bayesian Active Learning [51.426960808684655]
Expected predictive information gain (EPIG) is an acquisition function that measures information gain in the space of predictions rather than parameters. EPIG leads to stronger predictive performance compared with BALD across a range of datasets and models.
arXiv Detail & Related papers (2023-04-17T10:59:57Z)
Forecasting Large Realized Covariance Matrices: The Benefits of Factor Models and Shrinkage [1.0323063834827415]
We decompose the return covariance matrix using standard firm-level factors and use sectoral restrictions in the residual covariance matrix. Our methodology improves forecasting precision relative to standard benchmarks and leads to better estimates of minimum variance portfolios.
arXiv Detail & Related papers (2023-03-22T16:38:22Z)
Project and Probe: Sample-Efficient Domain Adaptation by Interpolating Orthogonal Features [119.22672589020394]
We propose a lightweight, sample-efficient approach that learns a diverse set of features and adapts to a target distribution by interpolating these features. Our experiments on four datasets, with multiple distribution shift settings for each, show that Pro$2$ improves performance by 5-15% when given limited target data.
arXiv Detail & Related papers (2023-02-10T18:58:03Z)
Empirical Asset Pricing via Ensemble Gaussian Process Regression [4.281723404774889]
Our ensemble learning approach significantly reduces the computational complexity inherent in GPR inference.<n>We find that our method dominates existing machine learning models statistically and economically.<n>It appeals to an uncertainty averse investor and significantly dominates the equal- and value-weighted prediction-sorted portfolios, which outperform the S&P 500.
arXiv Detail & Related papers (2022-12-02T09:37:29Z)
LoCoV: low dimension covariance voting algorithm for portfolio optimization [0.0]
We analyze the random matrix aspects of portfolio optimization and identify the order of errors in sample optimal portfolio weight. We also provide LoCoV (low dimension covariance voting) algorithm to reduce error inherited from random samples.
arXiv Detail & Related papers (2022-04-01T04:42:56Z)
SUMO: Unbiased Estimation of Log Marginal Probability for Latent Variable Models [80.22609163316459]
We introduce an unbiased estimator of the log marginal likelihood and its gradients for latent variable models based on randomized truncation of infinite series. We show that models trained using our estimator give better test-set likelihoods than a standard importance-sampling based approach for the same average computational cost.
arXiv Detail & Related papers (2020-04-01T11:49:30Z)
TPLVM: Portfolio Construction by Student's $t$-process Latent Variable Model [3.5408022972081694]
We propose the Student's $t$-process latent variable model (TPLVM) to describe non-Gaussian fluctuations of financial timeseries by lower dimensional latent variables. By comparing these portfolios, we confirm the proposed portfolio outperforms that of the existing Gaussian process latent variable model.
arXiv Detail & Related papers (2020-01-29T02:02:02Z)

This list is automatically generated from the titles and abstracts of the papers in this site.