Building an efficient investment portfolio begins with a fundamental challenge: understanding how various assets move in relation to one another. In expansive financial markets, this analysis is far from straightforward. The intricacies arise because observed data captures a mix of genuine collective asset behaviors and random noise inherent in sampling. This noise can obscure the true underlying relationships, leading to misinformed risk assessments. Even minor inaccuracies in estimating these co-movements, often expressed through covariance matrices, can yield unstable or suboptimal asset allocations, which can jeopardize an investor’s goals.
In a groundbreaking study featured in The Journal of Finance and Data Science, researchers from CentraleSupélec in France, alongside mathematicians from the University of Catania and University of Palermo in Italy, introduce a novel neural-network-based method to refine the estimation of these asset co-movements. Rather than allowing artificial intelligence to supplant traditional financial theories outright, their technique integrates AI within a transparent, theoretically sound optimization framework. This approach enables the neural network to “clean” the covariance matrix, which represents how assets move together, before it is utilized in portfolio construction.
Central to their methodology is the focus on global minimum-variance portfolios, which aim to minimize risk by reducing volatility. The neural network is trained not just on historical price patterns but specifically on the realized risk outcomes once the portfolio is allocated. This end-to-end training ensures that the covariance matrix cleaning is directly optimized to produce risk-efficient portfolios rather than simply fitting statistical correlations in isolation. Crucially, the process retains transparency by making its individual steps explicit, bridging the gap between black-box AI models and conventional financial engineering.
The novelty lies in their conceptualization of the covariance matrix as more than a mere tabulation of pairwise asset correlations. It encapsulates layered, collective market structures: broad systemic trends, sector-specific behaviors, and idiosyncratic noise elements. Effective cleaning involves discerning which of these collective patterns genuinely reflect market dynamics and which are artifacts of noise, then weighting them appropriately before the allocation step. This nuanced trust assignment to patterns stands in contrast to typical shrinkage or regularization methods used for covariance estimation.
Their framework views covariance transformation as a decision problem extending beyond finance. Any context where a noisy covariance matrix informs subsequent decisions can benefit from such learned corrections. By embedding the problem’s mathematical structure into the neural network architecture, the researchers ensure that the model respects essential symmetries inherent to covariance matrices. For instance, the network’s output remains invariant to the permutation of stock orderings or different mathematical representations of the same risk structure. This design choice prevents overfitting to a specific asset universe and promotes the discovery of generalized covariance cleaning principles.
Empirical evaluation on U.S. equity markets from 2000 to 2024 highlights the effectiveness of the model. Remarkably, a neural network trained on a relatively small set of around a few hundred stocks demonstrated exceptional generalizability when applied to an expanded universe of close to one thousand stocks without any additional retraining. The optimized portfolios consistently outperformed those constructed using conventional covariance estimators, including cutting-edge nonlinear shrinkage techniques. They exhibited materially lower realized volatility, reduced drawdowns during periods of market stress, and enhanced risk-adjusted returns as quantified by Sharpe ratios.
Significantly, the superiority of the neural-network-based portfolios did not diminish when subjected to realistic trading simulations. Accounting for practical market frictions such as transaction costs, bid-ask spreads, exchange fees, and financing expenses, the strategy maintained its performance edge. This robustness underscores the practical applicability of embedding AI-driven covariance cleaning directly within portfolio optimization workflows, rather than treating them as separate, heuristic preprocessing steps.
The study signals a paradigm shift for artificial intelligence in finance. Rather than relegating AI techniques to inscrutable black-box models divorced from domain knowledge, their approach champions intertwining neural networks with financial principles and mathematical constraints. By respecting the structure and symmetries of covariance matrices, the models become interpretable and generalizable tools that enhance decision-making in complex, noisy environments.
Furthermore, the integration of deep learning with covariance matrix estimation offers new avenues for managing uncertainty and noise in myriad complex systems. The neural network’s ability to learn optimal corrections tailored to portfolio risk objectives enables more reliable risk management in large-scale, high-dimensional financial settings. This is critical as portfolios expand in size and complexity, where traditional estimation methods often falter.
The implications extend beyond equities to any domain requiring covariance estimation under noise, such as macroeconomic risk analysis, insurance modeling, or energy system optimization. This research foreshadows a future in quantitative finance where machine learning methods are not standalone black boxes but core components of rigorously designed, mathematically grounded frameworks.
As AI continues to permeate financial decision-making, this study exemplifies how deeper collaboration between domain experts and machine learning researchers can yield transformative advances. The neural-network approach to covariance cleaning sets a new standard for integrating data-driven insights with theoretical finance, enabling more robust, efficient portfolios that withstand the complexities and uncertainties of real-world markets.
Contact:
Christian Bongiorno
Université Paris-Saclay, CentraleSupélec, MICS, Gif-sur-Yvette, France
christian.bongiorno@centralesupelec.fr
Subject of Research: Neural-network-based covariance matrix cleaning for portfolio optimization
Article Title: End-to-end large portfolio optimization for variance minimization with neural networks through covariance cleaning
Web References:
- The Journal of Finance and Data Science
- Article DOI: 10.1016/j.jfds.2026.100179
- Study on ScienceDirect
Image Credits: Christian Bongiorno, Efstratios Manolakis, and Rosario N. Mantegna
Keywords: Machine learning, Artificial intelligence, Finance, Risk management, Neural networks, Covariance, Portfolio optimization, Variance minimization

