Machine learning from a “Universe” of signals: The role of feature engineering
Bin Li, Alberto G. Rossi, Xuemin (Sterling) Yan, Lingling Zheng,
Machine learning from a “Universe” of signals: The role of feature engineering,
Journal of Financial Economics,
Volume 172,
2025,
104138,
ISSN 0304-405X,
https://doi.org/10.1016/j.jfineco.2025.104138.
(https://www.sciencedirect.com/science/article/pii/S0304405X25001461)
Abstract: We construct real-time machine learning strategies based on a “universe” of fundamental signals. The out-of-sample performance of these strategies is economically meaningful and statistically significant, but considerably weaker than those documented by prior studies that use curated sets of signals as predictors. Strategies based on a simple recursive ranking of each signal’s past performance also yield substantially better out-of-sample performance. We find qualitatively similar results when examining past-return-based signals. Our results underscore the key role of feature engineering and, more broadly, inductive biases in enhancing the economic benefits of machine learning investment strategies.
Keywords: Machine learning; Feature engineering; Return predictability; Cross-section of stock returns