Optimizing Crypto-Trading Performance: A Comparative Analysis of Innovative Reward Functions in Reinforcement Learning Models
Аннотация
Cryptocurrency trading presents significant challenges due to extreme market volatility, rapid regime transitions, and non-stationary dynamics that render traditional trading strategies ineffective. Existing reinforcement learning approaches for cryptocurrency trading typically employ simplistic profit-based reward functions that fail to adequately capture risk management considerations, market microstructure costs, temporal dependencies, and regime-specific optimal behaviors. This limitation often results in strategies that perform well during favorable market conditions but suffer catastrophic losses during downturns. This paper introduces five novel reward functions grounded in economic utility theory, market microstructure, behavioral finance, adaptive risk management, and regime-conditional optimization. We systematically evaluate these reward functions across three reinforcement learning algorithms (Deep Q-Network, Proximal Policy Optimization, and Advantage Actor–Critic) and four distinct market regimes (bull, bear, high volatility, and recovery), using Bitcoin hourly data from 2018–2022. Our comprehensive experimental evaluation demonstrates that the Adaptive Risk Control reward function achieves exceptional performance, with a Sharpe ratio of 2.47, cumulative return of 26.4%, and maximum drawdown of only 16.8% during the predominantly bearish 2022 test period. Critically, regime-specific analysis reveals substantial performance heterogeneity: Adaptive Risk Control excels during high volatility (Sharpe ratio 3.21), while Temporal Coherence and Asymmetric Market-Conditional rewards dominate in trending and bear markets, respectively. These findings establish that sophisticated, theory-grounded reward engineering—rather than algorithmic innovations alone—constitutes the primary lever for improving RL trading systems, enabling positive risk-adjusted returns even during severe market downturns.
Перевод пока недоступен