01Scale-In Logic: Enables reaching 100% allocation through consecutive same-direction signals rather than single high-risk trades.
020 GitHub stars
03Expanded Observation Matrix: Adds capital availability, account value, and position size features for deeper model memory.
04Small Account Simulation: Configurable $1,000+ initial account values with 30% safety buffers.
057-Action RL Space: Multi-tier sizing (25%, 50%, 75%) for both Long and Short positions.
06Position-Scaled Rewards: Dynamically scales reward signals based on the actual percentage of capital at risk.