About
PufferLib is a specialized skill for Claude Code designed to streamline the development and training of reinforcement learning (RL) models. It enables developers to achieve massive training throughput—up to millions of steps per second—through optimized parallel simulation and the efficient PuffeRL trainer. Whether you are building custom environments with the PufferEnv API, integrating standard Gymnasium or PettingZoo tasks, or scaling policies with CNNs and LSTMs, this skill provides the domain-specific guidance and implementation patterns needed for professional-grade RL experimentation and optimization.