013,983 GitHub stars
02Interactive configuration via a simple command-line interface
03Seamless integration with HuggingFace Transformers, PEFT, and TRL
04Unified API for DDP, DeepSpeed, FSDP, and Megatron-LM
05Automatic device placement and mixed precision support (FP16/BF16/FP8)
06Built-in support for gradient accumulation and sharding strategies