01Speculative RL workflows using EAGLE for significant rollout speedups
020 GitHub stars
03Low-precision optimization with unified FP8 and INT4 Quantization-Aware Training
04Advanced train-inference alignment using TIS/MIS and kernel-level optimizations
05Large-scale MoE training support for models like DeepSeek V3 and Qwen3-MoE
06Bit-wise expert alignment via Rollout Routing Replay (R3) technology