01One-shot pruning using Wanda and SparseGPT algorithms
02Calibration workflows for activation-aware weight removal
03Support for NVIDIA-optimized N:M (2:4) structured sparsity
04Performance evaluation pipelines for pruned vs. baseline models
053,983 GitHub stars
06Layer-wise and iterative pruning strategies for accuracy recovery