01Automated generation of Python splitting scripts using standard ML libraries
02712 GitHub stars
03Implementation of stratified sampling to maintain class distributions
04Automatic execution and creation of subset data files
05Randomized data shuffling to eliminate selection bias
06Support for custom training, validation, and testing ratios