01Automatic lineage tracking for data, code, and execution environments
02Schema validation and standardization for complex biological data types like AnnData and MuData
03Advanced querying and filtering of datasets by metadata, features, and provenance
04Standardized data curation using biological ontologies like genes, cell types, and diseases
052,066 GitHub stars
06Integration with workflow managers like Nextflow and Snakemake for pipeline tracking