Discover Agent Skills for data science & ml. Browse 61 skills for Claude, ChatGPT & Codex.
Automates the retrieval of training and test data dependencies from S3 for local machine learning model development and backtesting.
Streamlines the configuration and validation of Hotvect algorithm training runs for Vowpal Wabbit.
Captures and stores failed Hotvect command invocations as executable shell scripts for seamless debugging and reproduction.
Develops production-grade Python scripts for scalable ETL pipelines and high-performance data processing systems.
Automates production-grade ETL pipelines and data orchestration using industry-standard tools like Airflow, dbt, and Prefect.
Builds, optimizes, and deploys production-grade neural networks using PyTorch, TensorFlow, and modern transformer architectures.
Architects and optimizes high-performance distributed data pipelines for petabyte-scale workloads using Apache Spark and modern table formats.
Provides foundational mathematical tools and statistical methods for data analysis, hypothesis testing, and machine learning architecture.
Enables local semantic search and document indexing for PDF, DOCX, and XLSX files directly within Claude Code.
Evaluates research rigor and scientific claims by assessing methodology, statistical validity, and potential biases using standardized frameworks.
Provides standardized C++20 implementations for high-performance numerical computing, matrix operations, and parallel I/O.
Streamlines the creation, management, and serving of scalable feature stores within the Databricks MLOps ecosystem.
Performs specialized biological validation for ChIP-seq data by calculating cross-correlation metrics and fraction of reads in peaks.
Optimizes machine learning workflows on Databricks by implementing structured MLflow experiment tracking and model governance patterns.
Optimizes trade execution using advanced algorithms like TWAP, VWAP, and Iceberg orders to minimize market impact and slippage.
Implements automated model monitoring, drift detection, and performance tracking for production machine learning systems on Databricks.
Build and manage declarative, self-healing data pipelines with built-in quality enforcement and automated lineage tracking.
Deploys and manages production-grade machine learning models on Databricks with support for A/B testing and auto-scaling.
Optimizes interaction sequences using information theory and active inference to maximize learning efficiency and information gain.
Generates structured plans to decompose complex AI include chains into modular, reusable components.
Evaluates research rigor, methodology, and statistical validity to perform critical analysis of scientific claims.
Optimizes document retrieval and semantic search workflows using RAG best practices, vector databases, and advanced chunking strategies.
Manipulates, analyzes, and visualizes phylogenetic trees and genomic data with the Environment for Tree Exploration (ETE) framework.
Deploys and manages cloud-based AI agent swarms using event-driven workflow automation and intelligent coordination.
Trains and deploys complex neural networks across distributed E2B sandbox environments directly within Claude.
Implements high-performance adaptive learning and experience replay for AI agents using AgentDB's ultra-fast vector storage.
Enables Claude to perform complex scientific research by providing access to over 600 bioinformatics, genomics, and cheminformatics tools.
Implements high-performance persistent memory and learning patterns for AI agents using AgentDB.
Processes and analyzes complex physiological signals including ECG, EEG, and EDA for psychophysiology research and medical data science.
Performs Gene Ontology and KEGG pathway enrichment analysis from genomic regions or gene lists with automated R-based visualizations.
Scroll for more results...