The ENA Database skill provides a specialized interface for interacting with the European Nucleotide Archive (ENA), a massive public repository for nucleotide sequence data. It allows users to programmatically query and download DNA/RNA sequences, raw reads (FASTQ), genome assemblies, and associated metadata through various REST APIs and FTP protocols. Ideal for bioinformatics and genomics pipelines, this skill streamlines data discovery across studies, samples, and experiments while managing technical details like rate limiting, complex API parameters, and hierarchical data structures.
Key Features
01Taxonomic data and lineage querying for specific organisms
020 GitHub stars
03Programmatic retrieval of raw sequencing reads and FASTQ files
04Advanced metadata search via the ENA Portal and Browser APIs
05Genome assembly discovery and bulk download guidance
06Cross-reference searching with external biological databases