01Configurable TTL settings for ephemeral (5m) and extended (1h) cache persistence.
02Standardized cost-tracking templates for calculating real-world savings.
03Automatic OpenAI prefix caching optimization and monitoring.
04Best-practice prompt ordering to ensure high cache hit rates.
05Native Claude cache breakpoint implementation with support for up to 4 breakpoints.
0669 GitHub stars