eval-harness
Official evaluation framework for AI agent sessions, implementing Evaluation-Driven Development (EDD) principles to ensure reliability.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
280 skills found
Official evaluation framework for AI agent sessions, implementing Evaluation-Driven Development (EDD) principles to ensure reliability.
Control Claude Code via MCP protocol for autonomous development. Features persistent sessions, agent teams, precise execution planning, and advanced tool management for complex coding tasks.
Creates detailed, step-by-step TDD implementation plans for software development tasks.
Technical SEO audit skill for crawlability, indexability, and Core Web Vitals analysis. Use to audit webpages, validate schema, and fix technical performance issues.
Implement passwordless authentication in Go applications using MojoAuth OIDC Hosted Login Page.
A prototype skill for automating YouTube live chat moderation using pattern-based detection for spam, toxic content, and rate limiting, optimized for testing agent reliability before deployment.
Accelerate software delivery by shifting testing to the earliest development phases, using AI-driven requirements validation, TDD, and automated CI pipelines to reduce defect costs.
Implement a full Model Context Protocol (MCP) stack in Rails. Connect to external servers, expose your Rails app as an MCP server, or manage subprocess MCP containers via Docker with OAuth 2.1 PKCE support.
Create robust, scalable, and maintainable technical implementation plans for complex software projects.
Systematic Kubernetes troubleshooting, pod diagnostics, cluster health monitoring, and incident response playbooks.
Orchestrates complex multi-agent software development using a structured Royal Navy squadron metaphor, featuring mission planning, parallel task coordination, and rigorous audit logs.
SPARC methodology for multi-agent development: systematic Specification, Pseudocode, Architecture, Refinement, and Completion workflows via Claude Flow orchestration.