eval-harness
Official evaluation framework for AI agent sessions, implementing Evaluation-Driven Development (EDD) principles to ensure reliability.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
468 skills found
Official evaluation framework for AI agent sessions, implementing Evaluation-Driven Development (EDD) principles to ensure reliability.
Manage Go CLI i18n rules, locale file structures, env-based language detection, and string key naming conventions for the Skills-X ecosystem.
Implement robust backend error handling with custom classes, middleware, structured logging, and recovery patterns.
Manage isolated LlamaFarm development environments using git worktrees for parallel agent sessions and service testing.
Systematic debugging workflow for MCP servers and Microsoft Copilot Studio integrations, featuring common fix patterns and validation scripts.
Terminal-based Spotify playback and search controller for OpenClaw.
An AI-powered skill that automatically retrieves relevant project context from your RAG knowledge base for complex coding tasks.
Plan, implement, and execute user acceptance tests (UAT) and end-to-end scenarios to validate requirements against user-visible behavior.
Manage dlt data pipelines and Temporal workflows for the SignalRoom marketing platform. Sync sources like Everflow, Redtrack, and S3 to Postgres, check status, and debug ingestion.
Build and manage MCP servers using the FastMCP framework. Guide for creating tools, resources, prompts, Claude Desktop integration, and deployment with Python and TypeScript.
A structured PRD generator for vibe-coding MVPs. It guides you through defining product requirements, target audiences, and success metrics, ensuring a clear foundation for your development workflow.
Search and retrieve AI-generated documentation, architecture guides, and API references for 300+ popular GitHub repositories using DeepWiki and MCP.