eval-harness
Official evaluation framework for AI agent sessions, implementing Evaluation-Driven Development (EDD) principles to ensure reliability.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
550 skills found
Official evaluation framework for AI agent sessions, implementing Evaluation-Driven Development (EDD) principles to ensure reliability.
AI-powered video editing agent for talking head videos, featuring speech-to-text, disfluency detection, and browser-based review workflows.
Guide for creating and managing Sindri declarative YAML extensions, including capabilities for project-init, auth, lifecycle hooks, and MCP integration.
Interactive CLI-based issue management system for tracking, planning, and executing development tasks with full CRUD capabilities.
Pre-implementation confidence assessment tool for developers. Ensures 90%+ readiness via duplicate checks, architecture compliance, official docs verification, and root cause analysis.
Build and manage MCP servers using the FastMCP framework. Guide for creating tools, resources, prompts, Claude Desktop integration, and deployment with Python and TypeScript.
Master the EARS format to transform ambiguous feature ideas into precise, testable requirements, acceptance criteria, and edge case documentation.
Review, fix, and resolve GitHub PR review comments automatically.
A microworld operating system for LLM-based agent living memory, transforming filesystems into navigable rooms and code into habitable worlds.
Discord integration for automated messaging, channel management, and rich UI interactions using the OpenClaw agent.
A comprehensive moderation toolkit for Civitai, providing automated user management, strike systems, image review, content regulation, and CSAM reporting via tRPC API.
Enterprise-grade multi-agent swarm orchestration, event-driven workflow automation, and intelligent agent coordination for Claude Code.