eval-harness
Official evaluation framework for AI agent sessions, implementing Evaluation-Driven Development (EDD) principles to ensure reliability.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
136 skills found
Official evaluation framework for AI agent sessions, implementing Evaluation-Driven Development (EDD) principles to ensure reliability.
Automate GitLab repository management with this API-based tool. Perform file operations, branch management, and project tracking directly through your AI agent.
Generate AGENTS.md and AI configuration files (Cursor, Claude, Gemini, Copilot) for your project to streamline your vibe-coding workflow and maintain context across sessions.
Automated CI/CD incident response, failure analysis, and remediation for GitHub Actions pipelines. Resolves build and test failures with safety guardrails.
Autonomous agent for ticket-driven development, managing the full software lifecycle from polling providers to automated PR creation with ERNE standard validation.
Project bootstrap for Claude Code with safety guardrails, git workflow automation, project auditing, and structured multi-phase planning.
Perform comprehensive code reviews and generate QA test plans for Storyblok projects, ensuring quality, security, and adherence to best practices.
Enforces Sentry-style conventional commits, branch safety checks, and structured issue referencing for AI coding agents.
Interactive tool for generating Business, Model, Architecture, and Design (BMAD) planning documentation for feature development.
Automated PR lifecycle management: monitors conflicts, resolves CI failures, handles review feedback, and executes squash-merges for safe code integration.
Execute git commits with conventional commit message analysis, intelligent file staging, and automated semantic message generation based on code diffs.
Automated PR review agent for Schmock projects ensuring BDD coverage, code quality, TypeScript standards, and conventional commit adherence.