self-reviewer
Implements an autonomous, critical self-verification layer for AI agents to validate code quality, security, and requirement alignment before task completion.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
178 skills found
Implements an autonomous, critical self-verification layer for AI agents to validate code quality, security, and requirement alignment before task completion.
An autonomous AI agent loop that executes Claude Code repeatedly to build features from structured PRDs until completion.
Unit and integration test your Encore.ts backend applications using Vitest, including support for isolated test databases and service mocking.
Evaluate code generation models using BigCode Evaluation Harness. Benchmarks include HumanEval, MBPP, and MultiPL-E with pass@k metrics for multi-language coding models.
Framework for automated n8n integration testing including API contract validation, authentication flows, rate limit handling, and error scenario coverage.
Architects enterprise AI agents from structured specs, generating production-ready code, data flow diagrams, and platform-specific logic for ServiceNow, Salesforce, and Snowflake.
Apply context-driven testing principles to adapt testing strategies based on project goals, risks, and constraints rather than relying on universal best practices.
Validate test suite effectiveness and uncover weak assertions by introducing code mutations and measuring kill rates. Essential for proving tests genuinely catch bugs rather than just satisfying coverage metrics.
A framework for applying Test-Driven Development to process documentation, ensuring agent reliability by using pressure scenarios to identify and patch rationalization loopholes.
Interactive debugging workflow for Ruby test suites using the debug gem, featuring step execution, system state inspection, and root cause analysis.
A structured workflow for co-authoring documentation, technical specs, and proposals, guiding users through context gathering, collaborative refinement, and reader verification.
Maintain test suite health by automatically detecting orphaned tests, missing coverage, and implementation-coupled anti-patterns.