Verification & Quality Assurance
A robust verification and QA system for software agents featuring real-time truth scoring, automated code validation, and instant rollback capabilities to maintain high reliability.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
258 skills found
A robust verification and QA system for software agents featuring real-time truth scoring, automated code validation, and instant rollback capabilities to maintain high reliability.
AWS CloudFormation skill for infrastructure as code, automated stack management, template authoring, drift detection, and troubleshooting across AWS environments.
Operate the btca CLI for source-first code research. Manage git, local, and npm resources to ground AI answers in actual codebase context rather than outdated documentation.
Defense-in-depth protection for Claude Code. Manage security hooks to block dangerous commands, enforce file access controls, and protect sensitive paths across global or project-specific scopes.
Deploy isolated development containers with web-accessible VSCode, VNC, and automated app routing via Traefik or Cloudflare Tunnels.
A comprehensive guide and reference for building, orchestrating, and deploying AI agents using the Google Agent Development Kit (ADK).
Safe, protocol-driven Git operations for committing, pushing, and PR management using the GitHub CLI (gh).
Automates GitHub release creation by generating formatted changelogs from conventional commits and managing version bumps.
Perform rigorous code reviews for FastMCP projects, focusing on API design, dependency management, and codebase consistency.
Initiates automated reverse engineering by discovering codebase architecture, layers, and technology stacks to facilitate system modernization or documentation.
Generates structured Handoff Pack prompts for delegating scoped coding tasks to Gemini with clear instructions, acceptance criteria, and output requirements.
Official evaluation framework for AI agent sessions, implementing Evaluation-Driven Development (EDD) principles to ensure reliability.