Verification & Quality Assurance
A robust verification and QA system for software agents featuring real-time truth scoring, automated code validation, and instant rollback capabilities to maintain high reliability.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
446 skills found
A robust verification and QA system for software agents featuring real-time truth scoring, automated code validation, and instant rollback capabilities to maintain high reliability.
Applies cognitive science frameworks for creative thinking to generate genuinely novel research directions in computer science and AI.
Automate B2C mobile app marketing with short-form video strategies for TikTok, Instagram, and YouTube. Includes content creation, scheduling via Post Bridge API, and performance analysis.
Apply effective software quality consultancy practices. Use when consulting on QA strategy, advising development teams, or establishing sustainable quality workflows.
Migrate your codebase, prompts, and API calls from Claude Sonnet 4.0/4.5 or Opus 4.1 to the advanced Opus 4.5 model with automated configuration adjustments.
Build systematic evaluation frameworks for AI agents using multi-dimensional rubrics, LLM-as-a-judge, and regression testing to measure performance, quality, and context engineering effectiveness.
AI-powered lead generation pipeline: intelligent lead scoring (0-100) and context-aware follow-up generation for sales, cold outreach, and CRM integration.
Break down complex development requests into sequenced, actionable tasks for multi-agent delegation in Claude Code environments.
Generate and edit images using Google's Nano Banana 2 via WaveSpeed AI. Supports text-to-image, natural language editing, multi-image composition, 4K resolution, and various aspect ratios.
Convert Figma designs to project-consistent UI code using TemPad Dev MCP for precise markup, styling, and token integration.
Manage screenpipe pipes (AI-driven automations) and integrations via CLI. Create, run, schedule, and debug local agents to automate tasks based on your computer activity.
Advanced context engineering system for orchestrating AI agents, memory management, and token optimization to improve long-term persistence and project intelligence.