eval
Evaluate Deca agent prompts and behavioral consistency through automated test runners, manual LLM judgment, and structured reporting.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
204 skills found
Evaluate Deca agent prompts and behavioral consistency through automated test runners, manual LLM judgment, and structured reporting.
Framework for automated n8n integration testing including API contract validation, authentication flows, rate limit handling, and error scenario coverage.
Collaborative PR review using a swarm of three specialized AI agents (Correctness, Health, UX) that discuss findings and reach consensus before posting a structured summary with inline comments.
Orchestrate cross-browser, cross-device, and responsive design testing using cloud providers like BrowserStack and Playwright to ensure consistent user experiences.
Optimize Apache Spark jobs with partitioning strategies, memory management, shuffle tuning, and data skew mitigation for high-performance data processing pipelines.
Connect to the Notion API to create, manage, and query pages, databases, and blocks for your AI-powered knowledge management.
Analyze Substrate/Polkadot runtimes and FRAME pallets for 7 critical vulnerabilities including arithmetic overflow, DoS, and improper origin checks.
Maintain and update the MassGen model registry, including backend capabilities, model metadata, pricing structures, and context window configurations for new and existing AI models.
Upstash Vector DB setup, semantic search, namespaces, and embedding models. Ideal for building high-performance vector search features in Next.js 16/Vercel projects.
Pragmatic AI-assisted coding standards focused on clean code, simplicity, and maintainability. Enforces best practices like SRP, DRY, and KISS to prevent over-engineering.
Create and test AI-ready MCP tools for any web application. Inject code, automate browser interactions, and turn websites into intelligent agents.
Focus testing effort on highest-risk areas using risk assessment and prioritization. Use when planning test strategy, allocating resources, or making coverage decisions.