eval
Evaluate Deca agent prompts and behavioral consistency through automated test runners, manual LLM judgment, and structured reporting.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
160 skills found
Evaluate Deca agent prompts and behavioral consistency through automated test runners, manual LLM judgment, and structured reporting.
Prevents AI hallucination and ensures evidence-based, verifiable outputs when analyzing code, reviewing technical documents, or providing recommendations.
Comprehensive guide and implementation framework for building, configuring, and deploying NexAU agents from scratch, including tools, prompts, and skills.
Create high-performance AI skills by reverse-engineering successful GitHub projects and proven open-source methodologies.
Structured parallel brainstorming agent for ideation and conceptual expansion. Uses multi-agent perspectives to evolve vague ideas into practical, actionable visions. Ideation only, not for task planning.
Anthropic Claude integration patterns: streaming, RAG with pgvector, tool use, model selection (Haiku/Sonnet/Opus), prompt caching, and cost management for AI-powered engineering.
A microworld operating system for LLM-based agent living memory, transforming filesystems into navigable rooms and code into habitable worlds.
Enterprise-grade multi-agent swarm orchestration, event-driven workflow automation, and intelligent agent coordination for Claude Code.
Intelligent GitHub release orchestration using AI swarms for automated versioning, multi-platform deployment, testing, and rollback management.
A collection of design patterns for the Langroid multi-agent framework, covering agent configuration, tool handling, task orchestration, and external integrations.
Build stateful AI agents on Cloudflare Workers using the Agents SDK. Features real-time WebSockets, persistent state management, scheduled background tasks, and native tool integration for production-ready deployments.
AI agent skill for Moltbot Arena, a real-time strategy programming game. Manage units, automate resource harvesting, coordinate structures, and execute tactical decisions via REST API.