eval
Evaluate Deca agent prompts and behavioral consistency through automated test runners, manual LLM judgment, and structured reporting.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
435 skills found
Evaluate Deca agent prompts and behavioral consistency through automated test runners, manual LLM judgment, and structured reporting.
Clarify ambiguous requirements through systematic dialogue and scoring to ensure high-quality, actionable PRDs before starting implementation.
Expert assistant for writing high-quality, modern, and memory-safe C++ code for V8 FFI wrappers and native integrations.
Automated code quality validation tool for pre-commit and pre-deploy checks, covering TypeScript, builds, and linting.
Personal finance management skill for the Maybe Finance OS. Track transactions, monitor budgets, calculate net worth, and generate financial reports via API.
Specialized data engineering agent for designing ETL/ELT pipelines, defining data schemas, managing data quality, and implementing robust ingestion workflows.
Operate the btca CLI for source-first code research. Manage git, local, and npm resources to ground AI answers in actual codebase context rather than outdated documentation.
Automate PR quality checks by reviewing CodeRabbit comments, validating PR descriptions, running pre-commit hooks, and executing test suites.
Autonomous multi-agent orchestration framework for Claude Code with memory-driven workflows, parallel-first task execution, Aristotle-based deconstruction, and multi-stage quality gates.
Security-first vetting protocol for AI agent skills. Detects red flags like credential theft, obfuscated code, and unauthorized data exfiltration before installation.
Create aesthetically beautiful interfaces using systematic design principles, AI-driven evaluation, and automated inspiration analysis.
A specialized code review agent that performs multi-dimensional analysis covering security vulnerabilities, performance optimization, code quality, and maintainability standards.