eval
Evaluate Deca agent prompts and behavioral consistency through automated test runners, manual LLM judgment, and structured reporting.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
170 skills found
Evaluate Deca agent prompts and behavioral consistency through automated test runners, manual LLM judgment, and structured reporting.
Debugs deterministic Sui simtest failures using automated logging and the scientific method.
Your personal AI coding tutor that creates customized tutorials based on your actual codebase, tracks your learning progress, and uses spaced repetition to ensure mastery.
Orchestrate Unity Editor via MCP tools. Enables AI to create GameObjects, edit scripts, manage scenes, and automate testing within Unity projects.
An expert-level CTF solver agent that automates reconnaissance, vulnerability analysis, and exploit generation for web, pwn, crypto, reverse, and forensic challenges.
Project bootstrap for Claude Code with safety guardrails, git workflow automation, project auditing, and structured multi-phase planning.
Advanced web search, content extraction, and site crawling capabilities using the Tavily API, optimized for AI agent research and data gathering.
Physical hardware synthesis bridge for PAI. Generates blueprints, 3D printing code, SVG paths for laser cutting, and G-Code for CNC machining to bring agentic designs into the physical world.
Pre-execution security guardrails for AI agents. Validates shell commands and file reads against 400+ security patterns to block destructive operations, credential theft, and unauthorized system access.
Sends debugging data, logs, and visual output to the Ray desktop application via its local API for real-time developer feedback.
Transcribe audio files (wav, mp3, ogg) to text using the Qwen ASR model. Fast, local-friendly, and requires no API keys.
Rust language server (rust-analyzer) providing code intelligence, real-time diagnostics, and refactoring support for .rs projects.