eval
Evaluate Deca agent prompts and behavioral consistency through automated test runners, manual LLM judgment, and structured reporting.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
216 skills found
Evaluate Deca agent prompts and behavioral consistency through automated test runners, manual LLM judgment, and structured reporting.
Manage Vibesafe units to scan, generate, test, and verify AI-written code with cryptographically-secure hash-locked checkpoints.
An end-to-end video processing pipeline that transforms raw recordings into transcripts, key insights, short clips, and polished articles.
Execute implementation plans in small, verifiable batches with pause-for-feedback checkpoints to prevent drift and ensure code quality.
Systematically extract insights, decisions, and constraints from research documents, technical papers, and architectural design files.
Validates Skill, Agent, and Command syntax using validate_skills.py, logs errors, and manages the automated QC workflow for agent development.
Draft competitive research proposals for NSF, NIH, DOE, DARPA, and NSTC. Master agency-specific criteria, budget preparation, visual schematics, and submission compliance.
Method-driven planning workflow that intelligently decomposes tasks into structured plan.md files using zen-mcp tools, adapting to user clarity and automation needs.
Drafts LaTeX research papers section-by-section using paper plans and research narratives with multi-model reviewer validation.
Generate or edit images using AI models like FLUX and Gemini. Ideal for photos, illustrations, concept art, and visual assets, excluding technical diagrams and schematics.
🛡️ GDPR & LGPD Privacy Guardian: Automated compliance scanner that detects PII exposure, insecure logging, and tracking violations in your codebase to prevent regulatory fines.
Epistemic safety analysis for JSON data in prompts to prevent LLM hallucinations and reasoning errors when handling incomplete or large-scale datasets.