eval
Evaluate Deca agent prompts and behavioral consistency through automated test runners, manual LLM judgment, and structured reporting.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
547 skills found
Evaluate Deca agent prompts and behavioral consistency through automated test runners, manual LLM judgment, and structured reporting.
Manually triggers a Hipocampus memory flush to persist current session context to raw logs and initiate the compaction tree process for long-term agent memory maintenance.
Analyze and audit Excel spreadsheets to understand logic, identify formula errors, detect risks, and generate documentation for legacy or unknown files.
Orchestrates complex multi-agent software development using a structured Royal Navy squadron metaphor, featuring mission planning, parallel task coordination, and rigorous audit logs.
Audit Packmind documentation by cross-referencing MDX files against the codebase to detect broken links, outdated CLI references, and missing coverage.
Standardized structure and templates for project documentation, including READMEs, API references, CLI guides, and directory layouts.
Manage database orchestration sessions, state snapshots, and system-level operations for the BAZINGA-DB core engine.
Git-aware logical undo at track, phase, or task level with confirmation gates.
Automate Convex static site hosting integration, managing upload APIs, HTTP routing, and deployment scripts for React, Vite, and Next.js applications.
Monitor US-Iran strike probability via real-time open-source signals including market odds, flight traffic, energy prices, and geopolitical alerts.
Safe, protocol-driven Git operations for committing, pushing, and PR management using the GitHub CLI (gh).
Remotely control tmux sessions for interactive CLIs by sending automated keystrokes and scraping pane output.