eval
Evaluate Deca agent prompts and behavioral consistency through automated test runners, manual LLM judgment, and structured reporting.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
329 skills found
Evaluate Deca agent prompts and behavioral consistency through automated test runners, manual LLM judgment, and structured reporting.
Framework for orchestrating long-running agentic tasks, evidence-based delivery, and automated QA gates following Simon Willison's iterative loop.
Equip autonomous agents with a funded wallet, identity, and paid API tools for search, generative AI media creation, messaging, and remote communication.
Generate structured, machine-readable notes for papers in a core research set to enable reliable synthesis and evidence-backed writing.
Analyze project codebases to generate architecture documentation, coding standards, and development practices for AI onboarding.
Capture and formalize software development ideas into structured design documents within the Hashbrown repository, including research and conceptual sketches.
Generate a promotion content pack from PRDs or READMEs, including LinkedIn posts, Reddit drafts, and Twitter threads.
A comprehensive framework for deep analysis of articles, papers, and long-form content using 10+ thinking models like SCQA, First Principles, and Systems Thinking.
Retrieve current, source-backed technical information using MCP tools to resolve queries about libraries, APIs, SDKs, and evolving tech ecosystems.
Systematic security assessment using STRIDE threat modeling, OWASP top 10 review, and secure coding practices for code, architecture, and infrastructure.
Transform raw ideas into structured conference talk scripts using narrative frameworks. Features slide-by-slide content planning, speaker notes, and timing guidance in a tool-agnostic format.
Extract tacit engineering knowledge through guided interviews and generate structured steerings for consistent project standards and conventions.