reflect-appworld-failure
Analyze AppWorld task failures to extract specific API patterns and generate actionable playbook bullets with concrete code examples.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
445 skills found
Analyze AppWorld task failures to extract specific API patterns and generate actionable playbook bullets with concrete code examples.
Generate structured, machine-readable notes for papers in a core research set to enable reliable synthesis and evidence-backed writing.
Development guide for self-improving MassGen via programmatic automation testing and visual UI/UX evaluation.
Evaluate code generation models using BigCode Evaluation Harness. Benchmarks include HumanEval, MBPP, and MultiPL-E with pass@k metrics for multi-language coding models.
Advanced Gemini-powered web search plugin with smart caching, subagent context isolation, and automated query optimization.
Expert assistant for building modern Next.js 13+ applications using the App Router, including Server Components, nested layouts, streaming with Suspense, and advanced data fetching patterns.
Edit and modify PDF documents using natural-language instructions via the nano-pdf command-line interface.
Capture and formalize software development ideas into structured design documents within the Hashbrown repository, including research and conceptual sketches.
Package entire code repositories into single, AI-optimized files. Ideal for providing codebase context to LLMs like Claude, ChatGPT, and Gemini for analysis, security audits, and bug investigations.
Automated global intelligence aggregator for market, geopolitical, and AI news. Features RSS feed integration, real-time alert systems for critical events, and structured report generation with intelligence inference.
Toolkit for testing local web applications using Playwright, featuring server lifecycle management, automated DOM inspection, and browser automation workflows.
Structured parallel brainstorming agent for ideation and conceptual expansion. Uses multi-agent perspectives to evolve vague ideas into practical, actionable visions. Ideation only, not for task planning.