reflect-appworld-failure
Analyze AppWorld task failures to extract specific API patterns and generate actionable playbook bullets with concrete code examples.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
482 skills found
Analyze AppWorld task failures to extract specific API patterns and generate actionable playbook bullets with concrete code examples.
Persistent task memory and workflow synchronization for Claude Code using Beads, enabling multi-session project management and context preservation.
Intelligent RAG-based gateway that routes coding tasks to specialized Swift/iOS expertise without context window bloat. Uses MCP to retrieve precise patterns from 100+ indexed skills.
Automated pipeline to download, split, and deeply analyze academic PDFs in structured batches to avoid context window limits and ensure high-quality comprehension.
Prevents AI hallucination and ensures evidence-based, verifiable outputs when analyzing code, reviewing technical documents, or providing recommendations.
Semantic Go code navigation and analysis tool using the Language Server Protocol (LSP) for accurate, high-performance project intelligence.
Query Microsoft 365 Copilot for workplace intelligence—emails, meetings, documents, and team communication—to ground your AI agent in organizational context.
Profiles application performance using k6, Artillery, or JMeter to measure latency, throughput, and error rates. Ideal for planning load, stress, and soak tests to identify bottlenecks.
Semantic code analysis guide for Serena MCP. Automatically prioritizes Serena tools for symbols, references, and code memory to optimize context and efficiency.
Access Y Combinator’s library of 443+ startup resources for expert advice on fundraising, co-founders, product development, growth, and scaling your business.
Optimize Node.js performance via Redis caching, clustering, profiling, and monitoring to build fast, scalable, and efficient backend services.
Multi-perspective AI consultation for technical architecture, complex refactoring, and structured debugging.