reflect-appworld-failure
Analyze AppWorld task failures to extract specific API patterns and generate actionable playbook bullets with concrete code examples.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
443 skills found
Analyze AppWorld task failures to extract specific API patterns and generate actionable playbook bullets with concrete code examples.
A framework for creating, iterating, and managing reusable evolving workflows that combine structured documentation with custom Python automation tools.
A framework for managing the end-to-end LLM project lifecycle, from evaluating task-model fit and pipeline architecture design to implementing structured output parsing and agent-assisted development.
Generate structured development plans, checklists, and file contexts compatible with the IntelliJ coding-aider plugin.
A framework for applying Test-Driven Development to process documentation, ensuring agent reliability by using pressure scenarios to identify and patch rationalization loopholes.
Validates Skill, Agent, and Command syntax using validate_skills.py, logs errors, and manages the automated QC workflow for agent development.
Download videos from YouTube, Twitter, Bilibili, and thousands of sites using yt-dlp. Supports audio extraction, subtitle downloading, and format selection.
Build $50k-grade frontend interfaces with production-ready code, professional typography, and high-fidelity image integration.
Manage your Anki flashcards effortlessly via the AnkiConnect REST API. Create, update, search, and organize decks, notes, and cards directly through your AI agent.
Intelligent unit and integration test generation powered by Minion framework, featuring business logic validation, boundary testing, and Vitest integration.
Automate Python scripting and Gemini-powered image generation using uv. Ideal for creating art, editing images, and running ad-hoc scripts.
Generate comprehensive instructions for AI agents to operate the Taskery local Kanban board, including CLI, API, and concurrency management.