eval-harness
Official evaluation framework for AI agent sessions, implementing Evaluation-Driven Development (EDD) principles to ensure reliability.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
433 skills found
Official evaluation framework for AI agent sessions, implementing Evaluation-Driven Development (EDD) principles to ensure reliability.
Run Semgrep static analysis scans on codebases using parallel subagents, multi-language detection, and Pro-enabled cross-file taint tracking.
A toolkit for developing and bundling complex, multi-component React/TypeScript web artifacts using Vite, Tailwind CSS, and shadcn/ui.
Enforces structured self-assessment checkpoints to validate approach, mitigate risks, and ensure quality before, during, and after task execution.
Build $50k-grade frontend interfaces with production-ready code, professional typography, and high-fidelity image integration.
Automate your entire Git lifecycle from commit and PR creation to CI monitoring and branch merging, enforcing conventional commits throughout.
Manage OpenClaw's built-in Chrome browser and chrome-devtools-mcp integration for robust browser automation using the Model Context Protocol.
Visual web workspace for roadmap management, providing interactive kanban boards and graph-based dependency views for task planning and project progress tracking.
Development guide for creating custom nodes in FlowGram.ai workflows, supporting both auto-generated simple forms and complex custom UI components.
Automates invoice and receipt organization for tax preparation by parsing files, extracting financial data, renaming documents, and filing them into a structured directory system.
Generate optimized SQL queries from natural language. Supports BigQuery, PostgreSQL, MySQL, and Snowflake. Analyze database schemas, interpret business requirements, and output ready-to-run queries with explanations.
A unified interface for integrating and managing LLM chat providers like OpenAI, Anthropic, Google, Azure, and Bedrock within LangChain applications.