evaluation
Build systematic evaluation frameworks for AI agents using multi-dimensional rubrics, LLM-as-a-judge, and regression testing to measure performance, quality, and context engineering effectiveness.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
460 skills found
Build systematic evaluation frameworks for AI agents using multi-dimensional rubrics, LLM-as-a-judge, and regression testing to measure performance, quality, and context engineering effectiveness.
Frontend coding conventions for Preact and Tailwind. Use for web UI components in cluster applications.
AI-powered lead generation pipeline: intelligent lead scoring (0-100) and context-aware follow-up generation for sales, cold outreach, and CRM integration.
Easily configure and add Model Context Protocol (MCP) servers to various AI coding clients like Cursor, Claude, VS Code, and more using an interactive or automated command-line interface.
Generates a random lucky number between 0 and 9999 for games, decision-making, or entertainment.
Find, connect, and use over 100,000 MCP tools and skills via the Smithery CLI to integrate external services, manage agent workspaces, and automate workflows.
Interactive CLI-based issue management system for tracking, planning, and executing development tasks with full CRUD capabilities.
Build AI agents with the OpenAI Agents SDK for Python. Supports multi-agent handoffs, function tools, stateful sessions, streaming, and Azure OpenAI integration via LiteLLM.
Convert Figma designs to project-consistent UI code using TemPad Dev MCP for precise markup, styling, and token integration.
Orchestrates complex multi-agent software development using a structured Royal Navy squadron metaphor, featuring mission planning, parallel task coordination, and rigorous audit logs.
Build distinctive, high-end React Native Expo interfaces using liquid glass design and iOS Human Interface Guidelines for production-grade mobile apps.
Epsimo AI platform SDK and CLI for building agents with persistent state, Virtual Database, streaming conversations, and a React UI kit.