eval-harness
Official evaluation framework for AI agent sessions, implementing Evaluation-Driven Development (EDD) principles to ensure reliability.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
209 skills found
Official evaluation framework for AI agent sessions, implementing Evaluation-Driven Development (EDD) principles to ensure reliability.
Equip autonomous agents with a funded wallet, identity, and paid API tools for search, generative AI media creation, messaging, and remote communication.
High-performance document intelligence library for extracting text, tables, code, and metadata from 91+ file formats, with OCR and LLM-ready output.
A comprehensive library of 305+ modular instruction packages, Python CLI tools, and agent workflows designed to extend the capabilities of AI coding assistants like Claude Code, Cursor, Aider, and Gemini CLI.
Security advisory monitoring for NanoClaw WhatsApp bots, providing vulnerability scanning, skill safety checks, and integrity protection through MCP tools.
Guidance and operational tips for identifying, reviewing, and managing pull requests created by the GitHub Copilot coding agent within your repository.
Verify research idea novelty against recent literature. Use when user says '查新', 'novelty check', or needs to confirm if a method is original.
A structured repository of Agent Skills for context engineering, multi-agent architectures, and production-grade agent system optimization.
Automated inbound and outbound AI email workflow for 0 Finance, enabling agents to manage invoices, bank transfers, and financial conversations.
Search and download 3D models from Printables with automated manifest generation for 3D printing and prototyping workflows.
Enforces structured self-assessment checkpoints to validate approach, mitigate risks, and ensure quality before, during, and after task execution.
Self-modify your Milady agent by managing plugins. Edit code, rebuild, and restart the runtime to develop new capabilities or improve agent workflows locally.