trulens-evaluation-workflow
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
394 skills found
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Generate consistent, Conventional Commits-compliant messages directly from your staged git diffs.
Automated Vitest management skill: handles test execution, coverage reporting, failure diagnosis, and configuration management for TypeScript/JavaScript projects.
Expert consultant for designing and building high-quality, consistent AI agent skills. Guides you through discovery, architecture, and creation phases to ensure reliable, composable, and efficient skill delivery.
Adapt existing skills to your unique workflow, or create new ones for repetitive, time-consuming tasks.
Normalizes testing defect logs by correcting typos, abbreviations, and ambiguous descriptions based on product-specific codebooks and station validation.
Epistemic safety analysis for JSON data in prompts to prevent LLM hallucinations and reasoning errors when handling incomplete or large-scale datasets.
Persistent state management and workflow analytics using DuckDB for task dependency tracking, historical metrics, and context checkpointing.
Search the web for real-time data and research using the Turing Tavily proxy. Use for up-to-date information, current events, and web-based research tasks.
Self-healing Rust verification loop that automates test execution, clippy linting, and formatting checks.
Automates Moonwell protocol governance proposal lifecycle, from creation and verification to deployment and testing.
Create and manage Claude Code skills using Anthropic best practices: triggers, hooks, and progressive disclosure.