trulens-evaluation-workflow
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
491 skills found
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
A versatile data analysis assistant for loading datasets, performing statistical calculations, visualizing trends, and generating professional summary reports.
An MCP server enabling agents to edit, manage, and compile Arduino IDE 2.0 sketches, including source code manipulation and automated build capabilities via arduino-cli.
Unified Python CLI for Tavily AI operations including web search, URL extraction, site crawling, link mapping, and automated deep research reports.
Advanced web search and reasoning tool for OpenClaw agents. Features citation-heavy synthesis, multi-step reasoning, and live internet access via OpenRouter.
Expert skill for Next.js Server Actions, covering form handling, data mutations, revalidation, and optimistic UI updates in the App Router.
AI-powered browser automation server for web interaction, data extraction, and research using the Model Context Protocol.
An end-to-end video processing pipeline that transforms raw recordings into transcripts, key insights, short clips, and polished articles.
Generate finite-difference stencils, select optimal numerical schemes for PDEs/ODEs, and perform truncation error analysis to improve simulation accuracy.
Standardized skill for Claude Code agents to dynamically query OpenRouter model recommendations and metadata via the Claudish CLI.
Production-ready Scrum Master assistant for sprint management, capacity planning, and real-time team analytics.
Autonomous improvement loop for codebase optimization. Automatically modifies, measures, and iterates on code based on a specific goal and mechanical metric.