trulens-evaluation-workflow
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
383 skills found
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Unified API for LLM function calling and tool use across OpenAI, Anthropic, Google, and Ollama with standardized schema definitions and execution patterns.
Secure, isolated cloud sandbox environments for executing AI-generated code, running multi-language scripts, managing file systems, and integrating tools via the E2B MCP gateway.
Persistent state management and workflow analytics using DuckDB for task dependency tracking, historical metrics, and context checkpointing.
Unified AI gateway for 100+ LLMs with OpenAI-compatible API, model fallbacks, load balancing, and enterprise-grade tools.
A command-line tool for managing, building, and deploying Agent Skills as OCI artifacts within the Agent Skills ecosystem.
Fetch and aggregate latest Rust community news, including official blog updates, ecosystem developments, and Rust Foundation reports.
Free AI-powered web search via Exa MCP. Includes deep research, company/people lookup, and code context without API keys.
Dynamic meta-router for managing and orchestrating multi-domain AI coding agent skills across plugins and projects.
Automated PR lifecycle management: monitors conflicts, resolves CI failures, handles review feedback, and executes squash-merges for safe code integration.
Production-grade testing strategy implementing feature flags, canary releases, synthetic monitoring, and chaos engineering for continuous reliability in live environments.
Seamlessly toggle between live and mocked external dependencies using the Model Context Protocol (MCP) for autonomous development environments.