trulens-evaluation-workflow
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
244 skills found
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Expert guidance for Claude Messages API: structured outputs, prompt caching, tool use, and migration from deprecated Claude 3.x models to 4.5. Prevents common API errors.
Real-time observability dashboard for PAI multi-agent activity, featuring live WebSocket streaming, session tracing, and agent workflow debugging.
Connect your AI agent to the Hugging Face Hub via MCP. Search models, datasets, and papers, manage repos, run cloud compute jobs, and invoke Gradio Spaces as functional AI tools.
Automate Android device operations using AI AutoGLM Phone Agent. Enables natural language control for app testing, data collection, and UI interactions like tapping, scrolling, and inputting text.
Build production-grade RAG systems using vector databases, semantic search, and LangGraph to ground LLMs in external knowledge.
Create, refine, and optimize high-quality YAML prompts for AI assistants using structure guidelines, template patterns, and quality standards.
An AI-powered TestOps platform and MCP server providing automated failure analysis, RCA matching, and intelligent test orchestration for CI/CD pipelines.
Expert CLI guides for AI agents, featuring senior engineer workflows, safety guardrails, and operational patterns for cloud, IaC, containers, databases, and dev tools.
An automated memory middleware for AI agents, implementing a Retrieve-Respond-Save loop to maintain long-term persistent context across conversations.
Cross-agent interaction skill via ANP protocol. Use decentralized identity (DID) to discover and invoke remote agents like maps, booking, and logistics services across the ANP network.
Orchestrates complex programming tasks by analyzing available skills, generating structured execution plans, and managing manual or delegated multi-step workflows.