trulens-evaluation-workflow
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
368 skills found
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Automated WeChat article writing workflow including web research, viral title generation, drafting, and professional layout optimization.
Evaluate Deca agent prompts and behavioral consistency through automated test runners, manual LLM judgment, and structured reporting.
Automates Moonwell protocol governance proposal lifecycle, from creation and verification to deployment and testing.
Evaluate scientific claims and research methodology for rigor, bias, and validity. Use evidence-based frameworks like GRADE and Cochrane to analyze experiments, protocols, and study conclusions.
Completes development branches by verifying tests, managing merge or PR workflows, and cleaning up worktrees to ensure a consistent repository state.
An autonomous AI agent loop that executes Claude Code repeatedly to build features from structured PRDs until completion.
Meta Marketing CLI for Graph API automation, managing ad campaigns, insights reporting, and Instagram publishing with fail-closed security.
Analyze search results (SERP) to classify user intent, identify feature opportunities, and conduct competitive intelligence for content strategy.
Automate Adobe After Effects tasks using the Model Context Protocol. Manage compositions, layers, keyframes, effects, and expressions for motion graphics, title cards, and logo reveals.
Generate professional Product Requirements Documents (PRD) and structure features for autonomous development cycles.
Validates cross-artifact consistency (spec, plan, tasks) and detects breaking changes (API, DB, UI) during software feature development.