trulens-evaluation-workflow
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
260 skills found
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Automated quality gate using 5 parallel AI agents to review code changes for correctness, style, and consistency.
Epsimo AI platform SDK and CLI for building agents with persistent state, Virtual Database, streaming conversations, and a React UI kit.
Implementation patterns for MERIDIAN autonomous AI agents using Claude API, including BaseAgent lifecycle, structured tool use, token budget enforcement, and cron scheduling.
A rigorous TDD workflow agent that enforces test-first development, ensuring 80%+ code coverage across unit, integration, and E2E tests for features, bug fixes, and refactoring.
An AI-driven framework for crafting bespoke, authentic portfolio websites from scratch. Guides agents through research, design, and code implementation to build unique developer and professional sites.
Focus testing effort on highest-risk areas using risk assessment and prioritization. Use when planning test strategy, allocating resources, or making coverage decisions.
Generate comprehensive instructions for AI agents to operate the Taskery local Kanban board, including CLI, API, and concurrency management.
Official evaluation framework for AI agent sessions, implementing Evaluation-Driven Development (EDD) principles to ensure reliability.
Comprehensive social media campaign analyzer providing performance tracking, ROI calculations, audience insights, and actionable marketing optimization recommendations.
Master professional TDD with the London (mockist) and Chicago (classicist) schools. Automate test-first workflows, style selection, and refactoring with AI agents.
Generate professional multi-platform ad campaigns from a URL. Get ad copy, audience targeting, creative specs, and budget strategies ready for media buying.