trulens-evaluation-workflow
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
556 skills found
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
A suite of professional tools for auditing, evaluating, chunking, and scaffolding production-ready RAG pipelines within Claude Code.
Design comprehensive product metric dashboards, define KPIs, and establish monitoring plans with data-driven visualization, alert thresholds, and framework integration.
Expert-level guidance for ffuf web fuzzing, enabling automated discovery of hidden directories, files, parameters, and vulnerabilities during penetration testing.
Automated OpenClaw repository maintainer: triage, label, and validate PRs/issues using gitcrawl and GitHub CLI.
Systematic debugging skill to trace errors backward through call stacks, identify original triggers, and implement layered defenses instead of patching symptoms.
Standardize, validate, and manage Netresearch AI agent skill repositories with automated structure enforcement, distribution workflows, and licensing compliance tools.
Full-stack application orchestrator that analyzes natural language requests to determine tech stacks, scaffold projects, and coordinate specialized development agents.
Automated quality gate using 5 parallel AI agents to review code changes for correctness, style, and consistency.
Advanced visual regression testing with pixel-perfect and AI-powered diff analysis, cross-browser validation, and responsive design checks to prevent UI regressions in CI/CD pipelines.
Advanced TypeScript and React development assistant for modern web applications. Expert in component architecture, state management, Vitest unit testing, Playwright E2E automation, and efficient TypeScript configuration.
Generate triage reports and analyze feature area health for the Windows App SDK repository. Identify high-priority issues, triage backlogs, and team focus areas.