trulens-evaluation-workflow
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
121 skills found
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Find, connect, and use over 100,000 MCP tools and skills via the Smithery CLI to integrate external services, manage agent workspaces, and automate workflows.
Production-grade observability stack featuring Prometheus metrics, Grafana dashboarding, PromQL query language, alerting rules, and AI-powered anomaly detection for cloud-native applications.
Manage Jenkins CI/CD pipelines via REST API. Trigger builds, monitor job status, view console logs, and manage nodes and queues directly from your terminal or AI agent.
A comprehensive guide for using Bun as a high-performance alternative to Node.js. Supports project initialization, dependency management, script execution, and migration checklists.
A specification-driven workflow management system for structured development lifecycle management, covering proposal, planning, implementation, and archival phases.
A microworld operating system for LLM-based agent living memory, transforming filesystems into navigable rooms and code into habitable worlds.
Interactive terminal UI toolkit for Claude Code. Spawn and control calendar, document, and flight booking interfaces directly within tmux panes.
An autonomous AI-powered task management system with Kanban boards, git worktree isolation, and pluggable executors like Claude Code, Gemini, and OpenAI Codex.
Gate 2 development cycle skill that validates observability implementation, including structured logging, OpenTelemetry tracing, and instrumentation coverage, without modifying code.
Guide for creating and managing Sindri declarative YAML extensions, including capabilities for project-init, auth, lifecycle hooks, and MCP integration.
Search, analyze, and audit GeminiClaw session logs and memory. Use to investigate past interactions, track token usage, debug tool calls, and monitor agent performance.