trulens-evaluation-workflow
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
313 skills found
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Run, debug, and manage DBHub tests including unit, integration with Testcontainers, and database-specific suites. Perfect for verifying code changes and troubleshooting database connector issues.
Repository implementation guide for local-skills-mcp. Provides technical documentation on MCP tool handlers, skill loading, aggregation logic, and project structure for developers.
Command-line interface for controlling Bluesound and NAD audio players, enabling multi-room playback, grouping, and volume management.
Manages complete plugin lifecycle for JUCE development: install, uninstall, reset, and destroy. Handles system folder deployment, cache management, and safe, version-controlled removal for audio developers.
Automates the creation of isolated git worktree environments for parallel feature development and environment setup.
A professional framework for conducting network penetration testing, including automated reconnaissance, vulnerability scanning, and exploitation workflows.
Plan mode on steroids. Push engineers to think with a product mindset before building with structured intake and concrete technical options.
Debugging guide for AReaL distributed training issues, including hangs, NCCL errors, OOM, and numerical consistency in FSDP2/TP/CP/EP.
Manage project dependencies with mise: add, configure, and troubleshoot tool versions, PATH activation, and config files.
Automates research resource preparation by loading instances, searching GitHub for codebases, building dataset descriptions, and downloading arXiv papers.
Foundry development guide for CMTAT RuleEngine contracts, including testing, deployment scripts, and project-specific Solidity patterns.