trulens-evaluation-workflow
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
534 skills found
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Cascading goal tracking system connecting 3-year vision to daily tasks. Automates progress calculation, stalled goal detection, and project-to-goal alignment for Obsidian vaults.
Implement secure session-based authentication in FastAPI with Argon2 hashing, database-backed sessions, and OAuth2 provider integration.
Expert technical support for the Litestream disaster recovery tool, covering WAL monitoring, LTX replication, cloud storage backends, and SQLite page management.
Token-efficient codebase navigation through intelligent symbol indexing, domain chunking, and architectural layer filtering. Reduce token usage by 60-95% when exploring or developing complex systems.
Stream-JSON chaining for multi-agent pipelines, data transformation, and sequential workflows within the Ruflo/Claude Flow ecosystem.
Persistent state management and workflow analytics using DuckDB for task dependency tracking, historical metrics, and context checkpointing.
Run Semgrep static analysis scans on codebases using parallel subagents, multi-language detection, and Pro-enabled cross-file taint tracking.
Expert development guide for the Jean Claude orchestration framework. Use for source code changes, architecture, testing, and debugging.
Efficiently extract, filter, and transform specific fields from JSON files using jq, saving up to 95% of context window usage compared to reading full files.
Search the live web using Baidu AI Search Engine (BDSE) for real-time information, documentation, and research topics.
Executes a rigorous, multi-phase Fagan Inspection to systematically resolve persistent, stubborn bugs and complex code interactions.