trulens-evaluation-workflow
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
321 skills found
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Multi-source research tool for customer inquiries, bug investigations, and account history synthesis with source attribution and confidence scoring.
A Pomodoro focus timer that tracks work sessions in a local SQLite database to provide productivity analytics and personalized performance insights over time.
Convert diverse file formats like PDFs, Office docs, images, audio, and web content into clean Markdown, specifically optimized for LLM ingestion, RAG pipelines, and automated text analysis workflows.
Conduct strategic competitive analysis to map market landscapes, identify direct competitors, synthesize strengths and weaknesses, and uncover differentiation opportunities.
A robust verification and QA system for software agents featuring real-time truth scoring, automated code validation, and instant rollback capabilities to maintain high reliability.
Identify, categorize, and troubleshoot flaky tests by analyzing CI history, execution patterns, and code structure to improve test suite reliability.
Intelligent pattern selection for Fabric CLI, automatically choosing from 242+ specialized prompts for threat modeling, data analysis, summarization, and content creation.
Classify and group meteorological and environmental variables into specific driver categories for consistent attribution analysis and environmental modeling.
Guidance and operational tips for identifying, reviewing, and managing pull requests created by the GitHub Copilot coding agent within your repository.
Unified content extraction and action planning engine. Automatically processes URLs (YouTube, articles, PDFs) into actionable plans.
Systematic performance engineering: baseline measurement, profiling, bottleneck diagnosis, and evidence-based optimization guidance for high-performance applications.