trulens-evaluation-workflow
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
201 skills found
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
A comprehensive configuration suite for Claude Code, featuring production-grade agents, skills, hooks, and automated workflows optimized for high-intensity development.
Analyze Claude Code session history to identify inefficiencies, optimize token usage, and suggest workflow improvements.
AI-powered browser automation server for web interaction, data extraction, and research using the Model Context Protocol.
Analyze AppWorld task failures to extract specific API patterns and generate actionable playbook bullets with concrete code examples.
Xcode 26 expert for Liquid Glass, Foundation Models, and Apple Intelligence framework updates across SwiftUI, UIKit, AppKit, and more.
Standardizes the process of creating and maintaining reusable Claude Code skills for packaging developer workflows and domain expertise.
Implement, review, or improve SwiftUI features using the iOS 26+ Liquid Glass API for modern, performance-aware UI design.
Execute implementation plans in small, verifiable batches with pause-for-feedback checkpoints to prevent drift and ensure code quality.
Preserve successful Python code executions as reusable tools within the gentools package structure, utilizing Pydantic models for structured output and type-safe interfaces.
A framework for managing the end-to-end LLM project lifecycle, from evaluating task-model fit and pipeline architecture design to implementing structured output parsing and agent-assisted development.
Build production-grade RAG systems using vector databases, semantic search, and LangGraph to ground LLMs in external knowledge.