trulens-evaluation-workflow
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
566 skills found
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
A template skill for creating project-specific AI agent guidelines, defining architecture, file structures, and code patterns for deterministic development.
Analyze UI/UX quality against 4 authoritative standards (NNg, Laws of UX, Apple HIG, WCAG) to receive actionable design and accessibility improvements for mobile and web components.
Capture and formalize software development ideas into structured design documents within the Hashbrown repository, including research and conceptual sketches.
Build targeted prospect lists by analyzing public LinkedIn profiles and business data to identify decision-makers, track career moves, and enrich leads for outreach.
The final execution agent for the vibe-coding workflow. Builds your MVP incrementally by following the AGENTS.md master plan, managing session continuity, and verifying each feature via testing.
Automated session cleanup and documentation tool. Proactively updates CLAUDE.md, detects automation patterns, extracts insights, and organizes pending tasks.
GitHub workflow assistant with integrated git and gh CLI support for managing repositories, branches, pull requests, and issues.
Guidance and operational tips for identifying, reviewing, and managing pull requests created by the GitHub Copilot coding agent within your repository.
Manage isolated LlamaFarm development environments using git worktrees for parallel agent sessions and service testing.
Comprehensive health assessment tool for Continuous Claude components including skills, agents, hooks, and memory systems.
An automated meta-learning skill that improves agent workflows by capturing patterns, failures, and shortcuts after each task execution.