trulens-evaluation-workflow
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
309 skills found
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Pre-execution security guardrails for AI agents. Validates shell commands and file reads against 400+ security patterns to block destructive operations, credential theft, and unauthorized system access.
Extracts mathematical content like definitions, theorems, and proofs from documents (PDF, MD, TEX, TXT) using AI-based cleaning and conversion.
An end-to-end video processing pipeline that transforms raw recordings into transcripts, key insights, short clips, and polished articles.
Fetch, index, and search developer documentation from GitHub and websites to provide AI agents with accurate, grounded, and version-specific code context.
Conduct systematic literature reviews across PubMed, arXiv, and Semantic Scholar with AI-driven synthesis, verified citations, and mandatory schematic visualization.
Evidence-first literature collector for automated research pipelines. Scales paper pools to 1200+ with metadata normalization, provenance tracking, and multi-source ingestion.
Transforms content to match specific voice profiles, tones, or styles using configurable YAML templates for consistent brand and narrative output.
Perform internet searches using the Zhipu AI web search API to retrieve real-time information, news, and current data.
Provides targeted, concise English language editing and stylistic improvements for text without performing full rewrites.
Morph WarpGrep and Fast Apply tools for high-speed agentic code search, deep logic analysis, and efficient AI-driven code editing.
Create, alter, and validate Snowflake semantic views using the Snowflake CLI.