claude-rag-skills
A suite of professional tools for auditing, evaluating, chunking, and scaffolding production-ready RAG pipelines within Claude Code.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
150 skills found
A suite of professional tools for auditing, evaluating, chunking, and scaffolding production-ready RAG pipelines within Claude Code.
Guides agent memory system implementation, compares frameworks (Mem0, Zep, Letta, LangMem, Cognee), and designs persistence architectures for cross-session knowledge retention.
Guidelines for curating high-quality datasets for LLM post-training (SFT/DPO/RLHF), covering data formats, quality filtering, and collection strategies.
Prevents AI hallucination and ensures evidence-based, verifiable outputs when analyzing code, reviewing technical documents, or providing recommendations.
Evaluate code generation models using BigCode Evaluation Harness. Benchmarks include HumanEval, MBPP, and MultiPL-E with pass@k metrics for multi-language coding models.
A testing utility designed to simulate prompt injection attacks and validate security scanners for AI agent skills.
Build systematic evaluation frameworks for AI agents using multi-dimensional rubrics, LLM-as-a-judge, and regression testing to measure performance, quality, and context engineering effectiveness.
Unified AI gateway for 100+ LLMs with OpenAI-compatible API, model fallbacks, load balancing, and enterprise-grade tools.
Universal CLI tool to convert and synchronize AI agent skills between Claude Code and Gemini CLI extensions.
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Intelligent orchestration for dispatching tasks to specialized background agents with performance-based routing and execution tracking.
Expert guidance for building production-ready applications with Anthropic's Claude API. Covers SDKs, prompt caching, batch processing, streaming, tool use, and cost optimization strategies.