rag-engineer
Architect and optimize production-grade RAG systems. Master embedding models, vector databases, chunking strategies, and retrieval pipelines for high-accuracy LLM applications.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
126 skills found
Architect and optimize production-grade RAG systems. Master embedding models, vector databases, chunking strategies, and retrieval pipelines for high-accuracy LLM applications.
Build RAG systems to ground LLMs in proprietary data. Includes vector database integration, embedding strategies, hybrid search, and advanced retrieval patterns for FastAPI backends.
Build production-grade RAG systems using vector databases, semantic search, and LangGraph to ground LLMs in external knowledge.
Intelligent RAG-based gateway that routes coding tasks to specialized Swift/iOS expertise without context window bloat. Uses MCP to retrieve precise patterns from 100+ indexed skills.
A suite of professional tools for auditing, evaluating, chunking, and scaffolding production-ready RAG pipelines within Claude Code.
A local RAG semantic memory system using Qdrant and Ollama. Ideal for recalling workspace files, notes, project decisions, and user preferences with high-relevance vector search.
A toolkit for building robust LLM integrations: API patterns, streaming, function calling, RAG pipelines, and cost-effective model routing.
Essential guide to llmemory for document storage and search: installation, database setup with pgvector, document ingestion, hybrid/semantic retrieval, and building RAG systems with multi-tenant support.
Retrieve current, source-backed technical information using MCP tools to resolve queries about libraries, APIs, SDKs, and evolving tech ecosystems.
Guides agent memory system implementation, compares frameworks (Mem0, Zep, Letta, LangMem, Cognee), and designs persistence architectures for cross-session knowledge retention.
Search the web using Tavily's LLM-optimized search API for relevant, source-cited content without writing code.
A powerful CLI for converting web content and search results into LLM-friendly formats like Markdown, text, or HTML using the Jina AI Reader API.