qras
A local RAG semantic memory system using Qdrant and Ollama. Ideal for recalling workspace files, notes, project decisions, and user preferences with high-relevance vector search.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
118 skills found
A local RAG semantic memory system using Qdrant and Ollama. Ideal for recalling workspace files, notes, project decisions, and user preferences with high-relevance vector search.
Build RAG systems to ground LLMs in proprietary data. Includes vector database integration, embedding strategies, hybrid search, and advanced retrieval patterns for FastAPI backends.
Build production-grade RAG systems using vector databases, semantic search, and LangGraph to ground LLMs in external knowledge.
Essential guide to llmemory for document storage and search: installation, database setup with pgvector, document ingestion, hybrid/semantic retrieval, and building RAG systems with multi-tenant support.
Upstash Vector DB setup, semantic search, namespaces, and embedding models. Ideal for building high-performance vector search features in Next.js 16/Vercel projects.
Local hybrid search engine for markdown notes, documentation, and codebase knowledge bases to reduce token consumption and improve retrieval efficiency.
Efficiently search your Zotero library using Python code execution. Enables comprehensive multi-strategy queries, automated deduplication, and relevance ranking without context overflow or system crashes.
Intelligent RAG-based gateway that routes coding tasks to specialized Swift/iOS expertise without context window bloat. Uses MCP to retrieve precise patterns from 100+ indexed skills.
Supermemory is a long-term memory infrastructure for AI agents, enabling persistent context, user profiles, and semantic RAG across multi-modal knowledge bases.
A suite of professional tools for auditing, evaluating, chunking, and scaffolding production-ready RAG pipelines within Claude Code.
Architect and optimize production-grade RAG systems. Master embedding models, vector databases, chunking strategies, and retrieval pipelines for high-accuracy LLM applications.
Anthropic Claude integration patterns: streaming, RAG with pgvector, tool use, model selection (Haiku/Sonnet/Opus), prompt caching, and cost management for AI-powered engineering.