backend-rag-implementation
Build RAG systems to ground LLMs in proprietary data. Includes vector database integration, embedding strategies, hybrid search, and advanced retrieval patterns for FastAPI backends.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
128 skills found
Build RAG systems to ground LLMs in proprietary data. Includes vector database integration, embedding strategies, hybrid search, and advanced retrieval patterns for FastAPI backends.
Build production-grade RAG systems using vector databases, semantic search, and LangGraph to ground LLMs in external knowledge.
Intelligent RAG-based gateway that routes coding tasks to specialized Swift/iOS expertise without context window bloat. Uses MCP to retrieve precise patterns from 100+ indexed skills.
Find, review, and remove duplicate or near-duplicate images in FiftyOne datasets using computer vision similarity embeddings.
Upstash Vector DB setup, semantic search, namespaces, and embedding models. Ideal for building high-performance vector search features in Next.js 16/Vercel projects.
Evaluate code generation models using BigCode Evaluation Harness. Benchmarks include HumanEval, MBPP, and MultiPL-E with pass@k metrics for multi-language coding models.
Architect and optimize production-grade RAG systems. Master embedding models, vector databases, chunking strategies, and retrieval pipelines for high-accuracy LLM applications.
Local hybrid search engine for markdown notes, documentation, and codebase knowledge bases to reduce token consumption and improve retrieval efficiency.
A local RAG semantic memory system using Qdrant and Ollama. Ideal for recalling workspace files, notes, project decisions, and user preferences with high-relevance vector search.
Search and reference Chromium documentation, including design docs, APIs, and development guides. Use to locate, browse, or learn about architecture, GPU, network, security, and testing concepts within the Chromium codebase.
Essential guide to llmemory for document storage and search: installation, database setup with pgvector, document ingestion, hybrid/semantic retrieval, and building RAG systems with multi-tenant support.
Supermemory is a long-term memory infrastructure for AI agents, enabling persistent context, user profiles, and semantic RAG across multi-modal knowledge bases.