backend-rag-implementation
Build RAG systems to ground LLMs in proprietary data. Includes vector database integration, embedding strategies, hybrid search, and advanced retrieval patterns for FastAPI backends.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
138 skills found
Build RAG systems to ground LLMs in proprietary data. Includes vector database integration, embedding strategies, hybrid search, and advanced retrieval patterns for FastAPI backends.
Proactive context window management for AI agents via intelligent token monitoring, snapshot creation, and selective state rehydration to maintain continuity during long sessions.
Perform internet searches using the Zhipu AI web search API to retrieve real-time information, news, and current data.
Build AI agents with the OpenAI Agents SDK for Python. Supports multi-agent handoffs, function tools, stateful sessions, streaming, and Azure OpenAI integration via LiteLLM.
Maintain and update the MassGen model registry, including backend capabilities, model metadata, pricing structures, and context window configurations for new and existing AI models.
Package entire code repositories into single, AI-optimized files. Ideal for providing codebase context to LLMs like Claude, ChatGPT, and Gemini for analysis, security audits, and bug investigations.
Framework for multi-agent collaboration using the Google A2A protocol. Enables messaging, task delegation, and cross-agent coordination for CLI-based AI tools.
A powerful CLI for converting web content and search results into LLM-friendly formats like Markdown, text, or HTML using the Jina AI Reader API.
Analyze local system hardware (RAM, CPU, GPU/VRAM) to receive expert recommendations for optimized local LLM models, quantization settings, and performance estimates.
Unified AI gateway for 100+ LLMs with OpenAI-compatible API, model fallbacks, load balancing, and enterprise-grade tools.
Upstash Vector DB setup, semantic search, namespaces, and embedding models. Ideal for building high-performance vector search features in Next.js 16/Vercel projects.
GPT Researcher is an autonomous AI agent for comprehensive web and local research, generating detailed, cited reports using a planner-executor-publisher architecture.