gemini-vision
Implement Google Gemini API vision capabilities for image/document analysis including captioning, object detection, segmentation, and multi-image comparison.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
340 skills found
Implement Google Gemini API vision capabilities for image/document analysis including captioning, object detection, segmentation, and multi-image comparison.
Transforms vague or poorly structured prompts into optimized, high-performance instructions using proven prompt engineering principles for better AI model execution.
Build AI agents, multi-agent systems, and workflows using the OpenAI Agents SDK for TypeScript/JavaScript. Supports tools, handoffs, guardrails, MCP, and realtime voice.
Best practices for building integrations with NetBox REST and GraphQL APIs. Optimize performance, authentication, and architectural patterns for NetBox automations.
A guide for building high-quality MCP (Model Context Protocol) servers in Python or TypeScript to integrate external APIs and services into LLM workflows.
Generate, validate, and refine Mermaid diagrams including flowcharts, sequence diagrams, ERDs, and architecture maps to visualize complex software systems and workflows.
Expert guidance for configuring FeatBit observability via OpenTelemetry. Use for setting up metrics, logs, traces, and connecting OTEL backends like Seq, Jaeger, or Prometheus for FeatBit backend monitoring.
A RAG-based AI solver for high school Chinese GSAT exams, featuring structured knowledge retrieval, reasoning templates, and explainable AI outputs.
Manage automatic model routing for Higress AI Gateway via CLI. Configure triggers for intelligent model selection based on request content.
Create, debug, and optimize Cloudflare Durable Objects. Supports stateful coordination, RPC, SQLite storage, WebSocket handlers, and Vitest testing.
Discover, analyze, and summarize trending GitHub repositories, project health, and technical stacks to stay updated on open-source ecosystems.
Expert skill for building and maintaining AI agents using the Claude Agent SDK, covering architecture, tool integration, MCP servers, and agentic workflows.