gemini-vision
Implement Google Gemini API vision capabilities for image/document analysis including captioning, object detection, segmentation, and multi-image comparison.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
290 skills found
Implement Google Gemini API vision capabilities for image/document analysis including captioning, object detection, segmentation, and multi-image comparison.
Generate a structured academic paper outline from research narrative, experiment data, and review conclusions.
Manage version control with Jujutsu (jj): perform rebasing, conflict resolution, bookmark management, and commit manipulation in your Git-compatible workflow.
Autonomous iteration loop for AI software development. Executes tasks, validates code, and manages state until completion. Ideal for implementing complex PRP plans.
Cross-agent interaction skill via ANP protocol. Use decentralized identity (DID) to discover and invoke remote agents like maps, booking, and logistics services across the ANP network.
Manage automatic model routing for Higress AI Gateway via CLI. Configure triggers for intelligent model selection based on request content.
Fetches expert perspectives from OpenAI Codex and Google Gemini for architecture, code reviews, and debugging, with transparent LLM synthesis.
Advanced context engineering system for orchestrating AI agents, memory management, and token optimization to improve long-term persistence and project intelligence.
Implements an autonomous, critical self-verification layer for AI agents to validate code quality, security, and requirement alignment before task completion.
Autonomous multi-team codebase improvement agent with specialized modes: narrow (goal-directed), broad (hypothesis-divergent), and sweep (quality-focused).
An intelligent gateway that analyzes, scores, and routes user requests across 27 agents, 27 skills, and 14 MCPs to optimize Claude Code execution.
Find, review, and remove duplicate or near-duplicate images in FiftyOne datasets using computer vision similarity embeddings.