gemini-vision
Implement Google Gemini API vision capabilities for image/document analysis including captioning, object detection, segmentation, and multi-image comparison.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
508 skills found
Implement Google Gemini API vision capabilities for image/document analysis including captioning, object detection, segmentation, and multi-image comparison.
Gemini-powered UI design review, accessibility auditing, and design system validation tool for software agents.
Persistent state management and workflow analytics using DuckDB for task dependency tracking, historical metrics, and context checkpointing.
Standardized Java development guidelines including naming conventions, exception handling, Spring Boot best practices, and concurrency patterns.
Comprehensive AI-generated text detection framework. Features multi-layer analysis of vocabulary, structural patterns, model-specific fingerprints, and technical metadata artifacts to identify AI authorship.
Audit, prune, and maintain vector memory for Clawdbot. Prevents token waste, clears junk data, and automates memory hygiene via LanceDB maintenance.
Manage, run, and update JS framework benchmarks for the Gea framework, including reporting, HTML result generation, and performance comparisons.
Generate hierarchical, token-efficient AGENTS.md files for AI coding agents to provide repository-wide context and project-specific guidelines.
Build complete UI screens by composing multiple uxscii components. Use when you need to create, scaffold, or build .uxm screens like login, dashboard, profile, settings, or checkout pages.
Nonlinear optimization toolkit using CasADi and IPOPT. Ideal for building complex NLP models, defining symbolic variables, constraints, and solvers, with specialized support for power systems optimization patterns.
Expert guide for OpenCode AI: TUI commands, CLI operations, AGENTS.md configuration, custom agent workflows, and project setup.
A framework for applying Test-Driven Development to process documentation, ensuring agent reliability by using pressure scenarios to identify and patch rationalization loopholes.