gemini-vision
Implement Google Gemini API vision capabilities for image/document analysis including captioning, object detection, segmentation, and multi-image comparison.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
449 skills found
Implement Google Gemini API vision capabilities for image/document analysis including captioning, object detection, segmentation, and multi-image comparison.
Verify code style and formatting using Prettier and Stylelint without applying changes. Ensures consistent codebases by identifying issues in JS/TS/CSS/SCSS files.
Build a professional LinkedIn content system to establish authority, attract inbound leads, and maintain a consistent personal brand through strategic positioning, content pillars, and optimized posting rhythms.
A testing utility designed to simulate prompt injection attacks and validate security scanners for AI agent skills.
Implementation patterns for MERIDIAN autonomous AI agents using Claude API, including BaseAgent lifecycle, structured tool use, token budget enforcement, and cron scheduling.
Automate WordPress content publishing with draft workflows, media library integration, and native Hebrew/RTL support.
A professional tool for reading, creating, and editing .docx documents with precise layout control, using python-docx and automated visual rendering checks.
Autonomous multi-agent LinkedIn system using LangGraph and Claude Opus 4.5 for trend research, content creation, voice profiling, and analytics-driven optimization.
Detects timing side-channel vulnerabilities in cryptographic code through static and dynamic analysis across multiple programming languages.
iMessage and SMS CLI for macOS: list chats, view history, and send messages directly via Messages.app.
Maintain Mintlify documentation sites: configure navigation, manage MDX content, add components, and handle API references.
Search and analyze X (Twitter) trends, hashtags, and tweet data by location using custom CLI tools.