ai-multimodal
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
333 skills found
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
An AI-powered sales assistant that transforms business scenarios into optimized prompts, automatically generating high-quality emails, proposals, and analysis reports without requiring prompt engineering skills.
Generate and edit images, diagrams, and infographics using Google's Gemini 3 Pro model. Supports text-to-image, style transformation, and data-accurate visual creation.
Generate clinical trial protocols for medical devices and drugs. Supports modular, waypoint-based design, research integration, and regulatory documentation alignment.
A structured guide for novelists to navigate the seven-step writing process, from constitution and specification to planning, tasking, drafting, and quality analysis.
Build professional, accessible, and responsive user interfaces using React, Next.js, and modern design systems like shadcn/ui. Focuses on developer tools, chat interfaces, and real-time streaming components.
Extract tacit engineering knowledge through guided interviews and generate structured steerings for consistent project standards and conventions.
Orchestrate complex workflows by coordinating multiple specialized AI agents for multi-perspective code analysis, feature implementation, and system-wide reviews.
Autonomous multi-agent orchestration framework for Claude Code with memory-driven workflows, parallel-first task execution, Aristotle-based deconstruction, and multi-stage quality gates.
Orchestrator for audio plugin WebView UI design, handling iterative mockup generation and production-ready scaffolding for JUCE-based instruments and effects.
Standardizes project context by managing artifacts (product, tech-stack, workflow, tracks) in a conductor/ directory. Supports project scaffolding, artifact synchronization, and AI alignment for greenfield and brownfield projects.
Orchestrate Codex CLI for efficient parallel coding, task automation, and session-managed workflows to optimize token usage and development speed.