ai-multimodal
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
503 skills found
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Integrates browser-native Proofreader API into web applications for AI-powered text correction, grammar checking, and language support with managed model lifecycle.
Master Godot 4 GDScript patterns, including signal-based communication, state machines, scene architecture, and performance optimization for professional game development.
Track exercise reps like pushups and squats for the Peon Trainer. Log progress directly through your AI agent to trigger voice lines and stay motivated while coding.
Advanced prompt rewriting and optimization service. Analyzes prompts for clarity, specificity, and structure, providing actionable improvements, variations for testing, and prompt engineering best practices.
A specification-driven workflow management system for structured development lifecycle management, covering proposal, planning, implementation, and archival phases.
Self-maintaining skill for OpenCode agents to update documentation, capture learnings, and extend tool/agent capabilities dynamically.
Queen-led multi-agent orchestration for Claude Code, featuring Byzantine consensus, persistent collective memory, and adaptive task distribution for complex software projects.
Enforces low Cognitive and Cyclomatic complexity in all code. Automatically maintains readability, modularity, and maintainability by preventing complex functions during development.
Mandatory execution-based validation for all software implementation tasks. Ensures code works through empirical verification before confirmation.
Execute git commits with conventional commit message analysis, intelligent file staging, and automated semantic message generation based on code diffs.
Maintains a detailed, step-by-step implementation diary for coding sessions with docmgr integration to track changes, rationale, commands, and failures.