ai-multimodal
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
352 skills found
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
AI-optimized artifact tracking system for token-efficient project orchestration, phase management, and automated task delegation using YAML-Markdown hybrid formats.
Audit and optimize your AI prompts with Token Surgeon. Detect 10 common waste patterns, calculate efficiency, and reduce token usage for better prompt performance.
Perform rigorous code reviews for FastMCP projects, focusing on API design, dependency management, and codebase consistency.
Automated GitHub issue analysis, triage, and resolution planning tool integrated with Specification Driven Development (SDD) workflows.
Analyzes codebases to generate hierarchical documentation, onboarding guides, and architectural mapping, helping teams understand and document their projects efficiently.
Expert UI/UX design assistant for React & Next.js. Provides visual critiques, design system architecture, and Tailwind CSS/shadcn/ui implementation guidance for production-grade web applications.
Orchestrate Codex CLI for efficient parallel coding, task automation, and session-managed workflows to optimize token usage and development speed.
A framework for applying Test-Driven Development to process documentation, ensuring agent reliability by using pressure scenarios to identify and patch rationalization loopholes.
Build and manage MCP servers using the FastMCP framework. Guide for creating tools, resources, prompts, Claude Desktop integration, and deployment with Python and TypeScript.
Perform comprehensive code reviews with a focus on security vulnerabilities, performance optimization, maintainability, and code correctness.
Intelligent GitHub release orchestration using AI swarms for automated versioning, multi-platform deployment, testing, and rollback management.