ai-multimodal
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
261 skills found
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Generate AGENTS.md and AI configuration files (Cursor, Claude, Gemini, Copilot) for your project to streamline your vibe-coding workflow and maintain context across sessions.
Unified CLI tool to read, query, discover, and write AI agent conversations using the agents:// URI scheme across multiple coding agents and providers.
Create interactive, custom data visualizations using d3.js — including charts, graphs, and network diagrams. Ideal for when you need fine-grained control over visual elements, transitions, and interactions.
A unified document processing gateway for PDF parsing, text extraction, conversion, and document manipulation across multiple local and cloud providers.
6-phase read-only Python analysis workflow that identifies design principle violations, code smells, and modernization opportunities based on specific project types (POC to Open Source).
Pre-execution security guardrails for AI agents. Validates shell commands and file reads against 400+ security patterns to block destructive operations, credential theft, and unauthorized system access.
Specialized Pest 4 agent for Laravel testing: writing, refactoring, TDD, browser/smoke tests, and architecture enforcement.
Transforms content to match specific voice profiles, tones, or styles using configurable YAML templates for consistent brand and narrative output.
Orchestrate visual communication by drawing diagrams, flowcharts, and annotations on a TLDraw canvas via CLI. Ideal for architectural planning, PR reviews, and logging agent output.
Implement adaptive learning with ReasoningBank for pattern recognition, strategy optimization, and continuous improvement in AI agents.
Standardized Rust documentation practices for the HASH codebase, ensuring consistency in doc comments, intra-doc links, and error handling.