ai-multimodal
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
231 skills found
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Execute implementation plans in separate sessions with review checkpoints, ensuring task-by-task verification and robust code quality.
Unified Python CLI for Tavily AI operations including web search, URL extraction, site crawling, link mapping, and automated deep research reports.
Anthropic Claude AI models for high-performance coding, large-context analysis, and GUI interaction.
Linear issue management and synchronization for LobeHub, featuring automated PR referencing, sub-issue tree decomposition, and status tracking.
Bridge assets from EVM chains to Starknet, deploy agent accounts, and register identities with the HuginnRegistry for autonomous AI agent onboarding.
A command-line interface for X/Twitter that allows for reading, searching, posting, and social engagement using cookie-based authentication, integrated into the OpenWhale AI agent ecosystem.
A local RAG semantic memory system using Qdrant and Ollama. Ideal for recalling workspace files, notes, project decisions, and user preferences with high-relevance vector search.
Complete browser automation with Playwright. Features local dev server detection, script generation, screenshot capture, form filling, responsive testing, and UX validation.
Structured batch manipulation, validation, and reporting for PlantUML sequence diagrams across multiple files.
Expert consultant for designing and building high-quality, consistent AI agent skills. Guides you through discovery, architecture, and creation phases to ensure reliable, composable, and efficient skill delivery.
Master DP patterns with complete implementations for memoization, tabulation, and state design for production-ready solutions.