ai-multimodal
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
296 skills found
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Retrieves Apple platform documentation, Human Interface Guidelines, and WWDC transcripts as Markdown using the Sosumi service.
Supermemory is a long-term memory infrastructure for AI agents, enabling persistent context, user profiles, and semantic RAG across multi-modal knowledge bases.
AI-powered tax advisor providing expert guidance on 2025 Japanese tax regulations, deductions, and financial planning for freelancers and employees.
Execute z.AI CLI for multimodal analysis, web search, reader, and GitHub repo exploration via CLI and MCP.
A comprehensive Next.js 15 development and project management skill for Claude Code, featuring Supabase integration, RBAC, and automated quality validation.
CLI-only iOS development agent for Swift, SwiftUI, and UIKit. Handles the full lifecycle: build, debug, test, and release without Xcode.
Neural web search and code context retrieval via Exa AI. Ideal for documentation, technical research, code examples, and company intelligence.
Retrieve current weather conditions, forecasts, and temperature data for global locations to assist with daily planning and travel.
Pre-execution security guardrails for AI agents. Validates shell commands and file reads against 400+ security patterns to block destructive operations, credential theft, and unauthorized system access.
Create high-converting email sequences for sales, launches, and lead nurturing. Expertly crafted drip campaigns tailored to your business voice, audience, and offer goals.
Extract, deobfuscate, and port WebGL/Canvas/Shader visual effects from websites into standalone, native JavaScript projects.