ai-multimodal
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
187 skills found
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Build and manage MCP servers using the FastMCP framework. Guide for creating tools, resources, prompts, Claude Desktop integration, and deployment with Python and TypeScript.
Efficiently search your Zotero library using Python code execution. Enables comprehensive multi-strategy queries, automated deduplication, and relevance ranking without context overflow or system crashes.
Create high-performance AI skills by reverse-engineering successful GitHub projects and proven open-source methodologies.
🛡️ GDPR & LGPD Privacy Guardian: Automated compliance scanner that detects PII exposure, insecure logging, and tracking violations in your codebase to prevent regulatory fines.
AI-powered browser automation server for web interaction, data extraction, and research using the Model Context Protocol.
Crawl websites to extract content as clean markdown files. Ideal for documentation, research, and offline knowledge management.
Implement ReasoningBank adaptive learning with AgentDB's ultra-fast vector backend. Features trajectory tracking, verdict judgment, memory distillation, and pattern recognition for self-learning autonomous agents.
Expert CLI guides for AI agents, featuring senior engineer workflows, safety guardrails, and operational patterns for cloud, IaC, containers, databases, and dev tools.
Audit and optimize your AI prompts with Token Surgeon. Detect 10 common waste patterns, calculate efficiency, and reduce token usage for better prompt performance.
Manage automatic model routing for Higress AI Gateway via CLI. Configure triggers for intelligent model selection based on request content.
A systematic, multi-angle web research agent. Use for deep investigation, complex queries, and as a mandatory pre-research step before content generation to ensure evidence-backed, high-quality results.