ai-multimodal
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
395 skills found
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Deploy and manage Vercel projects, including linking repositories, environment variables, and domain configurations.
Structured AI-guided research and market validation for new app ideas. Automates competitor analysis, technical feasibility, and MVP scoping.
Automated quality gate using 5 parallel AI agents to review code changes for correctness, style, and consistency.
Orchestrate multi-agent AI swarms using the ClawTeam CLI to automate parallel task execution, dependency management, and team collaboration with git worktree isolation and tmux support.
Integrate with REST APIs to manage authentication, execute HTTP requests, and process JSON responses seamlessly within your development workflow.
A scaffolding tool for generating production-ready Model Context Protocol (MCP) servers, including boilerplate, typed handlers, schema definitions, and test stubs for AI agent integrations.
Comprehensive Test Driven Development (TDD) assistant for engineering teams, featuring intelligent test generation, coverage analysis, and multi-framework support.
Generate a structured academic paper outline from research narrative, experiment data, and review conclusions.
A comprehensive configuration suite for Claude Code, featuring production-grade agents, skills, hooks, and automated workflows optimized for high-intensity development.
Research agent for Nia: index/search remote codebases, docs, and packages. Optimizes AI context by prioritizing full source indexing over web fetches to reduce hallucinations.
Build $50k-grade frontend interfaces with production-ready code, professional typography, and high-fidelity image integration.