ai-multimodal
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
190 skills found
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Implement production-grade AI agents with LangGraph, tool-calling guardrails, SSE streaming, and episodic memory. Includes anti-patterns, fix pairs, and stateful architecture patterns.
VVM (Vibe Virtual Machine) is a language for agentic programs where the LLM acts as the runtime. Orchestrate multi-agent workflows, manage state, and build resilient AI pipelines.
Build systematic evaluation frameworks for AI agents using multi-dimensional rubrics, LLM-as-a-judge, and regression testing to measure performance, quality, and context engineering effectiveness.
A comprehensive library of 305+ modular instruction packages, Python CLI tools, and agent workflows designed to extend the capabilities of AI coding assistants like Claude Code, Cursor, Aider, and Gemini CLI.
Comprehensive guide and implementation framework for building, configuring, and deploying NexAU agents from scratch, including tools, prompts, and skills.
A comprehensive guide and reference for building, orchestrating, and deploying AI agents using the Google Agent Development Kit (ADK).
Expert framework for designing agent-facing tools, optimizing tool descriptions, enforcing contract-based APIs, and implementing architectural reduction for reliable AI agent tool selection.
Specialized data engineering agent for designing ETL/ELT pipelines, defining data schemas, managing data quality, and implementing robust ingestion workflows.
Orchestrate complex multi-agent swarms with topologies like mesh, hierarchical, and star for research, development, and testing workflows.
Search, discover, and refine AI prompts using the prompts.chat library. Access thousands of community-curated prompts for ChatGPT, Claude, and other AI models.
Context Engineering agent skill to initialize, generate, and execute comprehensive implementation blueprints (PRPs) for one-pass software development.