gemini-audio
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
437 skills found
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
Search the live web using Baidu AI Search Engine (BDSE) for real-time information, documentation, and research topics.
Interactive Archon integration for knowledge base and project management. Features RAG-powered semantic search, website crawling, document versioning, and hierarchical task management via REST API.
Initiates automated reverse engineering by discovering codebase architecture, layers, and technology stacks to facilitate system modernization or documentation.
Universal CLI tool to convert and synchronize AI agent skills between Claude Code and Gemini CLI extensions.
Debug the AWF (Agentic Workflow Firewall) by inspecting containers, analyzing Squid logs, checking iptables, and troubleshooting network or domain access issues in isolated sandboxes.
Expert consultant for designing and building high-quality, consistent AI agent skills. Guides you through discovery, architecture, and creation phases to ensure reliable, composable, and efficient skill delivery.
Automate B2C mobile app marketing with short-form video strategies for TikTok, Instagram, and YouTube. Includes content creation, scheduling via Post Bridge API, and performance analysis.
Framework for orchestrating long-running agentic tasks, evidence-based delivery, and automated QA gates following Simon Willison's iterative loop.
Comprehensive Test Driven Development (TDD) assistant for engineering teams, featuring intelligent test generation, coverage analysis, and multi-framework support.
Generate professional visual assets including app icons, logos, banners, and illustrations using the Nano Banana Pro (Gemini 3 Pro) AI model.
Create and manage Claude Code skills using Anthropic best practices: triggers, hooks, and progressive disclosure.