gemini-audio
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
518 skills found
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
Orchestrate parallel Claude Code worker swarms with protocol-based behavioral governance for complex features, multi-step refactors, and long-running autonomous coding sessions.
Break down complex development requests into sequenced, actionable tasks for multi-agent delegation in Claude Code environments.
Architectural guidance and pattern implementation for Java Spring Boot backends, covering REST API design, JPA, caching, async processing, and logging.
GitHub operations via gh CLI. Use for repository inspection, issues, PRs, releases, and deep codebase analysis including cloning for architectural insights.
Update context-mode to the latest version, rebuild assets, reinstall global NPM dependencies, and refresh hook configurations.
P9 Tech Lead mode: Manages P8 agent teams via Task Prompts (six-element) without direct coding. Orchestrates 3+ parallel agents for project management, task decomposition, and architecture.
A unified document processing gateway for PDF parsing, text extraction, conversion, and document manipulation across multiple local and cloud providers.
Structured AI-guided research and market validation for new app ideas. Automates competitor analysis, technical feasibility, and MVP scoping.
Advanced prompt rewriting and optimization service. Analyzes prompts for clarity, specificity, and structure, providing actionable improvements, variations for testing, and prompt engineering best practices.
Extract YouTube video subtitles or transcripts directly into local text files using yt-dlp or browser automation.
Persistent state management and workflow analytics using DuckDB for task dependency tracking, historical metrics, and context checkpointing.