gemini-audio
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
308 skills found
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
Validates and coordinates batch study guide operations, preventing errors by enforcing template compatibility, file availability, and source-only policies before agent execution.
Reliably read and extract content from publicly shared Google Docs using curl for full document retrieval.
Your AI chief of staff for startups. Includes 28 commands for idea validation, financial modeling, pitch decks, market research, and CEO operational frameworks from industry benchmarks.
Analyze GA4 and GSC performance data with automated benchmarks, status indicators, and actionable content optimization insights.
Build RAG systems to ground LLMs in proprietary data. Includes vector database integration, embedding strategies, hybrid search, and advanced retrieval patterns for FastAPI backends.
Expert SQL agent for modern database systems, query optimization, HTAP environments, and data architecture patterns. Optimize performance, schema design, and analytical workloads effectively.
Find, review, and remove duplicate or near-duplicate images in FiftyOne datasets using computer vision similarity embeddings.
An all-in-one Chinese daily utility toolkit: weather, currency exchange, news, and package tracking. Zero configuration, no API keys required.
Generates a random lucky number between 0 and 9999 for games, decision-making, or entertainment.
Development guide for lemline-core, the stateless Serverless Workflow engine. Manage workflow execution, node navigation, state transitions, JQ expression evaluation, error handling, and parallel fork logic.
Cross-agent interaction skill via ANP protocol. Use decentralized identity (DID) to discover and invoke remote agents like maps, booking, and logistics services across the ANP network.