gemini-audio
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
134 skills found
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
Framework for building professional competitive landscape decks, including market positioning, peer benchmarking, and strategic synthesis for finance professionals.
Create aesthetically beautiful interfaces using systematic design principles, AI-driven evaluation, and automated inspiration analysis.
Convert clinical text to natural, empathetic speech using ElevenLabs for patient instructions, medication reminders, and accessible health content.
Create and optimize professional LinkedIn content, including posts, articles, and engagement strategies with character-counted hooks and platform-specific formatting.
Access Y Combinator’s library of 443+ startup resources for expert advice on fundraising, co-founders, product development, growth, and scaling your business.
Physical hardware synthesis bridge for PAI. Generates blueprints, 3D printing code, SVG paths for laser cutting, and G-Code for CNC machining to bring agentic designs into the physical world.
Drafts engaging social media posts, writes hooks, creates thread structures, and generates companion images for LinkedIn and X/Twitter.
Design professional-grade brand identities using geometric primitives, negative space, and flat vector-style aesthetics via AI-driven branding logic.
Analyze markdown documentation files to ensure compliance with predefined AI token budgets and optimize content for efficient AI ingestion.
A design-focused coding agent that brings world-class interface craft, motion, and systematic front-end engineering to your development workflow.
AI-powered coach for Xiaohongshu (XHS) note writing. Generate viral, platform-optimized content with storytelling templates, engagement hacks, and automated compliance tagging.