gemini-audio
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
500 skills found
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
Complete project architecture and structure guide for LobeHub. Use for codebase exploration, project organization, file location, and architectural context.
Expert framework for designing agent-facing tools, optimizing tool descriptions, enforcing contract-based APIs, and implementing architectural reduction for reliable AI agent tool selection.
Intelligent strategic planning and requirements gathering with multi-perspective consensus loops and structured deliberation.
Interactive guide for workspace discovery, providing access to specialist agents, automated workflows, CLI tools, and active lifecycle hooks.
Development guide for creating custom nodes in FlowGram.ai workflows, supporting both auto-generated simple forms and complex custom UI components.
Efficiently manage git worktrees with automated file synchronization, background task execution, and CLI-based workspace orchestration.
Execute implementation plans in separate sessions with review checkpoints, ensuring task-by-task verification and robust code quality.
Implement LlamaExtract for robust structured data extraction from PDF, DOCX, and PPTX files using Pydantic schemas.
Generate or edit images using AI models like FLUX and Gemini. Ideal for photos, illustrations, concept art, and visual assets, excluding technical diagrams and schematics.
AI-powered generator for viral XiaoHongShu posts, including titles, captions, hashtags, cover image prompts, and posting strategies.
Gemini-powered UI design review, accessibility auditing, and design system validation tool for software agents.