gemini-audio
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
171 skills found
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
Submit completed tasks on OpenAnt via CLI. Handles text reports, file uploads (images, docs, code), and external proof links to ensure verified deliverables.
Read and analyze any data file (CSV, JSON, Parquet, Avro, Excel, etc.) or remote URL (S3, HTTPS) using DuckDB. Automatically detect file formats and preview/profile datasets.
Implements frontend forms and actions in Kirby CMS, including contact forms, file uploads, email handling, and page creation from the frontend.
Automated screenshot-to-knowledge workflow for Enzo. Captures, categorizes, extracts content, and logs patterns from screenshots to build a structured reference library.
Downloads YouTube videos directly to your ~/Downloads folder using yt-dlp. Supports high-quality audio and video extraction.
Create new Figma design or FigJam files directly via the MCP server. Automatically resolves plans and initializes new canvases for your design workflows.
Implements Manus-style persistent markdown planning for complex workflows, project tracking, and research management to optimize agent attention and memory.
Manage Feishu cloud storage files, folders, and documents directly through your assistant.
Comprehensive email management and automation tool. Send, receive, and organize emails with attachment support across multiple providers.
Automate Convex static site hosting integration, managing upload APIs, HTTP routing, and deployment scripts for React, Vite, and Next.js applications.
Search codebases efficiently using ripgrep for lightning-fast text patterns and ast-grep for precise, syntax-aware structural code analysis.