gemini-audio
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
283 skills found
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
A local RAG semantic memory system using Qdrant and Ollama. Ideal for recalling workspace files, notes, project decisions, and user preferences with high-relevance vector search.
Deploy serverless browser automation as cloud functions using Browserbase. Perfect for cron jobs, webhook endpoints, and scalable cloud-based automation tasks.
Create professional data visualizations with Python using matplotlib, seaborn, and plotly. Includes chart selection guidance, design principles, accessibility standards, and code patterns for publication-quality figures.
Generate professional pull request descriptions using Grey Haven Studio standards, ensuring clear summaries, motivation, technical implementation details, and testing strategies.
Research technical documentation and automatically generate ready-to-use software agent skills in markdown format.
Best practices and code patterns for ManimGL (3Blue1Brown's OpenGL animation engine). Provides templates, rules for 3D/interactive scenes, camera control, and LaTeX math visualization for technical creators.
Development guide for self-improving MassGen via programmatic automation testing and visual UI/UX evaluation.
A flexible template for developing and integrating custom AI agent skills within the Mini-Agent framework.
NestJS 11+ expert assistant for enterprise Node.js development, including dependency injection, DTO validation, authentication, ORMs, testing, microservices, and architectural best practices.
Framework for building multi-agent systems, AgentOS runtimes, and MCP-integrated AI agents.
Base ecosystem skill for Refly. Creates, discovers, and runs domain-specific skills, routes user intent to workflows via symlinks, and automates multi-step pipelines via the Refly CLI.