gemini-audio
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
531 skills found
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
Executes Gradle-based Java tests, filters results for failures and key statistics, and provides concise reports to streamline backend development and debugging.
Aggressively prune grammatical scaffolding and filler text from inputs to optimize LLM token usage while retaining core semantic content.
Implement production-grade AI agents with LangGraph, tool-calling guardrails, SSE streaming, and episodic memory. Includes anti-patterns, fix pairs, and stateful architecture patterns.
React and Vite performance optimization guidelines. Use when writing, reviewing, or optimizing React components built with Vite.
A highly customized personal garden based on Quartz v4, featuring enhanced Markdown parsing, telescopic text, TikZ/pseudocode rendering, and Obsidian integration.
Persistent state management and workflow analytics using DuckDB for task dependency tracking, historical metrics, and context checkpointing.
MCP Gateway design patterns for managing Agent Gateway, Subprocess, and Daemon isolation strategies to optimize context token usage and system performance.
Implement LlamaExtract for robust structured data extraction from PDF, DOCX, and PPTX files using Pydantic schemas.
Guidance on frontend state management, including global stores like Zustand/Pinia, server state via TanStack Query, and URL state handling.
Advanced multi-language debugging support with stack trace analysis, runtime error triage, and automated diagnostic tools for containerized and distributed systems.
Automated high-quality VS Code screenshot capture using Playwright and serve-web for documentation, slide decks, and visual technical content.