qwen-asr
Transcribe audio files (wav, mp3, ogg) to text using the Qwen ASR model. Fast, local-friendly, and requires no API keys.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
173 skills found
Transcribe audio files (wav, mp3, ogg) to text using the Qwen ASR model. Fast, local-friendly, and requires no API keys.
Gate 2 development cycle skill that validates observability implementation, including structured logging, OpenTelemetry tracing, and instrumentation coverage, without modifying code.
Standardized React UI patterns for loading states, error handling, and data fetching to ensure consistent UX and robust component architecture.
AI agent skill for Moltbot Arena, a real-time strategy programming game. Manage units, automate resource harvesting, coordinate structures, and execute tactical decisions via REST API.
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
Develop, test, sign, and publish governance plugins for Memoria using Rhai or gRPC runtimes. Manage the full plugin lifecycle from scaffolding to activation.
Unified CLI tool to read, query, discover, and write AI agent conversations using the agents:// URI scheme across multiple coding agents and providers.
Guide for integrating and managing custom Model Context Protocol (MCP) servers within the Cursor IDE environment.
Retrieve real-time library documentation, code examples, and technical guidance using the Context7 API for frameworks like React, FastAPI, and Next.js.
Expert for XRK-AGT runtime core, bot main class, event bus, server startup (HTTP/WS), and global object management.
Framework for building Vertesia plugins with a dual tool-server and UI architecture, featuring Hot Module Replacement, build-tools, and asset management.
Guidance for Model Context Protocol (MCP) server development, including tool design, resource handling, and AI/ML integration patterns.