qwen-asr
Transcribe audio files (wav, mp3, ogg) to text using the Qwen ASR model. Fast, local-friendly, and requires no API keys.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
452 skills found
Transcribe audio files (wav, mp3, ogg) to text using the Qwen ASR model. Fast, local-friendly, and requires no API keys.
An automated visual note and flowchart generator. Converts text or keywords into styled diagrams, mind maps, and handwritten notes exported as images without requiring file-reading permissions.
Implement PCI DSS compliance for secure payment processing, cardholder data protection, and audit preparation using standardized security patterns.
Analyze and audit Excel spreadsheets to understand logic, identify formula errors, detect risks, and generate documentation for legacy or unknown files.
Generate or edit images using AI models like FLUX and Gemini. Ideal for photos, illustrations, concept art, and visual assets, excluding technical diagrams and schematics.
Generate real-time AI podcast-style audio narratives using Azure OpenAI's GPT Realtime Mini model with WebSocket streaming, complete with PCM to WAV conversion and frontend playback integration.
An intelligent gateway that analyzes, scores, and routes user requests across 27 agents, 27 skills, and 14 MCPs to optimize Claude Code execution.
Build production-grade RAG systems using vector databases, semantic search, and LangGraph to ground LLMs in external knowledge.
Master workflow controller for Lovable-style, AI-driven development. Instantly generates premium, multi-page, animated applications by routing to specialized sub-agents. No prompts needed—just build.
Research agent for Nia: index/search remote codebases, docs, and packages. Optimizes AI context by prioritizing full source indexing over web fetches to reduce hallucinations.
Perform network protocol reverse engineering, including packet capture, traffic analysis, protocol dissection, and custom format documentation.
Fetch, download, and batch process web images in various formats (JPG, PNG, WebP, SVG, etc.) for embedding, archiving, or chat integration.