qwen-asr
Transcribe audio files (wav, mp3, ogg) to text using the Qwen ASR model. Fast, local-friendly, and requires no API keys.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
151 skills found
Transcribe audio files (wav, mp3, ogg) to text using the Qwen ASR model. Fast, local-friendly, and requires no API keys.
Analyze markdown documentation files to ensure compliance with predefined AI token budgets and optimize content for efficient AI ingestion.
Deep document structure analysis and intelligent content extraction for knowledge bases.
Official documentation skill for Shipany, an AI-powered SaaS boilerplate. Provides expert guidance on Next.js 15, Drizzle ORM, NextAuth, and payment integrations.
Find, connect, and use over 100,000 MCP tools and skills via the Smithery CLI to integrate external services, manage agent workspaces, and automate workflows.
Expert Kokoro TTS implementation skill for real-time, secure, and offline voice synthesis in JARVIS-style assistants. Features streaming output, prosody control, and performance-optimized audio generation.
Intelligent strategic planning and requirements gathering with multi-perspective consensus loops and structured deliberation.
High-performance document intelligence library for extracting text, tables, code, and metadata from 91+ file formats, with OCR and LLM-ready output.
Normalizes testing defect logs by correcting typos, abbreviations, and ambiguous descriptions based on product-specific codebooks and station validation.
Generate professional multi-platform ad campaigns from a URL. Get ad copy, audience targeting, creative specs, and budget strategies ready for media buying.
Analyze Stitch projects and synthesize a semantic design system into DESIGN.md files to serve as a source of truth for AI-driven UI generation.
Generate diverse landing page narrative angles, define target audiences, and specify required evidence for conversion-focused marketing workflows.