qwen-asr
Transcribe audio files (wav, mp3, ogg) to text using the Qwen ASR model. Fast, local-friendly, and requires no API keys.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
153 skills found
Transcribe audio files (wav, mp3, ogg) to text using the Qwen ASR model. Fast, local-friendly, and requires no API keys.
Orchestrate Codex CLI for efficient parallel coding, task automation, and session-managed workflows to optimize token usage and development speed.
Build production-grade RAG systems using vector databases, semantic search, and LangGraph to ground LLMs in external knowledge.
Autonomous multi-agent LinkedIn system using LangGraph and Claude Opus 4.5 for trend research, content creation, voice profiling, and analytics-driven optimization.
Automate i18n setup, string extraction, and locale parity audits for React/TS codebases. Features framework-aware config, automated audit scripts, and safe string replacement to ensure seamless localization.
Integrates browser-native Proofreader API into web applications for AI-powered text correction, grammar checking, and language support with managed model lifecycle.
CLI-only iOS development agent for Swift, SwiftUI, and UIKit. Handles the full lifecycle: build, debug, test, and release without Xcode.
Analyze AppWorld task failures to extract specific API patterns and generate actionable playbook bullets with concrete code examples.
Automated text-to-image rendering engine for social media posts, article covers, and long-form threads. Supports X-style, WeChat, and poster templates with high-precision text formatting and highlights.
Persistent, semantic long-term memory for AI agents. Save, query, and retrieve cross-session dialogues, decisions, and multimodal context using semantic compression.
Search, discover, and refine AI prompts using the prompts.chat library. Access thousands of community-curated prompts for ChatGPT, Claude, and other AI models.
Automate Android device operations using AI AutoGLM Phone Agent. Enables natural language control for app testing, data collection, and UI interactions like tapping, scrolling, and inputting text.