openai-whisper
Local speech-to-text transcription using the OpenAI Whisper CLI, providing private, high-accuracy audio processing without external API keys.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
307 skills found
Local speech-to-text transcription using the OpenAI Whisper CLI, providing private, high-accuracy audio processing without external API keys.
An end-to-end video processing pipeline that transforms raw recordings into transcripts, key insights, short clips, and polished articles.
Generate Bilibili-compatible video chapter lists from SRT subtitle files with strict format validation.
Process massive files and large codebases (10M+ tokens) by recursively chunking, sub-querying, and aggregating results to overcome LLM context limits.
Update text within fillable PDF forms programmatically. Efficiently modify names, dates, addresses, and reference numbers in form fields while preserving document structure.
A prototype skill for automating YouTube live chat moderation using pattern-based detection for spam, toxic content, and rate limiting, optimized for testing agent reliability before deployment.
Generate professional tarot and astrology content for social media, including 12-sign weekly horoscopes, tarot spread scripts, event-driven video scripts, and custom cover art.
Optimize non-signup forms to increase conversion rates. Includes lead capture, contact, demo, application, and survey forms.
Extract plain text from EPUB, MOBI, and PDF files for analysis or processing. Includes local support for all common ebook formats.
Synchronize English README.md with Chinese README_ZH.md, maintaining content parity and structural consistency for bilingual documentation projects.
A comprehensive framework for creating, structuring, and managing reusable AI Agent Skills to standardize instruction-driven workflows.
Creates and edits Excel spreadsheets with professional formatting, formulas, and financial modeling standards using openpyxl and pandas.