openai-whisper
Local speech-to-text transcription using the OpenAI Whisper CLI, providing private, high-accuracy audio processing without external API keys.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
171 skills found
Local speech-to-text transcription using the OpenAI Whisper CLI, providing private, high-accuracy audio processing without external API keys.
Implement Google Gemini API vision capabilities for image/document analysis including captioning, object detection, segmentation, and multi-image comparison.
Automate GitLab repository management with this API-based tool. Perform file operations, branch management, and project tracking directly through your AI agent.
A comprehensive PDF toolkit for extracting text/tables, merging, splitting, rotating, and programmatically generating or filling PDF documents using Python and CLI tools.
Seamlessly publish Markdown to Feishu Docs. Features automatic table conversion, permission management, and intelligent document batch writing.
Comprehensive office productivity toolkit for AI agents, featuring PDF, Word, Excel, PowerPoint, and internal communication automation capabilities.
Controls a local or remote headless browser for automated web navigation, data extraction, form interaction, and testing from sandboxed environments.
A comprehensive tool for managing PowerPoint presentations, supporting creation, editing, text extraction, template application, and visual analysis of .pptx files.
Fetch, download, and batch process web images in various formats (JPG, PNG, WebP, SVG, etc.) for embedding, archiving, or chat integration.
Manage, sync, and transfer files between local storage and cloud providers like S3, Cloudflare R2, Backblaze B2, Google Drive, and Dropbox using rclone.
Automate Instagram posts via Telegram or CLI. Features residential proxy bypass, session caching, and WaveSpeed image integration.
A modular data processing tool for cleaning, validating, and analyzing CSV files with support for custom transformations and automated dependency management.