videocut:剪口播
AI-powered video editing agent for talking head videos, featuring speech-to-text, disfluency detection, and browser-based review workflows.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
342 skills found
AI-powered video editing agent for talking head videos, featuring speech-to-text, disfluency detection, and browser-based review workflows.
Fetch and parse transcripts from YouTube and Bilibili videos for summarization, QA, and content extraction using yt-dlp.
A comprehensive framework for deep analysis of articles, papers, and long-form content using 10+ thinking models like SCQA, First Principles, and Systems Thinking.
Query Google NotebookLM notebooks directly from Claude Code for source-grounded, citation-backed answers from Gemini. Features persistent authentication, library management, and automated browser-based document retrieval.
Framework for multi-agent collaboration using the Google A2A protocol. Enables messaging, task delegation, and cross-agent coordination for CLI-based AI tools.
Symbol-level code understanding and navigation agent toolkit using LSP for precise code analysis, reference tracking, and surgical refactoring across 30+ programming languages.
Research agent for Nia: index/search remote codebases, docs, and packages. Optimizes AI context by prioritizing full source indexing over web fetches to reduce hallucinations.
Architects enterprise AI agents from structured specs, generating production-ready code, data flow diagrams, and platform-specific logic for ServiceNow, Salesforce, and Snowflake.
Token-efficient codebase analysis skill for call graphs, semantic search, impact analysis, and data flow. Saves ~95% tokens vs. raw reads.
Comprehensive AI-generated text detection framework. Features multi-layer analysis of vocabulary, structural patterns, model-specific fingerprints, and technical metadata artifacts to identify AI authorship.
Reliably read and extract content from publicly shared Google Docs using curl for full document retrieval.
Production-ready audio/video transcription using OpenAI Whisper. Features model selection, timing synchronization, speaker diarization, and batch processing for media workflows.