Content
video-pipeline avatar

video-pipeline

An end-to-end video processing pipeline that transforms raw recordings into transcripts, key insights, short clips, and polished articles.

Introduction

Video Pipeline is a comprehensive automation agent skill designed to streamline the lifecycle of video content processing. It acts as an orchestrator for a suite of specialized skills, moving from raw video files (such as live streams, podcast recordings, or conference footage) to a professional-grade multi-format output. This solution is specifically designed for creators, marketing teams, and knowledge workers who need to repurpose long-form audiovisual media into diverse content assets without manual intervention.

  • Orchestrates a four-stage pipeline: transcript generation, intelligent insight extraction, automated clip generation, and article composition.

  • Leverages WhisperX for high-accuracy speech-to-text with speaker diarization and word-level timestamp alignment.

  • Features intelligent clipping logic that automatically removes silence, filler words, and detects high-impact 'golden quotes' and debate segments.

  • Generates context-aware articles for platforms like blogs, Zhihu, or WeChat by synthesizing transcripts and structured insights.

  • Employs a parallel execution strategy using sub-agent spawning for the clipping and extraction phases to maximize throughput and minimize processing time.

  • Provides a complete workspace environment, organizing all intermediate and final assets into a structured project directory.

  • Input: Requires a video file (MP4/etc.), with optional parameters for project name, output scope, target platform, and desired clip count.

  • Output: A comprehensive directory containing transcript.txt, transcript.json, insights.md, clips/ (video assets), and articles/ (Markdown docs).

  • Prerequisites: Requires ffmpeg, whisperX, and a HuggingFace token for speaker diarization. Supports nohup-based background processing to handle long-duration tasks reliably.

  • Usage Tip: The insight-extractor phase is mandatory for generating high-quality clips and articles; skipping it may reduce the output quality of subsequent stages.

Repository Stats

Stars
1
Forks
1
Open Issues
0
Language
Python
Default Branch
main
Sync Status
Idle
Last Synced
May 3, 2026, 10:01 PM
View on GitHub