gemini-video-understanding
Perform advanced video analysis using Google's Gemini API: summarize content, transcribe audio, extract timestamps, clip segments, and analyze YouTube URLs or local files with support for multiple models and long contexts.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
189 skills found
Perform advanced video analysis using Google's Gemini API: summarize content, transcribe audio, extract timestamps, clip segments, and analyze YouTube URLs or local files with support for multiple models and long contexts.
Dialectical reasoning and adversarial coding agent for MCP-enabled editors, forcing LLMs to resolve internal contradictions for higher quality outputs.
Gemini-powered UI design review, accessibility auditing, and design system validation tool for software agents.
Expert guidance for Claude Messages API: structured outputs, prompt caching, tool use, and migration from deprecated Claude 3.x models to 4.5. Prevents common API errors.
Streamline technical documentation by generating, updating, and refining README files. Tailors content for specific audiences including OSS contributors, internal teams, and personal projects.
Provides real-time weather forecasts and personalized clothing recommendations for any city using wttr.in.
An automated memory middleware for AI agents, implementing a Retrieve-Respond-Save loop to maintain long-term persistent context across conversations.
Persistent, semantic long-term memory for AI agents. Save, query, and retrieve cross-session dialogues, decisions, and multimodal context using semantic compression.
ElevenLabs text-to-speech engine for OpenClaw with macOS-style CLI and voice synthesis control.
Translates Excel (.xlsx) files from English to Chinese while preserving all formatting, images, and charts.
Standardized skill for Claude Code agents to dynamically query OpenRouter model recommendations and metadata via the Claudish CLI.
Generate optimized YouTube metadata, titles, and descriptions for bilingual audiobook videos based on source and target language pairs.