gemini-video-understanding
Perform advanced video analysis using Google's Gemini API: summarize content, transcribe audio, extract timestamps, clip segments, and analyze YouTube URLs or local files with support for multiple models and long contexts.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
328 skills found
Perform advanced video analysis using Google's Gemini API: summarize content, transcribe audio, extract timestamps, clip segments, and analyze YouTube URLs or local files with support for multiple models and long contexts.
Fetches expert perspectives from OpenAI Codex and Google Gemini for architecture, code reviews, and debugging, with transparent LLM synthesis.
Automate high-quality screenshot generation for MicroSim visualizations using Chrome headless mode. Ideal for documentation, social media previews, and quality assessment.
Creative research ideation partner for exploring interdisciplinary connections, challenging assumptions, and generating testable scientific hypotheses.
Find, review, and remove duplicate or near-duplicate images in FiftyOne datasets using computer vision similarity embeddings.
Analyze and identify codebase patterns (naming, architecture, testing) to maintain consistency and enforce standards during development.
Transforms content to match specific voice profiles, tones, or styles using configurable YAML templates for consistent brand and narrative output.
Evaluate code generation models using BigCode Evaluation Harness. Benchmarks include HumanEval, MBPP, and MultiPL-E with pass@k metrics for multi-language coding models.
Search the web using Tavily's LLM-optimized search API for relevant, source-cited content without writing code.
Research technical documentation and automatically generate ready-to-use software agent skills in markdown format.
Generates structured Handoff Pack prompts for delegating scoped coding tasks to Gemini with clear instructions, acceptance criteria, and output requirements.
A runtime skill discovery engine for AI agents. Search and retrieve specialized agent skills (SKILL.md) on-demand via REST API or MCP to inject procedural knowledge into your agent's context.