gemini-video-understanding
Perform advanced video analysis using Google's Gemini API: summarize content, transcribe audio, extract timestamps, clip segments, and analyze YouTube URLs or local files with support for multiple models and long contexts.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
121 skills found
Perform advanced video analysis using Google's Gemini API: summarize content, transcribe audio, extract timestamps, clip segments, and analyze YouTube URLs or local files with support for multiple models and long contexts.
Gemini-powered UI design review, accessibility auditing, and design system validation tool for software agents.
A powerful CLI tool to automate and manage Google Workspace services, including Gmail, Calendar, Drive, Sheets, and Docs.
Automate frontend API integration using Apidog and MCP servers. Generate TypeScript types, TanStack Query hooks, and axios-based clients from OpenAPI specifications for consistent, type-safe API consumption.
Search the web for real-time information, technical documentation, or research topics using WebSearch and WebFetch tools.
Query Google NotebookLM notebooks directly from Claude Code for source-grounded, citation-backed answers from Gemini. Features persistent authentication, library management, and automated browser-based document retrieval.
Fetches expert perspectives from OpenAI Codex and Google Gemini for architecture, code reviews, and debugging, with transparent LLM synthesis.
Advanced Gemini-powered web search plugin with smart caching, subagent context isolation, and automated query optimization.
A comprehensive guide and reference for building, orchestrating, and deploying AI agents using the Google Agent Development Kit (ADK).
Google Ads integration for managing campaigns, accounts, budgets, and reporting via Membrane CLI. Streamline your advertising workflow with automated authentication and cross-resource management.
Operate Google Tag Manager via MCP. Handles OAuth, resource discovery, and CRUD operations for tags, triggers, and variables directly from your LLM agent.
Generate professional PowerPoint presentations using AI. Create full-bleed, high-resolution slide decks from topic prompts with Gemini-powered narrative planning and image generation.