wavespeed-nano-banana-2
Generate and edit images using Google's Nano Banana 2 via WaveSpeed AI. Supports text-to-image, natural language editing, multi-image composition, 4K resolution, and various aspect ratios.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
136 skills found
Generate and edit images using Google's Nano Banana 2 via WaveSpeed AI. Supports text-to-image, natural language editing, multi-image composition, 4K resolution, and various aspect ratios.
Generate and edit images, diagrams, and infographics using Google's Gemini 3 Pro model. Supports text-to-image, style transformation, and data-accurate visual creation.
Generate high-quality images via a local ComfyUI instance. Perfect for private workflows and professional-grade AI image synthesis.
Perform advanced video analysis using Google's Gemini API: summarize content, transcribe audio, extract timestamps, clip segments, and analyze YouTube URLs or local files with support for multiple models and long contexts.
Normalizes testing defect logs by correcting typos, abbreviations, and ambiguous descriptions based on product-specific codebooks and station validation.
Generate and edit images using the Gemini API via the nanaban CLI. Create illustrations, logos, and icons, or perform photo edits like background removal and style transfer.
Generate artistic 3D city-themed food diorama images using Google Gemini API. Creates Pop Mart style four-quadrant layouts featuring iconic dishes, cultural symbols, and city-specific heritage elements.
Generate professional PowerPoint presentations using AI. Create full-bleed, high-resolution slide decks from topic prompts with Gemini-powered narrative planning and image generation.
Create professional logos using AI. Generate, refine, crop, remove backgrounds, and export as SVG. Perfect for brands, mascots, icons, and emblems.
Preprocessing and cleaning astronomical light curves using Lightkurve. Tools for outlier removal, flattening, trend detrending, and quality flag handling for time-series analysis.
Analyze AppWorld task failures to extract specific API patterns and generate actionable playbook bullets with concrete code examples.
Local text-to-speech conversion using Kokoro TTS. Generate audio, read text aloud, and handle multilingual speech synthesis directly in your terminal.