wavespeed-nano-banana-2
Generate and edit images using Google's Nano Banana 2 via WaveSpeed AI. Supports text-to-image, natural language editing, multi-image composition, 4K resolution, and various aspect ratios.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
98 skills found
Generate and edit images using Google's Nano Banana 2 via WaveSpeed AI. Supports text-to-image, natural language editing, multi-image composition, 4K resolution, and various aspect ratios.
AI-powered creative visual prompt generator for posters, banners, product shots, and social media content.
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Generate high-quality visual content, characters, and scenes using structured JSON prompts and automated Python execution for guided image synthesis.
Generate professional PowerPoint presentations using AI. Create full-bleed, high-resolution slide decks from topic prompts with Gemini-powered narrative planning and image generation.
Generate images using the Cloudflare Workers AI flux-1-schnell model. Enables text-to-image capabilities directly within your workflow.
Google Gemini Image Generation API interface for text-to-image, editing, style templates, and automated retry workflows.
Generate high-quality images via a local ComfyUI instance. Perfect for private workflows and professional-grade AI image synthesis.
Creates professional, editable PowerPoint (.pptx) presentations with AI-generated full-slide images, brand consistency, and style references.
Generate or edit images using AI models like FLUX and Gemini. Ideal for photos, illustrations, concept art, and visual assets, excluding technical diagrams and schematics.
Generate and edit images using the Gemini API via the nanaban CLI. Create illustrations, logos, and icons, or perform photo edits like background removal and style transfer.
Generate artistic 3D city-themed food diorama images using Google Gemini API. Creates Pop Mart style four-quadrant layouts featuring iconic dishes, cultural symbols, and city-specific heritage elements.