Agent Skills Hub

Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.

Clear

136 skills found

ContentProductivityAutomation
image-generation avatar

image-generation

Generate high-quality visual content, characters, and scenes using structured JSON prompts and automated Python execution for guided image synthesis.

Views: 1064,356
ProductivityEngineeringAutomation
image-manipulation-image-magick avatar

image-manipulation-image-magick

Process and manipulate images using ImageMagick. Supports resizing, format conversion, batch processing, and retrieving image metadata for developers and automated workflows.

Views: 631,724
ProductivityEngineeringData AnalysisContentResearch
ai-multimodal avatar

ai-multimodal

Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.

Views: 149
EngineeringData AnalysisAutomation
gemini-vision avatar

gemini-vision

Implement Google Gemini API vision capabilities for image/document analysis including captioning, object detection, segmentation, and multi-image comparison.

Views: 251
EngineeringAutomation
robot-perception avatar

robot-perception

Robot perception system design, configuration, and optimization for cameras, LiDAR, and sensor fusion pipelines. Includes camera calibration, 3D reconstruction, and production deployment best practices.

Views: 14190
EngineeringData AnalysisAutomation
kreuzberg avatar

kreuzberg

High-performance document intelligence library for extracting text, tables, code, and metadata from 91+ file formats, with OCR and LLM-ready output.

Views: 118,205
ContentAutomationProductivity
video-pipeline avatar

video-pipeline

An end-to-end video processing pipeline that transforms raw recordings into transcripts, key insights, short clips, and polished articles.

Views: 121
EngineeringAutomation
pix avatar

pix

An autonomous UI implementation agent that converts Figma designs into pixel-perfect code using Figma MCP and browser-based refinement.

Views: 1490
EngineeringData AnalysisAutomationResearch
parxy avatar

parxy

A unified document processing gateway for PDF parsing, text extraction, conversion, and document manipulation across multiple local and cloud providers.

Views: 59
ContentResearchProductivity
generate-image avatar

generate-image

Generate or edit images using AI models like FLUX and Gemini. Ideal for photos, illustrations, concept art, and visual assets, excluding technical diagrams and schematics.

Views: 411,655
ContentMarketingProductivity
nano-image-generator avatar

nano-image-generator

Generate professional visual assets including app icons, logos, banners, and illustrations using the Nano Banana Pro (Gemini 3 Pro) AI model.

Views: 83,943
ProductivityAutomation
sag avatar

sag

ElevenLabs text-to-speech engine for OpenClaw with macOS-style CLI and voice synthesis control.

Views: 10366,063