Agent Skills Hub

Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.

Clear

94 skills found

ProductivityData AnalysisAutomation
ocr avatar

ocr

Extract text from images using the Tesseract OCR engine, supporting multiple languages, image preprocessing, and various formats.

Views: 181,130
ProductivityEngineeringData AnalysisContentResearch
ai-multimodal avatar

ai-multimodal

Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.

Views: 149
ContentProductivityAutomation
image-generation avatar

image-generation

Generate high-quality visual content, characters, and scenes using structured JSON prompts and automated Python execution for guided image synthesis.

Views: 1064,356
EngineeringData AnalysisAutomation
gemini-vision avatar

gemini-vision

Implement Google Gemini API vision capabilities for image/document analysis including captioning, object detection, segmentation, and multi-image comparison.

Views: 251
EngineeringAutomation
pattern-detection avatar

pattern-detection

Analyze and identify codebase patterns (naming, architecture, testing) to maintain consistency and enforce standards during development.

Views: 9265
ProductivityAutomationEngineering
image-fetcher avatar

image-fetcher

Fetch, download, and batch process web images in various formats (JPG, PNG, WebP, SVG, etc.) for embedding, archiving, or chat integration.

Views: 216
Data AnalysisAutomationEngineering
fiftyone-find-duplicates avatar

fiftyone-find-duplicates

Find, review, and remove duplicate or near-duplicate images in FiftyOne datasets using computer vision similarity embeddings.

Views: 726
ContentResearchProductivity
generate-image avatar

generate-image

Generate or edit images using AI models like FLUX and Gemini. Ideal for photos, illustrations, concept art, and visual assets, excluding technical diagrams and schematics.

Views: 411,655
EngineeringAutomation
robot-perception avatar

robot-perception

Robot perception system design, configuration, and optimization for cameras, LiDAR, and sensor fusion pipelines. Includes camera calibration, 3D reconstruction, and production deployment best practices.

Views: 14190
ContentAutomationProductivity
nanaban avatar

nanaban

Generate and edit images using the Gemini API via the nanaban CLI. Create illustrations, logos, and icons, or perform photo edits like background removal and style transfer.

Views: 16
ProductivityAutomationResearch
screenshot-capture avatar

screenshot-capture

Automated screenshot-to-knowledge workflow for Enzo. Captures, categorizes, extracts content, and logs patterns from screenshots to build a structured reference library.

Views: 124,456
EngineeringAutomation
pix avatar

pix

An autonomous UI implementation agent that converts Figma designs into pixel-perfect code using Figma MCP and browser-based refinement.

Views: 1490