Agent Skills Hub

Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.

109 skills found

ProductivityEngineeringData AnalysisContentResearch

ai-multimodal

Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.

Views: 14★ 9

EngineeringAutomation

robot-perception

Robot perception system design, configuration, and optimization for cameras, LiDAR, and sensor fusion pipelines. Includes camera calibration, 3D reconstruction, and production deployment best practices.

Views: 14★ 190

EngineeringData AnalysisAutomation

gemini-vision

Implement Google Gemini API vision capabilities for image/document analysis including captioning, object detection, segmentation, and multi-image comparison.

Views: 25★ 1

ProductivityData AnalysisAutomation

ocr

Extract text from images using the Tesseract OCR engine, supporting multiple languages, image preprocessing, and various formats.

Views: 18★ 1,130

ContentAutomationProductivity

nanaban

Generate and edit images using the Gemini API via the nanaban CLI. Create illustrations, logos, and icons, or perform photo edits like background removal and style transfer.

Data AnalysisAutomationEngineering

fiftyone-find-duplicates

Find, review, and remove duplicate or near-duplicate images in FiftyOne datasets using computer vision similarity embeddings.

Views: 7★ 26

ContentProductivityAutomation

image-generation

Generate high-quality visual content, characters, and scenes using structured JSON prompts and automated Python execution for guided image synthesis.

Views: 10★ 64,356

EngineeringData AnalysisAutomation

kreuzberg

High-performance document intelligence library for extracting text, tables, code, and metadata from 91+ file formats, with OCR and LLM-ready output.

Views: 11★ 8,205

AutomationEngineeringProductivity

seer

macOS visual automation tool for precise window capture, video recording, UI mockup annotation, Excalidraw wireframing, and automated visual regression testing.

Views: 14★ 62

AutomationEngineering