Agent Skills Hub

Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.

Clear

109 skills found

ProductivityEngineeringData AnalysisContentResearch
ai-multimodal avatar

ai-multimodal

Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.

Views: 149
EngineeringAutomation
robot-perception avatar

robot-perception

Robot perception system design, configuration, and optimization for cameras, LiDAR, and sensor fusion pipelines. Includes camera calibration, 3D reconstruction, and production deployment best practices.

Views: 14190
ProductivityData AnalysisAutomation
ocr avatar

ocr

Extract text from images using the Tesseract OCR engine, supporting multiple languages, image preprocessing, and various formats.

Views: 181,130
EngineeringData AnalysisAutomation
gemini-vision avatar

gemini-vision

Implement Google Gemini API vision capabilities for image/document analysis including captioning, object detection, segmentation, and multi-image comparison.

Views: 251
ContentAutomationProductivity
nanaban avatar

nanaban

Generate and edit images using the Gemini API via the nanaban CLI. Create illustrations, logos, and icons, or perform photo edits like background removal and style transfer.

Views: 16
ContentProductivityAutomation
image-generation avatar

image-generation

Generate high-quality visual content, characters, and scenes using structured JSON prompts and automated Python execution for guided image synthesis.

Views: 1064,356
ContentMarketing
visual-creative avatar

visual-creative

AI-powered creative visual prompt generator for posters, banners, product shots, and social media content.

Views: 64,430
EngineeringData AnalysisAutomation
kreuzberg avatar

kreuzberg

High-performance document intelligence library for extracting text, tables, code, and metadata from 91+ file formats, with OCR and LLM-ready output.

Views: 118,205
Data AnalysisAutomationEngineering
fiftyone-find-duplicates avatar

fiftyone-find-duplicates

Find, review, and remove duplicate or near-duplicate images in FiftyOne datasets using computer vision similarity embeddings.

Views: 726
ProductivityContentAutomation
digital-brain avatar

digital-brain

A structured personal operating system for managing digital presence, knowledge, relationships, and goals with AI assistance for founders, creators, and professionals.

Views: 4015,339
EngineeringData AnalysisEducationAutomation
gemini-video-understanding avatar

gemini-video-understanding

Perform advanced video analysis using Google's Gemini API: summarize content, transcribe audio, extract timestamps, clip segments, and analyze YouTube URLs or local files with support for multiple models and long contexts.

Views: 1091
ProductivityEngineeringContent
image-enhancer avatar

image-enhancer

Enhance image quality, resolution, and sharpness for screenshots and digital media. Perfect for professional documentation, blogs, and presentations.

Views: 132,839