robot-perception
Robot perception system design, configuration, and optimization for cameras, LiDAR, and sensor fusion pipelines. Includes camera calibration, 3D reconstruction, and production deployment best practices.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
149 skills found
Robot perception system design, configuration, and optimization for cameras, LiDAR, and sensor fusion pipelines. Includes camera calibration, 3D reconstruction, and production deployment best practices.
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Systematic project technology stack detection, framework-specific skill auto-loading, and multi-stack analysis for fullstack projects like React + Go.
Analyze and identify codebase patterns (naming, architecture, testing) to maintain consistency and enforce standards during development.
Implement Google Gemini API vision capabilities for image/document analysis including captioning, object detection, segmentation, and multi-image comparison.
High-performance document intelligence library for extracting text, tables, code, and metadata from 91+ file formats, with OCR and LLM-ready output.
Extract text from images using the Tesseract OCR engine, supporting multiple languages, image preprocessing, and various formats.
Evaluate code generation models using BigCode Evaluation Harness. Benchmarks include HumanEval, MBPP, and MultiPL-E with pass@k metrics for multi-language coding models.
Capture snapshots, video clips, and monitor motion events from RTSP and ONVIF compatible security cameras.
Analyze AppWorld task failures to extract specific API patterns and generate actionable playbook bullets with concrete code examples.
AI-powered food calorie and nutrient calculator. Uses vision recognition to identify meals, calculate macronutrients, and provide health suggestions based on a built-in nutrition database.
Automatically detect code changes and suggest documentation updates. Keeps READMEs, API specs, and configuration guides in sync with your implementation.