robot-perception
Robot perception system design, configuration, and optimization for cameras, LiDAR, and sensor fusion pipelines. Includes camera calibration, 3D reconstruction, and production deployment best practices.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
138 skills found
Robot perception system design, configuration, and optimization for cameras, LiDAR, and sensor fusion pipelines. Includes camera calibration, 3D reconstruction, and production deployment best practices.
Local text-to-speech conversion using Kokoro TTS. Generate audio, read text aloud, and handle multilingual speech synthesis directly in your terminal.
Build RAG systems to ground LLMs in proprietary data. Includes vector database integration, embedding strategies, hybrid search, and advanced retrieval patterns for FastAPI backends.
A systematic, multi-angle web research agent. Use for deep investigation, complex queries, and as a mandatory pre-research step before content generation to ensure evidence-backed, high-quality results.
Drafts LaTeX research papers section-by-section using paper plans and research narratives with multi-model reviewer validation.
Generate high-quality visual content, characters, and scenes using structured JSON prompts and automated Python execution for guided image synthesis.
Search and retrieve AI-generated documentation, architecture guides, and API references for 300+ popular GitHub repositories using DeepWiki and MCP.
Expert guide for OpenCode AI: TUI commands, CLI operations, AGENTS.md configuration, custom agent workflows, and project setup.
Connect your AI agent to the Hugging Face Hub via MCP. Search models, datasets, and papers, manage repos, run cloud compute jobs, and invoke Gradio Spaces as functional AI tools.
Tools for deploying, managing, and monitoring DataRobot models, including prediction environment configuration, champion/challenger workflows, and deployment operations.
Perform advanced video analysis using Google's Gemini API: summarize content, transcribe audio, extract timestamps, clip segments, and analyze YouTube URLs or local files with support for multiple models and long contexts.
An intelligent development orchestration skill that provides self-improving code analysis, build error diagnosis, and automated workflow configuration via mcp-prompts integration.