guard
Epistemic safety analysis for JSON data in prompts to prevent LLM hallucinations and reasoning errors when handling incomplete or large-scale datasets.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
450 skills found
Epistemic safety analysis for JSON data in prompts to prevent LLM hallucinations and reasoning errors when handling incomplete or large-scale datasets.
Monitor Runwall security posture, enabled guardrails, and recent audit logs for Claude Code, Codex, and MCP-based development environments.
Evaluate code generation models using BigCode Evaluation Harness. Benchmarks include HumanEval, MBPP, and MultiPL-E with pass@k metrics for multi-language coding models.
An advanced development guide for Claude Code, covering REPL environments, MCP integration, development workflows, and best practices for AI-assisted coding.
Standardize, validate, and manage Netresearch AI agent skill repositories with automated structure enforcement, distribution workflows, and licensing compliance tools.
Automated repository synchronization for multi-repo ecosystems, featuring intelligent failure diagnosis, auto-repair for Git state issues, and integrated ecosystem health checks.
Autonomous improvement loop for codebase optimization. Automatically modifies, measures, and iterates on code based on a specific goal and mechanical metric.
Download YouTube videos with customizable quality and format options. Supports video resolutions (360p-1080p), multiple containers (mp4, webm, mkv), and audio-only MP3 extraction.
Guide for integrating and managing custom Model Context Protocol (MCP) servers within the Cursor IDE environment.
Submit completed tasks on OpenAnt via CLI. Handles text reports, file uploads (images, docs, code), and external proof links to ensure verified deliverables.
Comprehensive Google Docs and Drive management tool. Supports document creation via Markdown, text formatting, structure analysis, and full file operations including upload, download, and sharing.
Preserve successful Python code executions as reusable tools within the gentools package structure, utilizing Pydantic models for structured output and type-safe interfaces.