nvidia-resiliency-ext
Provides resiliency, health monitoring, and fault tolerance utilities for NVIDIA GPU-accelerated distributed applications, including process management and API key handling.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
252 skills found
Provides resiliency, health monitoring, and fault tolerance utilities for NVIDIA GPU-accelerated distributed applications, including process management and API key handling.
Automate the migration of Netflix Conductor workflows to Temporal Python, including server orchestration, worker management, and workflow troubleshooting.
Integrate Snowflake with MCP clients. Manage Snowflake endpoints, validate connectivity, and leverage Cortex AI (Search, Analyst, Agent) services directly within your AI workflow.
Guide for implementing a new AI coding agent analyzer in Splitrail to track token usage, costs, and performance metrics.
AI-driven GitHub Actions automation featuring swarm-based workflow orchestration, intelligent CI/CD pipeline management, and autonomous repository maintenance.
Create, register, and manage custom agent tools and MCP servers to extend AI agent capabilities with external APIs and custom logic.
Perform a structured 8-factor conversion rate optimization (CRO) audit of any landing page to identify friction points and opportunities for growth.
A configuration and usage guide for the XRequest tool within the Ant Design X SDK, streamlining network integration for streaming AI interfaces.
Expert AWS solution architecture for startups focusing on serverless, scalable, and cost-effective cloud infrastructure with modern DevOps practices and IaC.
Interact with GitHub via the gh CLI to manage issues, pull requests, workflow runs, and execute advanced API queries programmatically.
Generates standardized metadata, including git/jj version info and timestamps, for research docs, handoffs, and implementation plans.
Epistemic safety analysis for JSON data in prompts to prevent LLM hallucinations and reasoning errors when handling incomplete or large-scale datasets.