pytorch-lightning
PyTorch Lightning skill for scalable deep learning: automates model training, multi-GPU orchestration, data pipelines, and distributed training strategies like DDP, FSDP, and DeepSpeed.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
138 skills found
PyTorch Lightning skill for scalable deep learning: automates model training, multi-GPU orchestration, data pipelines, and distributed training strategies like DDP, FSDP, and DeepSpeed.
Generate and update PyTorch-compliant function and method docstrings using reStructuredText/Sphinx conventions.
Debugging guide for AReaL distributed training issues, including hangs, NCCL errors, OOM, and numerical consistency in FSDP2/TP/CP/EP.
Production-ready reinforcement learning using Stable Baselines3. Train agents, design custom environments, implement training callbacks, and optimize workflows with a scikit-learn-style API.
Tutorial for identifying and resolving CUDA runtime crashes using FlashInfer's API logging framework.
Train and manage neural networks in distributed E2B sandboxes using the Flow Nexus platform, supporting custom architectures like Transformers, LSTMs, and GANs.
Expert LangGraph architect skill for designing stateful, multi-actor AI agent workflows with robust persistence, conditional branching, and ReAct patterns.
Provides resiliency, health monitoring, and fault tolerance utilities for NVIDIA GPU-accelerated distributed applications, including process management and API key handling.
Three.js geometry generation: built-in shapes, BufferGeometry, vertex manipulation, custom meshes, and performance-optimized instanced rendering.
Python coding assistant providing best practices, PEP 8 enforcement, automated testing with pytest, and dependency management using uv.
Best practices and code patterns for ManimGL (3Blue1Brown's OpenGL animation engine). Provides templates, rules for 3D/interactive scenes, camera control, and LaTeX math visualization for technical creators.
Comprehensive Python healthcare AI toolkit for clinical data processing, medical coding translation, and developing deep learning models like RETAIN and Transformers for EHR, physiological signals, and clinical prediction tasks.