pytorch-lightning
PyTorch Lightning skill for scalable deep learning: automates model training, multi-GPU orchestration, data pipelines, and distributed training strategies like DDP, FSDP, and DeepSpeed.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
131 skills found
PyTorch Lightning skill for scalable deep learning: automates model training, multi-GPU orchestration, data pipelines, and distributed training strategies like DDP, FSDP, and DeepSpeed.
Train and manage neural networks in distributed E2B sandboxes using the Flow Nexus platform, supporting custom architectures like Transformers, LSTMs, and GANs.
Local text-to-speech conversion using Kokoro TTS. Generate audio, read text aloud, and handle multilingual speech synthesis directly in your terminal.
Production-ready reinforcement learning using Stable Baselines3. Train agents, design custom environments, implement training callbacks, and optimize workflows with a scikit-learn-style API.
Debugging guide for AReaL distributed training issues, including hangs, NCCL errors, OOM, and numerical consistency in FSDP2/TP/CP/EP.
Analyze local system hardware (RAM, CPU, GPU/VRAM) to receive expert recommendations for optimized local LLM models, quantization settings, and performance estimates.
Comprehensive Python healthcare AI toolkit for clinical data processing, medical coding translation, and developing deep learning models like RETAIN and Transformers for EHR, physiological signals, and clinical prediction tasks.
Unified local ML inference server for ASR, TTS, Translation, Image Generation, and Vision on Apple Silicon, powered by MLX.
Generate or edit images using AI models like FLUX and Gemini. Ideal for photos, illustrations, concept art, and visual assets, excluding technical diagrams and schematics.
Tutorial for identifying and resolving CUDA runtime crashes using FlashInfer's API logging framework.
Production-ready audio/video transcription using OpenAI Whisper. Features model selection, timing synchronization, speaker diarization, and batch processing for media workflows.
Perform advanced video analysis using Google's Gemini API: summarize content, transcribe audio, extract timestamps, clip segments, and analyze YouTube URLs or local files with support for multiple models and long contexts.