training-data-curation
Guidelines for curating high-quality datasets for LLM post-training (SFT/DPO/RLHF), covering data formats, quality filtering, and collection strategies.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
143 skills found
Guidelines for curating high-quality datasets for LLM post-training (SFT/DPO/RLHF), covering data formats, quality filtering, and collection strategies.
Foundational Python library for static, animated, and interactive data visualization. Provides fine-grained control over plot elements for scientific, publication-ready figures.
A unified interface for integrating and managing LLM chat providers like OpenAI, Anthropic, Google, Azure, and Bedrock within LangChain applications.
Development guide for lemline-core, the stateless Serverless Workflow engine. Manage workflow execution, node navigation, state transitions, JQ expression evaluation, error handling, and parallel fork logic.
A structured guide for novelists to navigate the seven-step writing process, from constitution and specification to planning, tasking, drafting, and quality analysis.
Expert Kokoro TTS implementation skill for real-time, secure, and offline voice synthesis in JARVIS-style assistants. Features streaming output, prosody control, and performance-optimized audio generation.
Synchronize English README.md with Chinese README_ZH.md, maintaining content parity and structural consistency for bilingual documentation projects.
Build performant Three.js web scenes using modern ES modules. Includes scene graph setup, lighting, geometries, GLTF/GLB loading, animation loops, and performance optimization best practices.
FHIR API development guide for building compliant healthcare endpoints. Includes resource validation, coding systems, and standard error handling.
Control and monitor Xiaomi Mijia smart home devices including status switching, device discovery, automation scenes, and environmental statistics.
High-performance in-memory DataFrame library for Python and Rust. Features lazy evaluation, parallel execution, and an Apache Arrow backend for efficient ETL, data processing, and faster pandas alternatives.
Standardize git remote configuration and issue tracking for contributors working with forked repositories in the libuipc project.