training-data-curation
Guidelines for curating high-quality datasets for LLM post-training (SFT/DPO/RLHF), covering data formats, quality filtering, and collection strategies.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
137 skills found
Guidelines for curating high-quality datasets for LLM post-training (SFT/DPO/RLHF), covering data formats, quality filtering, and collection strategies.
Automated academic literature retrieval, structured summarization, and multi-channel scheduling workflow for research topics.
AI-driven GitHub Actions automation featuring swarm-based workflow orchestration, intelligent CI/CD pipeline management, and autonomous repository maintenance.
Systematic Kubernetes troubleshooting, pod diagnostics, cluster health monitoring, and incident response playbooks.
Systematic debugging skill to trace errors backward through call stacks, identify original triggers, and implement layered defenses instead of patching symptoms.
Profiles application performance using k6, Artillery, or JMeter to measure latency, throughput, and error rates. Ideal for planning load, stress, and soak tests to identify bottlenecks.
Open-source infrastructure for reliable, multi-destination event delivery. Route webhooks to HTTP, SQS, RabbitMQ, Pub/Sub, EventBridge, or Kafka with built-in retries and observability.
A multi-paradigm ETL pipeline agent supporting batch and streaming data processing, schema inference, and configurable DAG-based transformations for heterogeneous data sources.
Expert SwiftUI development assistant: refactor code, improve performance, and diagnose app hitches or CPU issues using Xcode Instruments trace analysis.
Implement robust backend error handling with custom classes, middleware, structured logging, and recovery patterns.
DevOps and platform engineering patterns: Kubernetes, Terraform, GitOps, CI/CD, observability, incident response, and cloud-native ops.
Guide for implementing a new AI coding agent analyzer in Splitrail to track token usage, costs, and performance metrics.