training-data-curation
Guidelines for curating high-quality datasets for LLM post-training (SFT/DPO/RLHF), covering data formats, quality filtering, and collection strategies.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
120 skills found
Guidelines for curating high-quality datasets for LLM post-training (SFT/DPO/RLHF), covering data formats, quality filtering, and collection strategies.
Expert advisor for implementing Anthropic's structured outputs. Choose between JSON mode and strict tool use for guaranteed schema compliance and validated agentic workflows.
Create and test AI-ready MCP tools for any web application. Inject code, automate browser interactions, and turn websites into intelligent agents.
Toolkit for testing local web applications using Playwright, featuring server lifecycle management, automated DOM inspection, and browser automation workflows.
Free AI-powered web search via Exa MCP. Includes deep research, company/people lookup, and code context without API keys.
Create and manage TikTok image carousels via the ViralBaby API. Automate image search, text overlays, and draft uploads for social media content creation.
Fetch YouTube transcripts and subtitles. Ideal for video summarization, language learning, accessibility, and content analysis. Supports timestamped data and raw text extraction.
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
An all-in-one Chinese daily utility toolkit: weather, currency exchange, news, and package tracking. Zero configuration, no API keys required.
Extract YouTube video subtitles or transcripts directly into local text files using yt-dlp or browser automation.
Monitor US-Iran strike probability via real-time open-source signals including market odds, flight traffic, energy prices, and geopolitical alerts.
Automate high-quality screenshot generation for MicroSim visualizations using Chrome headless mode. Ideal for documentation, social media previews, and quality assessment.