Agent Skills Hub

Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.

132 skills found

debug-distributed

Debugging guide for AReaL distributed training issues, including hangs, NCCL errors, OOM, and numerical consistency in FSDP2/TP/CP/EP.

Views: 4★ 5,126

ResearchAutomationEngineering

research-pipeline

End-to-end autonomous research agent: from idea generation and literature review to experiment execution, adversarial review loops, and paper writing.

Views: 15★ 7,817

EngineeringAutomation

shift-right-testing

Production-grade testing strategy implementing feature flags, canary releases, synthetic monitoring, and chaos engineering for continuous reliability in live environments.

Views: 20★ 329#shift-right#production-testing#canary#feature-flags

MarketingResearchEducation

marketing-psychology

Apply behavioral science, mental models, and psychological principles to marketing strategy, copywriting, and decision-making.

Views: 7★ 25,470

EducationResearchGeneral

yc-advisor

Access Y Combinator’s library of 443+ startup resources for expert advice on fundraising, co-founders, product development, growth, and scaling your business.

Views: 13★ 167

EngineeringAutomation

milady-development

Self-modify your Milady agent by managing plugins. Edit code, rebuild, and restart the runtime to develop new capabilities or improve agent workflows locally.

Views: 11★ 398

EngineeringAutomation

MLOps Industrialization

A framework to transform experimental ML prototypes into robust, production-ready Python packages using src layout, hybrid architecture, and strict configuration management.

Views: 1★ 1,408

Data AnalysisAutomationResearch

margin-management

Monitor and manage margin-living strategy by tracking balances, interest costs, and coverage ratios. Provides automated scaling recommendations and safety alerts based on portfolio-to-margin thresholds.

Views: 10★ 302

EngineeringResearch

evaluating-code-models

Evaluate code generation models using BigCode Evaluation Harness. Benchmarks include HumanEval, MBPP, and MultiPL-E with pass@k metrics for multi-language coding models.

Views: 19★ 7,624#Evaluation#Code Generation#HumanEval#MBPP

Data AnalysisResearch

meteorology-driver-classification

Classify and group meteorological and environmental variables into specific driver categories for consistent attribution analysis and environmental modeling.

Views: 9★ 1,084

ResearchData AnalysisEngineering

literature-engineer

Evidence-first literature collector for automated research pipelines. Scales paper pools to 1200+ with metadata normalization, provenance tracking, and multi-source ingestion.

Views: 9★ 422

EngineeringData AnalysisAutomation

guard

Epistemic safety analysis for JSON data in prompts to prevent LLM hallucinations and reasoning errors when handling incomplete or large-scale datasets.

Views: 7★ 13

Startup Courses

Online Courses

Physical Courses

Agent Skills Hub

debug-distributed

research-pipeline

shift-right-testing

marketing-psychology

yc-advisor

milady-development

MLOps Industrialization

margin-management

evaluating-code-models

meteorology-driver-classification

literature-engineer

guard