scholar-evaluation
Systematically evaluate scholarly work using the ScholarEval framework, providing structured, quantitative, and qualitative assessment across research quality dimensions with actionable feedback.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
160 skills found
Systematically evaluate scholarly work using the ScholarEval framework, providing structured, quantitative, and qualitative assessment across research quality dimensions with actionable feedback.
Monitor and manage margin-living strategy by tracking balances, interest costs, and coverage ratios. Provides automated scaling recommendations and safety alerts based on portfolio-to-margin thresholds.
Fast-reference guide and utility skill for Helm chart development, template syntax, and Kubernetes application deployment.
Build systematic evaluation frameworks for AI agents using multi-dimensional rubrics, LLM-as-a-judge, and regression testing to measure performance, quality, and context engineering effectiveness.
Universal CLI tool to convert and synchronize AI agent skills between Claude Code and Gemini CLI extensions.
Your AI chief of staff for startups. Includes 28 commands for idea validation, financial modeling, pitch decks, market research, and CEO operational frameworks from industry benchmarks.
Automated quality assurance system that validates markdown deliverables against defined checklists for PB-000 market research workflows.
Advanced QE reporting, quality dashboards, and predictive analytics for test metrics, code coverage, and deployment readiness to drive data-informed quality decisions.
AI-driven web testability assessment using 10 core principles. Evaluates observability, controllability, and stability via Playwright and Vibium to identify testing bottlenecks and improve quality readiness.
Stress-test existing product feature ideas by identifying risky assumptions across Value, Usability, Viability, and Feasibility using a multi-perspective devil's advocate framework.
Analyze Claude Code session history to identify inefficiencies, optimize token usage, and suggest workflow improvements.
Advanced prompt rewriting and optimization service. Analyzes prompts for clarity, specificity, and structure, providing actionable improvements, variations for testing, and prompt engineering best practices.