Verification & Quality Assurance
A robust verification and QA system for software agents featuring real-time truth scoring, automated code validation, and instant rollback capabilities to maintain high reliability.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
123 skills found
A robust verification and QA system for software agents featuring real-time truth scoring, automated code validation, and instant rollback capabilities to maintain high reliability.
Structured manuscript and grant review assistant utilizing checklist-based evaluation for methodology, statistical validity, and compliance with reporting standards like CONSORT and STROBE.
Fetch and analyze current trending programming models from OpenRouter. Ideal for selecting models for reviews, optimizing AI costs, and staying updated on AI coding trends with real-time pricing and context window data.
A framework for managing the end-to-end LLM project lifecycle, from evaluating task-model fit and pipeline architecture design to implementing structured output parsing and agent-assisted development.
Convert natural language queries to safe, optimized SQL. Automates database interactions with schema awareness and parameterized query generation.
Decompose financial variances into drivers with narrative explanations and waterfall analysis. Optimize budget vs. actual reporting, P&L commentary, and forecast reconciliation.
Validates Excel exports for Customer Feedback Analyzer with 7 specific view sheets, 36 columns, and precise color-coded formatting. Ensures zero errors in customer-facing deliverables.
Comprehensive UI testing, visual fidelity analysis, and browser debugging using Chrome DevTools MCP and AI-driven vision models.
Build comprehensive 3-5 year startup financial models, including revenue projections, cost structures, cash flow analysis, and scenario planning for fundraising and operations.
Apply behavioral science, mental models, and psychological principles to marketing strategy, copywriting, and decision-making.
Pre-implementation confidence assessment tool for developers. Ensures 90%+ readiness via duplicate checks, architecture compliance, official docs verification, and root cause analysis.
A unified interface for integrating and managing LLM chat providers like OpenAI, Anthropic, Google, Azure, and Bedrock within LangChain applications.