always-works-testing
Mandatory execution-based validation for all software implementation tasks. Ensures code works through empirical verification before confirmation.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
154 skills found
Mandatory execution-based validation for all software implementation tasks. Ensures code works through empirical verification before confirmation.
Create, manage, and debug dlt (data load tool) pipelines for ingesting data from APIs, databases, and custom sources into destinations like DuckDB, BigQuery, and Snowflake.
Enforces a strict evidence-based debugging workflow using structured observation, hypothesis testing, and causality validation to eliminate speculation in technical investigations.
Add evlog framework integration: automate wide-event logging across your stack with standardized middleware, build configurations, testing, and documentation.
Reference for all MCP tools exposed by the CCOS server, enabling capability discovery, session management, and governed RTFS execution for autonomous agent workflows.
A systematic workflow to instrument, evaluate, and monitor LLM applications using TruLens, supporting frameworks like LangChain, LangGraph, and LlamaIndex.
Build systematic evaluation frameworks for AI agents using multi-dimensional rubrics, LLM-as-a-judge, and regression testing to measure performance, quality, and context engineering effectiveness.
Expert guidance for configuring FeatBit observability via OpenTelemetry. Use for setting up metrics, logs, traces, and connecting OTEL backends like Seq, Jaeger, or Prometheus for FeatBit backend monitoring.
Refactors SwiftUI views for clean architecture, consistent property ordering, efficient dependency injection, and correct @Observable state management.
Validate WebSocket and HTTP stream health for WaveCap-SDR. Measure latency, throughput, packet loss, and signal quality for audio, spectrum, and IQ streams.
Systematic technical conversation logging for developers and engineers. Captures decisions, implementation details, and session outcomes with factual precision.
A meta-skill for building robust AI agent skills using a TDD approach: define failure (RED), implement the skill (GREEN), and plug rationalization loopholes (REFACTOR).