evidence-first-debugging

Introduction

The evidence-first-debugging skill is a professional-grade diagnostic framework designed for software engineers, site reliability engineers (SRE), and QA analysts. It shifts the debugging paradigm from intuition-based guessing to a rigorous, scientific-method approach. By mandating the use of a 15-section Unified Investigation Template, this skill ensures that every claim is anchored to verifiable signals, preventing common traps like correlation-causation fallacies, incomplete verification, and ungrounded speculation during complex incident responses.

Structured Observation Recording: Mandates that all FACTS, OBSERVATIONS, and RESULTS are tagged with unique evidence IDs [En] to maintain an audit trail for every investigation claim.
Hypothesis-Driven Testing: Requires explicit documentation of hypotheses, including clear prediction statements and falsifiable tests, ensuring that every debugging branch can be logically disproven or confirmed.
Causality Gate Validation: Implements strict classification rules for action-result links to ensure that code changes or configuration tweaks are backed by evidence rather than correlative guesswork.
Domain-Specific Extensions: Dynamically loads specialized debugging or performance modules (e.g., call stack analysis, dependency graphs, baseline metrics, and resource utilization) based on the investigation type (bug vs. performance regression).
Verification Gates: Enforces a requirement that investigations cannot be marked as resolved-verified without an explicit, successful verification command or test case that addresses the original issue.

Usage and Constraints:

Ideal for debugging software bugs, crashes, flaky tests, memory leaks, latency regressions, and complex performance throughput issues.
The skill requires input in the form of system signals, logs, or metrics. Outputs are strictly formatted; any abbreviated output must include a mandatory truncation disclosure block (total lines, method, fingerprint, and command).
Users should expect to interact with the system by providing raw signal data and following the agent's prompts to move through the Investigation Template sections (0-14).
Key keywords for search and integration include: root cause analysis, causality check, debugging extension, performance monitoring, investigation template, flaky tests, evidence-based development, verification gate, and scientific method for software.

Startup Courses

Online Courses

Physical Courses

evidence-first-debugging

Introduction

Repository Stats