Engineering
solve-challenge avatar

solve-challenge

A CTF solver agent that performs triage on challenges, identifies the vulnerability category, and routes tasks to specialized skills for web, pwn, crypto, forensic, and reverse engineering analysis.

Introduction

The solve-challenge skill acts as the primary orchestrator and entrypoint for CTF (Capture The Flag) competitions within an agentic environment. Designed for security researchers and CTF participants, it automates the initial triage process when provided with a challenge bundle, remote service, suspicious file, or a vague technical description. By analyzing file signatures, header metadata, and service behavior, it identifies the dominant attack surface and dynamically dispatches sub-tasks to specialized skills such as ctf-web, ctf-pwn, ctf-crypto, ctf-reverse, ctf-forensics, ctf-osint, or ctf-malware.

  • Performs automated reconnaissance: includes file analysis using strings, xxd, binwalk, and checksec for binary triage.

  • Handles CTFd platform detection: supports API-driven interaction with CTFd instances for challenge retrieval and flag submission.

  • Environment lifecycle management: provides modular installation scripts for toolsets across Python, Go, Ruby (gems), and system-level apt/brew packages.

  • Cross-disciplinary routing: intelligently pivots between categories (e.g., transitioning from forensics to crypto when encrypted data is discovered).

  • Context-aware investigation: executes network probing, interactive service testing via netcat, and protocol analysis.

  • Users should invoke this skill when the initial vector is unknown; once the category is identified, it chains into the appropriate specialized skill for deeper exploitation.

  • Requires a filesystem-enabled agent with bash, Python 3, and internet access to facilitate remote fetching and tool installation.

  • Users must handle API tokens for remote CTF platforms securely as instructed in the workflow guidelines.

  • Useful for automated parsing of challenge descriptions to determine if the task requires web exploitation (SQLi, XSS, SSRF), memory corruption (buffer overflow, ROP, heap), or classical/modern cryptographic analysis (RSA, AES, ECC).

Repository Stats

Stars
1,801
Forks
238
Open Issues
0
Language
Python
Default Branch
main
Sync Status
Idle
Last Synced
May 1, 2026, 09:31 AM
View on GitHub