spark-optimization
Optimize Apache Spark jobs with partitioning strategies, memory management, shuffle tuning, and data skew mitigation for high-performance data processing pipelines.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
227 skills found
Optimize Apache Spark jobs with partitioning strategies, memory management, shuffle tuning, and data skew mitigation for high-performance data processing pipelines.
Definition of Done (DoD) verification workflow that triggers automatically upon implementation completion to ensure quality, document evidence, and standardize reporting.
Expert guidance for Neo4j Cypher queries and MCP server tools, focusing on schema introspection, graph operations, and efficient database development workflows.
Orchestrate cross-browser, cross-device, and responsive design testing using cloud providers like BrowserStack and Playwright to ensure consistent user experiences.
Autonomous multi-team codebase improvement agent with specialized modes: narrow (goal-directed), broad (hypothesis-divergent), and sweep (quality-focused).
Implement Linkerd service mesh patterns for security, traffic policy management, and zero-trust networking in Kubernetes environments.
Prevents AI hallucination and ensures evidence-based, verifiable outputs when analyzing code, reviewing technical documents, or providing recommendations.
AI-powered documentation engine that automatically generates C4 architecture diagrams, technical specs, and codebase analysis from any source code directory.
RPI Plan Phase: Create chunk-based, dependency-aware implementation plans from research documents for structured, atomic development.
Scaffold custom React Flow node components with TypeScript, Zustand integration, and standard handles for visual workflow editors.
Build no-code MCP servers that orchestrate tools as directed graphs using YAML for data transformation, conditional routing, and automated workflows.
A Notion-based tracking system for tweet performance to enable data-driven content experimentation using reinforcement learning principles.