senior-data-engineer
World-class senior data engineering skill for building scalable data pipelines, ETL/ELT systems, and modern data infrastructure using Python, Spark, dbt, and Kafka.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
158 skills found
World-class senior data engineering skill for building scalable data pipelines, ETL/ELT systems, and modern data infrastructure using Python, Spark, dbt, and Kafka.
Guided, systematic feature development agent that orchestrates codebase exploration, architectural design, implementation, and automated testing.
A multi-paradigm ETL pipeline agent supporting batch and streaming data processing, schema inference, and configurable DAG-based transformations for heterogeneous data sources.
A comprehensive guide for designing high-performance, maintainable PostgreSQL database schemas, covering best practices, data types, indexing, and advanced features.
Build and orchestrate end-to-end MLOps pipelines covering data preparation, training, validation, and automated deployment.
Build production-grade RAG systems using vector databases, semantic search, and LangGraph to ground LLMs in external knowledge.
Open-source infrastructure for reliable, multi-destination event delivery. Route webhooks to HTTP, SQS, RabbitMQ, Pub/Sub, EventBridge, or Kafka with built-in retries and observability.
Persistent, semantic long-term memory for AI agents. Save, query, and retrieve cross-session dialogues, decisions, and multimodal context using semantic compression.
Structured, template-driven workflow for end-to-end feature development including coding, automated testing, verification, and session-based improvement.
Build RAG systems to ground LLMs in proprietary data. Includes vector database integration, embedding strategies, hybrid search, and advanced retrieval patterns for FastAPI backends.
Specialized IDF (Information Display Frame) sub-agent for generating and reviewing CQRS Query Side implementations across Java, TypeScript, and Go.
Train and manage neural networks in distributed E2B sandboxes using the Flow Nexus platform, supporting custom architectures like Transformers, LSTMs, and GANs.