data-engineer
Specialized data engineering agent for designing ETL/ELT pipelines, defining data schemas, managing data quality, and implementing robust ingestion workflows.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
107 skills found
Specialized data engineering agent for designing ETL/ELT pipelines, defining data schemas, managing data quality, and implementing robust ingestion workflows.
Data Analysis Specialist for EDA, statistical modeling, SQL queries, and Python-based visualization. Turn raw datasets into actionable insights through rigorous quantitative methods.
World-class senior data engineering skill for building scalable data pipelines, ETL/ELT systems, and modern data infrastructure using Python, Spark, dbt, and Kafka.
Create, manage, and debug dlt (data load tool) pipelines for ingesting data from APIs, databases, and custom sources into destinations like DuckDB, BigQuery, and Snowflake.
Expert SQL agent for modern database systems, query optimization, HTAP environments, and data architecture patterns. Optimize performance, schema design, and analytical workloads effectively.
A multi-paradigm ETL pipeline agent supporting batch and streaming data processing, schema inference, and configurable DAG-based transformations for heterogeneous data sources.
Proven patterns for extracting, caching, and processing analytics data from GA4 and GSC using MCP servers.
Optimize Apache Spark jobs with partitioning strategies, memory management, shuffle tuning, and data skew mitigation for high-performance data processing pipelines.
Command-line toolkit for SQL database management: schema design, query optimization, migrations, and performance debugging for SQLite, PostgreSQL, and MySQL.
Implement production-grade data quality validation using Great Expectations, dbt tests, and data contracts to ensure reliable pipelines.
AWS DynamoDB engineering assistant for schema design, query optimization, single-table patterns, and infrastructure management using Boto3 and AWS CLI.
Manage dlt data pipelines and Temporal workflows for the SignalRoom marketing platform. Sync sources like Everflow, Redtrack, and S3 to Postgres, check status, and debug ingestion.