data-engineer
Specialized data engineering agent for designing ETL/ELT pipelines, defining data schemas, managing data quality, and implementing robust ingestion workflows.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
119 skills found
Specialized data engineering agent for designing ETL/ELT pipelines, defining data schemas, managing data quality, and implementing robust ingestion workflows.
World-class senior data engineering skill for building scalable data pipelines, ETL/ELT systems, and modern data infrastructure using Python, Spark, dbt, and Kafka.
Implement production-grade data quality validation using Great Expectations, dbt tests, and data contracts to ensure reliable pipelines.
Create, alter, and validate Snowflake semantic views using the Snowflake CLI.
Generate optimized SQL queries from natural language. Supports BigQuery, PostgreSQL, MySQL, and Snowflake. Analyze database schemas, interpret business requirements, and output ready-to-run queries with explanations.
Create, alter, and validate Snowflake semantic views via the CLI. Automate the generation, documentation, and testing of semantic layer definitions to ensure model accuracy and star schema compliance.
Create, manage, and debug dlt (data load tool) pipelines for ingesting data from APIs, databases, and custom sources into destinations like DuckDB, BigQuery, and Snowflake.
Database schema validation, data integrity testing, migration validation, transaction isolation, and query performance testing. Ensure ACID compliance and referential integrity for data-driven applications.
AI-optimized artifact tracking system for token-efficient project orchestration, phase management, and automated task delegation using YAML-Markdown hybrid formats.
PostgreSQL schema and migration expert for Diddit. Manages idempotent SQL files, tables, indexes, and constraints following strict camelCase conventions and transactional safety.
Run, debug, and manage DBHub tests including unit, integration with Testcontainers, and database-specific suites. Perfect for verifying code changes and troubleshooting database connector issues.
A modular data processing tool for cleaning, validating, and analyzing CSV files with support for custom transformations and automated dependency management.