kreuzberg
High-performance document intelligence library for extracting text, tables, code, and metadata from 91+ file formats, with OCR and LLM-ready output.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
130 skills found
High-performance document intelligence library for extracting text, tables, code, and metadata from 91+ file formats, with OCR and LLM-ready output.
Implement production-grade data quality validation using Great Expectations, dbt tests, and data contracts to ensure reliable pipelines.
Provision and manage Railway database services (Postgres, Redis, MySQL, MongoDB) with automated configuration and environment wiring.
Create professional data visualizations with Python using matplotlib, seaborn, and plotly. Includes chart selection guidance, design principles, accessibility standards, and code patterns for publication-quality figures.
Convert natural language queries to safe, optimized SQL. Automates database interactions with schema awareness and parameterized query generation.
A local RAG semantic memory system using Qdrant and Ollama. Ideal for recalling workspace files, notes, project decisions, and user preferences with high-relevance vector search.
Advanced QE reporting, quality dashboards, and predictive analytics for test metrics, code coverage, and deployment readiness to drive data-informed quality decisions.
Proven patterns for extracting, caching, and processing analytics data from GA4 and GSC using MCP servers.
Generates data cleaning pipelines for pandas/polars/PySpark, handling missing values, duplicates, outliers, type conversions, and validation.
AWS DynamoDB engineering assistant for schema design, query optimization, single-table patterns, and infrastructure management using Boto3 and AWS CLI.
Load, validate, and preprocess weekly insurance policy CSV data with intelligent period detection and standardization.
Automate your daily Milan news digest with this Python-based briefing tool. Supports weather, strikes, world/AI/Italian news, and event scraping, featuring deduplication, RSS/API pipeline management, and AI-agent ready scheduling.