polars
High-performance in-memory DataFrame library for Python and Rust. Features lazy evaluation, parallel execution, and an Apache Arrow backend for efficient ETL, data processing, and faster pandas alternatives.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
120 skills found
High-performance in-memory DataFrame library for Python and Rust. Features lazy evaluation, parallel execution, and an Apache Arrow backend for efficient ETL, data processing, and faster pandas alternatives.
Interactive Python graphing library for 40+ chart types, scientific visualizations, statistical analysis, and web dashboards using Plotly Express and Graph Objects.
Generates data cleaning pipelines for pandas/polars/PySpark, handling missing values, duplicates, outliers, type conversions, and validation.
Comprehensive Python healthcare AI toolkit for clinical data processing, medical coding translation, and developing deep learning models like RETAIN and Transformers for EHR, physiological signals, and clinical prediction tasks.
Trace Rspack Rust function calls using LLVM XRay for performance analysis, troubleshooting, and visualization of execution flow.
Load, validate, and preprocess weekly insurance policy CSV data with intelligent period detection and standardization.
Read and analyze any data file (CSV, JSON, Parquet, Avro, Excel, etc.) or remote URL (S3, HTTPS) using DuckDB. Automatically detect file formats and preview/profile datasets.
Python skill for high-performance storage of chunked N-dimensional arrays using Zarr, supporting cloud storage (S3/GCS), parallel I/O, and integration with NumPy, Dask, and Xarray.
A modular data processing tool for cleaning, validating, and analyzing CSV files with support for custom transformations and automated dependency management.
Python library for geospatial vector data analysis. Perform spatial joins, geometric operations, coordinate transformations, and mapping using GeoPandas, shapely, and interactive tools.
Infrastructure for cross-product HealthSim data persistence, entity correlation via SSN, and DuckDB database operations.
A comprehensive PDF toolkit for extracting text/tables, merging, splitting, rotating, and programmatically generating or filling PDF documents using Python and CLI tools.