Extract structured data from unstructured files (PDF, PPTX, DOCX...)
Implement LlamaExtract for robust structured data extraction from PDF, DOCX, and PPTX files using Pydantic schemas.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
130 skills found
Implement LlamaExtract for robust structured data extraction from PDF, DOCX, and PPTX files using Pydantic schemas.
High-performance in-memory DataFrame library for Python and Rust. Features lazy evaluation, parallel execution, and an Apache Arrow backend for efficient ETL, data processing, and faster pandas alternatives.
Perform cohort analysis on user engagement data. Identify retention trends, feature adoption rates, churn patterns, and generate actionable research recommendations through quantitative data analysis.
A multi-paradigm ETL pipeline agent supporting batch and streaming data processing, schema inference, and configurable DAG-based transformations for heterogeneous data sources.
Search, retrieve, and manage your KUNGFU.SH bookmarks programmatically to streamline your research and knowledge management workflows.
Automated job search management for the Lofy AI assistant: track applications, tailor resumes, prepare for interviews, manage follow-ups, and analyze career pipelines.
Automate Excel report generation from CSVs, databases, or data structures using pandas and openpyxl. Supports chart creation, custom styling, template-based workflows, and data analysis.
Manage dlt data pipelines and Temporal workflows for the SignalRoom marketing platform. Sync sources like Everflow, Redtrack, and S3 to Postgres, check status, and debug ingestion.
Designs and implements professional, interactive filtering UX for data tables based on column data types.
Infrastructure for cross-product HealthSim data persistence, entity correlation via SSN, and DuckDB database operations.
Manage, sync, and transfer files between local storage and cloud providers like S3, Cloudflare R2, Backblaze B2, Google Drive, and Dropbox using rclone.
Automate GitHub issue triage by analyzing reports against the codebase, verifying technical claims, and providing expert-driven responses to resolve invalid issues.