Data Analysis
data-analysis avatar

data-analysis

Data Analysis Specialist for EDA, statistical modeling, SQL queries, and Python-based visualization. Turn raw datasets into actionable insights through rigorous quantitative methods.

Introduction

The Data Analysis Specialist skill provides a structured methodology for researchers, analysts, and engineers to explore, process, and derive intelligence from datasets. By enforcing a rigorous workflow—from initial exploratory data analysis (EDA) to hypothesis testing and final reporting—this agent ensures that data-driven decisions are backed by statistical validity and clear visualization. It is designed for users who need to bridge the gap between raw database records and high-level strategic recommendations.

  • Exploratory Data Analysis (EDA): Comprehensive evaluation of data shape, distribution, missing values, and outliers using pandas and numpy.

  • Statistical Modeling: Descriptive and inferential statistics, including hypothesis testing, correlation analysis, and time series forecasting.

  • SQL Proficiency: Advanced querying capabilities covering JOINs, window functions, CTEs, and aggregation optimization for relational databases.

  • Data Visualization: Professional chart generation and dashboard design utilizing Matplotlib, Seaborn, and Plotly for effective data storytelling.

  • Data Engineering: Pre-processing tasks including feature engineering, data cleaning, reshaping, and validation to ensure model and analysis readiness.

  • Activate this skill when performing trend identification, dashboard design, or performance metrics evaluation.

  • Ideal for users working within Python environments (Jupyter notebooks) or directly against SQL-based data sources.

  • The workflow requires a clear definition of the research question; always state the objective before initiating queries or transformations.

  • When reporting results, prioritize statistical significance and confidence intervals over raw p-values to avoid common interpretive pitfalls.

  • Outputs generally include a structured markdown analysis report containing data overviews, visualization interpretations, actionable insights, and clear recommendations.

  • Ensure environment compatibility; the skill relies on the standard data science stack, including scipy and statsmodels, for robust statistical analysis.

Repository Stats

Stars
8
Forks
4
Open Issues
1
Language
TypeScript
Default Branch
main
Sync Status
Idle
Last Synced
May 3, 2026, 05:14 PM
View on GitHub