retail-analytics-gpt4o-langchain-agent

provenance:github:AntonisXT/retail-analytics-gpt4o-langchain-agent

WHAT THIS AGENT DOES

Here's a plain English summary of the retail-analytics-gpt4o-langchain-agent: This agent helps businesses understand their retail sales data better. It takes complex sales information – like store performance, product trends, and the impact of holidays – and turns it into clear, easy-to-understand insights. It solves the problem of sifting through mountains of data and trying to spot important patterns. Instead of needing data experts to analyze everything, this agent can answer your questions about sales performance in plain language. Retail managers, business analysts, and anyone who needs to make decisions based on sales data would find this agent incredibly useful for quickly uncovering key trends and opportunities.

View Source ↗First seen 4mo agoNot yet hireable

README

# Retail Analytics with a GPT-4o LangChain Agent

**Exploratory retail analytics powered by a LangChain pandas agent (GPT-4o). Python owns the stats, the agent owns the narrative.**

[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/AntonisXT/retail-analytics-gpt4o-langchain-agent/blob/main/retail_sales_analysis.ipynb)
[![Python 3.10+](https://img.shields.io/badge/Python-3.10%2B-blue?logo=python&logoColor=white)](https://www.python.org/)
[![LangChain 0.3](https://img.shields.io/badge/LangChain-0.3-green?logo=chainlink&logoColor=white)](https://www.langchain.com/)

<br>

<p align="center">
  <img src="plots/01_overview.png" width="85%">
</p>

<br>

## What this project does

Retail data is dense. 45 stores, 81 departments, 3 years of weekly sales enriched with markdown spend, CPI, unemployment, and store metadata. Enough to answer dozens of analytical questions, each of which normally means writing more aggregation code, more statistical boilerplate, more interpretation glue.

This notebook takes a different approach. A 421,570-row dataset gets the full analytical treatment **in Python**: statistical tests, time series decomposition, K-Means segmentation, and hypothesis testing. A **LangChain pandas agent backed by GPT-4o** then interprets every result, synthesises cross-sectional findings, and answers open-ended business questions against the live DataFrame in plain English. Python handles everything that needs to be reproducible. The agent handles everything that benefits from natural language.

<br>

## Why this exists

Most LLM-in-a-notebook demos fall into one of two traps: either the LLM does everything (including unreliable statistics), or it does nothing beyond summarising text. This notebook draws a deliberate line:

| What runs in Python | What runs through the agent |
|---|---|
| Welch t-tests, Mann-Whitney U, ANOVA, Tukey HSD | Interpreting pre-computed test results in business context |
| STL decomposition, ADF stationarity test | Explaining what the decomposition means for forecasting |
| K-Means clustering, PCA, silhouette validation | Naming clusters, identifying where they diverge from official labels |
| Pearson correlations, stratified confounding checks | Synthesising macro, markdown, and holiday effects simultaneously |
| All visualisations (matplotlib, seaborn) | Open-ended Q&A: querying the DataFrame directly via natural language |

Every formal number in the notebook is independently reproducible without the LLM. The agent adds the interpretive layer that makes the analysis readable.

<br>

## Key findings

**Type C stores are the most space-efficient format** at $0.24/sqft vs $0.12 for Type A, and the most sales-stable. Bigger is not more efficient.

**Thanksgiving dominates the holiday calendar at +39.8% uplift**, 13x the Super Bowl effect. The Christmas holiday flag marks the wrong week; the actual peak is the unflagged Week 51 ($157.9M).

**MarkDown2 and MarkDown4 deliver the strongest promotional lift** (~9-10%), with clear diminishing returns above the low-spend tercile. The marginal markdown dollar is far less effective than the first.

**Markdowns are deployed to already-strong stores** (r = 0.819 between store markdown spend and average sales), making the observed uplift partially a selection effect. A randomised rollout would be needed to isolate the true promotional lift.

**29 of 81 departments generate 80% of chain revenue.** Dept 72 doubles its sales during holiday weeks, making it the highest-leverage department for seasonal inventory planning.

**Type C stores grow in high-unemployment markets** while Types A and B decline. This resilience pattern is invisible in the official A/B/C store classification.

<br>

## Notebook structure

The notebook is organised into 16 sections. Each analytical section follows the same pattern: Python computes, the agent interprets, and a blue callout box evaluates the agent's output with additional context.

| # | Section | What happens |
|---|---|---|
| 1 | Setup | Pinned dependencies, imports, API key |
| 2 | Data | Three-table merge, feature engineering, quality checks |
| 3 | The LLM Agent | Agent initialisation with full domain context |
| 4 | Exploratory Analysis | Agent queries the DataFrame directly for distributional overview |
| 5 | Seasonality & Sales Patterns | Holiday uplift by event, YoY growth, STL decomposition, ADF test |
| 6 | Store Performance | Sales per sqft, volatility (CV), outlier identification |
| 7 | Markdown Effectiveness | Welch t-tests per type, stratified confounding checks, diminishing returns |
| 8 | Macroeconomic Context | Unemployment and CPI correlations, multicollinearity check |
| 9 | Store Segmentation | K-Means (k=4), PCA, silhouette validation, cluster profiling |
| 10 | Hypothesis Testing | H1-H4 with parametric + non-parametric parallel tests |
| 11 | Department Analysis | Revenue concentration, holiday sensitivity, YoY growth trajectories |
| 12 | Synthesis | Cross-sectional analysis combining all prior findings |
| 13 | Open-Ended Q&A | Agent answers ad-hoc questions against the live DataFrame |
| 14 | Executive Summary | Agent-generated board-ready narrative with verified numbers |
| 15 | Limitations & Caveats | What the analysis can and cannot claim |
| 16 | Reflections | What the agent enabled and what required careful prompt design |

<br>

## Tech stack

| Layer | Tools |
|---|---|
| LLM Agent | LangChain 0.3.23 · langchain_experimental 0.3.4 · OpenAI GPT-4o |
| Statistical tests | scipy · statsmodels (ADF, STL, Tukey HSD) |
| Machine learning | scikit-learn (K-Means, PCA, silhouette scoring) |
| Data & visualisation | pandas · numpy · matplotlib · seaborn |
| Environment | Google Colab (GPU not required) |

<br>

## How to run

**1. Clone the repo**
```bash
git clone https://github.com/AntonisXT/retail-analytics-gpt4o-langchain-agent.git
cd retail-analytics-gpt4o-langchain-agent
```

**2. Get the data.** Download the [Walmart Retail Dataset](https://www.kaggle.com/datasets/manjeetsingh/retaildataset) from Kaggle and place the three CSV files in the `data/` folder.

**3. Set up your OpenAI API key.**
  - **Colab:** Add it via Colab Secrets (🔑 icon in the left sidebar) as `OPENAI_API_KEY`
  - **Local:** Set it as an environment variable: `export OPENAI_API_KEY=your_key` (Windows CMD: use `set`, PowerShell: use `$env:`)

**4. Open the notebook.** Upload to [Google Colab](https://colab.research.google.com) (recommended) or run locally with Jupyter.

**5. Run all cells.** Dependencies install automatically. Expect ~15 minutes and ~$0.2 in API costs (GPT-4o).

> **Running locally?** Python 3.10+ required. Run `pip install -r requirements.txt` before starting.

<br>

## Dataset

[Walmart Retail Dataset](https://www.kaggle.com/datasets/manjeetsingh/retaildataset) from Kaggle. Three CSV files:

| File | Contents | Rows |
|---|---|---|
| `sales data-set.csv` | Store × Dept × Week × Sales | 421,570 |
| `Features data set.csv` | Markdowns, CPI, unemployment, temperature per store/week | 8,190 |
| `stores data-set.csv` | Store type (A/B/C) and size in sqft | 45 |

Place the three files in a `data/` directory at the project root.

<br>

## Project structure

```
retail-analytics-gpt4o-langchain-agent/
├── retail_sales_analysis.ipynb
├── README.md
├── requirements.txt
├── .gitignore
├── data/
│   └── .gitkeep
└── plots/                               # Auto-generated visualisations
    ├── 01_overview.png
    ├── 02_seasonality.png
    ├── 03_store_performance.png
    ├── 04_markdowns.png
    ├── 05_clusters.png
    ├── 06_departments.png
    ├── 07_synthesis.png
    ├── stl_decomposition.png
    ├── silhouette.png
    ├── multicollinearity.png
    ├── outliers.png
    └── dept_yoy_growth.png
```

<br>

## Sample visualisations

<p align="center">
  <img src="plots/02_seasonality.png" width="85%"><br>
  <em>Weekly sales heatmap and monthly patterns by store type</em>
</p>

<p 

[truncated…]

PUBLIC HISTORY

First discoveredMar 21, 2026

IDENTITY

inferred

Identity inferred from code signals. No PROVENANCE.yml found.

Is this yours? Claim it →

METADATA

platformgithub

first seenMar 19, 2026

last updatedMar 20, 2026

last crawled3 months ago

version—

README BADGE

Add to your README:

![Provenance](https://getprovenance.dev/api/badge?id=provenance:github:AntonisXT/retail-analytics-gpt4o-langchain-agent)