Joni

Data Engineering Project

End-to-End Data Pipeline & BI Dashboard

Automated Superstore sales data pipeline from raw CSV ingestion to PostgreSQL analytics and Power BI dashboard reporting.

Sales performance dashboard preview
9,994 records processed
PostgreSQL analytics database
Power BI dashboard output

Business Problem

This project simulates an e-commerce reporting workflow that answers business questions around revenue trends, top revenue-generating products, regional performance, and active customers.

Pipeline Architecture

CSV data source -> Python extract, validate, and transform scripts -> PostgreSQL table -> SQL analysis -> BI dashboard.

CSV Python pandas PostgreSQL SQL Power BI

Workflow

Key Features

Result

The pipeline processed 9,994 Superstore records, generated a cleaned dataset, loaded it into PostgreSQL, and produced dashboard-ready insights including revenue trend, customer count, order volume, regional revenue split, and top-performing products.

Repository

The full source code, SQL queries, dashboard file, dataset folders, and pipeline logs are available on GitHub.

View on GitHub
← Back to Home