This project focuses on analyzing vendor performance, cost efficiency, and inventory optimization using real-world retail and wholesale data.
It combines Python (for data ingestion, transformation, and analysis) with Power BI (for interactive visualization) to deliver actionable insights for management.
The objective is to identify top-performing vendors, inefficient cost patterns, and profitability opportunities to enhance strategic decision-making.
Efficient inventory and vendor management are critical for business profitability.
This project aims to:
- Identify underperforming brands requiring promotional or pricing adjustments.
- Determine top vendors contributing to sales and gross profit.
- Analyze bulk purchasing impact on unit cost and profitability.
- Assess inventory turnover to reduce holding costs and improve efficiency.
- Investigate profitability variance between high- and low-performing vendors.
- Imported six raw datasets into SQLite using a Python ETL pipeline:
purchases.csvpurchase_prices.csvvendor_invoices.csvbegin_inventory.csvend_inventory.csvsales.csv
- Built robust scripts with logging and SQLAlchemy for data reliability and traceability.
- Created a consolidated table
vendor_sales_summaryusing SQL joins and feature engineering. - Added calculated fields for analysis:
GrossProfit = TotalSalesDollars - TotalPurchaseDollarsProfitMargin = (GrossProfit / TotalSalesDollars) * 100StockTurnover = TotalSalesQuantity / TotalPurchaseQuantitySalesToPurchaseRatio = TotalSalesDollars / TotalPurchaseDollars
- Conducted in-depth EDA using Pandas, Matplotlib, and Seaborn to identify trends and vendor performance patterns.
- The notebook helped finalize the dataset for dashboard creation.
- Built two interactive Power BI dashboards to summarize KPIs and insights for management-level decision-making.
-
Total Sales: π° $93.1M
-
Total Purchases: π $73.6M
-
Gross Profit: π $19.5M
-
Top Vendors by Sales & Profit:
-
π₯ DIAGEO NORTH AMERICA INC
-
π₯ MARTIGNETTI COMPANIES
-
π₯ PERNOD RICARD USA
-
-
Freight Cost: 3.6% of Total Sales β relatively stable among high-performing vendors.
-
Profit Margin Trends: Top vendors maintain >25% profit margins.
-
Stock Turnover Ratio: Efficient vendors average ~1.2x inventory cycles per period.
-
Vendors with high freight-to-sales ratios (>5%) show declining profit margins.
-
Bulk purchasing led to an average unit cost reduction of 72%, proving scale efficiency.
-
Underperforming brands identified with low Sales-to-Purchase ratios (<1.0).
-
Inventory Efficiency: Vendors maintaining 1.0β1.5 stock turnover ratios show optimal stock management.
-
Highlighted vendor-wise profitability variance for strategic negotiations.
-
DIAGEO NORTH AMERICA INC and MARTIGNETTI COMPANIES dominate both sales and profit metrics.
-
Freight cost significantly affects vendor profitability, especially for heavy-volume shipments.
-
Vendors with balanced Sales-to-Purchase ratios and high Stock Turnover yield the best ROI.
-
Identified opportunities for pricing optimization and vendor diversification.
| Tool / Technology | Purpose |
|---|---|
| Python | Data ingestion, transformation, and cleaning using Pandas |
| SQLite | Database for structured storage and SQL-based aggregation |
| Power BI | Dashboard creation, DAX calculations, and visualization |
| Power Query | Data cleaning and shaping for BI integration |
| Pandas | Data manipulation, merging, and EDA in Python |
| Matplotlib & Seaborn | Exploratory data visualization and insights |
| SQLAlchemy | Python ORM used for database connectivity and ingestion |
| Logging Module | Tracks execution steps and records data pipeline events |
| Jupyter Notebook | Used for data exploration and performance analysis |
- End-to-end company-level vendor analytics project
- Combines Python, SQL, and Power BI
- Includes full data pipeline, EDA, and BI dashboards
π¦ Vendor_Performance_Analysis
β
βββ Vendor_Performance_Analysis.pdf # Project report containing 2 dashboards
βββ Vendor_Performance_Report.pdf # Final Project Report
βββ README.md # Project documentation
β
βββ data/
β βββ vendor_sales_summary.csv # Final cleaned dataset used in analysis & Power BI
β
βββ scripts/
β βββ ingestion_db.py # Data ingestion and database creation
β βββ get_vendor_summary.py # SQL joins and vendor summary generation
β
βββ notebook/
β βββ Exploratory Data Analysis.ipynb # Initial data exploration
β βββ Vendor Performance Analysis.ipynb # Main analytical notebook
β
βββ dashboard/
β βββ Vendor_Performance_Analysis.pbix # Power BI dashboard file
β
βββ images/
βββ Executive_summary.png # Dashboard 1 preview
βββ Cost_Efficiency_and_Inventory.png # Dashboard 2 preview
This project demonstrates how to combine data engineering, analytics, and business intelligence for decision-making at a company level.
β
Identified top and underperforming vendors
β
Analyzed freight cost efficiency and stock turnover
β
Improved understanding of sales-to-purchase profitability
β
Delivered executive-level dashboards for actionable insights
- Automate data refresh from live databases
- Build predictive models for vendor performance forecasting
- Integrate Power BI service for real-time dashboard updates
π€ Harsh Belekar
π Data Analyst | Python | SQL | Power BI | Excel | Data Visualization
π¬ LinkedIn | π GitHub
π§ [email protected]
β If you liked this project, donβt forget to star the repo and connect with me on LinkedIn!

