A full-stack logistics analytics solution designed to forecast delivery delays, optimize routes, and enhance operational efficiency across regions, modes, and weather conditions. This project analyzes 200,000+ delivery records to empower supply chain teams with predictive insights and interactive dashboards.
๐ GitHub Project Repository
๐ Click to view Smart-Delivery-Optimization-Platform
๐ง Project Overview
Delivery reliability is critical for customer satisfaction and cost control. This project delivers an end-to-end analytics platform that enables:
- ๐ Delay prediction across regions and delivery modes
- ๐ฆ๏ธ Weather impact analysis on delivery performance
- ๐ Route optimization and time deviation tracking
- ๐ Executive dashboards for logistics planning
๐ฏ Key Objectives
- Clean and preprocess delivery data
- Engineer features for delay modeling and dashboarding
- Build classification models to predict delivery status
- Deploy interactive dashboards for stakeholder decision-making
๐ Project Structure
File Name |
Description |
delivery_data.csv |
Raw dataset with 200K+ delivery records |
cleaned_delivery_data.csv |
Preprocessed dataset with engineered features |
delivery_delay_model.pkl |
Trained model for predicting delivery delays |
smart_delivery.sql |
SQL queries for delivery filtering and aggregation |
sqlconnect.py |
Python script for SQL database connection |
app.py |
Streamlit app for model testing and backend logic |
smart_delivery.ipynb |
Jupyter notebook with EDA, modeling, and insights |
Smart Delivery Optimization Platform dashboard.pbix |
Power BI dashboard visualizing delivery KPIs and trends |
๐งน Data Preprocessing
- Converted
start_time
and end_time
to datetime format
- Calculated delivery duration and delay metrics
- Encoded categorical features (
region
, delivery_mode
, weather
, status
)
- Removed duplicates and ensured type consistency
- Normalized
distance_km
and estimated_time_min
for modeling
๐ Exploratory Data Analysis
- ๐ Delay trends by region and delivery mode
- ๐ฆ๏ธ Weather impact on delivery performance
- โฑ๏ธ Distance vs. time deviation analysis
- ๐ Status distribution across delivery types
- ๐
Hourly and daily delivery patterns
๐ค Modeling Approach
- Target Variable:
status
(Delivered, Delayed, Failed)
- Algorithms Used: Random Forest, XGBoost, Logistic Regression
- Evaluation Metrics: Accuracy, F1 Score, Confusion Matrix
- Top Features:
distance_km
, estimated_time_min
, weather
, region
๐ Dashboard Overview
๐ท Power BI Dashboard
Visualizes delivery performance and predictive insights:
- ๐ Regional delivery metrics
- ๐ Mode-wise delay analysis
- ๐ฆ๏ธ Weather impact visualization
- โฑ๏ธ Estimated vs. actual time comparison
- ๐ Status breakdown and forecasting



๐ข Streamlit App
Interactive dashboard for real-time delivery insights:
- ๐ Input delivery attributes to predict status
- ๐ Feature importance visualization
- ๐ Filters by region, mode, and weather
- ๐ Delay forecasting and performance benchmarking




๐ Deployment
- Model serialized with
joblib
as delivery_delay_model.pkl
- Dashboard deployed via Streamlit Cloud
- SQL integration for dynamic delivery querying
- Power BI dashboard built from cleaned dataset
- Git LFS used for large file management
๐ง Business Impact
- Predicts delivery delays for proactive planning
- Improves route and mode selection
- Enhances customer satisfaction through transparency
- Reduces operational inefficiencies and cost overruns
๐ ๏ธ Tech Stack
- Python: Pandas, NumPy, Scikit-learn
- SQL: Data extraction and filtering
- Visualization: Power BI, Matplotlib, Seaborn
- Deployment: Streamlit Cloud, GitHub, Git LFS
๐ Future Enhancements
- Integrate real-time traffic and weather APIs
- Add geospatial mapping for route optimization
- Enable alert system for high-risk deliveries
- Expand dashboard to include fleet performance metrics
๐ค Author
Anesh Raj
๐ GitHub Profile