Delivery Delay Risk & Value Optimization
Quantified where delivery delays truly destroy value and enabled targeted intervention on the small set of orders where action pays off.
Context
Delivery delays at Olist are rare but disproportionately costly, making blanket operational intervention inefficient. This project built a decision system to predict delivery delay risk at order creation and determine whether proactive intervention creates net business value.
Why it mattered: Using 100K+ real marketplace orders, delay risk was shown to be highly concentrated across distance, cross-state logistics, and a small subset of sellers. A ranking-based machine learning model doubled Precision@10% versus heuristic rules, enabling intervention on only the top 10% highest-risk orders. Crucially, model performance was reframed in economic terms, identifying break-even intervention cost thresholds and supporting a defensible operational rollout via an executive dashboard.
Problem Statement
Can delivery delays be predicted early enough to enable targeted intervention — and does acting on those predictions create positive economic value?
- Delivery delays are a rare event (high class imbalance)
- Predictions must rely only on information available at order creation
- Operational teams have limited capacity to intervene
Approach
Hypothesis: Delivery delay risk is driven by observable structural factors such as distance, cross-state logistics, and seller behavior, and can be concentrated into a small subset of orders.
- Constructed a canonical order-level fact table with strict grain control
- Diagnosed structural delay drivers using geography and seller behavior
- Benchmarked simple rule-based heuristics as operational baselines
- Trained ranking-based ML models optimized for Precision@K
- Translated model output into a cost-based decision framework
- Operationalized insights via an executive dashboard and playbook
Analysis
Methods: Order-level ETL and data validation, Geographic and logistics enrichment, Seller concentration (Pareto) analysis, Rule-based baselines vs ML ranking models, Cost-sensitive threshold evaluation
Metrics: Precision@10%, ROC-AUC
Data: 100K+ Brazilian e-commerce orders (2016–2018), enriched with seller, customer, product, review, and geolocation data.
Key Insights
- Delivery delay risk is structural, increasing sharply with shipment distance.
- Cross-state logistics introduce meaningful administrative and operational friction.
- Delay risk is highly concentrated: ~13% of sellers account for ~80% of delays.
- Rule-based heuristics capture some risk but are operationally inefficient.
- A ranking-based model materially improves intervention precision at the margin.
Impact
- Enabled selective intervention instead of blanket operational action
- Improved efficiency by focusing only on the highest-risk orders
- Aligned analytics output directly with economic decision-making
- Precision@10% improved to ~5.4% (≈2× uplift vs heuristic baselines)
- Identified clear break-even intervention cost thresholds
Trade-offs & Limitations
- Optimized for ranking precision rather than binary accuracy
- Selected a Top-10% operating threshold to match capacity
- Prioritized interpretable, business-grounded features
- External logistics variability introduces unavoidable noise
- Economic outcomes are sensitive to intervention cost discipline
- Causal impact requires controlled experimentation
Outcome & Next Steps
Delivered a deployable, economically defensible delivery-delay decision system supported by a six-page executive dashboard and a clear operational playbook.
- Run randomized A/B tests to measure causal intervention impact
- Integrate predictions into routing and seller SLA workflows
- Monitor data drift, seller mix shifts, and economic ROI over time
Discussion & Perspectives
This project is open for analytical discussion—questions on assumptions, methodology, and business interpretation are encouraged.
Join the discussion on GitHub