TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

22 Impacting Projects to Start Your Data Science for Supply Chain Journey

A list of Data Science for Supply Chain case studies that can be used to develop your skills and quickly impact your organization.

Samir Saci
TDS Archive
Published in
14 min readApr 9, 2021

--

A flowchart titled “Import Data, Select Objective, Visualize Results” shows a three-step process for supply chain optimization. The process begins with “Import Data” featuring a table icon, followed by “Select Objective” with a target icon. Below, three objectives are listed: minimize waste (factory icon), minimize cost (warehouse icon), and minimize CO2 emissions (truck icon). The final step is “Visualize Results” with icons representing a world map and data charts.
22 Impacting Projects to Start Your Data Science for Supply Chain Journey — (Image by Author)

If you want to impact your organization with data science, supply chain management is the best place to start your data science journey.

Examples of Data Science for Supply Chain Projects: Supply Planning, Sustainability and Transportation
Example of Supply Chain Projects — (Image by Author)

As a Data Scientist, how can I provide value to my organization?

When I was a Supply Chain Solution Designer, my job was to

  • Translate our customer's requirements into actual operational solutions (Retail, E-Commerce, Luxury, FMCG, Automotive)
  • Conduct re-engineering studies to improve warehouse operations and optimize transportation networks.

Leveraging data science, simulate supply chain scenarios to improve the performance and reduce costs.

In this article, you can find 22 major Data Science for Supply Chain case studies that can be applied to your operations by following detailed tutorials.

For each example, I share the Python source code with dummy data on GitHub so you can try to adapt the model for your projects.

I. Supply Chain Optimization with Python
Use data to produce and deliver products efficiently
1. What is Supply Chain Analytics?
2. Optimize production planning with Python
3. Supply Chain Optimization with Python
II. Warehouse Operations Optimization with Python
Support logistics operations to receive, store and prepare goods
1. Improve warehouse picking productivity with Python
2. Optimize Workforce Planning Python
3. Lean Six Sigma with Python using the Kruskal Wallis Test
III. Data Visualization with Python
Design interractive visualization to present results
1. Deploy Interactive Dashboard using Python Flask and javascript D3.js
IV. Inventory Management & Demand Forecasting
Forecasting and rules to minimize storage costs and avoid stock-outs
1. Inventory Management for Retail with Python
2. Machine learning for Retail Sales Forecasting with Python
V. Transportation Optimization with Python
Reduce costs by maximizing the efficiency of your routing
1. Transportation Network Analysis with Python using Graph Theory
2. Transportation Network Visualization using Python
VI. Data Science for Supply Chain Sustainability
Use analytics to reduce the environmental footprint of your company
1. Supply Chain Sustainability Reporting with Python
2. How sustainable is your circular economy?
3. What is Green Inventory Management?
4. Sustainable Supply Chain Optimization App
5. Automate ESG reporting with Python
6. Fight Greenwashing with Python
7. Leverage Data Analytics for Sustainability
VII. Generative AI for Supply Chain
How can we use LLM to improve user experience of analytics products?
1. Leveraging LLMs with LangChain for Supply Chain Analytics
2. Automate supply chain analytics with GPTs

Let us start with analytics for supply chain optimization.

What is Supply Chain Analytics?

A chart illustrating four types of supply chain analytics: Descriptive (“Which events happened?” with “Sales jumped by +20%”), Diagnostic (“Why did these events happen?” with “National Holidays Pushed the Sales”), Predictive (“What can happen?” with “Sales will increase by +10% next week”), and Prescriptive (“What should we do?” with “Replenish the store tomorrow”).
What is Supply Chain Analytics by Samir Saci

Scope: Supply Chain Optimization

Objective: Use data analytics with Python to improve operational efficiency by enabling data-driven diagnostics and decisions at strategic and operational levels.

A Supply Chain can be defined as several parties exchanging flows of material, information or money resources to fulfil a customer request.

Flows of information and goods — (Image by Author)

As information plays an important role, Supply chain Analytics has emerged as the methodologies and tools organizations use to get insights from data associated with all processes included in the value chain.

In this article, you will discover the different types of Supply Chain Analytics with Python and understand their impact on the efficiency of your end-to-end operations so you can start your project.

Optimize production planning with Python

A flowchart illustrating the supply chain process from customer to factory. It begins with a customer placing a purchase order, which is then sent to the factory. The factory determines production quantity and monitors inventory on hand. The flow also includes details on delivery quantity and delivery date, linking back to customer order fulfillment.
Optimize production planning with Python by Samir SACI

Scope: Supply Chain Optimization

Problem Statement: The master production schedule is the main communication tool between the commercial team and production.

Your customers send purchase orders with specific quantities to be delivered at a certain time.

Example of an order — (Image by Author)

Production planning minimizes the total cost of production by finding a balance between minimizing inventory and maximizing the quantity produced per setup.

Objective: In this article, we will implement optimal production planning using the Wagner-Whitin method with Python.

What is Supply Chain Optimization?

A diagram titled “Supply Chain Networks with Monte Carlo Simulation” comparing demand and supply for supply chain optimization. The left side represents demand with “Sales Volumes per Market,” highlighting different markets on a world map. The right side shows supply with “Manufacturing Capabilities per Region,” indicating factory locations and capabilities. Icons below represent key factors: market demand, supply constraints, minimizing costs, factory capacity, fixed costs, and variable costs.
What is Supply Chain Optimization?

Scope: Supply Chain Optimization

Problem Statement: Supply chain optimization uses data analytics to find an optimal combination of factories and distribution centres to meet customers' demands.

The core structure of many software and solutions in the market is a Linear Programming Model.

Some of these models find the right allocation of factories to meet the demand and minimize the costs, assuming a constant demand.

What happens if the demand is fluctuating?

Your network may lose robustness, especially if your demand is very seasonal (e-commerce, cosmetics, fast fashion).

Objective: In this article, we will build a simple methodology to design a Robust Supply Chain Network using Monte Carlo simulation with Python.

Let us move to warehouse operations.

Improve warehouse picking productivity with Python

A warehouse layout diagram illustrating an aisle-based picking system. The layout shows 19 aisles labeled A01 to A19, with 12 rows in each aisle, forming a grid. The diagram begins at a “Start” point in the bottom left corner and follows an order picking path labeled “Order 1: Start.” The x and y axes represent the warehouse dimensions, highlighting the structured organization for optimizing picking routes.
Example of three different picking routes — (Image by Author)

Scope: Warehouse Operations

Problem Statement: In a Distribution Center (DC), walking time from one location to another during the picking route can account for 60% to 70% of the operator’s working time.

Objective: How can you use Data Science to increase Warehouse Operators’ productivity by reducing walking distance?

I have written several articles explaining how to use Order Batching, Spatial Clustering and Pathfinding Algorithms to improve picking productivity.

Concept & Libraries Used:

  • Create Orders Batch using Python’s Pandas, Numpy
  • Spatial Clustering of Picking Location using Python’s Scipy
  • Pathfinding for Picking Route design using Google OR

Results: An increase in the picking productivity will lead to cost reductions

Link to the Articles

Optimize Workforce Planning Python

A diagram titled “Workforce Planning Problem” illustrating factors in workforce planning. At the center is “Workload Forecasting,” which is connected to four surrounding elements: time constraints, task planning, schedule constraints, and workforce planning. The diagram emphasizes the need to consider these factors when determining the minimum number of temporary workers required to meet weekly workload demands while ensuring employee retention.
Optimize Workforce Planning Python by Samir SACI

Scope: Warehouse Operations

Problem Statement: What is the minimum number of temporary workers you need to hire to absorb your weekly workload while ensuring employee retention?

Objective: To meet your manager's productivity targets, you must minimize the number of workers hired to handle the workload.

I have written a medium article on using Linear Programming to find the correct number of workers to hire.

Concept & Libraries Used:

  • Linear Programming with Python’s PuLP

Results: Calculate the minimal number of workers that respects all the constraints.

Lean Six Sigma with Python using the Kruskal Wallis Test

A flowchart titled “Warehouse Operators Training” illustrating the impact of training on operator productivity. It shows two groups: trained and untrained VAS (Value-Added Services) operators. Each group’s productivity is measured after the training, with arrows representing the process from random selection to measuring productivity for both groups.
Lean Six Sigma with Python using the Kruskal Wallis Test by Samir SACI

Scope: Warehouse Operations

Lean Six Sigma (LSS) is based on a stepwise approach to process improvements, following five steps (Define, Measure, Analyze, Improve, and Control).

As a continuous improvement Manager of a Distribution Center (DC) for an iconic Luxury Maison, you want to use this approach to improve the productivity of a specific process.

Does the training have a positive impact on the productivity of operators?

Hypothesis
The training has a positive impact on the productivity of VAS operators.

Experiment
Randomly select operators and measure the time per batch (Time to finish a batch of 30 labels in seconds) to build a sample of 56 records.

Objective: In this article, we will explore how Python can replace Minitab in the Analysis step to test hypotheses and understand what could improve the performance metrics of a specific process.

Do you need to develop interactive visualizations?

Deploy Interactive Dashboard using Python Flask and javascript D3.js

Scope: Visualization & Reporting

Interactive Dashboard with Python Flask and D3.js
Final Rendering of a Dashboard showing Luxury Brands’ Online Sales — (Image by Author)

A simple and fancy visualization can impact more than a very complex model, especially for a non-technical audience.

Therefore, building your visualization skills is essential to your Supply Chain Data Scientist job.

I have written a medium article on designing fancy visualization using D3.js without prior knowledge of JavaScript (or very light).

Concept & Libraries Used:

  • Data Processing with Numpy, Pandas, Flask and D3.js (Link)

Results: A dynamic dashboard that can interact with users to show business insights.

What about inventory management?

Inventory Management for Retail with Python

A dashboard titled “Inventory Management with a Stochastic Demand” shows three graphs: demand (in red), replenishment, and inventory (in green) over time. Demand fluctuates with a normal distribution. Replenishment follows the (s,Q) order point model, where replenishment occurs when inventory falls below a certain level. The inventory graph shows cycles of depletion and restocking. Explanations define demand distribution and replenishment with lead time.
Inventory Management with a Stochastic Demand — (Image by Author)

Scope: Inventory Management

Problem Statement: For most retailers, inventory management systems take a fixed, rule-based approach to forecast and replenishment order management.

Considering the demand distribution, the objective is to build a replenishment policy to minimize your ordering, holding and shortage costs.

Objective: We want to introduce a simple methodology for testing several inventory management rules using a discrete simulation model built with Python.

Machine Learning for Retail Sales Forecasting with Python

Machine Learning for Retail Sales Forecasting
Features Engineering for Machine Learning for Retail Sales Forecasting — (Image by Author)

Scope: Inventory Management & Demand Forecasting

Problem Statement: Based on the feedback of the last Makridakis Forecasting Competitions, machine learning models can reduce forecasting errors by 20% to 60% compared to benchmark statistical models. (M5 Competition)

Their major advantage is the capacity to include external features that heavily impact the variability of your sales.

For example, e-commerce cosmetics sales are driven by special events (promotions) and how you advertise a reference on the website (first page, second page, …).

Feature engineering is based on analytical concepts and business insights to understand what could drive your sales.

Objective: In this article, we will try to understand the impact of several features on a model’s accuracy using the M5 Forecasting competition dataset.

Let us optimize the road transportation!

Transportation Network Analysis with Python using Graph Theory

: A diagram titled “Transportation Network Analysis with Graph Theory” shows a network of stores connected by lines representing road transportation routes. The stores are color-coded into groups, with icons of trucks indicating active transportation between stores. Stores 1, 2, and 5 are connected in blue; stores 3, 4, and 9 in yellow; stores 6, 7, and 8 in orange. The diagram illustrates the use of graph theory to optimize a retail company’s transportation network.
Transportation Network Analysis with Python using Graph Theory by Samir SACI

Scope: Road Transportation

Objective: Build Visuals to support FTL routing optimization.

Problem Statement: For a retailer, road transportation to deliver stores represents a significant part of the logistics costs.

Companies often conduct route planning optimization studies to reduce these costs and improve the efficiency of their network.

It requires collaboration between continuous improvement engineers and the transportation teams managing daily operations.

Objective: In this article, we use Graph Theory to design visual representations of a transportation network to support solution design.

Transportation Network Visualization with Python

A map titled “Road Transportation Network Visualization” shows 14 routes for freight transportation across a region in East Asia. The map highlights delivery points connected by lines, representing routes for full truckload (FTL) shipments. The visual includes details such as deliveries per route, cost per ton, and truck sizes, overlaid on a geographical map with cities and roads marked.
Transportation Network Visualization with Python by Samir SACi

Scope: Transportation Operations

Problem Statement: How do you organize the delivery routes and truck loading to reduce your costs?

Objective: Visualisation and Costing of your Transportation Plan to optimize the loading rate and reduce the costs per ton of your Transportation.

An animated GIF generated with Python visualizing transportation routes in a network across a region in East Asia. The map shows multiple cities connected by lines representing 14 full truckload (FTL) routes. The routes are color-coded and span between key locations, with markers indicating cities and delivery points within the transportation network.
Visualisation of the different routes covered (1 colour = 1 route) — (Image by Author)

I have written a medium article on processing data and preparing visualization to impact the average cost per ton of your deliveries.

Concept & Libraries Used:

  • Data Processing with Numpy, Pandas and visualization with Matplotlib

Results: An Optimized transport plan using larger trucks with a higher loading rate.

Do you want to support the green transformation of your company?

Supply Chain Sustainability Reporting with Python

A flowchart titled “Supply Chain Sustainability Report” showing three transportation routes for estimating CO2 emissions in a supply chain. The first route uses road, air, and road transportation to deliver goods from a distribution center to customers. The second route is a direct road trip of 200 km. The third route uses road, sea, and road transport to reach customers.
Supply Chain Sustainability Reporting with Python by Samir SACI

Scope: Sustainability

Problem Statement: The demand for transparency in sustainable development from investors and customers has grown.

Investors have increasingly emphasised the business’s sustainability when assessing an organisation’s value and resiliency.

A diagram illustrating route 2 for parcel shipments, showing a calculation for total CO2 emissions. The route includes distances covered by road (120 km), air (1450 km), and road again (700 km), with an equation for calculating total emissions based on these distances and weight of shipments.
Example of CO2 emissions measured from warehouse to destination by Samir SACI

Therefore, more organizations are investing resources to build capabilities for sustainability reporting and determine the best strategies for a sustainable supply chain.

Objective: In this article, we will introduce a simple methodology to report the CO2 emissions of your Distribution Network using Python and PowerBI.

How Sustainable is Your Circular Economy?

Scope: Supply Chain Sustainability

Key Skills: Inventory Management, Life Cycle Analysis, CO2 Emissions Calculation, Supply Chain Descriptive Analytics (Data Processing, Visualization and KPI creation)

Problem Statement: A circular economy is an economic model that aims to minimize waste and maximize resource efficiency.

It involves designing products and processes focusing on longevity, reuse, and recycling.

A simplified flowchart illustrating a rental process in a circular economy model. On the left, a robot places items in bins, symbolizing the start of the process. On the right, the timeline shows “Day 1: Start Rental” with a green arrow connecting a store to a customer renting clothing. “Day 14: Return” with a red arrow indicates the return of the rented items to the store.
(Image by Author)

Some companies have implemented a subscription model where customers pay a regular fee to access a product or service for a specific period.

Objective: Use data analytics to simulate the impacts of several circular subscription model scenarios on emissions reductions and water usage of a fast fashion retailer.

What is Green Inventory Management?

Scope: Supply Chain Sustainability

Key Skills: Inventory Management, CO2 EmissionsCalculation, Supply Chain Descriptive Analytics (Data Processing, Visualization and KPI creation)

Problem Statement: Green inventory management can be defined as managing inventory in an environmentally sustainable way.

A distribution network can involve processes and rules to reduce the environmental impact of order transmission, preparation and delivery.

A flowchart showing the six steps of green inventory management. It starts with Step 1: Store Inventory Alert, followed by Step 2: Replenishment Order Creation. Step 3 involves the Order Transmission to Warehouse Management System. Step 4 is Order Preparation in Warehouse, followed by Step 5: Order Delivery to Store, and ends with Step 6: the store receiving the order. The process emphasizes sustainable inventory management and order transmission.
(Image by Author)

What would be the impact on CO2e emissions if we reduce the frequency of store replenishments?

Objective: Use data analytics to simulate the variation of store replenishment frequency and measure the impact on the overall environmental impact.

Sustainable Supply Chain Optimization Application

A flowchart titled “Create a Sustainable Supply Chain Optimization Web App” illustrates a process with three steps. The first step is “Import Data,” represented by a table icon and three factories (Factory 1, Factory 2, Factory 3). The second step is “Select Objective,” with options to minimize waste, cost, or CO2 emissions. The final step is “Visualize Results,” showing a world map with transportation routes and a pie chart displaying outcomes.
Sustainable Supply Chain Optimization Application by Samir SACI

Scope: Sustainability

Objective: Help your organization combine sustainable sourcing and supply chain optimization to curb costs and environmental impacts.

I developed an application using the VIKTOR platform to facilitate data-driven decision-making for supply chain sustainability.

Sustainable Supply Chain Optimization [Link]

For more details,

Automate ESG Reporting with Python

A diagram titled “Automate ESG Reporting with Python” visualizes three aspects of ESG (Environmental, Social, Governance) reporting. At the top is an “ESG” icon represented as a pie chart. Below, three icons depict environmental sustainability (hands holding the Earth), social responsibility (hands supporting people), and governance (a group of people in a meeting setting).
Automate ESG Reporting with Python by Samir SACI

Scope: Sustainability

Objective: Comprehensive and practical environmental, social and governance reporting of a company

Problem Statement: Environmental, Social and Governance (ESG) reporting is a method companies use to disclose their governance structures, societal impacts and ecological footprint.

A diagram breaks down the three pillars of ESG: Environmental, Social, and Governance. The Environmental section lists actions like carbon footprint reduction, waste reduction, and energy efficiency. The Social section covers fair wages, job opportunities, health and safety, and respecting labor laws. The Governance section focuses on corporate governance, risk management, ethical practices, and accounting transparency.

How can you use data analytics to automate this reporting?

As stakeholders increasingly demand corporate social responsibility (CSR), ESG reporting has become critical to companies’ long-term strategies.

This job requires to collect and process data from multiple sources.

In the article, we provide examples of analytics solutions that you can use to support your ESG reporting 👇

Fight Greenwashing with Python

A diagram titled “Fight Greenwashing with Python” illustrates five tactics used in greenwashing: lies, vagueness, proof-less claims, irrelevance, and trade-offs. Each tactic is represented by an icon, such as a person telling a lie, a question mark for vagueness, a document for proof-less claims, a hand with a star for irrelevance, and a hand with money for trade-offs.
Fight Greenwashing with Python by Samir SACI

Scope: Sustainability

Objective: use data analytics to detect greenwashing from large corporations.

Problem Statement: Greenwashing is the practice of making misleading claims about the environmental benefits of a product or a service to communicate a false image of sustainability.

How can we use data to detect frauds?

In this article, we try to understand the different types of greenwashing to explain its manifestations.

We will also introduce data analytics methods to detect and prevent these unethical practices. 👇

Leverage Data Analytics for Sustainability

Sustainability has become a critical aspect of business operations as companies face mounting pressure to address environmental and social issues for their ESG reporting.

A hierarchical diagram illustrating how data analytics can be leveraged for sustainability. At the top, a skull and crossbones icon represents environmental risks. Below, four branches depict key areas: partnerships, analytics, automation, and sustainability. Each area is represented with icons symbolizing collaboration, data charts, robots with data, and people focused on environmental goals.
Use Data Analytics to overcome the challenges companies face in their pursuit of sustainability

Scope: Sustainability

Experts exposed the barriers that companies face in their pursuit of sustainability, focusing on four “hidden enemies”:

  • Structure and Governance: Siloed sustainability limits influence.
  • Processes and Metrics: Unsustainable metrics hinder progress.
  • Culture and Leadership: Old mindsets challenge transformation.
  • Methods and Skills: Traditional tools obstruct change.

Problem Statement: As a data scientist, analyst, or continuous improvement engineer, how can your company boost its green business transformation?

Objective:

Explore how data analytics can help to overcome these challenges by focusing on the four “hidden enemies” of your Supply Chain green transformation.

Have you heard about Generative AI?

Leveraging LLMs with LangChain for Supply Chain Analytics

Leveraging LLMs with LangChain for Supply Chain Analytics by Samir SACI

Scope: Generative AI

Objective: Master Langchain with OpenAI’s GPT models and build the ultimate Supply Chain Control Tower.

Create a LangChain agent connected to a local database to answer operational questions using data.

Automate supply chain analytics with GPTs

A diagram illustrates automating supply chain analytics with GPTs. On the left, an Excel icon represents data inputs, accompanied by question marks symbolizing data queries. In the center, icons depict a magnifying glass over A, B, C options and a gear representing automated processes. On the right, a user icon with a speech bubble shows interaction with GPT for answers. At the top, a support agent icon represents AI assistance in analytics.
Automate supply chain analytics with GPTs

Scope: Generative AI

OpenAI introduced a new feature allowing users to create custom versions of ChatGPT tailored for specific purposes.

This is an opportunity to easily create and deploy an agent to automate Pareto and ABC analyses.

Objective: Introducing “The Supply Chain Analyst”, a custom GPT agent designed to automate supply chain analytics tasks and interact with users using natural language.

Conclusion

These examples can be directly applied to your operations using your data sets to provide insights that will impact your organization quickly.

Feel free to leave questions in the comment section.

You can follow me on Medium for more articles related to Data Science for Supply Chain Optimization.

About Me

Let’s connect on Linkedin and Twitter; I am a Supply Chain Engineer using data analytics to improve logistics operations and reduce costs.

For consulting or advice on analytics and sustainable supply chain transformation, feel free to contact me via Logigreen Consulting.

If you are interested in Data Analytics and Supply Chain, look at my website.

💌 New articles straight in your inbox for free: Newsletter
📘 Your complete guide for Supply Chain Analytics: Analytics Cheat Sheet

References

[0] Samir Saci, My GitHub Portfolio, Link

--

TDS Archive
TDS Archive

Published in TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Samir Saci
Samir Saci

Written by Samir Saci

Top Supply Chain Analytics Writer — Case studies using Data Science for Supply Chain Sustainability 🌳 and Productivity: https://bit.ly/supply-chain-cheat

Responses (4)