TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial…

Data Science for Good: Beyond Profits, Towards a Better World

Samir Saci
TDS Archive
Published in
11 min readAug 25, 2023

--

This image highlights three benefits of data science in a supply chain environment. On the left, a rising clock represents “Productivity Increase.” In the center, a graph with dollar coins and an upward arrow symbolizes “Profit Increase.” On the right, an icon showing a hand holding money next to a calendar signifies “Higher Salaries.” The visual portrays how data-driven improvements can boost productivity, profits, and employee compensation.
Data Science for Good — (Image by Author)

Data science can support business transformations that go far beyond financial gains.

It can improve working conditions, reduce inequalities and promote an inclusive working environment.

As a senior supply chain engineer, I’ve mainly used analytics to improve operational performance and cut costs.

However, an engineer's duty is not just about maximizing profit; she can also help make the world a better place.

As a Data Scientist, how can you improve working conditions of logistic operators?

In this article, I will share examples using data science to improve logistic operators' working conditions (and bonuses).

Productivity & Profitability

Profitability of Logistics Companies

As a former supply chain solution designer, I spent the early years of my career helping logistic companies optimize productivity to boost profit.

I’ve witnessed the immense pressure our customers (Retail, Fashion, Luxury, Cosmetics) face to deliver goods on time while minimizing costs.

Illustration of supply chain processes from upstream (manufacturing) to downstream (distribution). It shows a factory representing production, trucks symbolizing freight, a warehouse for inventory storage, and retail stores as end points for customer delivery.
3PL for Retail Distribution — (Image by Author)

As third-party logistic providers, this pressure was directed to us with a constant fear of having our contracts not renewed.

In this stressful environment, the lure of cutting corners and adopting aggressive management tactics to reduce workforce costs is ever-present.

Let’s find a smarter way to work!

By embracing the power of data science to improve processes, we can avoid transferring this pressure on operators and drive positive change.

Definition of Process Productivity

Let’s take a hypothetical scenario of a major international fashion retailer building its distribution network with logistic companies.

A central warehouse fulfills orders from various factories and distributes them to 100 stores. Represents the role of warehousing and distribution in the supply chain.
Distribution Network to Deliver 100 Stores — (Image by Author)

I&N is looking to outsource its warehousing and transportation operations to deliver its stores in Shanghai.

The logistics team organizes a tender, also called a Request for Proposal (RFP), inviting global logistics companies to submit their solutions.

Visual comparing three logistics companies based on inbound, outbound, and storage costs. Stores, warehouse, and transportation icons represent logistical functions and cost breakdowns for each service, supporting a tender request (RFP) process.
Request For Proposal — (Image by Author)

As part of the RFP, I&N’s logistics team provides

  • Data and process requirements
  • This quotation sheet outlines the different service prices for storage, receiving, box picking, piece picking, loading, and return management.

As the solution design manager, I was responsible for proposing solutions and calculating the price for each service.

The price was determined by considering the equipment and workforce costs driven by operators’ productivity and the margin on sales.

Calculate the price of a handling service — (Image by Author)

P.S: The margin on sales is defined as the percentage of the turnover that represents the margin.

Warehouse and transportation teams would run the operations for three years to preserve this margin.

This could be achieved by either increasing the price (which was nearly impossible) or reducing costs.

Depicts a balance scale between cost drivers (left: storage, transport, energy) and revenue opportunities (right: sales, analytics). The green and red arrows show profit or loss, emphasizing the importance of balancing operational costs with financial success.
P&L Balance — (Image by Author)

The latter option usually came down to one key factor: increasing productivity.

But how do you increase productivity?

This can be done with aggressive management tactics using high individual productivity targets, salary cuts and illegal labour practices.

It often leads to increased employee stress and dissatisfaction which may impact your company’s ESG score.

❓ Do you want to know more about ESG scoring?

The second method is to optimize processes, layouts and goods flows to help operators become more efficient with the same amount of effort.

This approach not only benefits the company but also improves the working conditions.

To illustrate my point, I will share examples that I have implemented in my career for operations in the retail, fast fashion and luxury industries.

🏫 Discover 70+ case studies using data analytics for supply chain sustainability🌳and business optimization 🏪 in this: Cheat Sheet

Data science can be crucial in improving operators' working conditions and wages.

Improving Picking Efficiency

Let’s start with an example from a re-engineering project. I used analytics to help operators improve their picking productivity.

Warehouse Picking can be defined as taking products from the stock to prepare an order (e-commerce, store delivery).

An animated schematic representation of a warehouse layout showing aisles A01 to A19, with the starting point labeled “START” at the bottom left. The image features 12 rows and 19 aisles, each divided into rectangular slots representing storage locations. The layout illustrates a typical order picking route starting from the bottom left corner, highlighting the structure and organization of the warehouse space and the walking route used by warehouse picking operators to prepare orders.
Picking routes in a warehouse — (Image by Author)

In a Distribution Center (DC), walking time from one location to another during the picking route can account for 60% to 70% of the operator’s working time.

Calculate the price of a picking — (Image by Author)

Reducing this walking time can drastically impact its productivity.

Let us assume that operators pick full boxes

  • Productivity will be in (boxes/hour)
  • Price will be in ($/hour)

Warehouse operators have productivity targets that they need to reach to receive bonuses.

The pressure on operators in retail and FMCG operations is immense, as they represent a major part of fixed and variable costs.

What is the best way to help operators improve their productivity?

Maximizing the number of boxes picked per meter walked can help operators improve their productivity while minimizing their efforts.

The image shows four sections. Top left: a warehouse layout (A01-A19) with an optimized picking route marked by dashed lines. Top right: a triangular picking path between three points. Bottom left: a bar chart comparing walking distances for 20,000 orders using three methods, x-axis as orders per wave, y-axis as distance. Bottom right: a TSP optimization graph using OR-Tools, with yellow dots and lines representing the shortest path for efficient picking.
3 methods to improve the ratio of boxes picked per meter walked — (Image by Author)

In several articles, I propose several data-driven methods to reduce this walking distance and maximize operators’ productivity.

  • Order Batching to increase the number of order lines picked per route
  • Clustering of the picking locations to group orders by area
  • Advanced pathfinding using Google-OR libraries to optimize the picking routes

These optimization approaches have been tested using actual order lines and warehouse layouts.

More Boxes Picked + Less Distance = Higher Productivity

Results show a huge decrease in the walking distance for the same scope of orders to pick.

Implementing these algorithms in your Warehouse Management System can boost operators’ productivity without changing any processes.

💡 For more details, you can have a look at these articles,

Parcel Packing Process Design with Queueing Theory

Let’s switch now to e-commerce operations.

The online retail boom has put enormous pressure on fulfilment centres to prepare and ship orders at unmatched speeds.

How many orders can our warehouse ship per day?

In this example, a major problem of continuous improvement engineers is shipping capacity.

3D illustration of an e-commerce fulfillment center packing area. Workers are operating a single conveyor belt with boxes moving through several stations, where items are packed. The background shows shelves with a large inventory of stacked boxes. In the foreground, multiple workstations are prepared for different packing tasks, including items like bags and clothing. The image highlights the outbound logistics process and illustrates a potential bottleneck in the packing operation due to delay
Packing Area in your Warehouse — (Image by Author)

After picking, the orders wait too long to be packed and loaded onto trucks.

Based on on-site observations and productivity analysis our engineer understood that the packing process was the bottleneck.

She would like to redesign the layout and optimize the process to release pressure from packing operators.

The site manager decided to invest in a second packing station.

Therefore, our engineer wants to use the Queueing Theory to find the best layout.

Solution 1: Keep a single line with two parallel stations

A warehouse layout depicting a single conveyor line with two parallel packing stations, where workers process parcels. Solution 1: The conveyor line has two packing operators working in parallel to increase throughput. The objective is to optimize the parcel packing process using the Queueing Theory for an e-commerce fulfillment center.
Solution 1 — (Image by Author)

Solution 2: Add a second line with a dedicated station

A warehouse layout showing two conveyor lines with dedicated packing stations for each line. Workers are positioned along both conveyor lines, processing parcels separately at each station. Solution 2: The addition of a second line with dedicated stations aims to improve the efficiency of the parcel packing process in an e-commerce fulfillment center by distributing the workload across two independent lines.
Solution 1 — (Image by Author)

What is the best solution to minimize the queueing time and reduce bottlenecks?

Using concepts from the Queuing Theory, we can estimate the performance of the two layouts by considering the variability of input flow.

A line graph comparing two solutions (SOL1 and SOL2) based on their waiting times as the coefficient of variation of processing times (CVp) increases. The x-axis represents the coefficient of variation of processing times (CVp), while the y-axis shows the waiting time. SOL1 is shown in blue, and SOL2 is shown in red, with SOL2 experiencing higher waiting times as CVp increases, indicating its inefficiency compared to SOL1 for this scenario.
Simulation Results for the Two Solutions— (Image by Author)

The analysis shows that the second solution is less robust and may reduce the packing station's capacity when faced with volume variability.

Assuming that we have a variability of 1.5,

  • -25% of the queueing time (seconds) in the packing line
  • Higher overall productivity with the same packing speed
A diagram comparing two packing station layouts for reducing queue time and bottlenecks in a warehouse process. In the first layout, a single conveyor belt feeds two packing stations, resulting in efficient load distribution. In the second layout, two conveyor belts independently feed two packing stations, leading to potential inefficiencies when volumes vary. The image suggests the first layout improves productivity by reducing queue time by 25% and maintaining the same packing speed.
Average Productivity by Solution — (Image by Author)

Thanks to this simple modelisation, she has designed a layout enabling operators to be more productive without additional effort.

💡 For more details, you can have a look at this article

Conclusion

You can redesign processes Using advanced analytics tools to help operators be more efficient.

  • Higher bonus for picking and packing operators (👩‍🏭)
  • Less effort to walk or pack fast w/o pressure from managers (🧘)
  • Better profitability for the company (💰)

In the next section, we will discover other opportunities for data-driven improvement focusing on resource allocation.

💡 Follow me on Medium for more articles related to 🏭 Supply Chain Analytics, 🌳 Sustainability and 🕜 Productivity.

Invest in Your Workforce (In a Smart Way)

The previous section focuses on designing an optimal process to maximize an operator's productivity.

However, there is a gap between process design and execution that can be filled with optimization tools.

Workforce Planning for Inbound Management

For our fast-fashion customer I&N, we must plan the workforce to absorb the fluctuating demands of the stores.

A series of four graphs representing different time-series data to illustrate the fluctuation of warehouse workload. The top graph (orange) shows the Orders over time, the second (green) depicts Lines, the third (red) illustrates SKU variability, and the bottom (purple) shows Destinations. The y-axis for all graphs represents frequency or volume, while the x-axis spans the timeline of observation. Peaks and troughs in each graph suggest fluctuating demand and activity in warehouse operations.
Example of Daily Fluctuation for an E-Commerce Warehouse — (Image by Author)

I&N provides volume forecasts, like the one above, and the managers are planning the number of headcounts needed to meet the demand.

A process diagram showing the flow from “Volumes Forecast” (boxes) to “Operators Productivity” (boxes per hour), then moving towards “Workforce Planning” (headcounts). The goal is to link volume predictions with productivity metrics to optimize workforce size, illustrating how forecasting can lead to efficient staffing. Icons such as shopping carts, clocks, and workers visually represent each stage in this workforce planning process for better logistics management.
Workforce Planning with Volumes Forecasts — (Image by Author)

They must minimize the number of temporary workers hired while ensuring employee retention and adhering to local regulations.

In our example, we would like to help the Inbound Manager.

His team responsibilities include

  • Unload Pallets from the Trucks
  • Scan each pallet and record the received quantity in the Warehouse Management System (WMS)
  • Put away these pallets in the Stock Area
A detailed 3D-rendered warehouse floor plan. It shows various sections, including inbound pallet wrapping (green arrow), unloading and receiving areas (yellow arrow), and handling of paper documents like purchase orders and shipping notices (red arrow). The diagram highlights workflow organization within the warehouse, from the receipt of goods through processing, and how materials flow through different operational zones.
Unloading Process at the Warehouse — (Image by Author)

The team has two types of productivity target

  • Number of pallets unloaded per hour for each worker (I)
  • Number of pallets unloaded per hour paid for the whole team (II)

How to Size the Workforce using volumes and productivity?

If the manager recruits too many operators the overall productivity (II) may be greatly reduced.

Based on volume forecasts, he can estimate what resources would be needed each day.

A bar chart depicting staff demand across different days of the week. Each bar represents the required workforce for specific days, with varying heights indicating changes in staffing needs. The y-axis shows the number of staff, and the x-axis lists the days. This graph helps managers plan optimal workforce allocation based on forecasted demand, ensuring adequate staffing while minimizing overstaffing.
Inbound Ressource Needs Forecasts by Day — (Image by Author)

To ensure employee retention, you need to guarantee a minimum of 5 consecutive working days per week.

A table with a weekly shift plan. It has seven rows representing shifts and columns for days of the week (Monday to Sunday). Green cells represent working days, while yellow cells indicate days off. The table is a visual tool for managers to plan shift schedules, ensuring continuous coverage while balancing workload distribution and operator availability, preventing overwork and ensuring compliance with labor regulations.
Planning by shift based on constraints listed above — (Image by Author)

To help our inbound manager balance these constraints and objectives, we can use linear programming with Python.

A bar chart output of the warehouse workforce optimization planning tool designed with Python. This shows staff demand (black bars) and staff supply (red bars) across different days. A blue line represents extra resources required. The graph helps managers visualize the gap between demand and available workforce, indicating where temporary or additional staffing may be needed. The x-axis shows the days, while the y-axis indicates the number of staff, helping to balance operational requirements.
Final Results — (Image by Author)

The results of this optimization tool are satisfying,

  • Except for Friday and Saturday, we do not have resources in excess
  • The supply matches the demand every day

Except for these two days, the overall productivity is not affected by planning issues.

If the operators meet the targets, they will receive a full bonus without impacting operational margin.

💡 For more details, you can have a look at this article

Optimal Incentive Policy for Warehouse Operators

In this last example, let’s imagine you are helping a Regional Director of your logistics company with 22 warehouses in her P&L.

A 3D illustration of a warehouse showing operators working along a conveyor belt system. Boxes are being sorted and placed on the conveyor, which runs between storage racks filled with pallets. Nearby, several packing stations with desks and equipment are visible, representing additional workflow areas. Two warehouse workers are sorting items onto the conveyor, while another operator uses a ladder to reach higher shelves. The image depicts an efficient warehouse layout to improve productivity.
Fulfilment Centers with Picking, VAS and Packing — (CAD Model by Author)

The objective is still to maximize the operators’ picking productivity.

She would like to use financial incentives to motivate operators to increase their outputs per hour paid.

The current incentive program provides 5 euros per day to operators that reach their target. (Daily salary: 62 euros)

However, this could be more efficient as only 20% of operators meet their target.

What should be the minimum daily bonus needed to reach 75% of the target?

The idea is to run a data-driven experiment

  1. Randomly select operators in your 22 warehouses
  2. Implement a daily incentive amount varying between 1 to 20 euros
  3. Verify if the operators reached their target

Using Logistic Regression, we get a probability plot that helps to estimate the probability of reaching the target for each value of the daily incentive.

Fitted line plot of your sample data — (Image by Author)

Based on the trend, with a 15 euro incentive per day, we get a 75% probability of reaching the target.

This result, backed by statistical tools, provides valuable insight for our director to change her incentive policy.

💡 For more details, you can have a look at this article

Conclusion

We have discovered optimization and statistical tools that help a company improve resource allocation to reach key operational targets.

  • Higher overall team productivity and an improved margin (👩‍🏭)
  • More incentives for the operators (💰)
  • Less pressure from managers (🧘)

💡 Follow me on Medium for more articles related to 🏭 Supply Chain Analytics, 🌳 Sustainability and 🕜 Productivity.

Next Steps

Hopefully, these real-world applications inspired you to design and deploy tools supporting financial gains while improving operator working conditions.

A visual representation of the three pillars of ESG. Each pillar is displayed in a separate colored house-shaped section. The “Environmental” pillar covers carbon footprint reduction, climate change strategy, waste reduction, and energy efficiency. The “Social” pillar emphasizes fair living wages, equal job opportunities, health and safety, responsible suppliers, and respecting labor laws. The “Governance” pillar focuses on corporate governance, risk management, compliance, ethical business.
ESG Pillars Presentation — (Image by Author)

As stakeholders increasingly demand corporate social responsibility (CSR) and ESG scoring, working conditions and fair wages will become key parameters of any business transformation.

Environmental, Social and Governance (ESG) is a reporting method used to disclose companies' ecological footprint, societal impacts and governance structures.

Looking at the indicators included in the ESG score, you’ll find additional opportunities to use data science for positive change.

For more details about ESG scoring

In the following article of this series, I will focus on advanced analytics to reduce consumables usage and waste generation.

Stay tuned!

🌍 Curious about the global roadmap for a sustainable future?

Dive into my recent insights about how Data can support the United Nations’ Sustainable Development Goals.

About Me

Let’s connect on Linkedin and Twitter. I am a Supply Chain Engineer who uses data analytics to improve logistics operations and reduce costs.

For consulting or advice on analytics and sustainable supply chain transformation, feel free to contact me via Logigreen Consulting.

If you are interested in Data Analytics and Supply Chain, look at my website.

💌 New articles straight in your inbox for free: Newsletter
📘 Your complete guide for Supply Chain Analytics: Analytics Cheat Sheet

💡 Follow me on Medium for more articles related to 🏭 Supply Chain Analytics, 🌳 Sustainability and 🕜 Productivity.

--

--

TDS Archive

Published in TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Samir Saci

Written by Samir Saci

Top Supply Chain Analytics Writer — Case studies using Data Science for Supply Chain Sustainability 🌳 and Productivity: https://bit.ly/supply-chain-cheat