TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial…

Data Science for Sustainable Sourcing

Samir Saci
TDS Archive
Published in
10 min readMar 7, 2024

--

Three factories compared on three environmental factors: water usage, waste management, and CO2 emissions. Each factory is rated with stars on each factor, with one to five stars used for evaluation. The visual represents sustainable sourcing considerations for supply chain optimization, highlighting differences in supplier environmental performance used for sustainable sourcing with Python.
Data Science for Sustainable Sourcing — (Image by Samir Saci)

Sustainable sourcing is the process of integrating social, ethical and environmental performance factors when selecting suppliers.

This includes assessing and evaluating suppliers based on sustainability criteria such as health and safety, environmental impacts or labour rights.

A breakdown of environmental impact factors for a specific factory, including water usage, waste management, and CO2 emissions. Each factor is rated with stars, ranging from one to five stars, visually representing the factory’s sustainability performance across these dimensions that can be used for sustainable sourcing.
Assessment of Suppliers based on Environmental Factors — (Image by Author)

The aim is to minimize the negative impacts on the environment and maximize the positive social impacts.

As a Data Science Manager, can you develop tools to streamline the process of sustainable sourcing?

Supply Chain Network design tools automatically select the best suppliers that minimize costs and meet the market demand while respecting constraints defined by your sustainability team.

In this article, we will explore how you can use data science in your organization to support the implementation of sustainable sourcing.

Sustainable Sourcing Problem Statement

Scenario: Selection of T-shirts Suppliers

Let us take the example used in the previous articles about Life Cycle Assessment.

You are the Data Science Manager of an international clothing group that has stores all around the world.

The company sources garments, bags, and accessories from suppliers located in Asia.

A flowchart illustrating the circular economy model for a fashion retailer. The forward logistics process includes the delivery of new garments from the factory to the warehouse and then to the stores. The reverse logistics process involves collecting used garments from stores and delivering them back to the warehouse for recycling into new garments. The focus is on recycling and sustainability.
Fast Fashion Company Circular Economy — (Image by Author)

Stores are delivered from local warehouses and directly replenished by suppliers.

Can you automatically select suppliers that minimize your costs while respecting environmental and social constraints?

The sourcing team has decided to implement sustainable sourcing on the limited scope of T-shirt suppliers.

A diagram showcasing the life cycle of a product, from sourcing raw materials (cotton) to manufacturing at a factory, followed by shipping to a warehouse. The diagram highlights key environmental impacts such as CO2 emissions, water, and energy consumption throughout the supply chain, including production, transportation, and warehousing. The diagram emphasizes the environmental footprint of each step.
Life Cycle Assessment of your T-shirt from Raw Materials to Warehouse Delivery — (Image by Author)

They want to review their supplier selection process to support the company’s sustainability roadmap.

However, they are facing several obstacles

  • Suppliers data collection and storage
    There is no central source of information as the data is stored in unstructured Excel files across the organization.
  • Complex decision-making based on many parameters
    Their current system of Excel Scorecard is showing its limits as the number of indicators is increasing.

Therefore, they request your help deploying a product to monitor suppliers’ KPIs and automate the selection.

Let’s see how data analytics can answer this problem.

Sustainable Sourcing Indicators

This article will use a simple example with a limited range of indicators.

  • Environmental Indicators including Carbon Emissions, Energy Consumption, Waste Generation and Water Usage
  • Social Indicators, including labour and human rights

We can quickly review them to understand their definition and how to measure them.

Carbon emissions
The primary indicator that will drive your sustainability roadmap is the supply chain's carbon footprint.

This can be measured by calculating the total emissions of greenhouse gases such as carbon dioxide, methane and nitrous oxide.

💡 What should we consider?

  • Emissions during the cultivation of the cotton used for your t-shirts
  • Emissions from electricity generation and chemicals used for t-shirt production
  • Transportation of the finished goods (t-shirt) from the factory to the central distribution centre by sea freight

The first two metrics should be collected from the suppliers, but your team can calculate the third one.

The formula is quite straightforward

Formula using Emission Factor for Transportation — (Image by Author)
With,
E_CO2: emissions in kilograms of CO2 equivalent (kgCO2eq)
W_goods: weight of the goods (Ton)
D: distance from your warehouse to the final destination(km)
F_mode: emissions factor for each transportation mode (kgCO2eq/t.km)

The data sources are

  • Transportation Management System databases with shipments including (dates, source, destination, shipment quantity)
  • Master Data to convert shipment quantities into weight

If you need more information, I have detailed the process in this article

Energy consumption
Another indicator is the amount of energy the supplier consumes to produce and deliver products.

It can be expressed in Joules used per production unit (functional units in the Life Cycle Assessment Methodology).

💡 What should we consider?

  • Energy consumption for cotton cultivation (kJ)
  • Energy consumption for spinning, weaving and dying (kJ)
  • Energy consumption for transportation (kJ)

The sourcing team should collect this metric.

Water usage
As it becomes a scarce resource, new regulations are pushing companies to redesign their processes for water consumption reduction.

With this indicator, you can focus on suppliers that do not affect the local water supply.

💡 What should we consider?

  • Water consumption for cotton cultivation (L/Unit)
  • Water consumption for spinning, weaving and dying (L/Unit)

Waste generation
Measuring the amount of waste generated during production can help identify areas for adopting more sustainable production methods.

💡 What should we consider?

  • Solid waste generation during production (kg/Unit)

Social indicators
These indicators assess suppliers’ performance in terms of social sustainability.

You can conduct your audits or use third-party assessments.

💡 How to Measure it?
Your sourcing team assigns a weight to each criterion in the scorecard below.

Table showing the evaluation of a supplier based on six criteria: Labor rights, Health and safety, Freedom of association, Discrimination, Environmental impact, and Human Rights. Each criterion is assigned a weight (in percentage) and a score (out of 100). The total weighted score for the supplier is 77/100, with Labor rights and Health and Safety being weighted the highest at 25% and 20%, respectively used for sustainable sourcing.
Example of Score Cards — (Image by Author)

The auditors calculate a score linked to each criterion to represent the supplier’s performance.

The total score is a pondered average of the scores that provide a general supplier assessment.

🏫 Discover 70+ case studies using data analytics for supply chain sustainability🌳and business optimization 🏪 in this: Cheat Sheet

How do we organize data collection and storage?

The first support you can provide to your sourcing team is to streamline the data collection and storage process using data analytics tools.

A data lake can be a central source of unstructured and structured data (databases, excel files, …).

Diagram illustrating the flow of data from various systems such as production management, utility bills, and waste inventory into a centralized data lake. The data includes production output, energy usage, water usage, and waste generation, along with supplier data like cotton purchased and emissions. The diagram also shows how routing data, fuel consumption, and shipment emissions are collected to create life cycle assessment that would feed sustainable sourcing algorithms.
Data Lake for Reporting — (Image by Author)

You can implement automated pipelines to extract, process, and analyze this data to build reports and provide insights.

This can lead to the implementation of data products that can be used for reporting and the optimization models I will present in the next section.

Diagram depicting the resources used and waste generated across a supply chain. The top half shows the resources consumed at each stage, including raw materials, energy, and fuel for transportation. The bottom half highlights the waste generated and carbon dioxide emissions produced at each stage, from production to delivery to stores.
Measure the environmental footprint along your value chain — (Image by Samir Saci)

This article about life cycle assessment provides more information on the best practices and methodologies for automating this exercise.

Sourcing Network Design

Now that we have collected information about suppliers and performed audits with scoring, we can use this data to drive decisions.

The sourcing team’s main obstacle was the problem’s complexity, considering the number of different metrics and suppliers.

A simplified flow of collecting, auditing, and selecting suppliers based on environmental and social performance metrics. Key metrics include CO2 emissions, water and energy usage, and waste generation. The diagram also shows auditing processes and supplier selection criteria based on performance scores used for sustainable sourcing.
Supplier Selection Process — (Image by Author)

They use Excel files with limited automation; this restrains their processing capacity.

Therefore, the idea is to use advanced analytics to automatically design sourcing and logistics networks to deliver your stores.

A global map illustrating the parameters of the sustainable sourcing problem. We have supply capacity for low- and high-capacity manufacturing sites, alongside sustainability metrics such as CO2 emissions, energy usage, and water consumption by country. It highlights the production costs, freight costs, and environmental footprint for different manufacturing locations worldwide.
Example of Sustainable Sourcing Problem — (Image by Author)

I will use an example with dummy data to understand how to implement a data-driven methodology to select suppliers considering environmental factors.

Problem Statement

You want to redefine the Supply Chain Network of your fast fashion retail company for the next five years.

A donut chart representing the distribution of data in percentage. The largest section, in blue, represents 57.2%, followed by a green section at 34.7%, an orange section at 1.8%, a red section at 3.0%, and smaller purple and pink sections at 3.3%. This chart could be part of an analysis showing the breakdown of supply chain factors such as production sites, environmental impact, or supplier contributions within a retail company’s network.

You have stores in 5 different markets worldwide (USA, Germany, Japan, India and Brazil).

In each market, you can select a supplier to produce your t-shirts and choose to source from abroad.

Two world maps. The first map shows sales volumes per market, highlighting five regions (USA, Germany, Japan, India, Brazil) and their demand levels. The second map displays supplier locations in five countries with icons for factory capacity, fixed costs, and variable costs, helping visualize the global supply chain network.
Suppliers Network Design — (Image by Author)

You would like to select the right set of suppliers to minimize the cost of production and delivery considering.

  • The demand of each market (MUnits/Month)
  • Fixed and variable costs of production in each country ($)
  • Transportation costs from the factory to the market by sea freight ($/Unit)
A flowchart illustrating the production and freight costs from two countries (USA and India) to the USA market. It shows fixed and variable production costs and variable sea freight costs, emphasizing the cost of delivering 2.5 million units per month to the USA.
Total cost of production and delivery — (Image by Author)

Your overseas suppliers are cheaper, but you must pay high freight costs to deliver to your market with higher CO2 emissions for transportation.

A flowchart illustrating the production and freight costs from two countries (USA and India) to the USA market. It shows fixed and variable production costs and variable sea freight costs, emphasizing the cost of delivering 2.5 million units per month to the USA.
Fixed & Variable Costs by Country of Production — (Image by Author)

Moreover, the sustainability team has implemented constraints on the maximum environmental impact per T-shirt produced, including water usage, CO2 emissions, energy usage, and waste generated.

The sourcing team needs linear programming support to manage these complex tradeoffs and decide which suppliers to select.

Let me illustrate my point with the example below, in which I will gradually add environmental constraints to see the impact on the footprint.

Initial Solution: Minimize Costs

If you define the objective function to minimize the costs without additional constraints, you can easily guess that it will maximize the volume produced in India and Brazil.

A Sankey diagram showing the distribution of sourcing volumes across different suppliers for a fast-fashion company. The width of the flows between the sources and the market reflects the volume of production for each supplier. The colors correspond to different countries, representing how much each country supplies relative to others in the supply chain network for this solution of sustainable sourcing.
Goods flow from Production to Markets — (Image by Author)

💡 Insights

  • In this case, the increased freight costs of sourcing overseas are compensated by the low production costs of India and Brazil.
A world map displaying supply chain routes for sourcing t-shirts to two markets: one in the USA and one in Europe representating the initial solution of sustainable sourcing. Factories in Brazil, the USA, and Asia are connected by red and green lines representing the shipment paths. The red lines depict the sourcing routes with higher costs, while the green lines show routes with lower costs. A blue line represents a third factory supplying the USA.
Minimal Cost Globalization Solution — (Image by Author)

What if we want to add sustainability constraints?

Implement Sustainable Sourcing Constraints

As your company implements a sustainability roadmap, you must add additional constraints limiting the environmental impact per T-shirt produced and delivered.

Therefore, your procurement team has contacted local suppliers in your predominant markets (USA, Japan).

What would be the impact on your sourcing network?

Test 1: Energy and Water Savings Constraints

We can start by adding constraints on the average energy and water consumption per unit produced.

  • Average water consumption below 3000 L/Unit
  • Average energy usage below 685 MJ/Unit
A bar chart representing the environmental impact of energy and water consumption. It displays various energy and water-saving scenarios using green bars on the left and blue bars on the right. The chart demonstrates the changes in consumption when implementing energy and water-saving constraints as constaints of the sustainable sourcing problem.
Energy and Water Consumption — (Image by Author)

Your suppliers in Germany and Japan heavily invested in modern equipment to reduce the environmental impact of their operations.

A Sankey diagram showing the flow of energy and water consumption across different factories, emphasizing energy-saving measures. The diagram uses colored flows in red, green, yellow, and blue to represent various levels of energy and water consumption, illustrating how the constraints on these resources affect the sourcing network when implementing water and energy constaints in the sustainabe sourcing problem.
Test 1 Solution — (Image by Author)

💡 Insights

  • The model switched production from flow Brazil to Germany to respect the additional constraints.
  • The model decided to keep producing in India because of the competitive costs.
  • Germany and Japan are now localizing their production.

What if we add CO2 emissions constraints?

Test 2: Add CO2 Emissions Constraints

Transportation accounts for a considerable part of the CO2 footprint of your Supply Chain.

Table comparing CO2 emissions between different countries for transportation routes. Each cell represents the emissions value (in kg CO2) associated with shipping goods from one country (listed in rows) to another (listed in columns). This matrix format helps visualize the environmental impact of logistics choices between USA, Germany, Japan, Brazil, and India that will be used by the linear programming model for sustainable sourcing.
CO2 emissions of production and transportation — (Image by Author)

Therefore, you would like to reduce the impact of upstream flows by limiting the CO2 emissions of

  • Cultivation and Production per unit kg CO2e/Unit
  • Transportation by sea freight kg CO2e/Unit
  • Total Emissions must be below 110 kg CO2e/Unit
A Sankey diagram showing the solution of minimizing CO2 by illustrating the distribution of supply chain flows. The different colors represent various sourcing locations, with the width of the flows correlating to the environmental impact of each sourcing decision. The diagram visualizes the shift in supply strategy when CO2 emissions constraints are introduced, redistributing production volumes among the five locations.
Test 2 Solution — (Image by Author)

💡 Insights

  • For the first time, India has the lowest production output.
  • Brazil is a solution to keep manufacturing costs low while limiting CO2 emissions.
  • USA suppliers have not invested enough in sustainable production facilities to be retained for the domestic market supply.

With this simple exercise, you can test several scenarios to determine the potential of these solutions to accelerate the decision-making process.

If you want to try it yourself, I have deployed this solution in a web application hosted on the VIKTOR platform.

Sustainable Supply Chain Optimization App — (Image by Author)

For more details about this solution, have a look at this article introducing the app and its functionalities.

💡 Follow me on Medium for more articles related to 🏭 Supply Chain Analytics, 🌳 Sustainability and 🕜 Productivity.

Conclusion

When implementing a green transformation, companies face many challenges related to change management and complex decision-making processes.

As a data analytics professional, you can support this transformation by facilitating data collection and partially automating decision-making using optimization models.

Sourcing teams can easily add constraints to their supplier selection process by following the guidance of the sustainability team.

A comparison of sourcing routes under three different objective functions: minimizing costs, optimizing energy and water usage, and reducing CO2 emissions. Each map shows sourcing routes from factories in Brazil, India, and Europe to markets in North America and Asia, with Sankey diagrams below showing the flow of goods and cost changes due to sustainability constraints.
Scenarios with Different Objective Functions — (Image by Author)

By showing the financial impact of each scenario, you facilitate the decision-making process by highlighting the balance between sustainability and cost efficiency.

3 maps of sourcing routes illustrating different sourcing strategies. The leftmost image shows the initial solution focused on minimizing costs. The middle image includes energy and water usage constraints, while the right image includes CO2 emissions constraints. Beneath each map, Sankey diagrams illustrate the flow of goods and the relative costs associated with each scenario. Text indicates a 12% and 18% increase in costs due to sustainability constraints to illustrate sustainable sourcing.
Conclusion — (Image by Author)

These results can trigger strategic discussions to find the right balance between sustainable sourcing and cost reduction.

A three-part pyramid displaying the principles of Environmental, Social, and Governance (ESG) in sustainable sourcing. The Environmental section highlights factors like carbon footprint reduction and energy efficiency. The Social section emphasizes fair wages, health and safety, and labor laws. The Governance section includes corporate governance, compliance, and ethical business practices.
ESG Pillars Presentation — (Image by Author)

As stakeholders increasingly demand corporate social responsibility (CSR), ESG-led transformation has become a critical part of companies’ long-term strategies.

A visual representation of ESG principles with icons for Environmental, Social, and Governance. The Environmental icons include CO2 emissions and energy usage, while the Social icons represent fair wages and human rights. Governance icons include corporate accountability and ethical business practices.
Example of Reporting Categories — (Image by Author)

The methodologies (and the tool) introduced in this article can help your organization gain several kilotons of CO2 emissions to reach the targets of 2030 and improve your ESG score.

A similar approach to data collection and data-driven decision-making can also be used for this specific non-financial reporting.

I have published this article for more information on how data analytics can support ESG reporting.

About Me

Let’s connect on Linkedin and Twitter; I am a Supply Chain Engineer using data analytics to improve logistics operations and reduce costs.

For consulting or advice on analytics and sustainable supply chain transformation, feel free to contact me via Logigreen Consulting.

If you are interested in Data Analytics and Supply Chain, look at my website.

💡 Follow me on Medium for more articles related to 🏭 Supply Chain Analytics, 🌳 Sustainability and 🕜 Productivity.

--

--

Published in TDS Archive

An archive of data science, data analytics, data engineering, machine learning, and artificial intelligence writing from the former Towards Data Science Medium publication.

Written by Samir Saci

Top Supply Chain Analytics Writer — Case studies using Data Science for Supply Chain Sustainability 🌳 and Productivity: https://bit.ly/supply-chain-cheat