From Gaming to Genius: Your primer on GPUs that are powering the AI boom

9 min readSep 24, 2024

1. Introduction

Imagine trying to find a friend in a crowded stadium by yourself — it would take ages. But if you had hundreds of people helping you search, you’d find them much faster. This is similar to how Graphics Processing Units (GPUs) have transformed computing, especially in artificial intelligence (AI). Once known only to gamers seeking better graphics, GPUs are now the unsung heroes making smart technologies possible. Let’s dive into the journey of GPUs, how they’ve evolved, who makes them, what they’re made of, and how the global race for GPUs is shaping the future.

2. A Brief History of GPUs

The Early Days: Simple Graphics Helpers

In the late 1970s and 1980s, computers began to have basic graphics capabilities, but these were limited to simple images and text. As video games and graphical user interfaces became more popular, there was a need for better graphics processing.

The Birth of the GPU

In 1999, NVIDIA, a company founded just six years earlier in California, introduced the term “GPU” with its GeForce 256 processor. This was one of the first processors dedicated to handling all the calculations needed for rendering complex graphics, like those in 3D video games.

Evolution Over Time

2000s: GPUs became more powerful, supporting more realistic graphics in games and professional applications like 3D modeling.
2010s: Developers realized GPUs could be used for more than graphics. Their ability to handle many tasks at once made them perfect for scientific calculations and, eventually, AI.
Today: GPUs are central to AI research, powering everything from voice recognition to self-driving cars.

How Far We’ve Come

From rendering simple 2D images to processing complex AI algorithms, GPUs have evolved dramatically. They’ve gone from being an optional component for gamers to an essential tool in modern computing.

3. What Are GPUs and CPUs?

The CPU: The Versatile Sprinter

A Central Processing Unit (CPU) is like a versatile sprinter, excellent at doing many different tasks quickly but one after another. It’s the main brain of your computer, handling everything from running applications to managing files.

The GPU: The Marathon Relay Team

A Graphics Processing Unit (GPU) is like a marathon relay team with thousands of runners, each taking a small part of the race simultaneously. Originally designed to render images and videos by handling many calculations at once, GPUs excel when the same type of operation needs to be performed repeatedly on large amounts of data.

4. Understanding GPU Fundamental Components

Before we explore how GPUs outshine CPUs in AI tasks, let’s break down the key parts of a GPU in simple terms.

Cores (The Workers)

Definition: Think of cores as individual workers in a factory.
GPU Cores: A GPU has thousands of small cores, often called CUDA cores (in NVIDIA GPUs) or stream processors (in AMD GPUs).
Role: Each core handles a tiny piece of the larger task, allowing the GPU to process many operations at the same time.

Memory (The Toolbox)

Definition: Memory in a GPU, often called Video RAM (VRAM), is like a toolbox where workers keep their tools and materials.
Role: It stores the data that the GPU cores need to access quickly, such as textures for images or matrices for AI computations.

Shaders (The Specialists)

Definition: Shaders are specialized programs that tell GPU cores how to process graphics or compute tasks.
Role: In AI, shaders can be programmed to perform specific calculations required by neural networks.

Clock Speed (The Pace Setter)

Definition: Clock speed is like the rhythm at which the workers operate.
Role: A higher clock speed means the GPU cores can perform more operations per second.

Memory Bandwidth (The Highway)

Definition: Memory bandwidth is like the number of lanes on a highway between the GPU cores and memory.
Role: Higher bandwidth allows more data to move quickly between the cores and memory, reducing bottlenecks.

Cooling System (The Air Conditioner)

Definition: GPUs generate heat, so they need cooling systems, like fans or heatsinks, to keep them from overheating.
Role: A good cooling system ensures the GPU runs efficiently without slowing down or getting damaged.

5. Who Makes GPUs and Where Are They Built?

Major GPU Manufacturers

NVIDIA: Founded in 1993 in California, NVIDIA is one of the leading GPU manufacturers. They introduced the term “GPU” and are known for their GeForce and Quadro series.
AMD (Advanced Micro Devices): Based in California, AMD acquired ATI Technologies in 2006, bringing GPUs like the Radeon series under their umbrella.
Intel: Primarily known for CPUs, Intel has entered the GPU market with its Intel Arc series.

Where Are GPUs Built?

While these companies design GPUs, the actual manufacturing (fabrication) is often done by specialized semiconductor foundries:

TSMC (Taiwan Semiconductor Manufacturing Company): Located in Taiwan, TSMC is one of the world’s largest semiconductor manufacturers and produces GPUs for companies like NVIDIA and AMD.
Samsung Electronics: Based in South Korea, Samsung also manufactures chips for various companies, including GPUs.

The Ingredients: What Are GPUs Made Of?

Semiconductor Materials: The primary ingredient is silicon, a semiconductor material that forms the basis of the GPU’s integrated circuits.
Transistors: Billions of tiny switches called transistors are etched onto the silicon wafer. These are the fundamental building blocks that make up the GPU cores and other components.
Copper and Aluminum: Used for wiring within the chip to connect different components.
Substrates and Packaging Materials: These materials protect the delicate silicon chip and allow it to connect to the computer’s motherboard.

The Manufacturing Process

Creating a GPU is a complex process involving:

Photolithography: Using light to etch intricate patterns onto the silicon wafer at a nanometer scale.
Doping: Adding impurities to the silicon to change its electrical properties.
Layering: Building the chip layer by layer to create the complex structures needed for processing.
Quality Control: Rigorous testing ensures each GPU functions correctly before it’s packaged and shipped.

6. A Simple AI Task: Recognizing Handwritten Numbers

Let’s say we want to teach a computer to recognize handwritten numbers — like the ones on mail envelopes. This involves training a neural network by showing it thousands of images so it can learn the patterns that make a “2” look different from a “3.”

7. How a CPU Handles the Task

Sequential Processing

A CPU tackles this by processing each image one after the other. Remember, it’s like a sprinter running multiple races back-to-back.

The Struggle

Processing thousands of images sequentially is time-consuming. It’s as if one person is trying to sort through a massive stack of mail alone — it would take days or even weeks.

8. How a GPU Tackles the Same Task

Parallel Processing Power

A GPU, with its thousands of cores (workers), can process many images at once. Each core handles a small part of the task simultaneously.

Speeding Up Learning

By utilizing its high memory bandwidth (the highway) and VRAM (the toolbox), the GPU quickly feeds data to its cores. The shaders (specialists) guide the cores on how to perform the necessary calculations.

The Result

Training that takes a CPU days can be completed in hours or even minutes on a GPU. It’s like having an entire team sorting the mail together, finishing the job much faster.

9. Why GPUs Are Perfect for AI

Massive Parallelism

The thousands of cores in a GPU allow it to perform many calculations at the same time, which is ideal for the repetitive computations in AI tasks.

Efficient Data Handling

The high memory bandwidth ensures data moves swiftly between the memory and cores, reducing delays.

Optimized Calculations

Shaders can be programmed for specific AI computations, making GPUs more efficient for these tasks than CPUs.

10. Real-World Impact: From Self-Driving Cars to Voice Assistants

Self-Driving Cars

GPUs process data from cameras and sensors in real-time, helping the car make instant decisions like when to stop or turn.

Voice Assistants

When you ask Siri or Alexa a question, GPUs help analyze your voice, understand the request, and find the answer quickly.

11. The Challenges GPUs Face

Energy Consumption

GPUs are powerful but can consume a lot of electricity, much like how a big truck uses more fuel than a small car.

Heat Generation

All that power generates heat. Without proper cooling systems (the air conditioners), GPUs can overheat, which can slow them down or cause damage.

Availability and Cost

Due to high demand, GPUs can be expensive and sometimes hard to find, which is like popular concert tickets selling out quickly.

12. The Race for GPUs: Big Tech and Global Supply

Big Tech Hoarding GPUs

In recent years, technology giants like Google, Amazon, Microsoft, and Meta have been investing heavily in AI research and services. To fuel their AI models and data centers, these companies require massive numbers of GPUs.

Bulk Purchasing: Big tech companies often buy GPUs in bulk directly from manufacturers, securing large portions of the supply.
Custom Solutions: Some companies work with manufacturers to create custom GPUs tailored to their specific needs.

Impact on Supply and Demand

Scarcity for Others: When big tech companies secure large GPU supplies, smaller businesses, researchers, and consumers may find it harder to obtain GPUs.
Price Increases: High demand and limited supply can drive up prices, making GPUs more expensive for everyone else.
Delay in Access: The time it takes for GPUs to become available on the consumer market can increase due to large pre-orders from big tech.

The Geographical Evolution of the Supplier Market

Diversifying Manufacturing Locations

Reducing Dependency: There’s a global push to reduce dependency on a single region for semiconductor manufacturing.
Investment in New Facilities: Companies like TSMC and Samsung are investing in building new fabs (fabrication plants) in different countries, including the United States and Europe.

Geopolitical Factors

Trade Relations: International trade policies and relations can impact where GPUs are manufactured and how they’re distributed.
Government Incentives: Some governments offer incentives to encourage domestic semiconductor manufacturing, aiming to bolster local tech industries.

Emerging Players

China’s Ambitions: Chinese companies are investing heavily in developing their own GPU technologies to become self-reliant.
Startups and Innovators: New companies are entering the GPU market with innovative approaches, potentially increasing competition and supply.

Future Implications

Improved Supply Chains: Diversification may lead to more resilient supply chains, reducing the risk of shortages.
Global Collaboration: International partnerships could accelerate technological advancements and make GPUs more accessible worldwide.
Ethical and Environmental Considerations: As manufacturing spreads, there may be increased focus on ethical sourcing of materials and environmentally friendly production methods.

13. The Future: What’s Next for GPUs and AI

Smarter and More Efficient GPUs

Manufacturers are developing GPUs with even more cores and better cooling systems, making them more powerful and efficient. Innovations like NVIDIA’s Tensor Cores are specifically designed to accelerate AI tasks.

Specialized AI Chips

Competition with GPUs: Companies are creating specialized chips like Google’s Tensor Processing Units (TPUs) that could challenge GPUs in AI tasks.
Complementary Technologies: Rather than replacing GPUs, these new chips might work alongside them to further boost performance.

Greater Accessibility

As technology advances, GPUs may become more affordable and energy-efficient, allowing more people to participate in AI development. Cloud computing services also offer access to powerful GPUs without the need for physical hardware.

14. Conclusion

From Rendering Images to Shaping Intelligence

GPUs have come a long way from their humble beginnings as simple graphics accelerators. Through innovation and the vision of companies like NVIDIA and AMD, they’ve evolved into the powerhouse processors that make modern AI possible.

Navigating the Global Landscape

The increasing demand for GPUs by big tech companies highlights the importance of these processors in our digital future. The evolving geographical landscape of GPU manufacturing reflects a world striving for technological advancement while balancing supply, demand, and geopolitical considerations.

Empowering the Future

By understanding the fundamental components of GPUs, their history, and how global dynamics affect their availability, we can appreciate the incredible role they play in AI and beyond. As GPUs continue to evolve, they hold the key to unlocking innovations we can only imagine.