GPUs and Multicore CPUs and Parallel and Concurrent Processing

Icodewithben
4 min readFeb 1, 2024

--

GPUS and Multicore CPUs process information in Parallel to improve speed and efficiency. The GPU does this by using thousands of smaller cores and Pipelining instructions. Whereas the CPU uses multiple cores and and Parallel Systems, Pipelining in a processor to improve efficiency​. Watch a quick Craig and Dave Summary here 8. OCR A Level (H046-H446) SLR2–1.1 Multi-core & parallel systems (youtube.com)

CPU will work on different tasks independently whereas a GPU will do it in Parallel

Parallel Processing

Parallel processing refers to the method by which a system executes multiple operations simultaneously. CPUs achieve parallelism using multi-core processors where each core executes a different task at the same time.

GPUs take this further with hundreds or thousands of smaller cores, each running separate threads of a larger task, like rendering graphics, making them highly efficient for tasks that can be divided into many similar operations processed at the same time. They are able to Fetch, Decode and Execute thousands of instructions at the same time using pipelining as you can see below:

Pipelining allows GPUs to Fetch, Decode and Execute all at the same time making efficient use of compute cycles.

Concurrent Processing:

Concurrent processing is about dealing with multiple tasks by switching between them quickly, giving the impression that the tasks are being performed simultaneously. This can be managed by a single-core CPU by time-slicing where the CPU rapidly switches between tasks, or by multi-core CPUs where each core handles a different task, allowing different tasks to progress at the same time without necessarily advancing them all in the same instant.

Multi-core and parallel systems

Multi-core CPUs are able to distribute workload across multiple CPU cores, thus achieving significantly higher performance.

The IBM Blue Gene supercomputer has 4,098 processors, allowing 560 Teraflops of processing. Supercomputers are used on problems such as weather forecasting, running climate change models, processing Big Data or sequencing DNA.

Deep blue super computer, 4,098 processors, allowing 560 Teraflops

Many personal computers and mobile devices are dual-core or quad-core, meaning they have two or four processing chips.

The improvement in performance gained by using a multi-core processor is dependent on the software being able to take advantage of the parallel processing capabilities. Maximizing the usage of the computing resources provided by multi-core processors requires adjustments both to the operating system and to existing application software.

Graphics processing unit (GPU)

A GPU is a specialised electronic circuit which is very efficient at manipulating computer graphics and image-processing. Whereas a CPU has a few cores optimised for sequential serial processing, a GPU has a massively parallel architecture consisting of thousands of smaller, more efficient cores designed for handling multiple tasks simultaneously.

Nvidia RTX graphics cards has made it into one of the most valuable companies in the world right now in 2024, Tesla have been developing their own GPU super computer to do AI processing tasks as GPUs are good at Matrix maths that are used to calculate weights in neural networks.

Its highly parallel structure makes it suitable for tasks where the processing of large blocks of visual data is done simultaneously, i.e. in parallel. In a personal computer, a GPU may be present on a graphics card, or embedded on the motherboard. GPUs are now finding generalised uses in computers used for applications such as machine learning, oil exploration, image processing and financial transactions.

Comparison of the models

Part 1: Serial processing ​vs Parallel

This activity shows the difference between serial and parallel processing of instructions.

  1. ​Make 2 groups.
  2. One group has a student who writes down a sentence 10 times on their own.
  3. The OTHER group has a student who is an organiser CPU, who will tell the other students what to do. As a group they are required to write down a statement collectively 10 times.
  4. Time how long it takes them to write down the same statement 10 times, record these times on the board. ​
  5. Discuss the issues with this method, advantages/disadvantages, suitability of the parallel processing group and the serial processing group

--

--