WebGPU vs Pixel Streaming — A View From Afar

Published in

The Startup

13 min readFeb 7, 2021

Two completely new technologies to develop modern graphics-focused software are on the rise. WebGPU is the successor to WebGL and offers remarkable performance improvements. However, pixel streaming goes in a completely different direction and is actively used by the gaming industry.

In this article, we go into the near future and look at a hypothetical 3D application’s top-level architecture and argue the pros and cons of WebGPU vs pixel streaming from a developer’s perspective.

Figure 1: A computer with dual RADEON 64 graphics cards (Unsplash, 2020).

1. Introduction

A company that wants to develop software asks itself at some point in the development process which technologies they should use. The investment in terms of future viability is a point to consider; e.g. if company XYZ writes an app with Perl, they will probably have difficulties finding new developers in 20 years who can continue developing with this technology (The HFT Guy, 2019).

The following passages will argue about two technologies that raise important questions for developing 3D applications in the next few years: In which direction should companies/developers invest? What are the advantages and disadvantages and above all: what are these technologies?

2. WebGPU

Before we can talk about WebGPU, let us first have a look at WebGL, and why it no longer has a future.

2.1 The Slow Death of WebGL

Do not get me wrong; WebGL is a fantastic piece of technology! It allows developers to create interactive 2D and 3D graphics which is impossible with standard JavaScript. Compared to, e.g. Flash, it allows developers to ship their content on the web without the need to install a plugin. The implementation of WebGL 1.0 and WebGL 2.0 is embedded in nearly all major browsers and even mobile phones.

Figure 2: WebGL 1.0 browser compatibility (Screenshot, Can I use, 2021).

Since technologies usually do not vanish overnight, WebGL will definitely stay for many more years. Numerous applications and frameworks rely on WebGL, millions of people use them (Similarweb, n.d.), and hundreds of thousands of developers produce new software with WebGL each day (npm Inc., n.d.). This is entirely understandable because developers can create remarkable things with a low-level graphics API like WebGL. The most famous example is Google Maps — especially Google Earth (Shankland, 2011). Applications like that were unattainable until some years ago, and today we take it for granted to e.g. interact with 3D content on a website — even on mobile devices!

WebGL gives developers the needed JavaScript binding to access the GPU. The WebGL API — especially shader programming in the Graphics Library Shader Language (GLSL) — is hard to use.

Only a few developers — myself excluded — grasp the complexity of OpenGL and therefore also WebGL.

That is why frameworks like Three.js, Babylon.js and Playcanvas came into existence. These frameworks are basically an abstraction on top of WebGL on top of OpenGL on top of the graphics cards to understand JavaScript code and render 2D/3D graphics in the browser. They are widely established and work great (Wappalyzer, n.d.). However, relying on a 10-year-old piece of technology has its downsides. Since the beginning in 2011, WebGL is based on OpenGL ES, a subset of the official OpenGL graphics API with various limitations like not using 3D textures, being limited to triangle meshes, and various others (The Khronos Group Inc., n.d.). WebGL is — from a high-level view — an API built on GPU technology’s understanding from around 15 years ago. Since then, many things have changed. One of the most notable changes is that modern GPUs usually work with shared memory access to perform parallel computing in a more performant fashion than before (Cabello and Wallez, 2019) — and that is where WebGPU comes into play.

2.2 Shared Memory

To simply explain it with my beginner’s understanding: Shared memory is conceptually a place where each Arithmetic Logic Unit (ALU) can efficiently access logic of other ALUs instead of calculating it on its own (Wallez, 2020). I like to think of shared memory as a hashmap in dynamic programming that can optimise the space-time complexity in e.g. traversing through an input array with constant time operations to access data from the previous iterations instead of calculating them again on its own.

Figure 3: Visualisation of a modern GPU’s architecture.

Modern graphics card APIs such as Vulkan or Metal use concepts like shared memory — and many others — to optimise their performance. I like to think comparing OpenGL to e.g. Vulkan as comparing C to JavaScript: OpenGL — and therefore WebGL too — is slow because the other APIs are closer to the actual hardware and more or less platform-specific optimised as well (Hruska, 2015). OpenGL is not optimised for these new concepts. Many developers switched to Vulkan, Metal and co. because of these new concepts (like shared memory) and the benefits of a better performance. But why can’t we just create a new kind of OpenGL, which makes it possible to talk to all the modern graphics card APIs? Instead of porting an old native technology to the web, why can’t we just create a web API that provides some output that will be understood by the modern APIs (something similar to how WebAssembly works with the CPUs (Mozilla Foundation, 2021))? Well, that is precisely what the W3C intends to do with the new WebGPU browser standard.

2.3 A Group-Effort: WebGPU

WebGPU is a group-effort between Apple, Google, Microsoft and many more and is currently being standardised by the W3C to replace WebGL (World Wide Web Consortium, n.d.). Compared to WebGL, WebGPU is not a port of an existing native graphics API to the web. It is based on concepts from e.g. Vulkan and aims to provide high performance on modern GPUs thanks to their optimised architecture (Wallez, 2018). It is intended to be able to write WebGPU specific JavaScript, which all the modern graphics card APIs will understand.

Additionally, WebGPU is stateless and allows command reuse, meaning that sending instructions to the GPU is less expensive than with WebGL because it creates groups of instructions which will be used in advance (Cabello and Wallez, 2019). At runtime, it can switch between entire instruction groups with a single function call (thanks to shared memory). Shortly explained: The current implementation of WebGPU works a lot faster than WebGL — especially with a lot of 3D object instances in a complex scene.

Demo comparison of WebGL vs WebGPU; displaying a scene with 10’000 instances of a 3D tree object (Video, Babylon.js, 2019).

2.4 The Limitation is The User

All in all, WebGPU sounds almost too perfect. There is just one thing to consider: We ship the full source code to the client without knowing their hardware. What if the user visits a super-advanced science visualisation made with modern WebGPU technologies but has only an old onboard graphics card from 5 years ago? Well, their fault. It is the same as if a user still uses Internet Explorer in 2021; the typical compatibility misery. However, what if users still need even more elaborate 3D graphics and complex visualisations? What if the requirement exceeds the average consumer hardware? It sounds impossible, but a new type of technology allows developers to tackle this issue.

3. Pixel Streaming

Pixel streaming (or sometimes also called render streaming or remote rendering) makes it possible to stream the audio-visual output of a hosted cloud software to the client (Antunes, 2020). The client does not need expensive hardware — only a good internet connection.

3.1 How Pixel Streaming Works

Figure 5: Top-down visual explanation of how pixel streaming works (Screenshot, Epic Games, 2019). — Figure 4: Top-down visual explanation of how pixel streaming works (Screenshot, Epic Games, 2019).

Basically, pixel streaming is about moving the heavy-lifting logic from the client to the server. It is the concept of developing software on dedicated hardware (with e.g. dedicated GPUs) and streaming the audio-visual output to the user. As a part of this, client-code (like JavaScript) will also be shipped to interact with the server’s software output in real-time (Epic Games Inc., n.d.). This makes pixel streaming from a developer’s perspective the less accessible technology compared to WebGL/WebGPU because it relies heavily on expensive hardware.

Nevertheless, with pixel streaming, nearly everything seems possible! Developers can produce cutting-edge software which only runs on high-end graphics cards which users could never buy on their own — or even maintain.

However, that is precisely the problem: scalability. A company/a developer needs to gain access to this high-end hardware. Of course, there are Infrastructure as a Service (IaaS) providers like Amazon Web Services (AWS) and Google Cloud Platform (GCP), but there is still a significant limitation: costs (Amazon.com Inc., n.d.). Even with AWS, GCP and co. in the backpack, running software on the required high-end hardware is a computationally expensive task that consumes a lot of energy. Yes, Google and co. seem to have a sheer endless amount of available computing power to make use of, but with the needs of a freely available pixel streaming application, one would push it to a costly limit.

Figure 6: Project Anywhere (Screenshot, Microsoft Corporation, 2020). — Figure 5: Project Anywhere (Screenshot, Microsoft Corporation, 2020).

There are working proof of concepts like the Project Anywhere from NVIDIA, Microsoft and co. with streaming-optimised graphics cards (Woodard and Young, 2020) or even running products like Google Stadia (Patterson, 2020). However, the problem with overcoming the huge price factor is that a company cannot give away a pixel streaming application freely on the web; they would need a payment model for access. Otherwise, the company would possibly ruin themselves financially (imagine e.g. the cost of a DDoS attack on such a free to use application).

4. Hypothetical App: Virtual Offices

Now that we know the technologies and some of the pros and cons let us imagine we want to build a 3D application in the near future. The application is called Hejm and aims to accommodate people from all over the world in a 3D virtual office. No matter what kind of business they run, Hejm provides them with the tools they will need for their remote-only office. Users can enter the office via virtual reality, augmented reality headsets or via a standard computer screen.

4.1 The Complexity of Pixel Streaming

In a pixel streaming scenario, we would write the Hejm software to run on a cloud GPU instance (e.g. AWS EC2 GPU). We then create a client-side of the software that only provides the interaction JavaScript code to access the server-side logic via a real-time communication protocol. Something to be aware of is the problem that per accessing user, we would need to fire up an instance with the running software. It is possible to stream the output of one instance to multiple clients (and they can even all interact with it), but that is, yet again, another drawback. Multiplayer in this sense can — as of today — only be achieved with multiple instances (Epic Games Inc., n.d.), which pushes the costs too. For sure, we can modify our software so that it is optimised to run for more than one accessing user per instance. Nonetheless, there is still a vertical limit per GPU.

Users should also be able to interact with each other in real-time and therefore require real-time server communication between instances. The instances also need a single source of truth for updates within the virtual office. There has to be a synchronisation module which updates all the different scenes on the other instances. However, this means that if e.g. many people move around or do things that are visible to all the other instances, simple functionalities can already be vast computational tasks and therefore costly. On top of this, there should be a chat, video and audio call features within the virtual office application which are expensive on their own.

4.2 A Hybrid Solution

In a perfect hypothetical world, WebGPU would have >95% browser compatibility and almost every conventional device supports the new standard, which would be very exciting times for Hejm! Hejm would have different product offerings, ranging from a free version, which runs only in the browser via WebGPU and an enterprise version which costs around 400$/month per user. Companies that need the power of pixel streaming — e.g. simulating manufacturing CAD models or the like — would probably also be willing to pay extra for such a remote-first solution (they then could at least save on PC hardware and offices for their employees). I would even go further and display certain rooms in the virtual office only via pixel streaming and let the rest run via WebGPU in the browser — therefore being a hybrid solution.

5. Conclusion

There is — as with many things — never a simple solution. But which technology should companies/developers use? I would say that it depends. Creating a high-end 3D software that can only run on high-end hardware with limited access and small user base: develop it with pixel streaming. If the application needs to be served to many thousands or even millions of users for free but does not require advanced features and visualisations: use WebGPU. If time allows it — and resources for research and development are available — I would encourage companies and developers to create a hybrid to satisfy both worlds’ requirements.

I also personally think that a hybrid solution is an optimal answer in many software-related cases.

I compare it with the discussion between Single-Page Applications (SPAs) and Multi-Page Applications (MPAs): First, the entire page was created on the server and sent as plain HTML/CSS/JS to the client. Then came Angular, React and co. and the UI logic layer shifted to the client; the server was only here to handle and serve the data. However, this solution was suboptimal: small websites used to send megabytes of JavaScript chunks just to display a simple UI (Tsarouva, 2019). This part of the web development industry is currently going into a hybrid phase as well: Server-Side Rendering (SSR) and Incremental Static Generation (ISG). Two different terms which deserve an article of their own; but in short: They solve the problem with pre-rendering the most critical software parts on the server and everything that needs to be asynchronous will be put together in the browser (Szczeciński, 2018).

6. Outlook

The entire topic about front vs back is an ongoing discussion in the web development/software industry. I do not think there will ever be a clear winner. It is exciting to see which of the two opponents in the graphics-world will be widely embraced — especially if pixel streaming is a thing to stay and how they want to overcome the high-price drawbacks. Moreover, maybe I am right with the hybrid solution, or perhaps an entirely new kind of technology enters the scene?

Bibliography

Amazon.com Inc. (n.d.) “AWS Pricing Calculator” AWS Pricing Calculator [online] Available at https://calculator.aws/#/createCalculator/amazonEC2 (Accessed 05.02.2021)

Antunes, J. (2020) “Pixel Streaming in UE4: a solution for real-time distributed content” ProVideo Coalition [online] Available at https://www.provideocoalition.com/pixel-streaming-in-unreal-engine-real-time-distributed-content (Accessed 26.01.2021)

Babylon.js (2019) “Babylon.js WebGPU Tech Demo” Babylon.js YouTube Channel [online] Available at https://youtu.be/eYgkDymaNr8 (Accessed 26.01.2021)

Beaufort, F. (2019) “Get started with GPU Compute on the Web” Google Developers [online] Available at https://developers.google.com/web/updates/2019/08/get-started-with-gpu-compute-on-the-web (Accessed 05.02.2021)

Cabello, R., Wallez, C. (2019) “Next-Generation 3D Graphics on the Web (Google I/O’ 19)” Google Chrome Developers YouTube Channel [online] Available at https://youtu.be/K2JzIUIHIhc (Accessed 26.01.2021)

Can I use (n.d.) “Can I use… Support for WebGL” Can I use… Support tables for HTML5, CSS3, etc [online] Available at https://caniuse.com/?search=webgl (Accessed 26.01.2021)

Catuhe, D. (2019) “From WebGL to WebGPU: A perspective from Babylon js by David Catuhe” Seattle JS YouTube Channel [online] Available at https://youtu.be/A2FxeEl4nWw (Accessed 26.01.2021)

Epic Games Inc. (2019) “Unreal Engine 4.24 Release Notes” Unreal Engine Documentation [online] Available at https://docs.unrealengine.com/en-US/WhatsNew/Builds/ReleaseNotes/4\_24/index.html (Accessed 05.02.2021)

Epic Games Inc. (n.d.) “Pixel Streaming” Unreal Engine Documentation [online] Available at https://docs.unrealengine.com/en-US/SharingAndReleasing/PixelStreaming/index.html (Accessed 26.01.2021)

Fatahalian, K. (2011) “How a GPU Works” Presentation PDF [online] Available at https://www.cs.cmu.edu/afs/cs/academic/class/15462-f11/www/lec\_slides/lec19.pdf (Accessed 26.01.2021)

Gwertzman, J. (2020) “Pixel Streaming and the Growth of 3D outside of Gaming” Game Stack Blog — Microsoft Developer [online] Available at https://developer.microsoft.com/en-us/games/blog/pixel-streaming-and-the-growth-of-3d-outside-of-gaming (Accessed 01.02.2021)

Hruska, J. (2015) “Next-generation Vulkan API could be Valve’s killer advantage in battling Microsoft” ExtremeTech [online] Available at https://www.extremetech.com/gaming/200836-next-generation-vulkan-api-could-be-valves-killer-advantage-in-battling-microsoft (Accessed 05.02.2021)

Maier, F. (2021) “WebGPU-Path-Tracer” GitHub [online] Available at https://github.com/maierfelix/WebGPU-Path-Tracer (Accessed 05.02.2021)

Miglio, S. (2019) “Pixel Streaming: delivering high-quality UE4 content to any device, anywhere” Unreal Engine Blog [online] Available at https://www.unrealengine.com/en-US/blog/pixel-streaming-delivering-high-quality-ue4-content-to-any-device-anywhere (Accessed 05.02.2021)

Mozilla Foundation (2021) “WebAssembly Concepts — WebAssembly” MDN [online] Available at https://developer.mozilla.org/en-US/docs/WebAssembly/Concepts (Accessed 05.02.2021)

npm Inc. (n.d.) “three” npm [online] Available at https://www.npmjs.com/package/three (Accessed 01.02.2021)

Patterson, D. (2020) “Google Stadia: The IT infrastructure behind the gaming service” TechRepublic [online] Available at https://www.techrepublic.com/article/google-stadia-the-it-infrastructure-behind-the-gaming-service (Accessed 05.02.2021)

Shankland, S. (2011) “3D Web hits the big time: Google Maps on WebGL” CNET [online] Available at https://www.cnet.com/news/3d-web-hits-the-big-time-google-maps-on-webgl (Accessed 05.02.2021)

Similarweb (n.d.) “maps.google.com Traffic Statistics” Website Traffic — Check and Analyse any Website | SimilarWeb [online] Available at http://similarweb.com/website/maps.google.com (Accessed 05.02.2021)

Szczeciński, B. (2018) “What’s Server Side Rendering and do I need it?” Medium [online] Available at https://medium.com/@baphemot/whats-server-side-rendering-and-do-i-need-it-cb42dc059b38 (Accessed 05.02.2021)

The HFT Guy (2019) “Perl is dying quick. Could be extinct by 2023” The HFT Guy [online] Available at https://thehftguy.com/2019/10/07/perl-is-dying-quick-could-be-extinct-by-2023 (Accessed 01.02.2021)

The Khronos Group Inc. (n.d.) “OpenGL ES” OpenGL Wiki [online] Available at https://www.khronos.org/opengl/wiki/OpenGL\_ES (Accessed 01.02.2021)

Tsarouva, M. (2019) “The Pros & Cons of Single Page Applications (SPAs)” iTech Blog [online] Available at https://www.itechart.com/blog/pros-cons-of-single-page-applications (Accessed 05.02.2021)

Wallez, C. (2018) “Intent to Implement: WebGPU” Google Groups — Chromium [online] Available at https://groups.google.com/a/chromium.org/g/blink-dev/c/dxqWTSvyhDg/m/1UDaFD17AQAJ?pli=1 (Accessed 01.02.2021)

Wallez, C. (2020) “WebGPU: Next-generation 3D graphics on the web (DevFest 2019)” Google Developer Groups YouTube Channel [online] Available at https://youtu.be/EhWvqaRDz5s (Accessed 26.01.2021)

Wappalyzer (n.d.) “Websites using three.js, reviews and alternatives” Wappalyzer: Technology lookup [online] Available at https://www.wappalyzer.com/technologies/javascript-graphics/three-js (Accessed 26.01.2021)

Woodard, T., Young, S. (2020) “Streaming Simulation and Training Applications with Project Anywhere” NVIDIA Developer Blog [online] Available at https://developer.nvidia.com/blog/streaming-simulation-and-training-applications-with-project-anywhere (Accessed 26.01.2021)

World Wide Web Consortium (n.d.) “WebGPU Editor’s Draft” WebGPU — GitHub Pages [online] Available at https://gpuweb.github.io/gpuweb (Accessed 26.01.2021)