Base64 Encoding Explained: What It Is and Why You Need It

Ahmed Elkourani
7 min readMay 9, 2023

--

Photo by Alexander Sinn on Unsplash

If you’ve ever worked with binary data such as images, audio, or documents, you may have come across the term “Base64 encoding”. But what is Base64, and why do we need it?

In this story, we’ll explore the basics of Base64 encoding, how it works, and some common use cases.

What is Base64?

Base64 is a binary-to-text encoding scheme that represents binary data in an ASCII string format. It is commonly used in computer systems to transmit data over channels that can handle only text, such as email or HTTP.

Here’s an example of a PNG file that depicts the ‘Linux Tux’ image, encoded in ASCII text using the Base64 scheme. This encoding method makes it simple to embed the image into text-based environments, such as HTML or CSS code, or as part of an API request payload.”



For instance, you can utilize the image directly as the src attribute of the <img> element in HTML, eliminating the need for a separate image file. Here’s an example:”

<!DOCTYPE html>
<html>
<body>

<h1>Linux Tux</h1>

<img src="" alt="Linux Tux" width="500" height="600">

</body>
</html>

Base64 gets its name from the fact that it uses a set of 64 unique characters to represent binary data. The 64 characters include uppercase and lowercase letters, numbers, and two special characters (‘+’ and ‘/’), along with the ‘=’ padding character.

Why do we need to encode binary data as text?

There are several reasons why we might need to encode binary data as text:

  1. Transmission: Many communication protocols, such as email and HTTP, are designed to handle text data. If we want to transmit binary data over these protocols, we need to encode it as text first.
  2. Display: Some binary data, such as images or audio, can’t be directly displayed in a text-based environment, such as a terminal or a web page. By encoding the data as text, we can display it as a series of characters that can be rendered on the screen.
  3. Compression: Text data can often be compressed more effectively than binary data. By encoding binary data as text, we can take advantage of compression techniques that are optimized for text.
  4. Compatibility: Some systems or applications may only accept text data as input. By encoding binary data as text, we can make it compatible with these systems.

How does Base64 encoding work?

Base64 encoding works by converting binary data into a text format that can be easily transmitted or stored as text. The encoding process involves dividing the binary data into 24-bit groups (3 bytes) and then converting each group into a 4-character string. Here’s a step-by-step explanation of how Base64 encoding works:

  1. Divide the binary data into 24-bit groups (3 bytes) if the length of the binary data is evenly divisible by 3 bytes.
  2. For each 24-bit group, break it into four 6-bit units.
  3. Convert each 6-bit unit to its corresponding Base64 character according to the Base64 index table. The specific mapping between 6-bit units and Base64 characters is fixed and defined by the Base64 standard.
  4. Concatenate the four Base64 characters to form a 4-character string.
  5. Repeat steps 2–4 for each 24-bit group in the binary data.
  6. If the length of the binary data is not evenly divisible by 3 bytes, pad the end of the data with one or two zero bytes and add “=” characters to the end of the Base64 string. The number of “=” characters added depends on the number of bytes of padding.

For example, let’s encode the binary data “01100001 01100010 01100011” (which represents the ASCII characters “abc”) in Base64. Here’s how the encoding process works:

  1. Divide the binary data into 24-bit groups:
    01100001 01100010 01100011
  2. Convert each 24-bit group into four 6-bit units:
    011000 010110 001001 100011
  3. Convert each 6-bit unit to its corresponding Base64 character:
    Y W J j
  4. Concatenate the four Base64 characters to form a 4-character string:
    YWJj

The resulting Base64 string for “abc” is “YWJj”.

Another example that explains the use of the padding character ‘=’, let’s say we want to encode the binary data
“01000001”
(which represents the letter “A” in ASCII) in Base64.

  1. Divide the binary data into 24-bit groups because the length of the binary data is not evenly divisible by 3 bytes, pad the end of the data with two zero bytes :
    01000001 00000000 00000000
  2. Convert each 24-bit group into four 6-bit units:
    010000 010000 000000 000000
  3. Convert each 6-bit unit to its corresponding Base64 character because a padding of two zero bytes was added, add the same number of the character ‘=’ to the Base64 string output:
    Q Q = =
  4. Concatenate the four Base64 characters to form a 4-character string:
    QQ==

So the Base64-encoded value of “A” is “QQ==”.

What are some common use cases for Base64 encoding?

Base64 encoding is commonly used for encoding binary data in a way that can be transmitted over channels that only support ASCII characters, such as email or HTTP headers. Some common use cases for Base64 encoding include:

  1. Email attachments: Email systems typically only support ASCII text, so binary data such as images, audio, or documents need to be converted to text before they can be sent as attachments, the email clients such as Gmail handle the conversion automatically once you upload your binary attachment.
  2. Web images: Base64 encoding is commonly used for embedding images into web pages without requiring a separate image file. This is achieved by converting the image into a Base64-encoded string, which can be directly embedded into the HTML code. Since HTML is a text-based format, the image data can be transmitted as text and rendered directly by the web browser.
  3. API requests: APIs typically use text-based data formats, binary data cannot be transmitted directly as binary. To transmit the binary data as text, it needs to be encoded into a text-based format, such as Base64. Base64 encoding can be used to represent the binary data as text in the request payload, allowing it to be transmitted as part of the API request. For example, let’s say you’re building a mobile app that allows users to upload photos to a server using an API. When a user selects a photo to upload, the photo is converted into a Base64-encoded string, which is then included in the API request payload. The server can then decode the Base64 string back into binary data and store the photo on the server.
  4. Data storage: In some cases, binary data may need to be stored as text, such as in a CSV file However, binary data cannot be stored directly in a CSV file as it is not text-based. For example, let’s say you have an image file that you want to store in a CSV file. you can encode the binary data as Base64 and store the resulting text in a column of the CSV file. When you want to retrieve the image data, you can decode the Base64-encoded text back into binary data. This approach can be useful when you want to store binary data in a format that is easily readable and editable by humans, such as in a spreadsheet or text editor. However, it’s important to note that encoding binary data as Base64 can increase the size of the data, as each byte of binary data is represented by four bytes of Base64-encoded text. This can lead to larger file sizes and slower processing times, especially for large amounts of binary data.
  5. Authentication: Base64 encoding is often used in authentication schemes, such as Basic authentication for HTTP requests, where a username and password are Base64-encoded and sent in the request header.
  6. URL parameters: Some web applications use Base64 encoding to encode data that is passed in URL parameters, since some characters used in binary data can cause issues with URL encoding.
  7. Encoding data for web services: Some web services may require data to be sent in a specific format, such as XML or JSON, and Base64 encoding may be required to encode binary data within those formats.
  8. Storing data locally: Mobile applications may need to store binary data locally, such as images or audio files, and Base64 encoding can be used to convert the binary data into a format that can be easily stored as text in a database or file.
  9. Image and file transfer: When transferring images or files between a device and a server, Base64 encoding can be used to convert the binary data into an ASCII string that can be easily transmitted over HTTP or other network protocols.

Conclusion

Base64 encoding is a method for representing binary data as text, which allows it to be transmitted and stored more easily and reliably. It is widely supported by many programming languages and systems, and it is a well-established standard for encoding binary data as text.

References

That’s all for this story :)

--

--

Ahmed Elkourani

Software Developer 📱 In ❤ with computer science 👨‍💻