The Fastest PCRE Compatible Regular Expression IP Core on Xilinx® Alveo™ Accelerator Card

INTRODUCTION

Fast analysis of unstructured textual data, such as system logs, network traffic, social media posts, emails, or news articles, is growing ever more important in technical and business data analytics applications. Nearly 85% of business data is in the form of unstructured textual logs. Rapidly extracting information from these text sources is critical for business decision making. GRegeX is an implementation of standard regular expression algorithm on FPGA chip achieving 12.8 GB/s throughput with a single IP core. Wide range of supported regular expression functions allows developers configure desired rules which can be handled in a chip without reducing the throughput.

SOLUTION OVERVIEW

The solution consists of two parts: Regular Expression IP core on the FPGA side and the drivers in Host side: The data sources of the solution can be the NIC of the server using Linux Kernel or DPDK library, the network interface available directly on the acceleration card or any application running on the Linux environment for feeding the GRegeX Drivers with the data.

Image for post
Image for post
FPGA acceleration architecture

SOLUTION DETAILS

The design occupies about 1/3 of the Xilinx Alveo U200 FPGA card resources making it available for further scaling and achieving greater throughput to support more than one 100G network interfaces with the single chip.

Specification of GRegeX:

Image for post
Image for post
GRegeX specification

The real footprint on the FPGA chip:

Image for post
Image for post
Design footprint on FPGA chip

Configuration file allows developers to customize the settings of the design and tune it for specific workloads at the expense of FPGA resource utilization. Host driver has embedded functionality to check the entered regular expression rules and determine any errors in the rules.

Current design can find the sophisticated, variable length regular expression patterns including: +, * (Kleene operations), |, ?(alternate operation), () -groups and [a..z] (character classes). The design also resolves collisions which arises if two neighboring regex rules have overlapped matching.

KEY BENEFITS

  1. 12.8 GB/s throughput with a single core

RESULTS

GRegeX achieves 12.8 GB/s throughput regardless of the regular expression rule set while software implementation speed decreases when using more complex regex rules such as brackets and repeat symbols.

Image for post
Image for post
Comparison with software implementation

Note: Results shown above are for Xilinx® Alveo™ U200 card

Regular expressions are commonly used functions in the applications such us: DNA analysis, content extraction, packet inspection, security and log text analysis and many more. The nature of the regex algorithm does not allow to parallel its components in a way that powerful GPUs or multi-core processors can benefit. Meanwhile, flexibility of FPGAs in sense of parallelism, pipelining, memory distribution architecture with a well designed algorithm solves text processing problems and turns FPGAs into irreplaceable device for this kind of applications.

Learn more about the product by visiting Xilinx®

Learn more about Grovf, Inc.

Learn more about Xilinx Alveo Accelerator Cards

grovf

Grovf is an application performance acceleration company…

Artavazd Khachatryan

Written by

grovf

grovf

Grovf is an application performance acceleration company through FPGA-CPU pairs, focusing on the development of basic programming algorithms on FPGA and creating the universal offloading platform in the application layer.

Artavazd Khachatryan

Written by

grovf

grovf

Grovf is an application performance acceleration company through FPGA-CPU pairs, focusing on the development of basic programming algorithms on FPGA and creating the universal offloading platform in the application layer.

Medium is an open platform where 170 million readers come to find insightful and dynamic thinking. Here, expert and undiscovered voices alike dive into the heart of any topic and bring new ideas to the surface. Learn more

Follow the writers, publications, and topics that matter to you, and you’ll see them on your homepage and in your inbox. Explore

If you have a story to tell, knowledge to share, or a perspective to offer — welcome home. It’s easy and free to post your thinking on any topic. Write on Medium

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store