Published in


The Life of a Numba Kernel: A Compilation Pipeline Taking User Defined Functions in Python to CUDA Kernels

Numba is the Just-in-time compiler used in RAPIDS cuDF to implement high-performance User-Defined Functions (UDFs) by turning user-supplied Python functions into CUDA kernels — but how does it go from Python code to CUDA kernel? In this post, we’ll take a look at Numba’s compilation pipeline.




RAPIDS is a suite of software libraries for executing end-to-end data science & analytics pipelines entirely on GPUs.

Recommended from Medium

CS373 Spring 2022: Sanchith Shanmuga

When it comes to online businesses, there is no exception

5 Things I’ve Learnt from Being a Software Developer for 15 Years

Final sprint of a web Bootcamp

Phone with text: Eat, sleep, code, repeat

10 reasons, why developers think python is finest for AI?

Link Preview Customizations On Dynamic Websites

How To Connect to Knack Databases from Excel & Power BI

WordPress Co-Founder On Why You Should Open-Source Your Life

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Graham Markall

Graham Markall

At NVIDIA, RAPIDS.ai — Numba developer.

More from Medium

The Intuition Behind Graph Convolutions and Message Passing

About Those Transformers for Tabular Data …

A Machine Learning Template based on Pytorch Lightning

The AutoTest Framework Makes the Operator Alignment Task for Deep Learning Frameworks Easy