Lucas de Lima NogueirainTowards Data ScienceRecreating PyTorch from scratch (with GPU support and automatic differentiation)Build your own deep learning framework based on C/C++, CUDA and Python, with GPU support and automatic differentiation!24 min read·May 14, 2024--12--12
Lucas de Lima NogueirainTowards Data ScienceWhy Deep Learning Models Run Faster on GPUs: A Brief Introduction to CUDA ProgrammingFor those who want to understand what .to(“cuda”) does.15 min read·Apr 17, 2024--6--6
Lucas de Lima NogueiraScaling Deep Learning Models in Production for millions of usersFor those who want to go beyond Flask+Heroku16 min read·Jul 22, 2023--1--1
Lucas de Lima NogueiraHow to run distributed multinode training in practiceTutorial for multinode training using PyTorch, Slurm and AWS14 min read·May 17, 2023----