Huan XuSparse GEMM and Tensor Core’s Structured SparsityThe world of scientific computing and deep neural networks is abuzz with the term sparse general matrix multiplication (spGEMM). But what…Jan 19Jan 19
Huan XuA Trip to Kernels: Understanding PyTorch’s Internal ArchitectureIf you’re here, you know that PyTorch is one of the most popular libraries among deep learning practitioners. It is highly efficient, and…Jul 8, 20232Jul 8, 20232
Huan XuStream OpenAI with FastAPI and Consuming it with React.jsProblem StatementMar 29, 20233Mar 29, 20233
Huan XuinAWS TipWarm a Vercel-hosted Next.js Website with Cloudflare WorkersWhen I hosted a Next.js-based ChatGPT clone on Vercel using the open-source project chatbot-ui for my parents in China, I noticed a…Mar 29, 2023Mar 29, 2023