mithil shahIntroduction to scaling Large Model training and inference using DeepSpeedDeepSpeed is an open source (apache2 license) library that optimizes training and inference for foundation modelsApr 26, 20231Apr 26, 20231
mithil shahUnderstand CLIP (Contrastive Language-Image Pre-Training) — Visual Models from NLPThis article was originally published here.Oct 6, 2022Oct 6, 2022
mithil shahUnderstanding 540 billion parameter NLP Language Model PaLMSummary of how Pathways Language Model (PaLM) achieves SOTA results in NLP tasksApr 22, 2022Apr 22, 2022