Jaideep RayPerf model cardsModel cards are metadata for trained ML models that provide benchmarked evaluation and performance characteristics. It is an effective…Feb 121Feb 121
Jaideep RayCheckpointing for distributed training failuresChallenges in checkpointingDec 16, 2023Dec 16, 2023
Jaideep RayModel quality & compute budgetA scaling law is a mathematical formula that describes how properties of a system change with the size/scale of a system. Recent work in…Oct 16, 2023Oct 16, 2023