Ceshine Lee[Notes] (Ir)Reproducible Machine Learning: A Case StudyI just read this (draft) paper named “(Ir)Reproducible Machine Learning: A Case Study” ( blog post; paper). It reviewed 15 papers that…3 min read·Jan 17, 2022----
Ceshine LeeHow to Create a Documentation Website for Your Python PackageUse Sphinx to (semi-)automatically generate documentation from docstrings4 min read·Dec 23, 2021--1--1
Ceshine LeeinVeritable[Notes] Gradient Checkpointing with BERTA brief analysis of huggingface’s implementation3 min read·Jun 18, 2021----
Ceshine LeeinVeritable[Paper] Adafactor: Adaptive Learning Rates with Sublinear Memory CostEssential for fine-tuning T5 v1.1 and mT5 models5 min read·Apr 18, 2021----
Ceshine LeeinVeritable[Paper] Rethinking Cooperative Rationalization: Introspective Extraction and Complement ControlBuilding competitive self-explaining NLP models5 min read·Mar 24, 2021----
Ceshine LeeinVeritableReducing the SentencePiece Vocabulary Size of Pretrained NLP ModelsUseful for fine-tuning on a subset of available languages4 min read·Feb 22, 2021----
Ceshine LeeinVeritable[PyTorch Lightning] Log Training Losses when Accumulating GradientsThe global step is not what you think it is5 min read·Jan 23, 2021----
Ceshine LeeinVeritableGenerating Synthetic Tabular Data Using GANA case study: detecting credit fraud5 min read·Dec 30, 2020----
Ceshine LeeinVeritable[Paper] Are We Really Making Much Progress?A Worrying Analysis of Recent Neural Recommendation Approaches3 min read·Dec 9, 2020----
Ceshine LeeinVeritableAutomatic Testing Your SQLite Database with Great ExpectationsA great tool for eliminating pipeline debts5 min read·Nov 21, 2020----