Homepage
Open in app
Sign in
Get started
Veritable
Towards human-centered data science. Consultancy: https://veritable.pw
Deep Learning
Python
R
Reading Lists
Data Analysis
Follow
[Notes] Gradient Checkpointing with BERT
[Notes] Gradient Checkpointing with BERT
A brief analysis of huggingface’s implementation
Ceshine Lee
Jun 18, 2021
[Paper] Adafactor: Adaptive Learning Rates with Sublinear Memory Cost
[Paper] Adafactor: Adaptive Learning Rates with Sublinear Memory Cost
Essential for fine-tuning T5 v1.1 and mT5 models
Ceshine Lee
Apr 17, 2021
[Paper] Rethinking Cooperative Rationalization: Introspective Extraction and Complement Control
[Paper] Rethinking Cooperative Rationalization: Introspective Extra...
Building competitive self-explaining NLP models
Ceshine Lee
Mar 23, 2021
Reducing the SentencePiece Vocabulary Size of Pretrained NLP Models
Reducing the SentencePiece Vocabulary Size of Pretrained NLP Models
Useful for fine-tuning on a subset of available languages
Ceshine Lee
Feb 22, 2021
[PyTorch Lightning] Log Training Losses when Accumulating Gradients
[PyTorch Lightning] Log Training Losses when Accumulating Gradients
The global step is not what you think it is
Ceshine Lee
Jan 23, 2021
About Veritable
Latest Stories
Archive
About Medium
Terms
Privacy
Teams