DeepMind’s Epistemic Neural Networks Enable Large Language Model Fine-Tuning With 50% Less Data
Large pretrained language models (LLMs) have emerged as the state-of-the-art deep learning architecture across a wide range of applications and have demonstrated impressive few-shot learning capabilities when transferred to new tasks. These models however generally require a fine-tuning process that entails costly additional…