Evan Pete Walsh – Medium

Evan Pete Walsh

Evan Pete Walsh
in
Ai2 Blog

Scaling up AllenNLP to 11B Parameter Models

A deep dive into the challenges of large scale training and the tools we used to get there

Oct 7, 2021

Scaling up AllenNLP to 11B Parameter Models

Oct 7, 2021

Evan Pete Walsh
in
Ai2 Blog

Python caching in GitHub Actions

How to speed up slow Python builds in GitHub Actions with effective caching

Sep 28, 2020

Python caching in GitHub Actions

Sep 28, 2020

Evan Pete Walsh
in
Ai2 Blog

Tutorial: training on larger batches with less memory in AllenNLP

This is part of a series of mini-tutorials to help you with various aspects of the AllenNLP library.

Sep 8, 2020

Tutorial: training on larger batches with less memory in AllenNLP

Sep 8, 2020

Evan Pete Walsh
in
Ai2 Blog

Tutorial: How to train with multiple GPUs in AllenNLP

This is part of a series of mini-tutorials to help you with various aspects of the AllenNLP library.

Aug 24, 2020

Tutorial: How to train with multiple GPUs in AllenNLP

Aug 24, 2020

Evan Pete Walsh
in
Ai2 Blog

Tutorial: How to upload transformer weights and tokenizers from AllenNLP to HuggingFace

This is the first of a series of mini tutorials to help you with various aspects of the AllenNLP library.

Aug 14, 2020

Tutorial: How to upload transformer weights and tokenizers from AllenNLP to HuggingFace

Aug 14, 2020

Evan Pete Walsh

Incorporating a copy mechanism into sequence-to-sequence models

This post explains the details behind the CopyNet model from Gu et al. (1). If you just want to see the code, you can check out my…

Sep 10, 2019

Incorporating a copy mechanism into sequence-to-sequence models

Sep 10, 2019

Evan Pete Walsh
in
Structurely Engineering

Sequence-to-sequence models with a dash of reinforcement learning 🚀

Practical training techniques for optimizing sequence-level objectives

Mar 6, 2019

Sequence-to-sequence models with a dash of reinforcement learning 🚀

Mar 6, 2019

Evan Pete Walsh

Evan Pete Walsh

Research Engineer, NLP

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams