Sitemap
Geek Culture

A new tech publication by Start it up (https://medium.com/swlh).

Follow publication

Understanding Google’s GPT Killer- The Pathways Architecture

Devansh
10 min readFeb 18, 2023

--

This ability to explain jokes is an indication that the PaLM model has a deeper understanding of meaning and sentence structure, and relationships between words. If I had to guess, this was made possible by the attention mechanism and the scale of the model itself.

Why Pathways is a revolution.

Keep in mind, that the y-axis is an improvement over SOTA (State of the Art). This is all one model. Insane

How Pathways takes inspiration from NeuroScience

Multi-Task Training

How is Pathways Training Different from Transfer Learning

Multiple Senses

Sparse Activation

PaLM also translates code. They really made their model do everything.

Reach out to me

--

--

Devansh
Devansh

Written by Devansh

Writing about AI, Math, the Tech Industry and whatever else interests me. Join my cult to gain inner peace and to support my crippling chocolate milk addiction

Responses (8)