Contents
Transformer

Overview of the Transformer architecture: parallel compute, attention, masking, and more.

Read more →