The Transformer Architecture

Multi-head attention, positional encoding, encoder-decoder, feed-forward layers.

Part of Transformers & NLP on neo-ai.

Browse all neo-ai courses · Back to course overview