Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

The Tokenizer

Neural networks operate on numbers, not words. The tokenizer is the bridge: it assigns every word in our vocabulary a unique integer, then translates sentences back and forth between text and number sequences.