The process by which a model generates text one token at a time in a left-to-right manner, following a specific sequence.