
In a new study, Apple researchers present a diffusion model that can write up to 128 times faster than its counterparts. Here’s how it works.
The nerdy bits
Here’s what you need to know for this study: LLMs such as ChatGPT are autoregressive models. They generate text sequentially, one token at a time, taking into account both the user’s prompt and all previously generated tokens.
In contrast to autoregressive models, there are diffusion models. They generate multiple…








