Accelerating Gemma 4: faster inference with multi-token prediction drafters 524 by amrrs | 236 comments on Hacker News.