Monday, October 14, 2024

New best story on Hacker News: The quiet art of attention

The quiet art of attention
741 by billwear | 263 comments on Hacker News.


New best story on Hacker News: Accelerating Gemma 4: faster inference with multi-token prediction drafters

Accelerating Gemma 4: faster inference with multi-token prediction drafters 524 by amrrs | 236 comments on Hacker News.