👨‍💻
myHN
Top
New
Best
Ask
Show
Job
Accelerating Gemma 4: faster inference with multi-token prediction drafters
by amrrs |
View on Hacker News