👨‍💻
myHN
Top
New
Best
Ask
Show
Job
GateGPT: 56k tokens per second Transformer (KV cache) on FPGA at 80 MHz
by laxmena |
View on Hacker News