High-Fidelity KV Cache Summarization Using Entropy and Low-Rank Reconstruction

by jchandra | View on Hacker News