ThinK: Thinner Key Cache by Query-Driven Pruning

ThinK: Thinner Key Cache by Query-Driven Pruning