Post
Anchor Large Language Models: Up to 99% KV cache reduction!
paper: https://arxiv.org/pdf/2402.07616.pdf
paper: https://arxiv.org/pdf/2402.07616.pdf
Join the community of Machine Learners and AI enthusiasts.
Sign Upthis is actually amazing + very cool/interesting i'm very happy i found the paper and the models.
congratulations on the tencent collaboration , i'm looking forward to the future.