Spaces:
Sleeping
Sleeping
Apply for community grant: Academic project (gpu)
#1
by
valcore
- opened
I'm a PhD student that works on lowering the computation requirements for LLMs by leveraging Early Exit techniques. This demo showcase a model where I trained Early Exit heads in order to do faster inference on easy token. I would like to have a community grant to show how we can accelerate LLMs through this technique. Thank you for considering !
Thank you for the grant ! 🥳