Does this work with vLLM?
#9
by
nickandbro
- opened
Looking to hopefully get this running on vLLM making use of the cuda graphs their.
Looking to hopefully get this running on vLLM making use of the cuda graphs their.