Cost Deploy llama 3 8b
#192
by
MartinRojo
- opened
I need to quote a machine to host an 8b call and make 10k inferences per second.
I need to quote a machine to host an 8b call and make 10k inferences per second.