Edit model card
README.md exists but content is empty. Use the Edit model card button to edit it.
Downloads last month
11
Inference API
Input a message to start chatting with mit-han-lab/Llama-3-8B-Instruct-QServe-W8A8.
This model can be loaded on Inference API (serverless).