badayvedat
commited on
Commit
•
f248b35
1
Parent(s):
5df3ede
Update recommended configurations
Browse files
app.py
CHANGED
@@ -345,7 +345,7 @@ ONLY WORKS WITH GPU!
|
|
345 |
You can load the model with 8-bit or 4-bit quantization to make it fit in smaller hardwares. Setting the environment variable `bits` to control the quantization.
|
346 |
|
347 |
Recommended configurations:
|
348 |
-
| Hardware | A10G-Large (24G) | T4-
|
349 |
|-------------------|------------------|-----------------|------------------|
|
350 |
| **Bits** | 8 (default) | 4 | 16 |
|
351 |
|
|
|
345 |
You can load the model with 8-bit or 4-bit quantization to make it fit in smaller hardwares. Setting the environment variable `bits` to control the quantization.
|
346 |
|
347 |
Recommended configurations:
|
348 |
+
| Hardware | A10G-Large (24G) | T4-Small (16G) | A100-Large (40G) |
|
349 |
|-------------------|------------------|-----------------|------------------|
|
350 |
| **Bits** | 8 (default) | 4 | 16 |
|
351 |
|