OpenSourceRonin
commited on
Commit
β’
8e9b126
1
Parent(s):
2c33130
Update README.md
Browse files
README.md
CHANGED
@@ -1,12 +1,18 @@
|
|
1 |
---
|
2 |
-
title:
|
3 |
-
emoji:
|
4 |
-
colorFrom:
|
5 |
-
colorTo:
|
6 |
-
sdk:
|
7 |
-
|
|
|
|
|
|
|
|
|
8 |
---
|
9 |
|
|
|
|
|
10 |
# VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models
|
11 |
|
12 |
## TL;DR
|
|
|
1 |
---
|
2 |
+
title: VPTQ demo
|
3 |
+
emoji: π
|
4 |
+
colorFrom: blue
|
5 |
+
colorTo: green
|
6 |
+
sdk: gradio
|
7 |
+
sdk_version: 4.36.1
|
8 |
+
app_file: app.py
|
9 |
+
pinned: true
|
10 |
+
license: mit
|
11 |
+
short_description: Vector Post-Training Quantization (VPTQ) Demo
|
12 |
---
|
13 |
|
14 |
+
An example chatbot using [VPTQ](https://github.com/microsoft/VPTQ), [huggingface community](https://huggingface.co/spaces/VPTQ-community/).
|
15 |
+
|
16 |
# VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models
|
17 |
|
18 |
## TL;DR
|