Update README.md
Browse files
README.md
CHANGED
@@ -7,13 +7,18 @@ sdk: static
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
-
##
|
11 |
|
12 |
-
We
|
13 |
|
|
|
14 |
|
15 |
## Resources
|
16 |
|
17 |
-
**
|
18 |
|
19 |
-
|
|
|
|
|
|
|
|
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
+
## Who are we?
|
11 |
|
12 |
+
We are a group of hackers from the Stanford NLP group and are interested in LLM interpretability.
|
13 |
|
14 |
+
`pyvene` is where we started, which stands for *Py*torch model inter*vene*tion.
|
15 |
|
16 |
## Resources
|
17 |
|
18 |
+
**Supervised dictionary learning release for Gemma 2 2B and 9B: [`AxBench Collection`](https://huggingface.co/collections/pyvene/axbench-release-6787576a14657bb1fc7a5117).**
|
19 |
|
20 |
+
**Benchmark interpretability methods at scale (AxBench) library: [`AxBench`](https://github.com/stanfordnlp/axbench).**
|
21 |
+
|
22 |
+
**Representation finetuning (ReFT) library: [`pyreft`](https://github.com/stanfordnlp/pyreft).**
|
23 |
+
|
24 |
+
**Generic model intervention library: [`pyvene`](https://github.com/stanfordnlp/pyvene).**
|