Update README.md
Browse files
README.md
CHANGED
@@ -25,19 +25,6 @@ We concentrate on understanding the relationship between internal structure in n
|
|
25 |
|
26 |
3. **Geometry of Program Synthesis (GPS)**: Applying SLT to study inductive biases, advancing our understanding of how to predict and measure alignment-relevant risks.
|
27 |
|
28 |
-
## Notable Achievements
|
29 |
-
|
30 |
-
- Established developmental interpretability as a concrete application of SLT to alignment
|
31 |
-
- Developed scalable new measuring tools like the Local Learning Coefficient (LLC)
|
32 |
-
- Validated that SLT can make accurate predictions about real-world AI systems
|
33 |
-
- Popularized SLT within the AI safety community through conferences, workshops, and collaborations
|
34 |
-
|
35 |
-
## Key Publications
|
36 |
-
|
37 |
-
- Quantifying Degeneracy in Singular Models via the learning coefficient (Lau et al. 2023)
|
38 |
-
- Estimating the Local Learning Coefficient at Scale (Furman and Lau 2024)
|
39 |
-
- The Developmental Landscape of In-Context Learning (Hoogland et al. 2024)
|
40 |
-
|
41 |
## Resources
|
42 |
|
43 |
- [DevInterp GitHub Repository](https://github.com/timaeus-research/devinterp)
|
|
|
25 |
|
26 |
3. **Geometry of Program Synthesis (GPS)**: Applying SLT to study inductive biases, advancing our understanding of how to predict and measure alignment-relevant risks.
|
27 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
28 |
## Resources
|
29 |
|
30 |
- [DevInterp GitHub Repository](https://github.com/timaeus-research/devinterp)
|