Inference-Time Intervention (ITI) Models
Collection
A collection of Llama models with Inference-Time Intervention (Li et al.) applied to them. Codebase: https://github.com/likenneth/honest_llama
•
6 items
•
Updated
•
1