Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
FAR AI
non-profit
https://far.ai/
FARAIResearch
AlignmentResearch
Request to join this org
AI & ML interests
Frontier alignment research to ensure the safe development and deployment of advanced AI systems.
Team members
9
spaces
1
Running
22
🔎
Tuned Lens
models
3272
Sort: Recently updated
AlignmentResearch/clf_wl_pythia-160m_s-2_adv_tr_gcg_t-2
Updated
6 days ago
•
99
AlignmentResearch/clf_wl_pythia-160m_s-1_adv_tr_gcg_t-1
Updated
6 days ago
•
107
AlignmentResearch/clf_spam_pythia-410m_s-0_adv_tr_gcg_t-0
Updated
6 days ago
•
138
AlignmentResearch/clf_pm_pythia-160m_s-1_adv_tr_gcg_t-1
Updated
6 days ago
•
125
AlignmentResearch/clf_pm_pythia-160m_s-0_adv_tr_gcg_t-0
Updated
6 days ago
•
127
AlignmentResearch/clf_pm_pythia-14m_s-4_adv_tr_gcg_t-4
Updated
6 days ago
•
120
AlignmentResearch/clf_pm_pythia-14m_s-2_adv_tr_gcg_t-2
Updated
6 days ago
•
117
AlignmentResearch/clf_imdb_pythia-31m_s-4_adv_tr_gcg_t-4
Updated
6 days ago
•
125
AlignmentResearch/clf_pm_pythia-70m_s-4_adv_tr_gcg_t-4
Updated
6 days ago
•
242
AlignmentResearch/clf_pm_pythia-410m_s-0_adv_tr_gcg_t-0
Updated
6 days ago
•
41
Expand 3272 models
datasets
14
Sort: Recently updated
AlignmentResearch/WordLength
Viewer
•
Updated
Aug 7
•
100k
•
6.86k
AlignmentResearch/Harmless
Viewer
•
Updated
Jul 29
•
86.6k
•
1.33k
AlignmentResearch/Helpful
Viewer
•
Updated
Jul 29
•
88.1k
•
2.98k
AlignmentResearch/StrongREJECT
Viewer
•
Updated
Jul 29
•
313
•
5.27k
AlignmentResearch/PasswordMatch
Viewer
•
Updated
Jul 29
•
100k
•
39.1k
AlignmentResearch/IMDB
Viewer
•
Updated
Jul 29
•
97.5k
•
33.7k
AlignmentResearch/EnronSpam
Viewer
•
Updated
Jul 29
•
62.3k
•
6.01k
AlignmentResearch/PasswordMatch-test
Viewer
•
Updated
Jul 26
•
50k
•
4
AlignmentResearch/WordLength-test
Viewer
•
Updated
Jul 26
•
100k
•
4
AlignmentResearch/StrongREJECT-test
Viewer
•
Updated
Jul 26
•
313
•
1.65k
Expand 14 datasets