Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
12
YigeYuan
1t4chi
Follow
AI & ML interests
None yet
Recent Activity
liked
a model
30 days ago
allenai/tulu-v2.5-dpo-13b-hh-rlhf
liked
a model
30 days ago
allenai/tulu-2-dpo-13b
liked
a model
about 1 month ago
PKU-Alignment/beaver-7b-v1.0
View all activity
Organizations
None yet
1t4chi
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
2 models
30 days ago
allenai/tulu-v2.5-dpo-13b-hh-rlhf
Text Generation
•
Updated
Jun 14
•
36
•
1
allenai/tulu-2-dpo-13b
Text Generation
•
Updated
May 17
•
1.73k
•
20
liked
a model
about 1 month ago
PKU-Alignment/beaver-7b-v1.0
Reinforcement Learning
•
Updated
May 9
•
151
•
10
liked
3 datasets
about 1 month ago
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
Oct 18
•
164k
•
3.87k
•
120
PKU-Alignment/PKU-SafeRLHF-10K
Viewer
•
Updated
Jul 20, 2023
•
10k
•
191
•
62
unalignment/toxic-dpo-v0.2
Viewer
•
Updated
Jan 9
•
541
•
149
•
117
liked
2 models
about 1 month ago
ChenmieNLP/Zephyr-7B-Beta-Helpful
Text Generation
•
Updated
Oct 10
•
21
•
1
OEvortex/HelpingAI-9B
Text Generation
•
Updated
about 1 month ago
•
72
•
25
liked
a dataset
2 months ago
rngusry/UltraFeedback-honesty-preferences
Viewer
•
Updated
Aug 3
•
251k
•
45
•
1
liked
a dataset
3 months ago
rngusry/UltraFeedback-truthfulness-preferences
Viewer
•
Updated
Jul 25
•
217k
•
31
•
1
updated
3 datasets
3 months ago
1t4chi/ultrafeedback-binarized-processed
Viewer
•
Updated
Sep 13
•
63.1k
•
37
1t4chi/hh-rlhf-harmless-processed
Viewer
•
Updated
Sep 13
•
44.8k
•
37
1t4chi/hh-rlhf-helpful-processed
Viewer
•
Updated
Sep 13
•
46.2k
•
37
liked
2 models
3 months ago
jointpreferences/mistral_7b_sft_helpful
Text Generation
•
Updated
Apr 2
•
489
•
1
GraySwanAI/Mistral-7B-Instruct-RR
Text Generation
•
Updated
Jul 9
•
2.23k
•
4
updated
5 models
6 months ago
1t4chi/zephyr-7b-DPOBS48-full
Updated
Jun 13
1t4chi/zephyr-7b-DPOBS128-full
Text Generation
•
Updated
Jun 13
•
11
1t4chi/Ber-shift-zephyr-7b-sft-full
Text Generation
•
Updated
Jun 5
•
3
1t4chi/DPO-shift-2-zephyr-7b-sft-full
Text Generation
•
Updated
Jun 5
•
4
1t4chi/DPO-shift-zephyr-7b-sft-full
Text Generation
•
Updated
Jun 5
•
5
Load more