-
AmberYifan/llama2-7b-sft-ultrachat-safeRLHF
Text Generation • Updated • 370 -
AmberYifan/mistral-v0.1-7b-sft-ultrachat-safeRLHF
Text Generation • Updated • 279 -
AmberYifan/Mistral-7B-v0.3-sft-ultrachat-safeRLHF
Text Generation • Updated • 73 -
AmberYifan/Gemma-2-9B-sft-ultrachat-safeRLHF
Text Generation • Updated • 74
Yifan Wang
AmberYifan
AI & ML interests
None yet
Recent Activity
updated
a model
30 minutes ago
AmberYifan/Mistral-7B-v0.1-sft-ultrachat-dpo-10k
updated
a model
about 1 hour ago
AmberYifan/Mistral-7B-v0.1-sft-ultrachat-spin-10k
updated
a model
about 4 hours ago
AmberYifan/Mistral-7B-v0.1-sft-ultrachat-gen-dpo-10k
Organizations
Collections
2
This collection contains safetyQA dataset for safe SPIN training and trained models
models
184
AmberYifan/Mistral-7B-v0.1-sft-ultrachat-dpo-10k
Updated
AmberYifan/Mistral-7B-v0.1-sft-ultrachat-spin-10k
Text Generation
•
Updated
AmberYifan/Mistral-7B-v0.1-sft-ultrachat-gen-dpo-10k
Text Generation
•
Updated
AmberYifan/Mistral-7B-v0.3-Mistral-7B-v0.1-mix
Text Generation
•
Updated
•
24
AmberYifan/Mistral-7B-v0.1-Mistral-7B-v0.3-mix
Text Generation
•
Updated
•
27
AmberYifan/Mistral-7B-v0.3-Llama-3.1-8B-mix
Text Generation
•
Updated
•
25
AmberYifan/Llama-3.1-8B-Mistral-7B-v0.3-mix
Text Generation
•
Updated
•
15
AmberYifan/Mistral-7B-v0.1-Llama-3.1-8B-mix
Text Generation
•
Updated
•
26
AmberYifan/Llama-3.1-8B-Mistral-7B-v0.1-mix
Text Generation
•
Updated
•
26
AmberYifan/Mistral-7B-v0.3-Llama-2-7b-mix
Text Generation
•
Updated
•
22
datasets
25
AmberYifan/mistral-v0.1-spin-hhrlhf
Viewer
•
Updated
•
5.5k
•
3
AmberYifan/sft-spin-filter
Updated
•
3
AmberYifan/sft-spin-kcenter-5k
Viewer
•
Updated
•
5.5k
•
7
AmberYifan/gsm8k-sft
Viewer
•
Updated
•
8.79k
•
2
AmberYifan/sft-spin-v
Viewer
•
Updated
•
50.5k
•
10
AmberYifan/safeRLHF-SFT
Viewer
•
Updated
•
83.4k
•
10
AmberYifan/SPIN-trans-DPOformat
Viewer
•
Updated
•
55k
•
2
AmberYifan/spin-v-diverse
Viewer
•
Updated
•
55k
•
5
AmberYifan/dpo-v
Viewer
•
Updated
•
55k
•
3
AmberYifan/spin-v
Viewer
•
Updated
•
55k
•
6