Directly distill from Llama, the finetune in DPO
Junxiong Wang
JunxiongWang
AI & ML interests
Attention Free Model / Subquadratic Language Models
Recent Activity
updated
a model
about 1 month ago
JunxiongWang/Llama3.2-Mamba-3B-dpo
updated
a model
about 1 month ago
JunxiongWang/Llama3.2-Mamba-3B-distill
updated
a model
about 1 month ago
JunxiongWang/Llama3.2-Mamba2-3B-distill
Organizations
Collections
7
models
36
JunxiongWang/Llama3.2-Mamba-3B-dpo
Updated
•
31
JunxiongWang/Llama3.2-Mamba-3B-distill
Updated
•
59
JunxiongWang/Llama3.2-Mamba2-3B-distill
Updated
•
282
JunxiongWang/Llama3.2-Mamba2-3B-dpo
Updated
•
14
JunxiongWang/Llama3.1-Mamba2-8B-dpo
Updated
•
7
JunxiongWang/Llama3.1-Mamba-8B-dpo
Updated
•
7
JunxiongWang/Llama3.1-Mamba2-8B-distill
Updated
•
24
JunxiongWang/Llama3.1-Mamba-8B-distill
Updated
•
20
JunxiongWang/MambaByte_Stories
Text Generation
•
Updated
•
14
•
1
JunxiongWang/MambaByte_Arxiv
Text Generation
•
Updated
•
9
•
3