Kisu Yang
ksyang
AI & ML interests
None yet
Recent Activity
View all activity
Organizations
ksyang's activity
Some weights of the model checkpoint at /models/DeepSeek-V3_bf16 were not used when initializing DeepseekV3ForCausalLM
3
#62 opened about 1 month ago
by
Bobcuicui
HeaderTooLarge
1
#1 opened 8 months ago
by
hyerong
![](https://cdn-avatars.huggingface.co/v1/production/uploads/665fc78ed2ed9163a3a3e55c/omC7HVfE6LMMnqwveDCo9.jpeg)
Adding `safetensors` variant of this model
#1 opened over 1 year ago
by
SFconvertbot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/635fd4cc14657fb8cff2a081/GDkyDwAcuqDBpaOvQgJuq.png)