Shangming Cai
Cheshire94
AI & ML interests
None yet
Organizations
None yet
Cheshire94's activity
[READ IF YOU DO NOT HAVE ACCESS] Getting access to the model
19
#130 opened about 1 month ago
by
osanseviero
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6032802e1f993496bc14d9e3/w6hr-DEQot4VVkoyRIBiy.png)
update README.md
1
#12 opened 6 months ago
by
Cheshire94
Update README of branch dev_triton.
2
#11 opened 6 months ago
by
Cheshire94
Add ApplyRoPE and RMSNorm kernels written in OpenAI Triton
1
#10 opened 7 months ago
by
Cheshire94
Does Qwen support 16k context, what is the best config for max_new_tokens?
2
#22 opened 11 months ago
by
Cheshire94
Error with dtype=torch.float16.
2
#10 opened 11 months ago
by
Cheshire94
Prompt template
18
#1 opened 11 months ago
by
monuminu