Commit History

support flash attn 2
d762e4c

x54-729 commited on

Create README.md
d619b7f
verified

ZwwWayne commited on

fix: add eoa into eos_token_id in chat to accelerate chat interface
0e5f375

ZwwWayne commited on

use bin instead of safetensors with max shard of 2GB
03da3f2

ZwwWayne commited on

fix(modeling): fix inference code
405ebfe

ZwwWayne commited on

initial commit internlm2-chat-7b model
d64cff5

ZwwWayne commited on

update gitattributes
5714f3f

ZwwWayne commited on

initial commit
5809a98

ZwwWayne commited on