Commit History

fix sampling in chat stream
13750ae

JustinLin610 commited on

fix slow long sequence inference
e326b6c

yangapku commited on

remove unused generation configs
43fe75e

yangapku commited on

remove unused generation configs
3cfc7b7

yangapku commited on

Update modeling_qwen.py
d25b5f8

JustinLin610 commited on

update tokenization and readme
acdaf68

yangapku commited on

deprecate argument stream in model.chat()
ff3a904

yangapku commited on

update support for flash attn
e3edce3

yangapku commited on

fix chat streaming
2db302e

yangapku commited on

add support for flash attn 2
50ea631

yangapku commited on

shard the pytorch_model.bin into parts
5c611a5

yangapku commited on

update readme
68e72e3

yangapku commited on

fix kwargs in generate method and update readme
193987f

yangapku commited on

refactor tokenization and update readme
69bd8ac

yangapku commited on

update config and streaming generation
04df5dd

yangapku commited on

update config about model precision, fix apply_rotary_pos_emb
26fad65

yangapku commited on

revert convert_tokens_to_string
f2e5005

yangapku commited on

update quickusage and example
7f6821c

yangapku commited on

update readme and fix convert_tokens_to_string
53c9efa

yangapku commited on

support cpu inference, format file (#9)
cbf815e

JustinLin610 commited on

fix decoder, and provide an option to remove attack rejection (#8)
f6498e5

JustinLin610 commited on

Update Readme.md, fix typo
62bf1c6

logicwong commited on

Update modeling_qwen.py, fix logn bug
f157e4e

logicwong commited on

Update config.json
4e41480

logicwong commited on

update fast usage
44e46a0

yangapku commited on

update humaneval score
d0db884

yangapku commited on

update fast usage
f59a0a9

yangapku commited on

Upload wanx_colorful_black.png
e03d77d

majx13 commited on

implement _convert_id_to_token
10173d4

yangapku commited on

fix flash-attention usage
405556d

yangapku commited on

Update README.md
1a2571e

yangapku commited on

update tokenization_qwen.py
3f9f12c

yangapku commited on

fix report link
5e7f6a3

yangapku commited on

update demo link
09644a4

yangapku commited on

update tokenization_qwen.py
9882935

yangapku commited on

update react example
8504b36

yangapku commited on

disable hosted inference API
946cbc6

yangapku commited on

update readme
9f47669

yangapku commited on

add resource files
4658aaa

yangapku commited on

upload pytorch_model.bin
498baa4

yangapku commited on

Create README.md
55ab57f

yangapku commited on

initial commit
bfbac77

yangapku commited on