neural-chat-7b-v1-1 / attention.py

Commit History

add int8 model inference
b0e5659

changwangss commited on

add model.
dac7b46

lvkaokao commited on