hf-llm-api / apis /chat_api.py

Commit History

:zap: [Enhance] Add new model yi-1.5-34b, and remove unavailable models
b9a642a

Hansimov commited on

:gem: [Feature] New auth_api_key, and catch Exception and response
97b108f

Hansimov commited on

:zap: [Enhance] Use nous-mixtral-8x7b as default model
214fb7b

Hansimov commited on

:gem: [Feature] ChatAPIApp: Enable chat with pro models
06e3150

Hansimov commited on

:recycle: [Refactor] Rename gpt-3.5 to gpt-3.5-turbo
d36d623

Hansimov commited on

:gem: [Feature] Enable gpt-3.5 in chat_api
cd6b52a

Hansimov commited on

:recycle: [Refactor] Rename message_streamer to huggingface_streamer
3125c87

Hansimov commited on

:recycle: [Refactor] Configure api info with config.json
deca16d

Hansimov commited on

:recycle: [Refactor] Use tclogger, and configure envs better
babcd78

Hansimov commited on

:recycle: [Refactor] Move AVAILABLE_MODELS(_DICTS) to constants
40ba0ea

Hansimov commited on

:gem: [Feature] Support new model: openchat-3.5-0106
9f5d69c

Hansimov commited on

:gem: [Feature] New model enabled: gemma-7b
77b5a47

Hansimov commited on

:gem: [Feature] Add top_p and set 0.95 as default
403b8cf

Hansimov commited on

:gem: [Feature] Add use_cache parameter, and set default temperature to 0.5
e2b245b

Hansimov commited on

:zap: [Enhance] Display schema of /api/v1 for user-friendly compatibility
a2d3414

Hansimov commited on

:gem: [Feature] ChatAPIApp: New /readme route
6e2fad5

Hansimov commited on

:zap: [Enhance] ChatAPIApp: Only include schema for non-prefix route
06a233d

Hansimov commited on

:gem: [Feature] New model enabled: nous-mixtral-8-7b
4a28ca5

Hansimov commited on

:zap: [Enhance] Align /models response with openai format
245d9fd

Hansimov commited on

:zap: [Enhance] add /api/v1 prefix for hf banning and ui compatability
6d3fdc0

Hansimov commited on

:zap: [Enhance] Remove openchat-3.5 from available models
06c8d58

Hansimov commited on

:boom: [Fix] Error 422 when max_tokens is null
a54e7a6

Hansimov commited on

:zap: [Enhance] Auto calculate max_tokens if not set
1b9f698

Hansimov commited on