hf-llm-api / apis /chat_api.py

Commit History

:recycle: [Refactor] Rename gpt-3.5 to gpt-3.5-turbo
d36d623

Hansimov commited on

:gem: [Feature] Enable gpt-3.5 in chat_api
cd6b52a

Hansimov commited on

:recycle: [Refactor] Rename message_streamer to huggingface_streamer
3125c87

Hansimov commited on

:recycle: [Refactor] Configure api info with config.json
deca16d

Hansimov commited on

:recycle: [Refactor] Use tclogger, and configure envs better
babcd78

Hansimov commited on

:recycle: [Refactor] Move AVAILABLE_MODELS(_DICTS) to constants
40ba0ea

Hansimov commited on

:gem: [Feature] Support new model: openchat-3.5-0106
9f5d69c

Hansimov commited on

:gem: [Feature] New model enabled: gemma-7b
77b5a47

Hansimov commited on

:gem: [Feature] Add top_p and set 0.95 as default
403b8cf

Hansimov commited on

:gem: [Feature] Add use_cache parameter, and set default temperature to 0.5
e2b245b

Hansimov commited on

:zap: [Enhance] Display schema of /api/v1 for user-friendly compatibility
a2d3414

Hansimov commited on

:gem: [Feature] ChatAPIApp: New /readme route
6e2fad5

Hansimov commited on

:zap: [Enhance] ChatAPIApp: Only include schema for non-prefix route
06a233d

Hansimov commited on

:gem: [Feature] New model enabled: nous-mixtral-8-7b
4a28ca5

Hansimov commited on

:zap: [Enhance] Align /models response with openai format
245d9fd

Hansimov commited on

:zap: [Enhance] add /api/v1 prefix for hf banning and ui compatability
6d3fdc0

Hansimov commited on

:zap: [Enhance] Remove openchat-3.5 from available models
06c8d58

Hansimov commited on

:boom: [Fix] Error 422 when max_tokens is null
a54e7a6

Hansimov commited on

:zap: [Enhance] Auto calculate max_tokens if not set
1b9f698

Hansimov commited on

:boom: [Fix] Ignore invalid HF Token
8ab8ca6

Hansimov commited on

:zap: [Enhance] Update available models description
c769be6

Hansimov commited on

:zap: [Enhance] Support provide api_key with os env HF_TOKEN
bc384a3

Hansimov commited on

:gem: [Feature] Support call hf api with api_key via HTTP Bearer
2da6968

Hansimov commited on

:gem: [Feature] Support no-stream mode with dict response
d2b20f2

Hansimov commited on

:boom: [Fix] Suppress ping message by increasing ping interval
489b65b

Hansimov commited on

:zap: [Enhance] New models: mistral-7b and openchat-3.5
e916990

Hansimov commited on

:zap: [Enhance] Default models and max_new_tokens
6aa8b86

Hansimov commited on

:gem: [Feature] New ArgParser: Specify server ip, port and running mode
e28221f

Hansimov commited on

:gem: [Feature] New ChatAPIApp: Enable fastapi for openai format api call
3a09006

Hansimov commited on