Commit History

feat(hf_token): set hf token during build
493a5f1

yusufs commited on

fix(hf_token): export HF_TOKEN during build
c6efe6a

yusufs commited on

feat(download-model): add download model at runtime
fc30f26

yusufs commited on

feat(change-model): change to sail/Sailor-4B-Chat 89a866a7041e6ec023dd462adeca8e28dd53c83e
d90e4d6

yusufs commited on

feat(endpoint): add prefix /api on each endpoint
5f3bf21

yusufs commited on

feat(refactor): move the files to root
7935381

yusufs commited on

fix(run.sh): move files
6479dc6

yusufs commited on

feat(run.sh): add script for running openai server
ded2af7

yusufs commited on

fix(python): fix absolute path of python script
d2e0be1

yusufs commited on

fix(cmd): fix 'error: failed to solve: dockerfile parse error on line 19: unknown instruction: "python3",'
de6b236

yusufs commited on

feat(openai): VLLM OpenAI compatible server
147b3a2

yusufs commited on

feat(parse): parse output
b44271e

yusufs commited on

feat(response): should dict only
b41be20

yusufs commited on

feat(one-model): one model at a time
35decf8

yusufs commited on

fix(remove): use_cached_output is not an option
6b1968a

yusufs commited on

feat(max_model_len): reducing max_model_len for T4 support
c41cdb4

yusufs commited on

fix(half-precision): use half precision for T4
d51e450

yusufs commited on

fix(tensor_parallel_size): set to 1
84c6c4a

yusufs commited on

feat(cuda): add cuda information
2457cd7

yusufs commited on

fix(remove-params): Removing max_model_len
0ef012d

yusufs commited on

feat(sailor-chat): add sail/Sailor-4B-Chat with the same context length
586265c

yusufs commited on

feat(reduce-max-length): reduce maximum length
2425953

yusufs commited on

feat(t4-gpu): add t4 gpu capability
4998ce7

yusufs commited on

fix(expose-port): add EXPOSE in Dockerfile
6d19ece

yusufs commited on

fix(module): fix 'error module app' it should be 'main'
6a914f2

yusufs commited on

feat(first-commit): follow examples and tutorials
ae7cfbb

yusufs commited on

initial commit
1a7087e
verified

yusufs commited on