python3: No module named lmdeploy.serve.turbomind.__main__;
#1
by
victorx98
- opened
Hello,
I'm running command: python3 -m lmdeploy.serve.turbomind ./workspace --server_name 127.0.0.1 --server_port 5001
But got the error message:
python3: No module named lmdeploy.serve.turbomind.main; 'lmdeploy.serve.turbomind' is a package and cannot be directly executed
What could be wrong?
Btw, running the chat mode is really fast. Probably around 30 token /s on RTX 3090. Took about 23GB VRAM, despite the model file size only 12GB
Thanks
You are using a wrong script. Where did you get the command?