this model is pretrained based on meta/Llama-2-13b-chat-hf
python -u gradio_demo.py --base_model MosesLM-13B-chat \ --lora_model MosesLM-13B-chat \ --alpha 1 \ --post_host 0.0.0.0 \ --port 7777