tool: automatically download llama.cpp and model files; run chat completions server (win, cuda) 179c68f verified leafspark commited on May 22