Spaces:

imperialwool
/

llama-cpp-api

Running

llama-cpp-api / Dockerfile

toaster61

it works!

e3396ba 11 months ago

No virus

1.2 kB

	# Loading base. I'm using Debian, u can use whatever u want.
	FROM python:3.11.5-slim-bookworm

	# Just for sure everything will be fine.
	USER root

	# Installing gcc compiler and main library.
	RUN apt update && apt install gcc cmake build-essential -y
	RUN CMAKE_ARGS="-DLLAMA_BLAS=ON -DLLAMA_BLAS_VENDOR=OpenBLAS" pip install llama-cpp-python==0.1.78

	# Installing wget and downloading model.
	RUN apt install wget -y
	RUN wget -O model.bin https://huggingface.co/OpenBuddy/openbuddy-ggml/resolve/main/openbuddy-openllama-3b-v10-q5_0.bin
	# You can use other models! Visit https://huggingface.co/OpenBuddy/openbuddy-ggml and choose model that u like!
	# Or u can comment this two RUNs and include in Space/repo/Docker image own model with name "model.bin".

	# Copying files into folder and making it working dir.
	RUN mkdir app
	COPY . /app
	RUN chmod -R 777 /app
	WORKDIR /app

	# Updating pip and installing everything from requirements
	RUN python3 -m pip install -U --no-cache-dir pip setuptools wheel
	RUN pip install --no-cache-dir --upgrade -r /app/requirements.txt

	# Now it's time to run Quart app using uvicorn! (It's faster, trust me.)
	CMD ["uvicorn", "app:app", "--host", "0.0.0.0", "--port", "7860"]