TheBloke
/

CodeLlama-34B-Instruct-GPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

TheBloke commited on Aug 26, 2023

Commit

8d14d25

•

1 Parent(s): 7d1f043

Initial GPTQ model commit

Files changed (1) hide show

README.md +7 -3

README.md CHANGED Viewed

@@ -47,10 +47,12 @@ Multiple GPTQ parameter permutations are provided; see Provided Files below for
 * [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference (deprecated)](https://huggingface.co/TheBloke/CodeLlama-34B-Instruct-GGML)
 * [Meta's original unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/codellama/CodeLlama-34b-instruct-hf)
-## Prompt template: TBC
 ```
-Info on prompt template will be added shortly.
 ```
 ## Provided files and GPTQ parameters
@@ -159,7 +161,9 @@ model = AutoGPTQForCausalLM.from_quantized(model_name_or_path,
 """
 prompt = "Tell me about AI"
-prompt_template=f'''Info on prompt template will be added shortly.
 '''
 print("\n\n*** Generate:")

 * [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference (deprecated)](https://huggingface.co/TheBloke/CodeLlama-34B-Instruct-GGML)
 * [Meta's original unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/codellama/CodeLlama-34b-instruct-hf)
+## Prompt template: CodeLlama
 ```
+[INST] Write code to solve the following coding problem that obeys the constraints and passes the example test cases. Please wrap your code answer using ```:
+{prompt}
+[/INST]
 ```
 ## Provided files and GPTQ parameters
 """
 prompt = "Tell me about AI"
+prompt_template=f'''[INST] Write code to solve the following coding problem that obeys the constraints and passes the example test cases. Please wrap your code answer using ```:
+{prompt}
+[/INST]
 '''
 print("\n\n*** Generate:")