DiscoResearch
/

Llama3-DiscoLeo-Instruct-8B-v0.1

Text Generation

Inference Endpoints

text-generation-inference

Model card Files Files and versions Community

mbrack commited on May 25

Commit

eeafb78

•

1 Parent(s): de19672

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -22,7 +22,7 @@ We finetuned this checkpoint on the German Instruction dataset from DiscoResearc
 ## How to use
 Llama3_DiscoLeo_Instruct_8B_v0.1 uses the [Llama-3 chat template](https://github.com/meta-llama/llama3?tab=readme-ov-file#instruction-tuned-models), which can be easily used with [transformer's chat templating](https://huggingface.co/docs/transformers/main/en/chat_templating).
 ## Model Training and Hyperparameters
 The model was full-fintuned with axolotl on the [hessian.Ai 42](hessian.ai) with 8192 context-length, learning rate 2e-5 and batch size of 16.
@@ -52,7 +52,7 @@ We release DiscoLeo-8B in the following configurations:
 5. [Experimental `DARE-TIES` Merge with Llama3-Instruct](https://huggingface.co/DiscoResearch/Llama3_DiscoLeo_8B_DARE_Experimental)
 6. [Collection of Quantized versions](https://huggingface.co/collections/DiscoResearch/discoleo-8b-quants-6651bcf8f72c9a37ce485d42)
-## How to use:
 Here's how to use the model with transformers:
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer

 ## How to use
 Llama3_DiscoLeo_Instruct_8B_v0.1 uses the [Llama-3 chat template](https://github.com/meta-llama/llama3?tab=readme-ov-file#instruction-tuned-models), which can be easily used with [transformer's chat templating](https://huggingface.co/docs/transformers/main/en/chat_templating).
+See [below](https://huggingface.co/DiscoResearch/Llama3_DiscoLeo_Instruct_8B_v0.1#usage-example) for a usage example.
 ## Model Training and Hyperparameters
 The model was full-fintuned with axolotl on the [hessian.Ai 42](hessian.ai) with 8192 context-length, learning rate 2e-5 and batch size of 16.
 5. [Experimental `DARE-TIES` Merge with Llama3-Instruct](https://huggingface.co/DiscoResearch/Llama3_DiscoLeo_8B_DARE_Experimental)
 6. [Collection of Quantized versions](https://huggingface.co/collections/DiscoResearch/discoleo-8b-quants-6651bcf8f72c9a37ce485d42)
+## Usage Example
 Here's how to use the model with transformers:
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer