Update README.md (#2)

Browse files

- Update README.md (c12acd3b731d4262960aa7ee10451ca153ab1559)
- Update README.md (99fc7fd2b6c4808fe987eed26893a7f33974b624)
- Update README.md (3390f6c553dccbea8ad97de6b0d4328bf44893be)
- Update README.md (aec22bcf80e5401dec5f7a20e22cb4d3cf51b503)
- Update README.md (3480bebe65c92860af6620687b2133eeb74e3232)
- Update README.md (342b8976c96c02e25fb9734bcb786c4ea79ad4fb)
- Update README.md (e55ace262fa681a0a7c946e41fdc6663de848020)
- Update README.md (1b5e62aefe5f159160c2efd036d4d2457397f683)
- Update README.md (4a8ebd0e571d5aa56c74e4e41d48573218e36563)

Files changed (1) hide show

README.md +47 -0

README.md CHANGED Viewed

@@ -296,6 +296,53 @@ Where to send questions or comments about the model Instructions on how to provi
 **<span style="text-decoration:underline;">Note</span>: Llama 3.1 has been trained on a broader collection of languages than the 8 supported languages. Developers may fine-tune Llama 3.1 models for languages beyond the 8 supported languages provided they comply with the Llama 3.1 Community License and the Acceptable Use Policy and in such cases are responsible for ensuring that any uses of Llama 3.1 in additional languages is done in a safe and responsible manner.
 ## Hardware and Software

 **<span style="text-decoration:underline;">Note</span>: Llama 3.1 has been trained on a broader collection of languages than the 8 supported languages. Developers may fine-tune Llama 3.1 models for languages beyond the 8 supported languages provided they comply with the Llama 3.1 Community License and the Acceptable Use Policy and in such cases are responsible for ensuring that any uses of Llama 3.1 in additional languages is done in a safe and responsible manner.
+## How to use
+This repository contains two versions of Meta-Llama-3.1-8B-Instruct, for use with transformers and with the original `llama` codebase.
+### Use with transformers
+Starting with `transformers >= 4.43.0` onward, you can run conversational inference using the Transformers `pipeline` abstraction or by leveraging the Auto classes with the `generate()` function.
+Make sure to update your transformers installation via `pip install --upgrade transformers`.
+```python
+import transformers
+import torch
+model_id = "meta-llama/Meta-Llama-3.1-8B-Instruct"
+pipeline = transformers.pipeline(
+    "text-generation",
+    model=model_id,
+    model_kwargs={"torch_dtype": torch.bfloat16},
+    device_map="auto",
+)
+messages = [
+    {"role": "system", "content": "You are a pirate chatbot who always responds in pirate speak!"},
+    {"role": "user", "content": "Who are you?"},
+]
+outputs = pipeline(
+    messages,
+    max_new_tokens=256,
+    eos_token_id=terminators,
+)
+print(outputs[0]["generated_text"][-1])
+```
+Note: You can also find detailed recipes on how to use the model locally, with `torch.compile()`, assisted generations, quantised and more at [`huggingface-llama-recipes`](https://github.com/huggingface/huggingface-llama-recipes)
+### Use with `llama3`
+Please, follow the instructions in the [repository](https://github.com/meta-llama/llama)
+To download Original checkpoints, see the example command below leveraging `huggingface-cli`:
+```
+huggingface-cli download meta-llama/Meta-Llama-3.1-8B-Instruct --include "original/*" --local-dir Meta-Llama-3.1-8B-Instruct
+```
 ## Hardware and Software