TinyLlama
/

TinyLlama-1.1B-python-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

PY007 commited on Oct 3, 2023

Commit

555570e

•

1 Parent(s): 849bf2b

Update README.md

Files changed (1) hide show

README.md +1 -28

README.md CHANGED Viewed

@@ -28,31 +28,4 @@ This is a code LM finetuned(or so-called continue pretrianed) from the 500B Tiny
 The HumanEval accuracy is **14**.
-#### How to use
-You will need the transformers>=4.31
-Do check the [TinyLlama](https://github.com/jzhang38/TinyLlama) github page for more information.
-```
-from transformers import AutoTokenizer
-import transformers
-import torch
-model = "PY007/TinyLlama-1.1B-intermediate-step-240k-503b"
-tokenizer = AutoTokenizer.from_pretrained(model)
-pipeline = transformers.pipeline(
-    "text-generation",
-    model=model,
-    torch_dtype=torch.float16,
-    device_map="auto",
-)
-sequences = pipeline(
-    'The TinyLlama project aims to pretrain a 1.1B Llama model on 3 trillion tokens. With some proper optimization, we can achieve this within a span of "just" 90 days using 16 A100-40G GPUs 🚀🚀. The training has started on 2023-09-01.',
-    do_sample=True,
-    top_k=10,
-    num_return_sequences=1,
-    repetition_penalty=1.5,
-    eos_token_id=tokenizer.eos_token_id,
-    max_length=500,
-)
-for seq in sequences:
-    print(f"Result: {seq['generated_text']}")
-```


28
29	The HumanEval accuracy is 14.
30
31	+ It can be used as the draft model to speculative-decode larger models such as models in the CodeLlama family.