Aleph-Alpha
/

umup-research-7b-fp8

Text Generation

Model card Files Files and versions Community

GregorZiegltrumAA commited on Oct 29, 2024

Commit

edc895f

·

verified ·

1 Parent(s): a1c7328

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ pipeline_tag: text-generation
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/671a0238b080a748c29b8fea/F1-zbAXF5LGvxpIRrYfU4.png)
-This Repository holds the model weights for the 7B u-μP models trained at Aleph Alpha Research for 72k steps (300B tokens). Please note, that the released checkpoints are not fully converged models and are intended for research use only.
 You can find all model weights and their corresponding safetensors conversions at the following links:
 - [umup-research-7b-bf16](https://huggingface.co/Aleph-Alpha/umup-research-7b-bf16)
@@ -42,7 +42,7 @@ ckpt_path = Path("<path_to_repo>/7B_umup_fp8")
 model = TransformerInferenceModule.from_checkpoint(ckpt_path)
-prompt = "Once upon a time"
 output = model.generate(max_tokens=100, input_text=prompt)

 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/671a0238b080a748c29b8fea/F1-zbAXF5LGvxpIRrYfU4.png)
+This Repository holds the model weights for the 7B u-μP models trained at Aleph Alpha Research, in collaboration with Graphcore, for 72k steps (300B tokens). Please note, that the released checkpoints are not fully converged models and are intended for research use only.
 You can find all model weights and their corresponding safetensors conversions at the following links:
 - [umup-research-7b-bf16](https://huggingface.co/Aleph-Alpha/umup-research-7b-bf16)
 model = TransformerInferenceModule.from_checkpoint(ckpt_path)
+prompt = "Yesterday I dreamt of "
 output = model.generate(max_tokens=100, input_text=prompt)