File size: 805 Bytes
fab73e3 b1b4ba5 fab73e3 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 |
---
language:
- en
tags:
- openvino
---
# xverse/XVERSE-7B
This is the [xverse/XVERSE-7B](https://huggingface.co/xverse/XVERSE-7B) model converted to [OpenVINO](https://openvino.ai) with INT8 weights compression for accelerated inference.
An example of how to do inference on this model:
```python
from optimum.intel import OVModelForCausalLM
from transformers import AutoTokenizer, pipeline
# model_id should be set to either a local directory or a model available on the HuggingFace hub.
model_id = "helenai/xverse-XVERSE-7B-ov"
tokenizer = AutoTokenizer.from_pretrained(model_id, trust_remote_code=True)
model = OVModelForCausalLM.from_pretrained(model_id, trust_remote_code=True)
pipe = pipeline("text-generation", model=model, tokenizer=tokenizer)
result = pipe("hello world")
print(result)
```
|