added colab
Browse files
README.md
CHANGED
@@ -16,6 +16,8 @@ arxiv: 2408.03326
|
|
16 |
|
17 |
data:image/s3,"s3://crabby-images/e83f5/e83f56014bdd677df21c46940f1f6ca61ce1fb8f" alt="image/png"
|
18 |
|
|
|
|
|
19 |
Below is the model card of 7B LLaVA-Onevision model which is copied from the original LLaVA-Onevision model card that you can find [here](https://huggingface.co/lmms-lab/llava-onevision-qwen2-0.5b-si).
|
20 |
|
21 |
|
@@ -53,12 +55,14 @@ The model supports multi-image and multi-prompt generation. Meaning that you can
|
|
53 |
Below we used [`"llava-hf/llava-onevision-qwen2-7b-ov-hf"`](https://huggingface.co/llava-hf/llava-onevision-qwen2-7b-ov-hf) checkpoint.
|
54 |
|
55 |
```python
|
56 |
-
from transformers import pipeline
|
57 |
from PIL import Image
|
58 |
import requests
|
59 |
|
60 |
model_id = "llava-hf/llava-onevision-qwen2-7b-ov-hf"
|
61 |
pipe = pipeline("image-to-text", model=model_id)
|
|
|
|
|
62 |
url = "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/tasks/ai2d-demo.jpg"
|
63 |
image = Image.open(requests.get(url, stream=True).raw)
|
64 |
|
|
|
16 |
|
17 |
data:image/s3,"s3://crabby-images/e83f5/e83f56014bdd677df21c46940f1f6ca61ce1fb8f" alt="image/png"
|
18 |
|
19 |
+
Check out also the Google Colab demo to run Llava on a free-tier Google Colab instance: [data:image/s3,"s3://crabby-images/e7985/e79852128a5f83c92496b9d734ca52d01e009a39" alt="Open In Colab"](https://colab.research.google.com/drive/1-4AtYjR8UMtCALV0AswU1kiNkWCLTALT?usp=sharing)
|
20 |
+
|
21 |
Below is the model card of 7B LLaVA-Onevision model which is copied from the original LLaVA-Onevision model card that you can find [here](https://huggingface.co/lmms-lab/llava-onevision-qwen2-0.5b-si).
|
22 |
|
23 |
|
|
|
55 |
Below we used [`"llava-hf/llava-onevision-qwen2-7b-ov-hf"`](https://huggingface.co/llava-hf/llava-onevision-qwen2-7b-ov-hf) checkpoint.
|
56 |
|
57 |
```python
|
58 |
+
from transformers import pipeline, AutoProcessor
|
59 |
from PIL import Image
|
60 |
import requests
|
61 |
|
62 |
model_id = "llava-hf/llava-onevision-qwen2-7b-ov-hf"
|
63 |
pipe = pipeline("image-to-text", model=model_id)
|
64 |
+
processor = AutoProcessor.from_pretrained(model_id)
|
65 |
+
|
66 |
url = "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/tasks/ai2d-demo.jpg"
|
67 |
image = Image.open(requests.get(url, stream=True).raw)
|
68 |
|