Update README.md
Browse files
README.md
CHANGED
@@ -25,7 +25,7 @@ pipeline_tag: image-text-to-text
|
|
25 |
# EraX-VL-7B-V1
|
26 |
## Introduction
|
27 |
|
28 |
-
We are excited to introduce **EraX-VL-7B-v1**, a robust multimodal model for OCR (optical character recognition) and VQA (visual question-answering) that excels in various languages, with a particular focus on Vietnamese. The
|
29 |
|
30 |
**EraX-VL-7B-V1** is a young member of our **EraX's LànhGPT** collection of LLM models.
|
31 |
|
|
|
25 |
# EraX-VL-7B-V1
|
26 |
## Introduction
|
27 |
|
28 |
+
We are excited to introduce **EraX-VL-7B-v1**, a robust multimodal model for OCR (optical character recognition) and VQA (visual question-answering) that excels in various languages, with a particular focus on Vietnamese. The `EraX-VL-7B` model stands out for its precise recognition capabilities across a range of documents, including medical forms, invoices, bills of sale, quotes, and medical records. This functionality is expected to be highly beneficial for hospitals, clinics, insurance companies, and other similar applications. Built on the solid foundation of the [Qwen/Qwen2-VL-7B-Instruct](https://huggingface.co/Qwen/Qwen2-VL-7B-Instruct)[1], which we found to be of high quality and fluent in Vietnamese, EraX-VL-7B has been fine-tuned to enhance its performance. We plan to continue improving and releasing new versions for free, along with sharing performance benchmarks in the near future.
|
29 |
|
30 |
**EraX-VL-7B-V1** is a young member of our **EraX's LànhGPT** collection of LLM models.
|
31 |
|