czczup commited on
Commit
9976c63
β€’
1 Parent(s): 3dc1891

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -7
README.md CHANGED
@@ -33,19 +33,19 @@ To enhance the OCR capability of the model, we have incorporated additional OCR
33
  ### Vision Foundation model
34
  | Model | Date | Download | Note |
35
  | ----------------------- | ---------- | ---------------------------------------------------------------------- | -------------------------------- |
36
- | InternViT-6B-448px-V1.5 | 2024.04.20 | πŸ€— [HF link](https://huggingface.co/OpenGVLab/InternViT-6B-448px-V1-5) | support dynamic resolution, super strong OCR (πŸ”₯new) |
37
- | InternViT-6B-448px-V1.2 | 2024.02.11 | πŸ€— [HF link](https://huggingface.co/OpenGVLab/InternViT-6B-448px-V1-2) | 448 resolution |
38
- | InternViT-6B-448px-V1.0 | 2024.01.30 | πŸ€— [HF link](https://huggingface.co/OpenGVLab/InternViT-6B-448px-V1-0) | 448 resolution |
39
  | InternViT-6B-224px | 2023.12.22 | πŸ€— [HF link](https://huggingface.co/OpenGVLab/InternViT-6B-224px) | vision foundation model |
40
  | InternVL-14B-224px | 2023.12.22 | πŸ€— [HF link](https://huggingface.co/OpenGVLab/InternVL-14B-224px) | vision-language foundation model |
41
 
42
  ### Multimodal Large Language Model (MLLM)
43
  | Model | Date | Download | Note |
44
  | ----------------------- | ---------- | --------------------------------------------------------------------------- | ---------------------------------- |
45
- | InternVL-Chat-V1.5 | 2024.04.18 | πŸ€— [HF link](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-5) | support 4K image; super strong OCR; Approaching the performance of GPT-4V and Gemini Pro on various benchmarks like MMMU, DocVQA, ChartQA, MathVista, etc. (πŸ”₯new)|
46
- | InternVL-Chat-V1.2-Plus | 2024.02.21 | πŸ€— [HF link](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-2-Plus) | more SFT data and stronger |
47
- | InternVL-Chat-V1.2 | 2024.02.11 | πŸ€— [HF link](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-2) | scaling up LLM to 34B |
48
- | InternVL-Chat-V1.1 | 2024.01.24 | πŸ€— [HF link](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-1) | support Chinese and stronger OCR |
49
 
50
  ## Model Usage (Image Embeddings)
51
 
 
33
  ### Vision Foundation model
34
  | Model | Date | Download | Note |
35
  | ----------------------- | ---------- | ---------------------------------------------------------------------- | -------------------------------- |
36
+ | InternViT-6B-448px-V1-5 | 2024.04.20 | πŸ€— [HF link](https://huggingface.co/OpenGVLab/InternViT-6B-448px-V1-5) | support dynamic resolution, super strong OCR (πŸ”₯new) |
37
+ | InternViT-6B-448px-V1-2 | 2024.02.11 | πŸ€— [HF link](https://huggingface.co/OpenGVLab/InternViT-6B-448px-V1-2) | 448 resolution |
38
+ | InternViT-6B-448px-V1-0 | 2024.01.30 | πŸ€— [HF link](https://huggingface.co/OpenGVLab/InternViT-6B-448px-V1-0) | 448 resolution |
39
  | InternViT-6B-224px | 2023.12.22 | πŸ€— [HF link](https://huggingface.co/OpenGVLab/InternViT-6B-224px) | vision foundation model |
40
  | InternVL-14B-224px | 2023.12.22 | πŸ€— [HF link](https://huggingface.co/OpenGVLab/InternVL-14B-224px) | vision-language foundation model |
41
 
42
  ### Multimodal Large Language Model (MLLM)
43
  | Model | Date | Download | Note |
44
  | ----------------------- | ---------- | --------------------------------------------------------------------------- | ---------------------------------- |
45
+ | InternVL-Chat-V1-5 | 2024.04.18 | πŸ€— [HF link](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-5) | support 4K image; super strong OCR; Approaching the performance of GPT-4V and Gemini Pro on various benchmarks like MMMU, DocVQA, ChartQA, MathVista, etc. (πŸ”₯new)|
46
+ | InternVL-Chat-V1-2-Plus | 2024.02.21 | πŸ€— [HF link](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-2-Plus) | more SFT data and stronger |
47
+ | InternVL-Chat-V1-2 | 2024.02.11 | πŸ€— [HF link](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-2) | scaling up LLM to 34B |
48
+ | InternVL-Chat-V1-1 | 2024.01.24 | πŸ€— [HF link](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-1) | support Chinese and stronger OCR |
49
 
50
  ## Model Usage (Image Embeddings)
51