![](https://cdn-avatars.huggingface.co/v1/production/uploads/1677749304221-64006c09330a45b03605bba3.png)
OpenGVLab/InternViT-6B-224px
Image Feature Extraction
•
Updated
•
1.7k
•
17
Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks
Note Relased at 2024.02.21 | 40B parameters | More SFT data and stronger.
Note Released at 2024.02.11 | 40B parameters | scaling up LLM to 34B.
Note Released at 2024.01.24 | 19B parameters | support Chinese and stronger OCR
Note Released at 2024.02.11 | Vision Foundation Model | 448 resolution
Note Released at 2024.01.30 | Vision Foundation Model | 448 resolution
Note CVPR 2024, Oral