InternVL 1.0

OpenGVLab 's Collections

InternVL 2.0

InternVL 1.5

InternVL 1.0

InternVideo2

InternVid

VideoMamba

PVT

InternImage

All-Seeing Project

VideoChat

updated 2 days ago

Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Upvote

OpenGVLab/InternViT-6B-224px

Image Feature Extraction • Updated about 1 month ago • 1.7k • 17
OpenGVLab/InternVL-14B-224px

Image Feature Extraction • Updated about 1 month ago • 1.74k • 29
OpenGVLab/InternVL-Chat-V1-2-Plus

Visual Question Answering • Updated about 1 month ago • 369 • 31

Note Relased at 2024.02.21 | 40B parameters | More SFT data and stronger.
OpenGVLab/InternVL-Chat-V1-2

Visual Question Answering • Updated about 1 month ago • 524 • 12

Note Released at 2024.02.11 | 40B parameters | scaling up LLM to 34B.
OpenGVLab/InternVL-Chat-V1-1

Visual Question Answering • Updated about 1 month ago • 121 • 11

Note Released at 2024.01.24 | 19B parameters | support Chinese and stronger OCR
OpenGVLab/InternViT-6B-448px-V1-2

Image Feature Extraction • Updated 30 days ago • 1.07k • 20

Note Released at 2024.02.11 | Vision Foundation Model | 448 resolution
OpenGVLab/InternViT-6B-448px-V1-0

Image Feature Extraction • Updated about 1 month ago • 31 • 6

Note Released at 2024.01.30 | Vision Foundation Model | 448 resolution
OpenGVLab/InternVL-14B-Flickr30K-FT-364px

Feature Extraction • Updated Mar 8 • 26 • 4
OpenGVLab/InternVL-14B-FlickrCN-FT-364px

Updated Mar 8 • 2
OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-7B

Visual Question Answering • Updated Apr 27 • 934 • 7
OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-13B

Visual Question Answering • Updated Apr 27 • 28 • 6
OpenGVLab/InternVL-Chat-ViT-6B-Vicuna-13B-448px

Visual Question Answering • Updated Apr 4 • 15 • 2
OpenGVLab/InternVL

Updated Apr 29 • 16
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Paper • 2312.14238 • Published Dec 21, 2023 • 12

Note CVPR 2024, Oral

Upvote