ShareGPT4Video

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Lin-Chen authored a paper 5 days ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Wiselnn authored a paper 9 days ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

LanguageBind authored a paper 19 days ago

Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle

View all activity

ShareGPT4Video's activity

Lin-Chen

authored a paper 5 days ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published 10 days ago • 89

Wiselnn

authored a paper 9 days ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published 10 days ago • 89

LanguageBind

authored 4 papers 19 days ago

Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle

Paper • 2407.19548 • Published Jul 28 • 24

OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model

Paper • 2409.01199 • Published Sep 2 • 12

WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model

Paper • 2411.17459 • Published 26 days ago • 10

Open-Sora Plan: Open-Source Large Video Generation Model

Paper • 2412.00131 • Published 24 days ago • 32

Lin-Chen

authored a paper 19 days ago

Open-Sora Plan: Open-Source Large Video Generation Model

Paper • 2412.00131 • Published 24 days ago • 32

Lin-Chen

authored a paper 5 months ago

VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models

Paper • 2407.11691 • Published Jul 16 • 13

Lin-Chen

updated a dataset 6 months ago

ShareGPT4Video/ShareGPT4Video

Viewer • Updated Jul 8 • 40.2k • 2.79k • 183

Lin-Chen

authored 2 papers 6 months ago

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Paper • 2407.03320 • Published Jul 3 • 93

Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs

Paper • 2406.14544 • Published Jun 20 • 34

Wiselnn

authored a paper 6 months ago

MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs

Paper • 2406.11833 • Published Jun 17 • 61

Lin-Chen

authored a paper 7 months ago

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Paper • 2406.04325 • Published Jun 6 • 72

Wiselnn

authored a paper 7 months ago

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Paper • 2406.04325 • Published Jun 6 • 72

Jinsong-Li

authored a paper 7 months ago

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Paper • 2406.04325 • Published Jun 6 • 72

LanguageBind

authored a paper 7 months ago

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Paper • 2406.04325 • Published Jun 6 • 72

LanguageBind

authored a paper 9 months ago

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Paper • 2404.05014 • Published Apr 7 • 31

Jinsong-Li

authored a paper 9 months ago

Are We on the Right Way for Evaluating Large Vision-Language Models?

Paper • 2403.20330 • Published Mar 29 • 6

Lin-Chen

authored a paper 9 months ago

Are We on the Right Way for Evaluating Large Vision-Language Models?

Paper • 2403.20330 • Published Mar 29 • 6

Wiselnn

authored a paper 11 months ago

InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model

Paper • 2401.16420 • Published Jan 29 • 55

AI & ML interests

Recent Activity

Team members 4

ShareGPT4Video's activity