3 9 13

Peng Jin

Chat-UniVi

https://scholar.google.com/citations?user=HHXLexAAAAAJ&hl=en

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

upvoted a paper 2 days ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

liked a model about 1 month ago

Chat-UniVi/Chat-UniVi-7B-v1.5

View all activity

Organizations

None yet

Chat-UniVi's activity

authored a paper 1 day ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published 3 days ago • 64

upvoted a paper 2 days ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published 3 days ago • 64

liked a model about 1 month ago

Chat-UniVi/Chat-UniVi-7B-v1.5

Video-Text-to-Text • Updated Dec 7, 2024 • 60 • 2

updated 4 models about 2 months ago

liked a Space 2 months ago

Running on Zero

100

🐨

ViewCrafter

authored a paper 2 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 113

upvoted a paper 3 months ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 89

liked a dataset 3 months ago

Chat-UniVi/Chat-UniVi-Eval

Preview • Updated Nov 23, 2023 • 43 • 4

liked 3 models 3 months ago

Chat-UniVi/MoE-Plus-Plus-7B

Text Generation • Updated Dec 7, 2024 • 14 • 4

Chat-UniVi/MoH-LLaMA3-8B

Text Generation • Updated Dec 7, 2024 • 13 • 3

Chat-UniVi/MoH-DiT-XL-90

Updated Oct 17, 2024 • 3

New activity in Chat-UniVi/Chat-UniVi 3 months ago

Update pipeline tag

#1 opened 3 months ago by

nielsr

updated a model 3 months ago

Chat-UniVi/Chat-UniVi

Video-Text-to-Text • Updated Oct 22, 2024 • 69.5k • 13

commented a paper 3 months ago

MoH: Multi-Head Attention as Mixture-of-Head Attention

Paper • 2410.11842 • Published Oct 15, 2024 • 21 •

authored 3 papers 3 months ago

DiffusionRet: Generative Text-Video Retrieval with Diffusion Model

Paper • 2303.09867 • Published Mar 17, 2023

Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentation

Paper • 2303.13399 • Published Mar 23, 2023

Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning

Paper • 2303.14369 • Published Mar 25, 2023