30 35 53

Lin Chen

Lin-Chen

https://lin-chen.site

AI & ML interests

None yet

Recent Activity

authored a paper 5 days ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

liked a model 9 days ago

internlm/internlm-xcomposer2d5-ol-7b

upvoted a paper 9 days ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

View all activity

Organizations

Lin-Chen's activity

authored a paper 5 days ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published 10 days ago • 89

liked a model 9 days ago

internlm/internlm-xcomposer2d5-ol-7b

Visual Question Answering • Updated 9 days ago • 40

upvoted a paper 9 days ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published 10 days ago • 89

liked a dataset 12 days ago

Tongyi-ConvAI/MMEvol

Preview • Updated 22 days ago • 1.58k • 8

authored a paper 19 days ago

Open-Sora Plan: Open-Source Large Video Generation Model

Paper • 2412.00131 • Published 24 days ago • 32

upvoted a paper 19 days ago

Open-Sora Plan: Open-Source Large Video Generation Model

Paper • 2412.00131 • Published 24 days ago • 32

liked a Space 19 days ago

Running

265

⚡

Qwen2.5 72B Instruct

upvoted a collection about 1 month ago

Qwen2-VL

Collection

Vision-language model series based on Qwen2 • 16 items • Updated 16 days ago • 180

updated a dataset about 2 months ago

Lin-Chen/Open-LLaVA-NeXT-mix1M

Updated Oct 25 • 93 • 11

upvoted a paper about 2 months ago

MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models

Paper • 2410.17637 • Published Oct 23 • 34

upvoted 2 papers 2 months ago

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Paper • 2410.16268 • Published Oct 21 • 65

PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction

Paper • 2410.17247 • Published Oct 22 • 45

liked 6 datasets 2 months ago

upvoted a paper 2 months ago

Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate

Paper • 2410.07167 • Published Oct 9 • 37

liked a dataset 2 months ago

AIDC-AI/Ovis-dataset

Preview • Updated Sep 16 • 1.09k • 23