liuziwei7 (Ziwei Liu)

upvoted a paper 14 days ago

MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D

Paper • 2411.02336 • Published 15 days ago • 23

upvoted a paper 23 days ago

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Paper • 2410.19355 • Published 26 days ago • 23

upvoted a paper 29 days ago

Disco4D: Disentangled 4D Human Generation and Animation from a Single Image

Paper • 2409.17280 • Published Sep 25 • 9

upvoted 2 collections about 2 months ago

LMMs-Eval-Lite

Collection

Making Lite version of the dataset to accelerate holistic evaluation during model development! • 20 items • Updated Oct 4 • 1

LLaVA-OneVision

Collection

a model good at arbitrary types of visual input • 15 items • Updated Oct 5 • 20

upvoted a paper about 2 months ago

Video Instruction Tuning With Synthetic Data

Paper • 2410.02713 • Published Oct 3 • 37

upvoted a collection about 2 months ago

Oryx

Collection

Oryx: One Multi-Modal LLM for On-Demand Spatial-Temporal Understanding • 5 items • Updated 28 days ago • 14

upvoted a paper about 2 months ago

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Paper • 2409.12961 • Published Sep 19 • 24

upvoted 2 papers 2 months ago

3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion

Paper • 2409.12957 • Published Sep 19 • 18

Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

Paper • 2409.11406 • Published Sep 17 • 25

upvoted 2 papers 3 months ago

ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer

Paper • 2408.03284 • Published Aug 6 • 10

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6 • 59

upvoted 3 papers 4 months ago

upvoted a paper 5 months ago

FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models

Paper • 2406.16863 • Published Jun 24 • 10

upvoted 3 collections 5 months ago

LMMs-Eval

Collection

Dataset Collection of LMMs-Eval • 36 items • Updated Oct 4 • 25

LLaVA-Video

Collection

Models focus on video understanding (previously known as LLaVA-NeXT-Video). • 6 items • Updated Oct 5 • 53

LongVA

Collection

Long Context Transfer From Text To Vision: https://lmms-lab.github.io/posts/longva/ • 5 items • Updated Oct 4 • 12

upvoted a paper 5 months ago

L4GM: Large 4D Gaussian Reconstruction Model

Paper • 2406.10324 • Published Jun 14 • 13

Ziwei Liu

AI & ML interests

Organizations

liuziwei7's activity

MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D

FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality

Disco4D: Disentangled 4D Human Generation and Animation from a Single Image

LMMs-Eval-Lite

LLaVA-OneVision

Video Instruction Tuning With Synthetic Data

Oryx

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion

Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer

LLaVA-OneVision: Easy Visual Task Transfer

LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models

CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation

VEnhancer: Generative Space-Time Enhancement for Video Generation

FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models

LMMs-Eval

LLaVA-Video

LongVA

L4GM: Large 4D Gaussian Reconstruction Model