7 8 10

Manli Shu

Manli

azshue

AI & ML interests

None yet

Recent Activity

liked a dataset 13 days ago

Salesforce/ProVision-10M

View all activity

Organizations

Manli's activity

liked a dataset 13 days ago

Salesforce/ProVision-10M

Viewer • Updated about 6 hours ago • 24.5M • 1.47k • 11

updated a model 3 months ago

Salesforce/xgen-mm-phi3-mini-instruct-interleave-r-v1.5

Image-Text-to-Text • Updated Sep 20 • 3.51k • 45

New activity in Salesforce/xgen-mm-phi3-mini-instruct-interleave-r-v1.5 3 months ago

Dataset link doesn't work?

#1 opened 4 months ago by

dibmvt

New activity in Salesforce/xgen-mm-phi3-mini-instruct-interleave-r-v1.5 4 months ago

Extremely high GPU requirements for both basic (demo.ipynb) and batch (batch_inference.ipynb) notebooks

#3 opened 4 months ago by

dwb2023

upvoted a paper 4 months ago

xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations

Paper • 2408.12590 • Published Aug 22 • 35

New activity in Salesforce/xgen-mm-phi3-mini-base-r-v1 4 months ago

Link model to paper

#1 opened 4 months ago by

nielsr

New activity in Salesforce/xgen-mm-phi3-mini-instruct-r-v1 4 months ago

Link model to paper

#12 opened 4 months ago by

nielsr

liked 4 models 4 months ago

authored a paper 4 months ago

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16 • 98

New activity in Salesforce/xgen-mm-phi3-mini-base-r-v1.5 4 months ago

Upload examples.

#2 opened 4 months ago by

an-yan

Update README.md

#1 opened 4 months ago by

an-yan

upvoted a paper 4 months ago

xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

Paper • 2408.08872 • Published Aug 16 • 98

upvoted a collection 5 months ago

🍃 MINT-1T

Collection

Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 13 items • Updated Jul 24 • 56

liked 2 datasets 5 months ago

mlfoundations/MINT-1T-HTML

Viewer • Updated Sep 21 • 623M • 177k • 79

TIGER-Lab/VisualWebInstruct

Viewer • Updated about 16 hours ago • 23.6k • 186 • 15

authored 2 papers 6 months ago

Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models

Paper • 2209.07511 • Published Sep 15, 2022

Coercing LLMs to do and reveal (almost) anything

Paper • 2402.14020 • Published Feb 21 • 12