Salesforce/xgen-mm-vid-phi3-mini-r-v1.5-128tokens-16frames Image-Text-to-Text • Updated 17 days ago • 1 • 2
Salesforce/xgen-mm-vid-phi3-mini-r-v1.5-128tokens-16frames Image-Text-to-Text • Updated 17 days ago • 1 • 2
xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs Paper • 2410.16267 • Published Oct 21, 2024 • 17
xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs Paper • 2410.16267 • Published Oct 21, 2024 • 17 • 2