7 107 133

Emanuele Vivoli

emanuelevivoli

https://emanuelevivoli.github.io

AI & ML interests

I work on Comics/Manga :)

Recent Activity

new activity 3 days ago

allenai/Molmo-72B-0924:Quantisation

liked a model 6 days ago

Qwen/Qwen2-VL-72B-Instruct-GPTQ-Int4

liked a model 8 days ago

google/paligemma2-10b-ft-docci-448

View all activity

Organizations

emanuelevivoli's activity

upvoted a paper 10 days ago

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published 13 days ago • 47

upvoted a paper 15 days ago

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published 20 days ago • 79

upvoted 3 papers 19 days ago

Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions

Paper • 2412.08737 • Published 21 days ago • 51

Phi-4 Technical Report

Paper • 2412.08905 • Published 21 days ago • 93

LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations

Paper • 2412.08580 • Published 21 days ago • 45

upvoted a paper 23 days ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published 26 days ago • 121

upvoted a paper 24 days ago

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

Paper • 2411.04997 • Published Nov 7, 2024 • 37

upvoted a paper 27 days ago

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Paper • 2412.04424 • Published 27 days ago • 58

upvoted an article 29 days ago

Article

EuroLLM-9B

•

about 1 month ago

• 105

upvoted a paper about 1 month ago

MARVEL-40M+: Multi-Level Visual Elaboration for High-Fidelity Text-to-3D Content Creation

Paper • 2411.17945 • Published Nov 26, 2024 • 24

upvoted 3 papers 2 months ago

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published Oct 22, 2024 • 89

xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs

Paper • 2410.16267 • Published Oct 21, 2024 • 17

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Paper • 2410.16153 • Published Oct 21, 2024 • 43

upvoted 6 collections 2 months ago

upvoted a paper 3 months ago

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Paper • 2410.10563 • Published Oct 14, 2024 • 38