-
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
Paper • 2501.00958 • Published • 99 -
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings
Paper • 2501.01257 • Published • 49 -
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
Paper • 2501.01423 • Published • 37 -
REDUCIO! Generating 1024times1024 Video within 16 Seconds using Extremely Compressed Motion Latents
Paper • 2411.13552 • Published

Raffaele Salvi
Rufy992
·
AI & ML interests
Interest for research
Recent Activity
updated
a collection
23 days ago
Articoli PHD
updated
a collection
28 days ago
Articoli PHD
updated
a collection
28 days ago
Articoli PHD
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet