38 72 319

Edoardo Federici

efederici

https://banda-larga.github.io

AI & ML interests

llms, ir, graphs & co

Recent Activity

liked a dataset 2 days ago

GAIR/LIMO

updated a dataset 3 days ago

mii-llm/self-check-5588-dpo4

published a dataset 3 days ago

mii-llm/self-check-5588-dpo4

View all activity

Organizations

efederici's activity

upvoted a paper 3 months ago

OminiControl: Minimal and Universal Control for Diffusion Transformer

Paper • 2411.15098 • Published Nov 22, 2024 • 55

upvoted an article 4 months ago

Article

Visually Multilingual: Introducing mcdse-2b

•

Oct 27, 2024

• 38

upvoted 4 papers 4 months ago

Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs

Paper • 2404.05719 • Published Apr 8, 2024 • 82

Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs

Paper • 2402.14740 • Published Feb 22, 2024 • 13

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 145

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 51

upvoted 2 papers 5 months ago

Improve Mathematical Reasoning in Language Models by Automated Process Supervision

Paper • 2406.06592 • Published Jun 5, 2024 • 27

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

Paper • 2409.05840 • Published Sep 9, 2024 • 47

upvoted an article 5 months ago

Article

Selective fine-tuning of Language Models with Spectrum

•

Sep 3, 2024

• 32

upvoted a paper 6 months ago

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Paper • 2408.12528 • Published Aug 22, 2024 • 51

upvoted a collection 6 months ago

Probably function calling datasets

Collection

Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17, 2024 • 37

upvoted 2 papers 7 months ago

Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion

Paper • 2407.01392 • Published Jul 1, 2024 • 40

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 126

upvoted a paper 8 months ago

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10, 2024 • 67

upvoted 4 papers 9 months ago

Matryoshka Multimodal Models

Paper • 2405.17430 • Published May 27, 2024 • 31

An Introduction to Vision-Language Modeling

Paper • 2405.17247 • Published May 27, 2024 • 87

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Paper • 2405.15071 • Published May 23, 2024 • 38

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23, 2024 • 37

upvoted 2 papers 10 months ago

KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published Apr 30, 2024 • 109

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published Apr 30, 2024 • 48