Léo Hunout's picture

Léo Hunout

hunoutl

·

AI & ML interests

AI Engineer working on Jean Zay supercomputer in France 🇫🇷

Recent Activity

upvoted an article 21 days ago

They Said It Couldn’t Be Done

upvoted a paper 27 days ago

VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation

upvoted a paper 27 days ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

View all activity

Organizations

hunoutl's activity

upvoted an article 21 days ago

Article

They Said It Couldn’t Be Done

By

•

28 days ago

• 76

upvoted 2 papers 27 days ago

VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation

Paper • 2412.02259 • Published about 1 month ago • 59

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published 29 days ago • 119

upvoted 17 papers about 2 months ago

Balancing Pipeline Parallelism with Vocabulary Parallelism

Paper • 2411.05288 • Published Nov 8, 2024 • 19

Face Anonymization Made Simple

Paper • 2411.00762 • Published Nov 1, 2024 • 7

NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks

Paper • 2410.20650 • Published Oct 28, 2024 • 16

TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters

Paper • 2410.23168 • Published Oct 30, 2024 • 24

Zipfian Whitening

Paper • 2411.00680 • Published Nov 1, 2024 • 9

GPT or BERT: why not both?

Paper • 2410.24159 • Published Oct 31, 2024 • 14

Sparsing Law: Towards Large Language Models with Greater Activation Sparsity

Paper • 2411.02335 • Published Nov 4, 2024 • 11

Adaptive Caching for Faster Video Generation with Diffusion Transformers

Paper • 2411.02397 • Published Nov 4, 2024 • 23

Self-Consistency Preference Optimization

Paper • 2411.04109 • Published Nov 6, 2024 • 16

Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models

Paper • 2411.03884 • Published Nov 6, 2024 • 26

Animate-X: Universal Character Image Animation with Enhanced Motion Representation

Paper • 2410.10306 • Published Oct 14, 2024 • 54

What Matters in Transformers? Not All Attention is Needed

Paper • 2406.15786 • Published Jun 22, 2024 • 30

Neural Metamorphosis

Paper • 2410.11878 • Published Oct 10, 2024 • 8

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17, 2024 • 90

AutoTrain: No-code training for state-of-the-art models

Paper • 2410.15735 • Published Oct 21, 2024 • 59

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published Nov 7, 2024 • 49

BitNet a4.8: 4-bit Activations for 1-bit LLMs

Paper • 2411.04965 • Published Nov 7, 2024 • 63