Johan Ferret's picture

2 1

Johan Ferret

ferretj

·

https://ferretj.github.io

AI & ML interests

None yet

Recent Activity

authored a paper 5 days ago

On Teacher Hacking in Language Model Distillation

authored a paper 4 months ago

Diversity-Rewarded CFG Distillation

authored a paper 6 months ago

Gemma 2: Improving Open Language Models at a Practical Size

View all activity

Organizations

None yet

ferretj's activity

authored a paper 5 days ago

On Teacher Hacking in Language Model Distillation

Paper • 2502.02671 • Published 7 days ago • 14

authored a paper 4 months ago

Diversity-Rewarded CFG Distillation

Paper • 2410.06084 • Published Oct 8, 2024 • 10

authored a paper 6 months ago

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31, 2024 • 76

authored 2 papers 7 months ago

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Paper • 2407.15762 • Published Jul 22, 2024 • 10

BOND: Aligning LLMs with Best-of-N Distillation

Paper • 2407.14622 • Published Jul 19, 2024 • 19

upvoted a paper 8 months ago

WARP: On the Benefits of Weight Averaged Rewarded Policies

Paper • 2406.16768 • Published Jun 24, 2024 • 23

authored a paper 8 months ago

WARP: On the Benefits of Weight Averaged Rewarded Policies

Paper • 2406.16768 • Published Jun 24, 2024 • 23

authored a paper 10 months ago

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

Paper • 2404.07839 • Published Apr 11, 2024 • 44

authored a paper 11 months ago

Gemma: Open Models Based on Gemini Research and Technology

Paper • 2403.08295 • Published Mar 13, 2024 • 48

authored 3 papers about 1 year ago

Direct Language Model Alignment from Online AI Feedback

Paper • 2402.04792 • Published Feb 7, 2024 • 31

Gemini: A Family of Highly Capable Multimodal Models

Paper • 2312.11805 • Published Dec 19, 2023 • 44

WARM: On the Benefits of Weight Averaged Reward Models

Paper • 2401.12187 • Published Jan 22, 2024 • 18