RLAIF

Enterprise

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

alon-albalak authored a paper 28 days ago

Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models

nlile authored a paper 2 months ago

Generative Reward Models

alon-albalak authored a paper 2 months ago

Generative Reward Models

View all activity

RLAIF's activity

alon-albalak

authored a paper 28 days ago

Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models

Paper • 2412.02980 • Published 29 days ago • 12

nlile

authored a paper 2 months ago

Generative Reward Models

Paper • 2410.12832 • Published Oct 2, 2024 • 6

alon-albalak

authored a paper 2 months ago

Generative Reward Models

Paper • 2410.12832 • Published Oct 2, 2024 • 6

Asap7772

authored a paper 3 months ago

Adaptive Inference-Time Compute: LLMs Can Predict if They Can Do Better, Even Mid-Generation

Paper • 2410.02725 • Published Oct 3, 2024 • 1

Asap7772

authored 4 papers 5 months ago

sea-snell

authored 3 papers 5 months ago

LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models

Paper • 2311.18232 • Published Nov 30, 2023 • 1

The False Promise of Imitating Proprietary LLMs

Paper • 2305.15717 • Published May 25, 2023 • 5

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 53

rmrafailov

authored a paper 5 months ago

PERSONA: A Reproducible Testbed for Pluralistic Alignment

Paper • 2407.17387 • Published Jul 24, 2024 • 18

nlile

authored a paper 5 months ago

PERSONA: A Reproducible Testbed for Pluralistic Alignment

Paper • 2407.17387 • Published Jul 24, 2024 • 18

LouisCastricato

authored a paper 5 months ago

PERSONA: A Reproducible Testbed for Pluralistic Alignment

Paper • 2407.17387 • Published Jul 24, 2024 • 18

rmrafailov

authored a paper 6 months ago

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

Paper • 2407.04842 • Published Jul 5, 2024 • 52

alon-albalak

authored 2 papers 6 months ago

Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning

Paper • 2305.12295 • Published May 20, 2023

The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources

Paper • 2406.16746 • Published Jun 24, 2024

alon-albalak

authored a paper 7 months ago

DataComp-LM: In search of the next generation of training sets for language models

Paper • 2406.11794 • Published Jun 17, 2024 • 50

rmrafailov

authored 2 papers 7 months ago

OpenVLA: An Open-Source Vision-Language-Action Model

Paper • 2406.09246 • Published Jun 13, 2024 • 36

Offline Regularised Reinforcement Learning for Large Language Models Alignment

Paper • 2405.19107 • Published May 29, 2024 • 14

AI & ML interests

Recent Activity

Team members 11

RLAIF's activity