
ICML2023
AI & ML interests
None defined yet.
Recent Activity
View all activity
ICML2023's activity

ameerazam08
posted
an
update
about 1 month ago
Post
1613
R1 is out! And with a lot of other R1 releated models...

hysts
updated
a
Space
about 2 months ago

vwxyzjn
authored
5
papers
2 months ago
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization
Paper
•
2403.17031
•
Published
•
6
A2C is a special case of PPO
Paper
•
2205.09123
•
Published
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Paper
•
2410.18252
•
Published
•
7
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
Paper
•
2411.15124
•
Published
•
59
2 OLMo 2 Furious
Paper
•
2501.00656
•
Published
•
16

mbrack
authored
a
paper
3 months ago
Post
12726
Google drops Gemini 2.0 Flash Thinking
a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more
now available in anychat, try it out: akhaliq/anychat
a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more
now available in anychat, try it out: akhaliq/anychat

Kameshr
authored
a
paper
3 months ago
Post
13278
QwQ-32B-Preview is now available in anychat
A reasoning model that is competitive with OpenAI o1-mini and o1-preview
try it out: akhaliq/anychat
A reasoning model that is competitive with OpenAI o1-mini and o1-preview
try it out: akhaliq/anychat
Post
4213
Post
3121
anychat
supports chatgpt, gemini, perplexity, claude, meta llama, grok all in one app
try it out there: akhaliq/anychat
supports chatgpt, gemini, perplexity, claude, meta llama, grok all in one app
try it out there: akhaliq/anychat

xzyao
authored
a
paper
4 months ago

Lupin1998
authored
a
paper
5 months ago