4 3 14

Ganqu Cui

ganqu

cgq15

AI & ML interests

None yet

Recent Activity

published an article 8 days ago

Process Reinforcement through Implicit Rewards

updated a Space 8 days ago

PRIME-RL/README

liked a dataset 8 days ago

PRIME-RL/Eurus-2-RL-Data

View all activity

Articles

Process Reinforcement through Implicit Rewards

8 days ago

• 14

Organizations

ganqu's activity

published an article 8 days ago

Article

Process Reinforcement through Implicit Rewards

•

8 days ago

• 14

updated a Space 8 days ago

Running

🏃

README

liked a dataset 8 days ago

PRIME-RL/Eurus-2-RL-Data

Viewer • Updated 4 days ago • 484k • 149 • 19

liked 2 models 9 days ago

PRIME-RL/Eurus-2-7B-PRIME

Text Generation • Updated 4 days ago • 358 • 48

PRIME-RL/EurusPRM-Stage2

Updated 6 days ago • 78 • 6

updated a model 9 days ago

PRIME-RL/Eurus-2-7B-PRIME

Text Generation • Updated 4 days ago • 358 • 48

authored 3 papers about 1 month ago

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness

Paper • 2405.17220 • Published May 27, 2024 • 1

UltraMedical: Building Specialized Generalists in Biomedicine

Paper • 2406.03949 • Published Jun 6, 2024

Free Process Rewards without Process Labels

Paper • 2412.01981 • Published Dec 2, 2024 • 30

upvoted a paper about 1 month ago

Free Process Rewards without Process Labels

Paper • 2412.01981 • Published Dec 2, 2024 • 30

updated a dataset 3 months ago

ganqu/openbackdoor

Preview • Updated Oct 23, 2024 • 6

authored 6 papers 9 months ago

Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models

Paper • 2403.08281 • Published Mar 13, 2024

Advancing LLM Reasoning Generalists with Preference Trees

Paper • 2404.02078 • Published Apr 2, 2024 • 44

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

Paper • 2404.06395 • Published Apr 9, 2024 • 22

New activity in openbmb/Eurux-8x22b-nca 9 months ago

Will `Eurux-8x22b-sft` be realeased too?

#2 opened 9 months ago by

jukofyork

liked 2 models 9 months ago

openbmb/Eurux-8x22b-kto

Text Generation • Updated Apr 29, 2024 • 13 • 8

openbmb/Eurux-8x22b-nca

Text Generation • Updated Apr 15, 2024 • 37 • 28