Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
1
1
Johan Samir Obando Ceron
johanobandoc
Follow
ppzxx's profile picture
AthulSathyapal's profile picture
akhaliq's profile picture
4 followers
·
1 following
johanobandoc
JohanSamir
AI & ML interests
Reinforcement Learning, Deep Learning, LLMs/LVMs.
Recent Activity
commented
on
a paper
4 days ago
Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training
authored
a paper
17 days ago
Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training
upvoted
a
paper
17 days ago
Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training
View all activity
Organizations
Papers
6
arxiv:
2503.18929
arxiv:
2402.12479
arxiv:
2402.08609
arxiv:
2310.03882
Expand 6 papers
models
3
Sort: Recently updated
johanobandoc/rlooG
Text Generation
•
Updated
Nov 24, 2024
•
3
johanobandoc/RM_anthropic_hh_EleutherAI1b
Text Classification
•
Updated
Sep 18, 2024
•
1
johanobandoc/SFT_anthropic_hh_EleutherAI1b
Text Classification
•
Updated
Sep 18, 2024
•
1
datasets
11
Sort: Recently updated
johanobandoc/Natural_sciences
Viewer
•
Updated
Feb 5
•
80
•
10
johanobandoc/Reading_comprehension
Viewer
•
Updated
Feb 5
•
51
•
8
johanobandoc/Driving_test
Viewer
•
Updated
Feb 5
•
495
•
10
johanobandoc/Physics
Viewer
•
Updated
Feb 5
•
256
•
7
johanobandoc/Math
Viewer
•
Updated
Feb 5
•
275
•
9
johanobandoc/gsm8k
Updated
Oct 8, 2024
•
5
johanobandoc/natural_science_spanish_exam
Viewer
•
Updated
Aug 31, 2024
•
831
•
10
johanobandoc/literature_languaje_spanish_exam
Viewer
•
Updated
Aug 28, 2024
•
479
•
5
johanobandoc/philosophy_spanish_exam
Viewer
•
Updated
Aug 27, 2024
•
163
•
5
johanobandoc/economy_spanish_exam
Viewer
•
Updated
Aug 26, 2024
•
215
•
7
Expand 11 datasets