Ber666's picture

5 4

Ber666

SDSB

ber66666

AI & ML interests

None yet

Recent Activity

upvoted a paper 26 days ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

upvoted a paper about 1 month ago

Training Large Language Models to Reason in a Continuous Latent Space

upvoted a paper 6 months ago

Flow of Reasoning: Efficient Training of LLM Policy with Divergent Thinking

View all activity

Organizations

None yet

models

None public yet

datasets

None public yet