Sainbayar Sukhbaatar's picture

2 1

Sainbayar Sukhbaatar

sainbar

·

https://tesatory.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

authored a paper 3 months ago

Training Large Language Models to Reason in a Continuous Latent Space

authored a paper 4 months ago

Adaptive Decoding via Latent Preference Optimization

View all activity

Organizations

None yet

sainbar's activity

upvoted a paper 8 months ago

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Paper • 2407.19594 • Published Jul 28, 2024 • 20