Miakat's picture

5 94

Miakat

darthfalka

·

AI & ML interests

None yet

Recent Activity

updated a Space 4 days ago

darthfalka/hf_space

liked a dataset 21 days ago

valhalla/emoji-dataset

liked a Space 29 days ago

yslan/GaussianAnything-AIGC3D

View all activity

Organizations

None yet

darthfalka's activity

upvoted a collection about 1 month ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28, 2024 • 258

upvoted a paper about 2 months ago

The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization

Paper • 2403.17031 • Published Mar 24, 2024 • 3

upvoted an article 5 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

• 122

upvoted an article 8 months ago

Article

The Technology Behind BLOOM Training

Jul 14, 2022

• 24