wh's picture

wh

cswhjiang

·

AI & ML interests

None yet

Organizations

cswhjiang's activity

upvoted a paper 6 months ago

Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training

Paper • 2405.15319 • Published May 24 • 25

upvoted 2 papers over 1 year ago

DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales

Paper • 2308.01320 • Published Aug 2, 2023 • 44

Benchmarking Large Language Model Capabilities for Conditional Generation

Paper • 2306.16793 • Published Jun 29, 2023 • 7