Jiang Jiwen

jjw0126

AI & ML interests

RL, LLM

Organizations

jjw0126's activity

New activity in nvidia/Llama3-ChatQA-1.5-8B 29 days ago

megatron format to HF format

#19 opened 29 days ago by jjw0126