Chujie Zheng

chujiezheng

AI & ML interests

Large Language Models

Organizations

chujiezheng's activity

New activity in mistralai/Mistral-7B-Instruct-v0.3 24 days ago

no system message?

8
#14 opened about 1 month ago by mclassHF2023
New activity in princeton-nlp/Llama-3-Instruct-8B-SimPO 28 days ago

add chat_template

#3 opened 28 days ago by chujiezheng
New activity in chujiezheng/Llama-3-Instruct-8B-SimPO-ExPO about 1 month ago

Possibly wrong model

2
#1 opened about 1 month ago by ByteBrew23
New activity in chujiezheng/Smaug-Llama-3-70B-Instruct-ExPO about 1 month ago

Update README.md

#3 opened about 1 month ago by chujiezheng
New activity in chujiezheng/Llama3-8B-Chinese-Chat-ExPO about 1 month ago

Update README.md

#2 opened about 1 month ago by chujiezheng
New activity in chujiezheng/Llama3-70B-Chinese-Chat-ExPO about 1 month ago

Create README.md

#1 opened about 1 month ago by chujiezheng
New activity in chujiezheng/Smaug-Llama-3-70B-Instruct-ExPO about 1 month ago

Update README.md

#2 opened about 1 month ago by chujiezheng
New activity in chujiezheng/Llama3-8B-Chinese-Chat-ExPO about 1 month ago

Create README.md

#1 opened about 1 month ago by chujiezheng
New activity in chujiezheng/Smaug-Llama-3-70B-Instruct-ExPO about 1 month ago

Create README.md

#1 opened about 1 month ago by chujiezheng
New activity in chujiezheng/LLaMA3-iterative-DPO-final-ExPO about 1 month ago

Create README.md

#1 opened about 1 month ago by chujiezheng
New activity in chujiezheng/tulu-2-dpo-13b about 1 month ago

Update tokenizer_config.json

#2 opened about 1 month ago by chujiezheng
New activity in allenai/tulu-2-dpo-13b about 1 month ago

Update tokenizer_config.json

2
#4 opened about 1 month ago by chujiezheng
New activity in allenai/tulu-2-13b about 1 month ago

Update tokenizer_config.json

2
#2 opened about 1 month ago by chujiezheng
New activity in chujiezheng/tulu-2-dpo-13b about 2 months ago

Update README.md

#1 opened about 2 months ago by chujiezheng
New activity in chujiezheng/tulu-2-dpo-7b about 2 months ago

Update README.md

#1 opened about 2 months ago by chujiezheng
New activity in allenai/tulu-2-dpo-7b about 2 months ago

add license

1
#3 opened about 2 months ago by chujiezheng
New activity in allenai/tulu-2-dpo-13b about 2 months ago

add license

1
#3 opened about 2 months ago by chujiezheng
New activity in chujiezheng/internlm2-chat-1_8b-ExPO about 2 months ago

Update tokenizer_config.json

#1 opened about 2 months ago by chujiezheng
New activity in chujiezheng/internlm2-chat-7b-ExPO about 2 months ago

Update tokenizer_config.json

#1 opened about 2 months ago by chujiezheng
New activity in chujiezheng/internlm2-chat-20b-ExPO about 2 months ago

Update tokenizer_config.json

#1 opened about 2 months ago by chujiezheng
New activity in internlm/internlm2-chat-20b-sft about 2 months ago

fix `eos_token`

#4 opened about 2 months ago by chujiezheng
New activity in internlm/internlm2-chat-7b-sft about 2 months ago

fix `eos_token`

#3 opened about 2 months ago by chujiezheng
New activity in internlm/internlm2-chat-1_8b-sft about 2 months ago

fix `eos_token`

#1 opened about 2 months ago by chujiezheng
New activity in internlm/internlm2-chat-1_8b about 2 months ago

fix `eos_token`

#3 opened about 2 months ago by chujiezheng
New activity in internlm/internlm2-chat-7b about 2 months ago

fix `eos_token`

#12 opened about 2 months ago by chujiezheng
New activity in internlm/internlm2-chat-20b about 2 months ago

fix `eos_token`

#10 opened about 2 months ago by chujiezheng
New activity in google/gemma-1.1-7b-it 2 months ago
New activity in Nexusflow/Starling-RM-34B 3 months ago
New activity in Nexusflow/Starling-LM-7B-beta 3 months ago
New activity in thu-coai/CharacterGLM-6B 5 months ago

add model

#1 opened 5 months ago by wandz
New activity in LLM360/CrystalChat 5 months ago
New activity in mistralai/Mistral-7B-Instruct-v0.1 6 months ago

System Prompt

3
#41 opened 9 months ago by sakshat98
New activity in meta-llama/LlamaGuard-7b 7 months ago

Does not respect nerd guard

1
#7 opened 7 months ago by userzyzz
New activity in mistralai/Mistral-7B-Instruct-v0.1 7 months ago

When v2?

1
#88 opened 7 months ago by amgadhasan
New activity in lmsys/vicuna-13b-v1.5 8 months ago
New activity in lmsys/vicuna-7b-v1.5 8 months ago
New activity in thu-coai/LongLM-base 11 months ago
New activity in thu-coai/esconv 12 months ago

Train-dev-test splits ?

3
#1 opened 12 months ago by lihVerma
New activity in thu-coai/blenderbot-1B-augesc about 1 year ago
New activity in EleutherAI/pythia-6.9b about 1 year ago

Missing checkpoint at step26000

1
#2 opened about 1 year ago by chujiezheng
New activity in thu-coai/roberta-zh-sensible over 1 year ago
New activity in thu-coai/blenderbot-400M-esconv over 1 year ago
New activity in thu-coai/roberta-base-cold over 1 year ago
New activity in ceggian/bart_post_trained_reddit_batch128 over 1 year ago

About model details

#1 opened about 2 years ago by chujiezheng