Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Ray2333
/
GRM-Llama3.2-3B-rewardmodel-ft
like
2
Text Classification
Safetensors
Skywork/Skywork-Reward-Preference-80K-v0.2
llama
arxiv:
2406.10216
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Train
2780666
GRM-Llama3.2-3B-rewardmodel-ft
Commit History
Update README.md
2780666
verified
Ray2333
commited on
11 days ago
Update README.md
6630d95
verified
Ray2333
commited on
28 days ago
Update README.md
6777dd4
verified
Ray2333
commited on
28 days ago
Update README.md
fff9954
verified
Ray2333
commited on
29 days ago
Update README.md
5f83dd8
verified
Ray2333
commited on
29 days ago
Update README.md
50fb9d4
verified
Ray2333
commited on
29 days ago
Update config.json
e3dc5ec
verified
Ray2333
commited on
29 days ago
Upload tokenizer
9ac3e42
verified
Ray2333
commited on
29 days ago
Upload LlamaForSequenceClassification
0d2e996
verified
Ray2333
commited on
29 days ago
initial commit
241cf11
verified
Ray2333
commited on
29 days ago