Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
lunahr
/
thea-rp-3b-25r
like
1
Text Generation
Transformers
Safetensors
KingNish/reasoning-base-20k
lunahr/thea-name-overrides
English
llama
text-generation-inference
trl
sft
reasoning
llama-3
conversational
Eval Results
Inference Endpoints
License:
llama3.2
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
861d3ec
thea-rp-3b-25r
Commit History
Adding Evaluation Results
861d3ec
verified
leaderboard-pr-bot
commited on
Oct 17, 2024
Fixed README reference
ed4c338
verified
Piotr Zalewski
commited on
Oct 14, 2024
Add reasoning to tokenizer
4ddd7e8
verified
Piotr Zalewski
commited on
Oct 13, 2024
written README
539c687
verified
Piotr Zalewski
commited on
Oct 13, 2024
Upload merged BF16 model
73e5b07
verified
Piotr Zalewski
commited on
Oct 13, 2024
initial commit
e907787
verified
Piotr Zalewski
commited on
Oct 13, 2024