Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
EpistemeAI
/
Fireball-Meta-Llama-3.2-8B-Instruct-agent-003-128k-code-DPO
like
0
Text Generation
Transformers
PyTorch
English
llama
text-generation-inference
unsloth
trl
conversational
Inference Endpoints
arxiv:
2210.03629
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Fireball-Meta-Llama-3.2-8B-Instruct-agent-003-128k-code-DPO
1 contributor
History:
4 commits
legolasyiu
Update README.md
b3c0fce
verified
10 days ago
.gitattributes
Safe
1.52 kB
initial commit
10 days ago
README.md
8.5 kB
Update README.md
10 days ago
config.json
Safe
1.01 kB
(Trained with Unsloth)
10 days ago
generation_config.json
Safe
234 Bytes
(Trained with Unsloth)
10 days ago
pytorch_model-00001-of-00004.bin
4.98 GB
LFS
(Trained with Unsloth)
10 days ago
pytorch_model-00002-of-00004.bin
5 GB
LFS
(Trained with Unsloth)
10 days ago
pytorch_model-00003-of-00004.bin
4.92 GB
LFS
(Trained with Unsloth)
10 days ago
pytorch_model-00004-of-00004.bin
1.17 GB
LFS
(Trained with Unsloth)
10 days ago
pytorch_model.bin.index.json
Safe
24 kB
(Trained with Unsloth)
10 days ago
special_tokens_map.json
Safe
454 Bytes
(Trained with Unsloth)
10 days ago
tokenizer.json
Safe
9.09 MB
(Trained with Unsloth)
10 days ago
tokenizer_config.json
Safe
51.4 kB
(Trained with Unsloth)
10 days ago