Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Chuanming
/
Mixtral-QLoRA-test
like
0
arxiv:
1909.08593
Model card
Files
Files and versions
Community
main
Mixtral-QLoRA-test
/
examples
/
scripts
1 contributor
History:
1 commit
Chuanming
Upload folder using huggingface_hub
fa4458a
over 1 year ago
ddpo.py
6.17 kB
Upload folder using huggingface_hub
over 1 year ago
dpo.py
7.57 kB
Upload folder using huggingface_hub
over 1 year ago
ppo.py
7.96 kB
Upload folder using huggingface_hub
over 1 year ago
ppo_multi_adapter.py
5.2 kB
Upload folder using huggingface_hub
over 1 year ago
reward_modeling.py
5.6 kB
Upload folder using huggingface_hub
over 1 year ago
sft.py
6.94 kB
Upload folder using huggingface_hub
over 1 year ago