kaizuberbuehler
/

Alpesteibock-Llama-3-8B-Alpha

Text Generation

Inference Endpoints

text-generation-inference

Model card Files Files and versions Community

kaizuberbuehler commited on 20 days ago

Commit

06607c1

•

1 Parent(s): ab54f6e

Update README.md

Files changed (1) hide show

README.md +16 -1

README.md CHANGED Viewed

@@ -12,7 +12,22 @@ license: llama3
 ## Training Details
 ## Limitations

 ## Training Details
+Hardware: 1x RTX 4090
+Duration: ~30 hours in total (~2 hours for first phase and ~28 hours for second phase)
+### Hyperparameters
+Adapter: QLoRA
+Precision: 4 bit
+Optimizer: adamw_bnb_8bit
+LoRA Rank: 256
+LoRA Alpha: 256
+Learning Rate: 1e-5
+Context Length: 4096 tokens
+Batch Size: 1
+Gradient Accumulation Steps: 1
+Sample Packing: Off for first phase, on for second phase
+Epochs: 2
 ## Limitations