BramVanroy
/

llama2-13b-ft-mc4_nl_cleaned_tiny

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

BramVanroy commited on Sep 27, 2023

Commit

b23fe7d

•

1 Parent(s): 92b6946

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -41,9 +41,12 @@ Trained with LoRA targetting `["q_proj", "v_proj"]` in 4 bit and merged before u
 The adapters are in the `adapters` branch.
 ### Training hyperparameters
-The following hyperparameters were used during training:
 - learning_rate: 0.0003
 - train_batch_size: 12
 - eval_batch_size: 12

 The adapters are in the `adapters` branch.
+Initial training investigation on the Tier-1 HPC of [Vlaams Supercomputer Centrum (VSC)](https://www.vscentrum.be/) and training on our own research cluster of 4x 3090s.
 ### Training hyperparameters
+The following hyperparameters were used during training in the HPC investigation:
 - learning_rate: 0.0003
 - train_batch_size: 12
 - eval_batch_size: 12