922-Narra
/

tagalog-lm-lora-tests

Model card Files Files and versions Community

922CA commited on Sep 3, 2023

Commit

30390ad

•

1 Parent(s): 787df88

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -65,11 +65,11 @@ Example:
 # llama-2-13b-tagalog-v0.3 loras (09/01-02/2023)
 * Fine tuned on experimental datasets of ~1k items (Tagalog-focused dataset, based off Tagalog sentences augmented by LLaMA-2-13b base to create a 3-turn dialogue dataset between Human and Assistant)
-* 3 fine-tuned for 1 epoch, rank = 16
-* 3a for 1 epoch, rank = 8
 * 3b for 2 epochs
 * 3c for 1 epoch, lr = 1e-4, warmup steps = 0.1
-* 3d for 1 epoch, lr = 2e-4, warmup steps = 0.1, rank = 32, lora alpha = 64
-* 3e for 2 epochs, lr = 2e-4, warmup steps = 0.1, rank = 32, lora alpha = 64
 * From LLaMA-2-13b
 * Trying LLaMA-2-13b chat/other base and curated dataset for next attempts

 # llama-2-13b-tagalog-v0.3 loras (09/01-02/2023)
 * Fine tuned on experimental datasets of ~1k items (Tagalog-focused dataset, based off Tagalog sentences augmented by LLaMA-2-13b base to create a 3-turn dialogue dataset between Human and Assistant)
+* 3 fine-tuned for 1 epoch, rank = 16, lora alpha = 32
+* 3a with rank = 8
 * 3b for 2 epochs
 * 3c for 1 epoch, lr = 1e-4, warmup steps = 0.1
+* 3d with lr = 2e-4, rank = 32, lora alpha = 64
+* 3e for 2 epochs
 * From LLaMA-2-13b
 * Trying LLaMA-2-13b chat/other base and curated dataset for next attempts