KaiserWhoLearns
/

PTvsSFT_OLMo1b

Model card Files Files and versions Community

KaiserWhoLearns commited on Jun 22, 2024

Commit

906f2c5

·

verified ·

1 Parent(s): 7b208dc

Update README.md

Files changed (1) hide show

README.md +22 -3

README.md CHANGED Viewed

@@ -1,3 +1,22 @@
----
-license: apache-2.0
----

+This is the model checkpoint release for Amuro \& Char: Analyzing the Relationship between Pre-Training and Fine-Tuning of Large Language Models.
+All the fine-tuned model checkpoints are released in this repository. The naming convention of the revisions are `olmo1b_hf_{checkpoint}_{train_dataset}_{epoch}_{lr}`.
+To load a specific model checkpoint, use the following command.
+```
+model = AutoModelForCausalLM.from_pretrained(
+                model_name_or_path="KaiserWhoLearns/PTvsSFT_OLMo1b",
+                trust_remote_code=trust_remote_code,
+                revision="your revision"
+            )
+```
+All the checkpoints are fine-tuned based on the checkpoints of [OLMo1b-HF](https://huggingface.co/allenai/OLMo-1B-hf).
+Citation:
+```
+TODO
+```
+---
+license: apache-2.0
+---