KaiserWhoLearns commited on
Commit
906f2c5
1 Parent(s): 7b208dc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +22 -3
README.md CHANGED
@@ -1,3 +1,22 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ This is the model checkpoint release for Amuro \& Char: Analyzing the Relationship between Pre-Training and Fine-Tuning of Large Language Models.
2
+
3
+ All the fine-tuned model checkpoints are released in this repository. The naming convention of the revisions are `olmo1b_hf_{checkpoint}_{train_dataset}_{epoch}_{lr}`.
4
+ To load a specific model checkpoint, use the following command.
5
+ ```
6
+ model = AutoModelForCausalLM.from_pretrained(
7
+ model_name_or_path="KaiserWhoLearns/PTvsSFT_OLMo1b",
8
+ trust_remote_code=trust_remote_code,
9
+ revision="your revision"
10
+ )
11
+ ```
12
+
13
+ All the checkpoints are fine-tuned based on the checkpoints of [OLMo1b-HF](https://huggingface.co/allenai/OLMo-1B-hf).
14
+
15
+ Citation:
16
+ ```
17
+ TODO
18
+ ```
19
+
20
+ ---
21
+ license: apache-2.0
22
+ ---