nisten commited on
Commit
48116da
1 Parent(s): 010d91d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -1
README.md CHANGED
@@ -1,10 +1,24 @@
1
  ---
2
  base_model: HuggingFaceTB/SmolLM-135M
 
 
3
  ---
4
 
5
  ###EVEN SMALLER Frankenstein of smolLm-0.13b upped to 0.15b
6
  Use this frankenbase for training.
7
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  Done via semi-automated continuous merging to figure out the recipe.
9
  Model is more coherent.
10
 
@@ -22,4 +36,4 @@ Amazing option for further training. And this is a merge of the base, not the in
22
 
23
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6379683a81c1783a4a2ddba8/UK0_mQxy6GOHKxGKBbdhx.png)
24
 
25
- I don't understand how the f a 150mb file can talk but it can
 
1
  ---
2
  base_model: HuggingFaceTB/SmolLM-135M
3
+ datasets:
4
+ - LDJnr/Capybara
5
  ---
6
 
7
  ###EVEN SMALLER Frankenstein of smolLm-0.13b upped to 0.15b
8
  Use this frankenbase for training.
9
 
10
+ If you're here from twitter and imatient, get the trained checkpoint file.
11
+
12
+ ```bash
13
+ biggie-smollm-checkpoint-twitter-q8_0.gguf
14
+ ```
15
+
16
+ ```bash
17
+ wget https://huggingface.co/nisten/Biggie-SmoLlm-0.15B-Base/resolve/main/biggie-smollm-checkpoint-twitter-q8_0.gguf
18
+
19
+ ./llama-cli -n 1024 -fa -b 512 --min-p 0.3 --top-p 0.85 -ctk q8_0 -ctv q8_0 --keep -1 -p "You are a Nasa jpl engineer teaching the user about space and cats. <|im_start|>User: How to build a city on Mars via calculating Aldrin-Cycler orbits?<im_end> /n " -m biggie-smollm-checkpoint-twitter-q8_0.gguf --temp 2 -ngl 0 -t 1 -co -cnv --reverse-prompt "Assistant:"
20
+ ```
21
+
22
  Done via semi-automated continuous merging to figure out the recipe.
23
  Model is more coherent.
24
 
 
36
 
37
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6379683a81c1783a4a2ddba8/UK0_mQxy6GOHKxGKBbdhx.png)
38
 
39
+ I don't understand how the f a 150mb file can talk but it can