invisietch commited on
Commit
f34c5eb
·
verified ·
1 Parent(s): 27fce1f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -53,7 +53,7 @@ I'll try resolving it with a light merge ASAP, it seems like the wrong weight is
53
  I appreciate all feedback on any of my models, you can use:
54
 
55
  * [My Discord server](https://discord.gg/AJwZuu7Ncx) - requires Discord.
56
- * [The Community tab](https://huggingface.co/invisietch/MiS-Firefly-v0.1-22B/discussions) - requires HF login.
57
  * Discord DMs to **invisietch**.
58
 
59
  Your feedback is how I improve these models for future versions.
@@ -95,6 +95,8 @@ The first stage of my training was a single epoch at low LR over a 474 million t
95
 
96
  I followed this up with a coherence, decensorship & roleplay finetune over a 172 million token instruct dataset over two epochs.
97
 
 
 
98
  Total training time was about 32hrs on 4x Nvidia A100 80GB.
99
 
100
  <img src="https://raw.githubusercontent.com/axolotl-ai-cloud/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>
 
53
  I appreciate all feedback on any of my models, you can use:
54
 
55
  * [My Discord server](https://discord.gg/AJwZuu7Ncx) - requires Discord.
56
+ * [The Community tab](https://huggingface.co/invisietch/MiS-Firefly-v0.2-22B/discussions) - requires HF login.
57
  * Discord DMs to **invisietch**.
58
 
59
  Your feedback is how I improve these models for future versions.
 
95
 
96
  I followed this up with a coherence, decensorship & roleplay finetune over a 172 million token instruct dataset over two epochs.
97
 
98
+ I did a slerp merge of epoch 1 into epoch 2 at a light weight which resolved the name-spelling issues on quantized versions of Firefly v0.1.
99
+
100
  Total training time was about 32hrs on 4x Nvidia A100 80GB.
101
 
102
  <img src="https://raw.githubusercontent.com/axolotl-ai-cloud/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>