Update README.md
Browse files
README.md
CHANGED
@@ -1,10 +1,25 @@
|
|
1 |
---
|
2 |
base_model: Qwen/Qwen2.5-72B-Instruct
|
3 |
license: mit
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
4 |
---
|
5 |
|
6 |
-
Experimental commander model.
|
7 |
|
8 |
Named it Zelensky in order to troll Uncle Elon on twitter over how bad Grok-2 is.
|
9 |
|
10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
base_model: Qwen/Qwen2.5-72B-Instruct
|
3 |
license: mit
|
4 |
+
datasets:
|
5 |
+
- arcee-ai/EvolKit-75K
|
6 |
+
- SkunkworksAI/reasoning-0.01
|
7 |
+
- berkeley-nest/Nectar
|
8 |
+
- Nexusflow/VirusTotalAgentic
|
9 |
+
- allenai/WildChat-1M-Full
|
10 |
+
- Magpie-Align/Magpie-LlamaCoT-250K
|
11 |
---
|
12 |
|
13 |
+
Experimental commander model V1.
|
14 |
|
15 |
Named it Zelensky in order to troll Uncle Elon on twitter over how bad Grok-2 is.
|
16 |
|
17 |
+
Training process, low 1 epoch learning rate and evolutionary-merged via https://github.com/arcee-ai/EvolKit
|
18 |
+
|
19 |
+
Process on 8x AMD Mi300 192GB gpus.
|
20 |
+
|
21 |
+
Thank you Vultr https://www.vultr.com/register/ for sponsoring the compute.
|
22 |
+
|
23 |
+
|
24 |
+
|
25 |
+
Qwen License applies by default.
|