Undi95 commited on
Commit
146d042
1 Parent(s): 1220186

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +45 -9
README.md CHANGED
@@ -22,15 +22,6 @@ As some people have told us our models are sloppy, Ikari decided to say fuck it
22
 
23
  Our dataset stayed the same since day one, we added data over time, cleaned them, and repeat. After not releasing model for a while because we were never satisfied, we think it's time to come back!
24
 
25
- ## Credits:
26
- - Undi
27
- - IkariDev
28
-
29
- ## Training data used:
30
- We will point out all dataset we used here, please be patient the time we get them all back kek.
31
-
32
- Temporary credit for the following madlads, who contributed to the datasets we have build over time: Gryphe, Caitlyn, Kalomaze, Gifted Gummy Bee, Sao [...]
33
-
34
  ## Prompt template: Llama-3-Instruct
35
 
36
  ```
@@ -43,6 +34,51 @@ Temporary credit for the following madlads, who contributed to the datasets we h
43
  {output}<|eot_id|>
44
  ```
45
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
46
  ## Others
47
 
48
  Undi: If you want to support us, you can [here](https://ko-fi.com/undiai).
 
22
 
23
  Our dataset stayed the same since day one, we added data over time, cleaned them, and repeat. After not releasing model for a while because we were never satisfied, we think it's time to come back!
24
 
 
 
 
 
 
 
 
 
 
25
  ## Prompt template: Llama-3-Instruct
26
 
27
  ```
 
34
  {output}<|eot_id|>
35
  ```
36
 
37
+ ## Credits:
38
+ - Undi
39
+ - IkariDev
40
+
41
+ ## Training data we used to make our dataset:
42
+
43
+ - [Epiculous/Gnosis](https://huggingface.co/Epiculous/Gnosis)
44
+ - [ChaoticNeutrals/Luminous_Opus](https://huggingface.co/datasets/ChaoticNeutrals/Luminous_Opus)
45
+ - [ChaoticNeutrals/Synthetic-Dark-RP](https://huggingface.co/datasets/ChaoticNeutrals/Synthetic-Dark-RP)
46
+ - [ChaoticNeutrals/Synthetic-RP](https://huggingface.co/datasets/ChaoticNeutrals/Synthetic-RP)
47
+ - [Gryphe/Sonnet3.5-SlimOrcaDedupCleaned](https://huggingface.co/datasets/Gryphe/Sonnet3.5-SlimOrcaDedupCleaned)
48
+ - [Gryphe/Opus-WritingPrompts](https://huggingface.co/datasets/Gryphe/Opus-WritingPrompts)
49
+ - [meseca/writing-opus-6k](https://huggingface.co/datasets/meseca/writing-opus-6k)
50
+ - [meseca/opus-instruct-9k](https://huggingface.co/datasets/meseca/opus-instruct-9k)
51
+ - [PJMixers/grimulkan_theory-of-mind-ShareGPT](https://huggingface.co/datasets/PJMixers/grimulkan_theory-of-mind-ShareGPT)
52
+ - [NobodyExistsOnTheInternet/ToxicQAFinal](https://huggingface.co/datasets/NobodyExistsOnTheInternet/ToxicQAFinal)
53
+ - [Undi95/toxic-dpo-v0.1-sharegpt](https://huggingface.co/datasets/Undi95/toxic-dpo-v0.1-sharegpt)
54
+ - [cgato/SlimOrcaDedupCleaned](https://huggingface.co/datasets/cgato/SlimOrcaDedupCleaned)
55
+ - [kalomaze/Opus_Instruct_25k](https://huggingface.co/datasets/kalomaze/Opus_Instruct_25k)
56
+ - [Doctor-Shotgun/no-robots-sharegpt](https://huggingface.co/datasets/Doctor-Shotgun/no-robots-sharegpt)
57
+ - [Norquinal/claude_multiround_chat_30k](https://huggingface.co/datasets/Norquinal/claude_multiround_chat_30k)
58
+ - [nothingiisreal/Claude-3-Opus-Instruct-15K](https://huggingface.co/datasets/nothingiisreal/Claude-3-Opus-Instruct-15K)
59
+ - All the Aesirs dataset, cleaned, unslopped
60
+ - All le luminae dataset, cleaned, unslopped
61
+ - Small part of Airoboros reduced
62
+
63
+ We sadly didn't find the sources of the following, DM us if you recognize your set !
64
+
65
+ - Opus_Instruct-v2-6.5K-Filtered-v2-sharegpt
66
+ - claude_sharegpt_trimmed
67
+ - CapybaraPure_Decontaminated-ShareGPT_reduced
68
+
69
+ ## Datasets credits:
70
+ - Epiculous
71
+ - ChaoticNeutrals
72
+ - Gryphe
73
+ - meseca
74
+ - PJMixers
75
+ - NobodyExistsOnTheInternet
76
+ - cgato
77
+ - kalomaze
78
+ - Doctor-Shotgun
79
+ - Norquinal
80
+ - nothingiisreal
81
+
82
  ## Others
83
 
84
  Undi: If you want to support us, you can [here](https://ko-fi.com/undiai).