CausalLM
/

35b-beta2ep

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

JosephusCheung commited on Apr 13, 2024

Commit

5275e6e

•

1 Parent(s): 7fbd7b5

Update README.md

Files changed (1) hide show

README.md +19 -0

README.md CHANGED Viewed

@@ -5,6 +5,25 @@ language:
 - zh
 - ja
 - de
 ---
 Tokenizer is different from cohere - and chat template is ChatML - fully fine-tuned at 128K+ ~ 30M entries long, web crawl input, GPT-4-32k/3.5-16k output, synthetic dataset - 1 epoch

 - zh
 - ja
 - de
+datasets:
+- JosephusCheung/GuanacoDataset
+- meta-math/MetaMathQA
+- jondurbin/airoboros-3.1
+- WizardLM/WizardLM_evol_instruct_V2_196k
+- RyokoAI/ShareGPT52K
+- RyokoAI/Fandom23K
+- milashkaarshif/MoeGirlPedia_wikitext_raw_archive
+- wikipedia
+- wiki_lingua
+- garage-bAInd/Open-Platypus
+- LDJnr/Puffin
+- BAAI/COIG
+- TigerResearch/tigerbot-zhihu-zh-10k
+- liwu/MNBVC
+- teknium/openhermes
+- CausalLM/Refined-Anime-Text
+- microsoft/orca-math-word-problems-200k
+- m-a-p/CodeFeedback-Filtered-Instruction
 ---
 Tokenizer is different from cohere - and chat template is ChatML - fully fine-tuned at 128K+ ~ 30M entries long, web crawl input, GPT-4-32k/3.5-16k output, synthetic dataset - 1 epoch