Burkov

#30 opened 2 months ago by

anandhperumal

New activity in axolotl-ai-co/gemma-2-9b 2 months ago

Why a separate release?

#1 opened 2 months ago by

Andriy

New activity in Qwen/Qwen2-0.5B-Instruct 3 months ago

add_special_tokens=True doesn't add eos token at the end of the sequence

#4 opened 3 months ago by

Andriy

New activity in microsoft/Phi-3-mini-128k-instruct-onnx 5 months ago

Where is the model? 0 downloads means nobody can use it. Please fix.

10

#1 opened 5 months ago by

Andriy

New activity in mistralai/Mistral-7B-Instruct-v0.2 6 months ago

How does v0.2 manages to support 32k token context without Sliding Window Attention?

4

#85 opened 6 months ago by

Andriy

What is the max. content length of Mistral-7B-Instruct-v0.2?

17

#43 opened 8 months ago by

hanshupe

New activity in 1bitLLM/bitnet_b1_58-3B 6 months ago

Longer inference time

#4 opened 6 months ago by

dittops

New activity in WizardLMTeam/WizardCoder-Python-34B-V1.0 6 months ago

Finetuning dataset

#35 opened 6 months ago by

Andriy

New activity in Qwen/Qwen1.5-MoE-A2.7B-Chat 6 months ago

Instruct-finetuning dataset

#4 opened 6 months ago by

Andriy

New activity in FelixChao/Capricorn-7B 6 months ago

Finetuning dataset

#2 opened 6 months ago by

Andriy

New activity in cloudyu/Yi-34Bx2-MOE-200K 6 months ago

Instruct-finetuning dataset

#1 opened 6 months ago by

Andriy

New activity in touqir/Cyrax-7B 6 months ago

Instruct-finetuning dataset

#3 opened 6 months ago by

Andriy

New activity in Kukedlc/NeuralKrishna-7B-V2-DPO 6 months ago

instruct-finetuning dataset

#2 opened 6 months ago by

Andriy

New activity in FelixChao/Capricorn-7B-DPO 6 months ago

Instruct-finetuning dataset

#2 opened 6 months ago by

Andriy

New activity in MaziyarPanahi/Calme-7B-Instruct-v0.5 6 months ago

Instruct-finetuning dataset

#5 opened 6 months ago by

Andriy

New activity in jan-hq/stealth-v2 6 months ago

Instruct-finetuning dataset

#3 opened 6 months ago by

Andriy

New activity in chihoonlee10/T3Q-EN-DPO-Mistral-7B 6 months ago

Instruct-finetuning dataset

#1 opened 6 months ago by

Andriy

New activity in cloudyu/Yi-34Bx2-MoE-60B-DPO 6 months ago

Instruct-finetuning dataset

#2 opened 6 months ago by

Andriy

New activity in bobofrut/ladybird-base-7B-v8 6 months ago

Instruct-finetuning dataset

#1 opened 6 months ago by

Andriy

New activity in zhengr/MixTAO-7Bx2-MoE-Instruct-v7.0 6 months ago

Instruct-finetuning dataset

#4 opened 6 months ago by

Andriy

New activity in TomGrc/FusionNet_34Bx2_MoE 6 months ago

Instruct-finetuning dataset

#3 opened 6 months ago by

Andriy

New activity in abacusai/Smaug-34B-v0.1 6 months ago

Datasets

#8 opened 6 months ago by

Andriy

New activity in migtissera/Tess-72B-v1.5b 6 months ago

Instruct-finetuning dataset

#6 opened 6 months ago by

Andriy

New activity in mixtao/MixTAO-7Bx2-MoE-v8.1 6 months ago

Instruct-finetuning dataset

#6 opened 6 months ago by

Andriy

New activity in MTSAIR/MultiVerse_70B 6 months ago

Instruct-finetuning dataset

#1 opened 6 months ago by

Andriy

New activity in davidkim205/Rhea-72b-v0.5 6 months ago

Instruct-finetuning dataset

#3 opened 6 months ago by

Andriy

New activity in NousResearch/Hermes-2-Pro-Mistral-7B 6 months ago

Datasets for function calling and JSON

#13 opened 6 months ago by

Andriy

New activity in openchat/openchat-3.5-0106 6 months ago

Instruct-finetuning dataset

#9 opened 6 months ago by

Andriy

New activity in 01-ai/Yi-34B-Chat 6 months ago

What the SFT data?

#7 opened 10 months ago by

Ede-CH

New activity in mistralai/Mixtral-8x7B-Instruct-v0.1 6 months ago

Instruct-finetuning dataset

#189 opened 6 months ago by

Andriy

New activity in WizardLMTeam/WizardLM-70B-V1.0 6 months ago

Instruct-finetuning dataset

#22 opened 6 months ago by

Andriy

New activity in Nexusflow/Starling-LM-7B-beta 6 months ago

Instruct-finetuning dataset

#8 opened 6 months ago by

Andriy

New activity in CohereForAI/c4ai-command-r-v01 6 months ago

Instruct-finetuning dataset

#43 opened 6 months ago by

Andriy

New activity in Qwen/Qwen1.5-72B-Chat 6 months ago

Instruct-finetuning data

#12 opened 6 months ago by

Andriy

New activity in openchat/openchat-3.5-0106-gemma 6 months ago

Instruct dataset

#5 opened 6 months ago by

Andriy

New activity in MaziyarPanahi/Calme-7B-Instruct-v0.2 6 months ago

Dataset

#4 opened 6 months ago by

Andriy

New activity in moreh/MoMo-72B-lora-1.8.7-DPO 6 months ago

DPO dataset

#11 opened 6 months ago by

Andriy

New activity in abacusai/Smaug-72B-v0.1 6 months ago

Dataset

#26 opened 6 months ago by

Andriy

New activity in databricks/dbrx-instruct 6 months ago

Instruct dataset

#23 opened 6 months ago by

Andriy

the license

#8 opened 6 months ago by

Andriy

New activity in migtissera/Tess-72B-v1.5b 7 months ago

Is it QLoRA or a full finetune?

#5 opened 7 months ago by

Andriy

New activity in ibivibiv/alpaca-dragon-72b-v1 7 months ago

Is it QLoRA or a full finetune?

#5 opened 7 months ago by

Andriy

New activity in abacusai/Liberated-Qwen1.5-72B 7 months ago

DeepSpeed ZeRO-3 and full finetune

#5 opened 7 months ago by

Andriy

New activity in abacaj/phi-2-super 7 months ago

Dataset?

#1 opened 7 months ago by

0xbitches

New activity in upstage/SOLAR-10.7B-v1.0 7 months ago

What is the context size of this model?

#11 opened 7 months ago by

Andriy

New activity in abacusai/Smaug-72B-v0.1 7 months ago

Questions about architecture (+ LoRA)

#16 opened 7 months ago by

alex0dd

New activity in NousResearch/Nous-Hermes-Llama2-70b 7 months ago

Finetuning setup

#4 opened 7 months ago by

Andriy

New activity in OpenPipe/mistral-ft-optimized-1218 9 months ago

Can you tell us the original models that you merged to create this model？

#3 opened 9 months ago by

Bruce001

New activity in mistralai/Mistral-7B-v0.1 12 months ago

What was the dataset used to pretrain Mistral-7B?