Burkov
Andriy
AI & ML interests
None yet
Organizations
None yet
Andriy's activity
Issues with FSDP and DeepSpeed During Distributed Training for Gemma
5
#30 opened 2 months ago
by
anandhperumal
Why a separate release?
#1 opened 2 months ago
by
Andriy
add_special_tokens=True doesn't add eos token at the end of the sequence
1
#4 opened 3 months ago
by
Andriy
Where is the model? 0 downloads means nobody can use it. Please fix.
10
#1 opened 5 months ago
by
Andriy
How does v0.2 manages to support 32k token context without Sliding Window Attention?
4
#85 opened 6 months ago
by
Andriy
What is the max. content length of Mistral-7B-Instruct-v0.2?
17
#43 opened 8 months ago
by
hanshupe
Longer inference time
2
#4 opened 6 months ago
by
dittops
Finetuning dataset
#35 opened 6 months ago
by
Andriy
Instruct-finetuning dataset
#4 opened 6 months ago
by
Andriy
Finetuning dataset
#2 opened 6 months ago
by
Andriy
Instruct-finetuning dataset
#1 opened 6 months ago
by
Andriy
Instruct-finetuning dataset
#3 opened 6 months ago
by
Andriy
instruct-finetuning dataset
1
#2 opened 6 months ago
by
Andriy
Instruct-finetuning dataset
#2 opened 6 months ago
by
Andriy
Instruct-finetuning dataset
#5 opened 6 months ago
by
Andriy
Instruct-finetuning dataset
#3 opened 6 months ago
by
Andriy
Instruct-finetuning dataset
#1 opened 6 months ago
by
Andriy
Instruct-finetuning dataset
#2 opened 6 months ago
by
Andriy
Instruct-finetuning dataset
1
#1 opened 6 months ago
by
Andriy
Instruct-finetuning dataset
#4 opened 6 months ago
by
Andriy
Instruct-finetuning dataset
1
#3 opened 6 months ago
by
Andriy
Instruct-finetuning dataset
1
#6 opened 6 months ago
by
Andriy
Instruct-finetuning dataset
#6 opened 6 months ago
by
Andriy
Instruct-finetuning dataset
1
#1 opened 6 months ago
by
Andriy
Instruct-finetuning dataset
1
#3 opened 6 months ago
by
Andriy
Datasets for function calling and JSON
5
#13 opened 6 months ago
by
Andriy
Instruct-finetuning dataset
1
#9 opened 6 months ago
by
Andriy
What the SFT data?
5
#7 opened 10 months ago
by
Ede-CH
Instruct-finetuning dataset
#189 opened 6 months ago
by
Andriy
Instruct-finetuning dataset
#22 opened 6 months ago
by
Andriy
Instruct-finetuning dataset
1
#8 opened 6 months ago
by
Andriy
Instruct-finetuning dataset
5
#43 opened 6 months ago
by
Andriy
Instruct-finetuning data
#12 opened 6 months ago
by
Andriy
Instruct dataset
#5 opened 6 months ago
by
Andriy
DPO dataset
#11 opened 6 months ago
by
Andriy
Instruct dataset
#23 opened 6 months ago
by
Andriy
the license
2
#8 opened 6 months ago
by
Andriy
Is it QLoRA or a full finetune?
1
#5 opened 7 months ago
by
Andriy
Is it QLoRA or a full finetune?
1
#5 opened 7 months ago
by
Andriy
DeepSpeed ZeRO-3 and full finetune
2
#5 opened 7 months ago
by
Andriy
Dataset?
5
#1 opened 7 months ago
by
0xbitches
What is the context size of this model?
1
#11 opened 7 months ago
by
Andriy
Questions about architecture (+ LoRA)
2
#16 opened 7 months ago
by
alex0dd
Finetuning setup
#4 opened 7 months ago
by
Andriy
Can you tell us the original models that you merged to create this model?
1
#3 opened 9 months ago
by
Bruce001
What was the dataset used to pretrain Mistral-7B?
1
#38 opened 12 months ago
by
Andriy