Align tokenizer with mistral-common
9
#225 opened 3 days ago
by
Rocketknight1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1660312628256-60ba519750effef3a58beac3.png)
Max Response Length
#219 opened 12 days ago
by
LARRY-YIN
How much RAM is needed in AWS Dedicated Inference Endpoint Deployment?
5
#217 opened 17 days ago
by
ramda1234786
A complex prompt resulted in an empty output.
4
#216 opened 18 days ago
by
gostop
Python script running from VSCODE over WSL distro in Windows not finding this model
1
#215 opened 27 days ago
by
hrighugging
Fine-tuning Mixtral with different types of datasets
#214 opened about 1 month ago
by
ziko11
Model Overloaded
3
#213 opened about 1 month ago
by
ramda1234786
I want to make the chatbot using Mixtral-8x7B-Instruct-v0.1 model, but the inference speed is very slow, so I cannot use it as a chatbot. How can I fix this issue?
#211 opened about 1 month ago
by
rising620
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/RrkbQsG45_dgmQYNKTg8N.png)
Getting OS error.
#209 opened about 2 months ago
by
iffy
Addressing Incomplete Answers Generated from Large Contexts
2
#206 opened 2 months ago
by
pooja03
Model gating?
1
#204 opened 2 months ago
by
gawotik
Upload 2 files
2
#203 opened 2 months ago
by
chakkakrishna
RAG, prompt and memory with Mixtral
8
#201 opened 2 months ago
by
edoyen
Any Update on mistralai/Mixtral-8x7B-Instruct-v0.2 ?
#200 opened 2 months ago
by
bayraktaroglu
Input validation error: `inputs` tokens + `max_new_tokens` must be <= 2048. on Mixtral8x7b 32K token
2
#199 opened 2 months ago
by
sunnykusawa
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6582ba3092b5a9664df73b62/kGR6-jTTlbr8bBJRpsCBE.jpeg)
Warning message for right side padding even after setting padding_side="left"
#198 opened 2 months ago
by
mbismay
Input token size issue, does it realy supports 32k tokens?
1
#197 opened 2 months ago
by
sunnykusawa
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6582ba3092b5a9664df73b62/kGR6-jTTlbr8bBJRpsCBE.jpeg)
infinite carriage returns
#195 opened 2 months ago
by
lowfreak
What is the stop token for this model please
2
#194 opened 2 months ago
by
NigelTheMaker
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1673670043032-62129484581b98bb1ad00a60.jpeg)
[AUTOMATED] Model Memory Requirements
#193 opened 3 months ago
by
model-sizer-bot
Problem while running on multiple GPUs
#192 opened 3 months ago
by
venkilfc
Discrepancy between kv_proj in .safetensors and .pt?
1
#191 opened 3 months ago
by
kolinko
Missing Output problem
4
#190 opened 3 months ago
by
chaydaroglu
Instruct-finetuning dataset
#189 opened 3 months ago
by
Andriy
when I run it in multi-gpus by accelerate, it has an AttributeError
#188 opened 3 months ago
by
waleyWang
What is the actual context size of mistralai/Mixtral-8x7B-Instruct-v0.1 model
3
#186 opened 3 months ago
by
Pradeep1995
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1599822346546-noauth.jpeg)
How to All Utilize all GPU's when device="balanced_low_0" in GPU setting
2
#185 opened 3 months ago
by
kmukeshreddy
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1665881381855-noauth.jpeg)
Update README.md
#184 opened 3 months ago
by
alamati
Is function calling (tools) supported?
1
#183 opened 3 months ago
by
TomerRobusta
Getting cut-off responses with Mixtral 8x7B-Instruct-v0.1 mostly in Date of Birth years
1
#182 opened 3 months ago
by
keskival
How can I run it on multiple GPUs?
11
#181 opened 3 months ago
by
barbery
Where is the mixtral-8x7b's tokenizer encoder? Is there a specific repository or node module?
1
#180 opened 3 months ago
by
RamanSB
What is the max token limit on this model?
2
#179 opened 3 months ago
by
RamanSB
Finetuning Mixtral 8x7B Instruct-v0.1 using Transformers
2
#178 opened 3 months ago
by
Ateeqq
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65b04ef37c11edbf6e39f4bb/26Eb8KkHuFZwvlsyL-Hhc.jpeg)
Update chat template to resemble the prompt as stated in the model card.
4
#176 opened 3 months ago
by
nilsec
max_sequence_length
1
#175 opened 3 months ago
by
Ravnoor1
Awesome. I Got Very Good Responses, However...
#174 opened 4 months ago
by
deleted
How to run the full model ?
2
#171 opened 4 months ago
by
dounykim
Is there a working/quantized/exl2 (etc) version that will fit on a single 24GB video card (4090)
2
#170 opened 4 months ago
by
cleverest
403 error
1
#169 opened 4 months ago
by
minhphan-qbe
Adding Evaluation Results
#168 opened 4 months ago
by
leaderboard-pr-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/655506df9dc61e22c5f9c732/IZGvup0FdVlioPPIPnzZv.jpeg)
Rename README.md to RegulusOne
#167 opened 4 months ago
by
Theguy666
Help: CUDA Out of Memory. Hardware requirements.
2
#147 opened 4 months ago
by
zebfreeman
Update README.md
#146 opened 4 months ago
by
frank76rm
Experimental use
#144 opened 4 months ago
by
yassineelkhadiri14
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65da3fb3c3e37ebc672b1d6c/3cURnm-n1YceRk_0K28Xe.jpeg)
TemplateError: Conversation roles must alternate user/assistant/user/assistant/...
4
#143 opened 4 months ago
by
quamer23
Is instruction format necessary
2
#142 opened 4 months ago
by
supercharge19