Has the tokenizer of the base model(Mistral-7B-v0.1) been retrained?
#37 opened 5 days ago
by
LH0521
How did you trained your LatentAttentionLayer?
1
#36 opened 14 days ago
by
juneonetwothree
Why do we need to hardcode self._attn_implementation = "eager"
1
#35 opened 14 days ago
by
shantanuagarwal
Error to load model with HuggingFace API
1
#34 opened 15 days ago
by
hhcloud
Regarding max seq length
1
#33 opened 15 days ago
by
sandeep456
How to fine-tune this model?
#32 opened 16 days ago
by
caochengchen
error with module datasets
2
#31 opened 17 days ago
by
claraadam
Distant resource does not have a Content-Length
#30 opened 17 days ago
by
caochengchen
Best instructions for clustering and semantic similarity
2
#29 opened 18 days ago
by
rmilliere
Dataloader multiprocessing error
1
#28 opened 21 days ago
by
Atsunori
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1615517039409-noauth.jpeg)
Fixing "KeyError: 'NVEmbedConfig'"
8
#27 opened 22 days ago
by
Th3l
Error using multi-gpu support
4
#26 opened 24 days ago
by
bobwhiterabbit
Access to model nvidia/NV-Embed-v1 is restricted. You must be authenticated to access it
6
#25 opened 24 days ago
by
yijiu
Matryoshka Embedding
1
#24 opened 24 days ago
by
XingyanZhang
nvidia/NV-Embed-v1 is not the path to a directory containing a file named config.json.
3
#23 opened 24 days ago
by
XuehangCang
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63fb7edfa3c067e6289097c8/7M2NnBsvc60X7BP5v0iCj.jpeg)
Finetuning guidelines
#21 opened 26 days ago
by
mali404
How much VRAM is needed to run this model? Like for the bare minimum length etc?
3
#20 opened 27 days ago
by
smpa239
Ollama Version
1
#19 opened 27 days ago
by
yangwang825
![](https://cdn-avatars.huggingface.co/v1/production/uploads/60f313f4adf471cbdf8bb66a/5NJFqnldE_0fdE_mEvz9V.jpeg)
Weights are in FP16 (loaded in FP32) but paper mentions BF16
#17 opened 29 days ago
by
AdrienC
ONNX version
1
#16 opened 29 days ago
by
michaelfeil
![](https://cdn-avatars.huggingface.co/v1/production/uploads/644fac0ce1d7a97f3b653ab1/fottSAPFrJdKeMW2UJv_l.jpeg)
Sentence Transformer compatibility
4
#15 opened 29 days ago
by
michaelfeil
![](https://cdn-avatars.huggingface.co/v1/production/uploads/644fac0ce1d7a97f3b653ab1/fottSAPFrJdKeMW2UJv_l.jpeg)
Please provide a 8bit quantified version
#14 opened 29 days ago
by
fukai
How to use for AutoModelForSequenceClassification?
#13 opened 30 days ago
by
deshwalmahesh
Possible to implement `_no_split_modules` attribute?
1
#12 opened 30 days ago
by
ronnybehrens
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1677145697781-noauth.png)
missing citation
3
#11 opened about 1 month ago
by
SeanLee97
![](https://cdn-avatars.huggingface.co/v1/production/uploads/635cc29de7aef2358a9b03ee/SVHL_mTCiOfmBamzSucb0.jpeg)
Multi-Lingual?
2
#10 opened about 1 month ago
by
dejanseo
![](https://cdn-avatars.huggingface.co/v1/production/uploads/64732e7f7be71eb8b1b572a8/bEgqSstenzd6kjkGWjrgd.png)
Getting "KeyError" when loading model
5
#8 opened about 1 month ago
by
tsakaiba
TypeError: MistralDecoderLayer.forward() got an unexpected keyword argument 'is_causal'
3
#7 opened about 1 month ago
by
yxzwayne
![](https://cdn-avatars.huggingface.co/v1/production/uploads/638dc18e8da27a1390bee22f/kM5-5uorjPCvPb1H8uh1O.jpeg)
Is this model active?
1
#5 opened about 1 month ago
by
gsnic
Sharing training data & reproducing training
1
#4 opened about 1 month ago
by
xhluca
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1633380224986-5fa9ff3ea13e063b8b2b60cb.jpeg)