Björn Plüster
bjoernp
AI & ML interests
None yet
Organizations
bjoernp's activity
Can you share how you converted this?
5
#1 opened 14 days ago
by
bjoernp
Hf safetensors version
8
#3 opened 17 days ago
by
ehartford
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63111b2d88942700629f5771/u2a9y-yx6TG0N31OhMSHI.png)
use_flash_attention_2=True
3
#9 opened about 2 months ago
by
TillFetzer
leo-mistral-hessianai-7b-chat for privateGPT
3
#8 opened 2 months ago
by
Dodo124
Update tokenizer_config.json
#1 opened 2 months ago
by
bjoernp
Problems with flash-attention2
1
#13 opened 4 months ago
by
omaer0
Loss function?
1
#10 opened 7 months ago
by
narvind2003
![](https://cdn-avatars.huggingface.co/v1/production/uploads/605cc8ef6ce6cabbb3474b6a/b8yTvn1GFhkA-jdkwat35.png)
No multi GPU inference support?
8
#4 opened 7 months ago
by
dataautogpt3
![](https://cdn-avatars.huggingface.co/v1/production/uploads/64430e32e255a338677fe9a7/qPn5pCkKOdpmyTHWFve_U.png)
Llama2 vs Mistral
1
#2 opened 7 months ago
by
lightningRalf
Add languages
#8 opened 7 months ago
by
lbourdois
![](https://cdn-avatars.huggingface.co/v1/production/uploads/613b0a62a14099d5afed7830/pLuqSIYaNYhUqdjxlNrFn.png)
Missing module/classes: from transformers.cache_utils import Cache, DynamicCache
1
#7 opened 7 months ago
by
panopstor
changed "tokenizer" typo to be the one we create.
#4 opened 7 months ago
by
dyngnosis
Which transformers version is being used here?
2
#6 opened 7 months ago
by
Promptengineering
Flash dependency (locks out non-NVIDIA GPUs)
3
#4 opened 7 months ago
by
Thalesian
Update modeling_moe_mistral.py
#5 opened 7 months ago
by
bjoernp
Trying to quantize. Running into the issue below. Any suggestions?
1
#5 opened 7 months ago
by
BigDeeper
small readme fix
#1 opened 7 months ago
by
jphme
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/1gCpz_Og6-LyCBXDmuOd1.jpeg)
Update modeling_moe_mistral.py
2
#1 opened 7 months ago
by
bjoernp
AWQ-Variante
4
#2 opened 7 months ago
by
SebastianBodza
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6400c075f4ff62c2617023f7/DNrK0LTDuj9ZEpXl0_k_A.jpeg)
Little Mistake :)
1
#1 opened 7 months ago
by
DRXD1000
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6474c16e7d131daf633db8ad/oGw2d8zrtbqRl-Wly3sY_.jpeg)
Can you incorporate madlad400 training data ?
1
#11 opened 7 months ago
by
cmp-nct
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6344a1b0762379fc63017e62/g4VIT8l2lZIj6AoQAwVy7.png)
Is this instruction following model?
1
#1 opened 7 months ago
by
rjmehta
fix vocab size
4
#4 opened 7 months ago
by
jphme
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/1gCpz_Og6-LyCBXDmuOd1.jpeg)
Inconsistency in effective batch size reporting
3
#1 opened 8 months ago
by
bjoernp
Update README.md
1
#2 opened 8 months ago
by
waler4ik28
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65362b6edd926ca2ac0caa44/GieWT4bstWA0n07Z9Uuro.jpeg)
Update READEME.md to include system prompt
1
#3 opened 8 months ago
by
aari1995
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5f3801ab7e583543386217ac/4xMdDV1gws7nxCJrU321H.jpeg)
Quantise this model - missing file
1
#10 opened 8 months ago
by
cuh008
gguf version?
1
#2 opened 9 months ago
by
guido1893
Sentence Transformers
2
#8 opened 9 months ago
by
jdjayakaran
Ambiguity in Language detection
5
#7 opened 9 months ago
by
jdjayakaran
tokenizer.model missing?
2
#2 opened 9 months ago
by
darule
Quantised models by thebloke
#1 opened 9 months ago
by
choltha
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6445c17d1cfc9ae6bb3fbfa2/Q8AkjMZIlKrO1J9GpXv6q.png)
Training code
2
#2 opened 9 months ago
by
robert-h
First sentence of the description wrong?
1
#1 opened 9 months ago
by
h3ndrik
Add a `chat template` to this repository
1
#6 opened 9 months ago
by
LLukas22
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62da57f34be126e22e8bed5f/ghmINp1UDr9XnqZVaf_9G.png)
How to achieve better results with fine-tuning
1
#5 opened 9 months ago
by
jdjayakaran
Some weights of LlamaForCausalLM were not initialized from the model checkpoint
1
#3 opened 9 months ago
by
fcivardi
CUDA out of memory applying to a dataset of texts
3
#4 opened 9 months ago
by
fcivardi
how to prompt
2
#5 opened 9 months ago
by
g58892881
Falsche Ausgabe bei Abfrage von Landeshauptstädten:
1
#4 opened 9 months ago
by
darule
Flash attention NVCC requirements
3
#2 opened 9 months ago
by
jdjayakaran
missing tokenizer.model?
7
#2 opened 9 months ago
by
b0968
Can't get any reasonable output
3
#3 opened 9 months ago
by
Sebbecking
tokenizer.model missing?
1
#1 opened 9 months ago
by
b0968
Commercial use
5
#1 opened 9 months ago
by
BramVanroy
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1594192845975-5e1e17b6fcf41d740b6996a8.jpeg)
Is there a problem with year numbers?
3
#1 opened 9 months ago
by
stelterlab
Fixed typo in FP16 and 8bit examples
1
#4 opened over 1 year ago
by
bjoernp