Björn Plüster's picture

Björn Plüster

bjoernp

·

AI & ML interests

None yet

Recent Activity

liked a dataset about 1 month ago

microsoft/orca-agentinstruct-1M-v1

liked a dataset about 2 months ago

wyu1/Leopard-Instruct

View all activity

Organizations

bjoernp's activity

New activity in failspy/Nemotron-4-340B-Instruct-SafeTensors 7 months ago

Can you share how you converted this?

#1 opened 7 months ago by

New activity in nvidia/Nemotron-4-340B-Base 7 months ago

Hf safetensors version

#3 opened 7 months ago by

New activity in LeoLM/leo-mistral-hessianai-7b-chat 8 months ago

use_flash_attention_2=True

#9 opened 8 months ago by

leo-mistral-hessianai-7b-chat for privateGPT

#8 opened 8 months ago by

New activity in DiscoResearch/Llama3_DiscoLM_German_8b_v0.1_experimental 8 months ago

Update tokenizer_config.json

#1 opened 8 months ago by

New activity in LeoLM/leo-hessianai-13b-chat 10 months ago

Problems with flash-attention2

#13 opened 10 months ago by

New activity in DiscoResearch/mixtral-7b-8expert about 1 year ago

Loss function?

#10 opened about 1 year ago by

New activity in mistralai/Mixtral-8x7B-v0.1 about 1 year ago

No multi GPU inference support?

#4 opened about 1 year ago by

New activity in LeoLM/leo-hessianai-7b about 1 year ago

Llama2 vs Mistral

#2 opened about 1 year ago by

New activity in DiscoResearch/mixtral-7b-8expert about 1 year ago

Add languages

#8 opened about 1 year ago by

Missing module/classes: from transformers.cache_utils import Cache, DynamicCache

#7 opened about 1 year ago by

New activity in DiscoResearch/DiscoLM-mixtral-8x7b-v2 about 1 year ago

changed "tokenizer" typo to be the one we create.

#4 opened about 1 year ago by

Which transformers version is being used here?

#6 opened about 1 year ago by

Promptengineering

New activity in DiscoResearch/mixtral-7b-8expert about 1 year ago

Flash dependency (locks out non-NVIDIA GPUs)

#4 opened about 1 year ago by

Update modeling_moe_mistral.py

#5 opened about 1 year ago by

Really appreciate the work put into this! I have noticed a change in the model output since first release.

#3 opened about 1 year ago by

New activity in DiscoResearch/DiscoLM-mixtral-8x7b-v2 about 1 year ago

Trying to quantize. Running into the issue below. Any suggestions?

#5 opened about 1 year ago by

small readme fix

#1 opened about 1 year ago by

New activity in DiscoResearch/mixtral-7b-8expert about 1 year ago

Update modeling_moe_mistral.py

#1 opened about 1 year ago by

New activity in LeoLM/leo-hessianai-70b-chat about 1 year ago

AWQ-Variante

#2 opened about 1 year ago by