93 14 179

Michael Han

shimmyshimmer

https://unsloth.ai

AI & ML interests

None yet

Recent Activity

new activity 3 days ago

unsloth/Hermes-3-Llama-3.1-70B-bnb-4bit:Model card is 405B and not 70B

new activity 4 days ago

unsloth/Llama-3.3-70B-Instruct-GGUF:RAM requirements for running Llama-3.3-70B-Instruct-Q5_K_M.gguf

liked a model 9 days ago

unsloth/QVQ-72B-Preview-bnb-4bit

View all activity

Organizations

shimmyshimmer's activity

New activity in unsloth/Hermes-3-Llama-3.1-70B-bnb-4bit 3 days ago

Model card is 405B and not 70B

#1 opened 3 days ago by

Spestly

New activity in unsloth/Llama-3.3-70B-Instruct-GGUF 4 days ago

RAM requirements for running Llama-3.3-70B-Instruct-Q5_K_M.gguf

#4 opened 5 days ago by

hyadav22

New activity in unsloth/c4ai-command-r-plus-08-2024-bnb-4bit 12 days ago

NameError: name 'CohereLayerNorm' is not defined

#1 opened 14 days ago by

joelniklaus

New activity in unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit 12 days ago

Strange behaviour of Llama3.2-vision - it behaves like text model

#9 opened 14 days ago by

jirkazcech

New activity in unsloth/llama-3-8b-bnb-4bit 12 days ago

'LlamaForCausalLM' object has no attribute 'max_seq_length'

#8 opened 6 months ago by

AronVic

New activity in unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit 14 days ago

Can you post the script that was used to quantize this model please?

#2 opened 3 months ago by

ctranslate2-4you

New activity in unsloth/c4ai-command-r-08-2024-bnb-4bit 14 days ago

c4ai-command-r-v01 in one gguf

#3 opened 15 days ago by

Markobes

New activity in unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit 17 days ago

How to use it in ollama

#8 opened 18 days ago by

vejahetobeu

New activity in unsloth/c4ai-command-r-08-2024-bnb-4bit 18 days ago

Base model

#2 opened 18 days ago by

Spestly

New activity in unsloth/Llama-3.3-70B-Instruct-GGUF 18 days ago

can vllm launch this model?

#2 opened 18 days ago by

chopin1998

New activity in unsloth/Llama-3.3-70B-Instruct-GGUF 20 days ago

It is quant of your own finetuned or original model?

#1 opened 24 days ago by

supercharge19

New activity in unsloth/README 22 days ago

Qwen 2.5 (7B) notebook points to the Gemma 2 9B + Alpaca notebook instead.

#4 opened 23 days ago by

qingy2024

New activity in unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF 23 days ago

Q4_K_S Please

#2 opened 24 days ago by

pipilok

New activity in unsloth/Llama-3.2-1B-Instruct-bnb-4bit 24 days ago

fixed "3B" to "8B"

#2 opened 25 days ago by

Solshine

New activity in unsloth/Llama-3.2-11B-Vision-Instruct-bnb-4bit about 1 month ago

Exporting to GGUF

#7 opened about 1 month ago by

krasivayakoshka

New activity in unsloth/Qwen2.5-0.5B about 1 month ago

Invalid script is provided

#1 opened 3 months ago by

antony-pk

New activity in unsloth/mistral-7b-v0.2-bnb-4bit about 1 month ago

full disk on colab

#2 opened 7 months ago by

Dav22

New activity in unsloth/llama-3-8b-Instruct-bnb-4bit about 1 month ago

It is needed to use bnb 4bit?

#3 opened 7 months ago by

bullerwins

New activity in unsloth/Phi-3-medium-4k-instruct-bnb-4bit about 1 month ago

Phi3 or Mistral?

#3 opened about 1 month ago by

csabakecskemeti

New activity in unsloth/Llama-3.2-11B-Vision-Instruct about 1 month ago

Issue Loading Llama-3.2 (11B) Vision Instruct Model in Colab

#3 opened 3 months ago by

Sampas