Michael Han
shimmyshimmer
AI & ML interests
None yet
Recent Activity
new activity
3 days ago
unsloth/Hermes-3-Llama-3.1-70B-bnb-4bit:Model card is 405B and not 70B
liked
a model
9 days ago
unsloth/QVQ-72B-Preview-bnb-4bit
Organizations
shimmyshimmer's activity
Model card is 405B and not 70B
1
#1 opened 3 days ago
by
Spestly
RAM requirements for running Llama-3.3-70B-Instruct-Q5_K_M.gguf
1
#4 opened 5 days ago
by
hyadav22
NameError: name 'CohereLayerNorm' is not defined
2
#1 opened 14 days ago
by
joelniklaus
Strange behaviour of Llama3.2-vision - it behaves like text model
1
#9 opened 14 days ago
by
jirkazcech
'LlamaForCausalLM' object has no attribute 'max_seq_length'
3
#8 opened 6 months ago
by
AronVic
Can you post the script that was used to quantize this model please?
10
#2 opened 3 months ago
by
ctranslate2-4you
c4ai-command-r-v01 in one gguf
2
#3 opened 15 days ago
by
Markobes
How to use it in ollama
1
#8 opened 18 days ago
by
vejahetobeu
Base model
1
#2 opened 18 days ago
by
Spestly
can vllm launch this model?
2
#2 opened 18 days ago
by
chopin1998
It is quant of your own finetuned or original model?
6
#1 opened 24 days ago
by
supercharge19
Qwen 2.5 (7B) notebook points to the Gemma 2 9B + Alpaca notebook instead.
1
#4 opened 23 days ago
by
qingy2024
Q4_K_S Please
1
#2 opened 24 days ago
by
pipilok
fixed "3B" to "8B"
1
#2 opened 25 days ago
by
Solshine
Exporting to GGUF
5
#7 opened about 1 month ago
by
krasivayakoshka
Invalid script is provided
1
#1 opened 3 months ago
by
antony-pk
full disk on colab
3
#2 opened 7 months ago
by
Dav22
It is needed to use bnb 4bit?
1
#3 opened 7 months ago
by
bullerwins
Phi3 or Mistral?
2
#3 opened about 1 month ago
by
csabakecskemeti
Issue Loading Llama-3.2 (11B) Vision Instruct Model in Colab
12
#3 opened 3 months ago
by
Sampas