Angelino Santiago
MrDevolver
AI & ML interests
None yet
Recent Activity
liked
a model
about 1 hour ago
Spestly/Atlas-Pro-7B-Preview-GGUF
new activity
about 4 hours ago
google/gemma-3-12b-it:[System prompt inside] Poor man's R1 based on Gemma 3
Organizations
None yet
MrDevolver's activity
Tell me how do you feel about this model without telling me how do you feel about this model
#5 opened about 1 hour ago
by
MrDevolver

[System prompt inside] Poor man's R1 based on Gemma 3
5
#7 opened 1 day ago
by
MrDevolver

REALLY slow with flash attention and quantized cache.
7
#2 opened 2 days ago
by
Olafangensan
Prompt template
13
#1 opened 2 days ago
by
YearZero
Refining QWQ Model Output: Direct Responses Without Step-by-Step Reasoning
1
#39 opened 7 days ago
by
gslinx
Wowowowow
27
#1 opened 8 days ago
by
owao
Different than Unsloth?
4
#8 opened 5 days ago
by
MrDevolver

This model beats Qwen Max!
4
#33 opened 7 days ago
by
MrDevolver

8GB GPU can run this,10t/s
2
#41 opened 7 days ago
by
wqerrewetw
Sixteen knives
1
#1 opened 6 days ago
by
MrDevolver

Obligatory question about model sizes...
#34 opened 7 days ago
by
MrDevolver

<think> shortening / summarizing
#7 opened 7 days ago
by
MrDevolver

Output repeating
29
#1 opened 16 days ago
by
getfit

Existence of this model is a faux pas, but...
6
#166 opened 20 days ago
by
MrDevolver

Draft model as accelerator for DeepSeek-R1?
4
#174 opened 16 days ago
by
inputout

Model getting stuck
6
#2 opened 17 days ago
by
GhostGate
Very nice!
2
#1 opened 17 days ago
by
MrDevolver

Deploying production ready service with Unsloth GGUF quants on your AWS account. (4 x L40S)
8
#171 opened 17 days ago
by
samagra-tensorfuse
This seems to be a good base model recipe!
2
#1 opened 17 days ago
by
MrDevolver

Good for coding?
3
#1 opened 21 days ago
by
urtuuuu