Angelino Santiago's picture

Angelino Santiago

MrDevolver

·

AI & ML interests

None yet

Recent Activity

new activity about 1 hour ago

CohereForAI/c4ai-command-a-03-2025:Tell me how do you feel about this model without telling me how do you feel about this model

liked a model about 1 hour ago

Spestly/Atlas-Pro-7B-Preview-GGUF

new activity about 4 hours ago

google/gemma-3-12b-it:[System prompt inside] Poor man's R1 based on Gemma 3

View all activity

Organizations

None yet

MrDevolver's activity

New activity in CohereForAI/c4ai-command-a-03-2025 about 1 hour ago

Tell me how do you feel about this model without telling me how do you feel about this model

#5 opened about 1 hour ago by

New activity in google/gemma-3-12b-it about 4 hours ago

[System prompt inside] Poor man's R1 based on Gemma 3

#7 opened 1 day ago by

New activity in bartowski/RekaAI_reka-flash-3-GGUF 1 day ago

REALLY slow with flash attention and quantized cache.

#2 opened 2 days ago by

New activity in bartowski/RekaAI_reka-flash-3-GGUF 2 days ago

Prompt template

#1 opened 2 days ago by

New activity in Qwen/QwQ-32B 2 days ago

Refining QWQ Model Output: Direct Responses Without Step-by-Step Reasoning

#39 opened 7 days ago by

New activity in bartowski/Qwen_QwQ-32B-GGUF 5 days ago

Wowowowow

#1 opened 8 days ago by

Different than Unsloth?

#8 opened 5 days ago by

New activity in Qwen/QwQ-32B 6 days ago

This model beats Qwen Max!

#33 opened 7 days ago by

8GB GPU can run this,10t/s

#41 opened 7 days ago by

New activity in DavidAU/Qwen2.5-QwQ-35B-Eureka-Cubed 6 days ago

Sixteen knives

#1 opened 6 days ago by

New activity in Qwen/QwQ-32B 7 days ago

Obligatory question about model sizes...

#34 opened 7 days ago by

New activity in bartowski/Qwen_QwQ-32B-GGUF 7 days ago

<think> shortening / summarizing

#7 opened 7 days ago by

New activity in qihoo360/TinyR1-32B-Preview 8 days ago

Output repeating

#1 opened 16 days ago by

New activity in perplexity-ai/r1-1776 12 days ago

Existence of this model is a faux pas, but...

#166 opened 20 days ago by

New activity in deepseek-ai/DeepSeek-R1 12 days ago

Draft model as accelerator for DeepSeek-R1?

#174 opened 16 days ago by

New activity in OddTheGreat/Apparatus_24B 16 days ago

Model getting stuck

#2 opened 17 days ago by

Very nice!

#1 opened 17 days ago by

New activity in deepseek-ai/DeepSeek-R1 17 days ago

Deploying production ready service with Unsloth GGUF quants on your AWS account. (4 x L40S)

#171 opened 17 days ago by

samagra-tensorfuse

New activity in OddTheGreat/Famous_Trio_22B 17 days ago

This seems to be a good base model recipe!

#1 opened 17 days ago by

New activity in arcee-ai/Arcee-Maestro-7B-Preview 20 days ago

Good for coding?

#1 opened 21 days ago by