Saeed's picture

Saeed

MLDataScientist

·

AI & ML interests

None yet

Recent Activity

new activity 5 days ago

MLDataScientist/Mistral-Large-Instruct-2407-GPTQ-3bit:3bit QPTQ quants for Mistral-Large-Instruct-2411

new activity 21 days ago

tomg-group-umd/huginn-0125:Can we quantize the model to GGUF or GPTQ?

new activity 29 days ago

Enturbulate/DeepSeek-v2.5-1210-UD-gguf:Some description with each quant sizes would be nice.

View all activity

Organizations

None yet

MLDataScientist's activity

upvoted a paper about 1 month ago

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30 • 56

upvoted a paper about 2 months ago

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Paper • 2501.11873 • Published Jan 21 • 63

upvoted a paper 4 months ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 142

upvoted a paper 6 months ago

Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12, 2024 • 70

upvoted a collection 8 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 654

upvoted a collection 9 months ago

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated about 4 hours ago • 162