Hristo Panev's picture

87 656

Hristo Panev

hppdqdq

·

AI & ML interests

None yet

Recent Activity

upvoted an article about 5 hours ago

Open-R1: Update #1

liked a model 3 days ago

m-a-p/YuE-s1-7B-anneal-en-cot

liked a model 8 days ago

mradermacher/Deepseek-R1-Distill-NSFW-RPv1-GGUF

View all activity

Organizations

None yet

hppdqdq's activity

upvoted an article about 5 hours ago

Article

Open-R1: Update #1

By

•

about 18 hours ago

• 87

upvoted a paper 11 days ago

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published 12 days ago • 47

upvoted a paper 24 days ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 25 days ago • 249

upvoted 2 papers 3 months ago

Awaker2.5-VL: Stably Scaling MLLMs with Parameter-Efficient Mixture of Experts

Paper • 2411.10669 • Published Nov 16, 2024 • 10

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 113

upvoted a collection 3 months ago

LongVU

7 items • Updated Oct 31, 2024 • 28

upvoted an article 3 months ago

Article

Allegro: Advanced Video Generation Model

By

•

Oct 22, 2024

• 59

upvoted 3 papers 4 months ago

FlatQuant: Flatness Matters for LLM Quantization

Paper • 2410.09426 • Published Oct 12, 2024 • 13

Retrospective Learning from Interactions

Paper • 2410.13852 • Published Oct 17, 2024 • 8

Mini-Omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities

Paper • 2410.11190 • Published Oct 15, 2024 • 21

upvoted an article 4 months ago

Article

How to build a custom text classifier without days of human labeling

By

•

Oct 17, 2024

• 55

upvoted a collection 4 months ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated 16 days ago • 152

upvoted a paper 4 months ago

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published Oct 10, 2024 • 28

upvoted a collection 4 months ago

🍓 Ichigo v0.3

The experimental family designed to train LLMs to understand sound natively. • 6 items • Updated Nov 11, 2024 • 17

upvoted an article 4 months ago

Article

Accelerate 1.0.0

Sep 13, 2024

• 52

upvoted 2 papers 4 months ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1, 2024 • 145

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27, 2024 • 94

upvoted a collection 4 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 567

upvoted 2 papers 4 months ago

Phantom of Latent for Large Language and Vision Models

Paper • 2409.14713 • Published Sep 23, 2024 • 28

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Paper • 2409.12183 • Published Sep 18, 2024 • 37