13 3 1

Hannibal

Hannibal046

Hannibal046

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

upvoted a paper about 2 months ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

upvoted a paper 2 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

View all activity

Organizations

Hannibal046's activity

upvoted 2 papers about 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 346

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 276

upvoted a paper 2 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 263

updated a model 8 months ago

Hannibal046/xrag-v1.1-7b

Text Generation • Updated Jul 13, 2024 • 10

updated a model 9 months ago

Hannibal046/gtr_t5_nq_32_stage2

Updated Jun 29, 2024 • 24 • 1

liked a model 9 months ago

bosonai/Higgs-Llama-3-70B

Text Generation • Updated Aug 20, 2024 • 279 • 220

New activity in meta-llama/Meta-Llama-3-8B-Instruct 11 months ago

The request to access the repo has been sent for several days, why hasn't it passed yet?

#70 opened 11 months ago by

water-cui

request to access is still pending a review

#50 opened 11 months ago by

Hoo1196

updated 2 models 11 months ago

Hannibal046/xrag-moe

Text Generation • Updated Apr 24, 2024 • 16

Hannibal046/xrag-7b

Text Generation • Updated Apr 23, 2024 • 2.28k • 1

New activity in mistralai/Mistral-7B-Instruct-v0.1 about 1 year ago

Which padding side to choose while finetuning

#47 opened over 1 year ago by

parikshit1619

New activity in akariasai/PopQA about 1 year ago

type of `possible_answers` is string, not list

#1 opened about 1 year ago by

Hannibal046

updated 8 models about 1 year ago