1 12 84

wangdi

huayranus

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

Team-ACE/ToolACE-8B

liked a model about 2 months ago

tablegpt/TableGPT2-7B

liked a model about 2 months ago

tencent/Tencent-Hunyuan-Large

View all activity

Organizations

huayranus's activity

upvoted a paper about 2 months ago

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models

Paper • 2410.14059 • Published Oct 17 • 54

upvoted a paper 2 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7 • 167

upvoted a paper 3 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

upvoted 2 papers 4 months ago

Sapiens: Foundation for Human Vision Models

Paper • 2408.12569 • Published Aug 22 • 89

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21 • 57

upvoted an article 4 months ago

Article

Let's talk about LLM evaluation

•

May 23

• 140

upvoted a collection 5 months ago

"Physics of Language Models" series

Collection

6 items • Updated Aug 30 • 38

upvoted an article 5 months ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23

• 224

upvoted 3 papers 5 months ago

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12 • 129

Qwen2-Audio Technical Report

Paper • 2407.10759 • Published Jul 15 • 55

ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation

Paper • 2407.06135 • Published Jul 8 • 20

upvoted a collection 7 months ago

Function Calling Dataset

Collection

7 items • Updated Dec 5, 2023 • 4