LLMReasoner

community

AI & ML interests

None defined yet.

Recent Activity

Viol2000 authored a paper 5 days ago

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

Viol2000 authored a paper 5 days ago

When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models

Viol2000 authored a paper 5 days ago

Efficiently Serving LLM Reasoning Programs with Certaindex

View all activity

LLMReasoner's activity

Viol2000

authored 3 papers 5 days ago

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

Paper • 2406.05981 • Published Jun 10, 2024 • 12

When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models

Paper • 2406.07368 • Published Jun 11, 2024 • 2

Efficiently Serving LLM Reasoning Programs with Certaindex

Paper • 2412.20993 • Published 8 days ago • 31

zsqzz

authored a paper 5 days ago

Efficiently Serving LLM Reasoning Programs with Certaindex

Paper • 2412.20993 • Published 8 days ago • 31

zsqzz

authored 2 papers 4 months ago

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Paper • 2408.07055 • Published Aug 13, 2024 • 66

Efficient LLM Scheduling by Learning to Rank

Paper • 2408.15792 • Published Aug 28, 2024 • 19

Viol2000

authored 2 papers 4 months ago

Efficient LLM Scheduling by Learning to Rank

Paper • 2408.15792 • Published Aug 28, 2024 • 19

Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Paper • 2402.02057 • Published Feb 3, 2024