-
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper • 2312.11514 • Published • 259 -
Magicoder: Source Code Is All You Need
Paper • 2312.02120 • Published • 82 -
Mixtral of Experts
Paper • 2401.04088 • Published • 158 -
Chain-of-Thought Reasoning Without Prompting
Paper • 2402.10200 • Published • 105
xiepengli
ginobiLi
AI & ML interests
LLM
Recent Activity
liked
a Space
about 5 hours ago
Qwen/QwQ-32B-Demo
liked
a model
3 days ago
microsoft/Phi-4-mini-instruct
liked
a model
3 days ago
Qwen/QwQ-32B-GGUF
Organizations
Collections
1
models
None public yet
datasets
None public yet