Guangxuan Xiao's picture

3 2 3

Guangxuan Xiao

Guangxuan-Xiao

·

http://guangxuanx.com

Guangxuan-Xiao

AI & ML interests

Efficient Machine Learning

Organizations

Guangxuan-Xiao's activity

commented a paper 3 months ago

DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads

Paper • 2410.10819 • Published Oct 14, 2024 • 6 •

New activity in mit-han-lab/opt-13b-smoothquant about 2 years ago

how to load and use model?

#1 opened about 2 years ago by