Haocheng Xi's picture

2 10

Haocheng Xi

Xihc20

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

S*: Test Time Scaling for Code Generation

upvoted a paper 19 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

upvoted a paper 2 months ago

SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

View all activity

Organizations

Xihc20's activity

upvoted a paper 16 days ago

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published 18 days ago • 59

upvoted a paper 19 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 21 days ago • 141

upvoted a paper 2 months ago

SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Paper • 2411.10958 • Published Nov 17, 2024 • 53

upvoted a paper 3 months ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 58

upvoted an article 4 months ago

Article

Unbelievable! Run 70B LLM Inference on a Single 4GB GPU with This NEW Technique

By

•

Nov 30, 2023

• 34

upvoted 2 papers 4 months ago

T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation

Paper • 2407.14505 • Published Jul 19, 2024 • 26

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Paper • 2410.19313 • Published Oct 25, 2024 • 19

upvoted 2 papers 5 months ago

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

Paper • 2410.13861 • Published Oct 17, 2024 • 53

SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration

Paper • 2410.02367 • Published Oct 3, 2024 • 48

upvoted a paper over 1 year ago

Training Transformers with 4-bit Integers

Paper • 2306.11987 • Published Jun 21, 2023 • 22