Xijie Huang's picture

1 8 11

Xijie Huang

ScarletAce

·

https://huangowen.github.io/

HuangOwen

AI & ML interests

Efficient deep learning, Model Compression, Large Language Models(LLMs)

Organizations

None yet

ScarletAce's activity

upvoted a paper 2 months ago

SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training

Paper • 2412.09619 • Published Dec 12, 2024 • 23

upvoted a paper 4 months ago

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22, 2024 • 127

upvoted 2 collections 4 months ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 107

Stable Diffusion 3.5

6 items • Updated Jan 9 • 132

upvoted a collection 5 months ago

RoLoRA

[EMNLP2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization • 3 items • Updated Sep 26, 2024 • 3

upvoted a paper 5 months ago

RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization

Paper • 2407.08044 • Published Jul 10, 2024 • 1

upvoted 2 papers over 1 year ago

LLM-FP4: 4-Bit Floating-Point Quantized Transformers

Paper • 2310.16836 • Published Oct 25, 2023 • 14

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 244