ligeng-dev

community

AI & ML interests

None defined yet.

Recent Activity

Ligeng-Zhu authored a paper 11 days ago

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Ligeng-Zhu authored a paper 11 days ago

VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Ligeng-Zhu authored a paper 11 days ago

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers

View all activity

ligeng-dev's activity

Ligeng-Zhu

authored 6 papers 11 days ago

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Paper • 2408.10188 • Published Aug 19, 2024 • 51

VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Paper • 2409.04429 • Published Sep 6, 2024

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers

Paper • 2410.10629 • Published Oct 14, 2024 • 9

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Paper • 2410.19313 • Published Oct 25, 2024 • 19

TinyTL: Reduce Activations, Not Trainable Parameters for Efficient On-Device Learning

Paper • 2007.11622 • Published Jul 22, 2020

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 57

Ligeng-Zhu

authored 2 papers 6 months ago

Wolf: Captioning Everything with a World Summarization Framework

Paper • 2407.18908 • Published Jul 26, 2024 • 32

$VILA^2$: VILA Augmented VILA

Paper • 2407.17453 • Published Jul 24, 2024 • 40

Ligeng-Zhu

authored a paper 8 months ago

HAT: Hardware-Aware Transformers for Efficient Natural Language Processing

Paper • 2005.14187 • Published May 28, 2020 • 2

Ligeng-Zhu

authored 5 papers about 1 year ago

On-Device Training Under 256KB Memory

Paper • 2206.15472 • Published Jun 30, 2022

Deep Leakage from Gradients

Paper • 1906.08935 • Published Jun 21, 2019

ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware

Paper • 1812.00332 • Published Dec 2, 2018

Sparsely Aggregated Convolutional Networks

Paper • 1801.05895 • Published Jan 18, 2018

PockEngine: Sparse and Efficient Fine-tuning in a Pocket

Paper • 2310.17752 • Published Oct 26, 2023 • 12