蓋瑞王's picture

蓋瑞王

gary109

·

AI & ML interests

GAN,Music

Recent Activity

liked a Space 5 days ago

TencentARC/BrushEdit

liked a Space 12 days ago

TencentARC/PhotoMaker-Style

liked a model 13 days ago

memoavatar/memo

View all activity

Organizations

None yet

gary109's activity

upvoted a paper 19 days ago

TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video

Paper • 2411.18671 • Published 25 days ago • 20

upvoted a collection about 1 month ago

LLM2CLIP

LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 10 items • Updated 11 days ago • 47

upvoted a paper about 1 month ago

StdGEN: Semantic-Decomposed 3D Character Generation from Single Images

Paper • 2411.05738 • Published Nov 8 • 14

upvoted 3 papers 3 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25 • 60

Seeing Faces in Things: A Model and Dataset for Pareidolia

Paper • 2409.16143 • Published Sep 24 • 15

RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning

Paper • 2409.14674 • Published Sep 23 • 41

upvoted 14 papers 4 months ago

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Paper • 2408.16532 • Published Aug 29 • 47

Scaling Up Diffusion and Flow-based XGBoost Models

Paper • 2408.16046 • Published Aug 28 • 9

Segment Anything with Multiple Modalities

Paper • 2408.09085 • Published Aug 17 • 21

TurboEdit: Instant text-based image editing

Paper • 2408.08332 • Published Aug 14 • 19

Can Large Language Models Understand Symbolic Graphics Programs?

Paper • 2408.08313 • Published Aug 15 • 7

D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning

Paper • 2408.08441 • Published Aug 15 • 7

JPEG-LM: LLMs as Image Generators with Canonical Codec Representations

Paper • 2408.08459 • Published Aug 15 • 45

Automated Design of Agentic Systems

Paper • 2408.08435 • Published Aug 15 • 38

PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform Generation

Paper • 2408.07547 • Published Aug 14 • 7

Rethinking Open-Vocabulary Segmentation of Radiance Fields in 3D Space

Paper • 2408.07416 • Published Aug 14 • 6

Aquila2 Technical Report

Paper • 2408.07410 • Published Aug 14 • 13

3D Gaussian Editing with A Single Image

Paper • 2408.07540 • Published Aug 14 • 10

InfinityMATH: A Scalable Instruction Tuning Dataset in Programmatic Mathematical Reasoning

Paper • 2408.07089 • Published Aug 9 • 13

Generative Photomontage

Paper • 2408.07116 • Published Aug 13 • 19